Available snaps

This page contains a list of inference snaps and the available optimizations.

DeepSeek R1

deepseek-r1 snap deepseek-r1 code

DeepSeek R1 is a reasoning Large Language Model mainly meant for chat completions. Input and output is text-based.

This inference snap is optimized for the following hardware:

Arch

Optimization

Description

amd64

Intel GPU

Optimized for Intel integrated and discrete graphics

amd64

Intel NPU

Intel Neural Processing Unit acceleration

amd64

Intel CPU

Intel-specific CPU optimizations

amd64

NVIDIA GPU

CUDA-enabled GPU acceleration

arm64

Ampere Altra/One CPUs

Optimized for Ampere processors

Once installed, use list-engines and show-engine commands to explore the available engines.

Qwen VL

qwen-vl snap qwen-vl source

Qwen VL is a Vision Language Model which has the ability to process both visual and textual data. The input can be a combination of an image and text, with the output being text-based.

The inference snap for Qwen 2.5 VL has been optimized for the following:

Arch

Optimization

Description

amd64

Intel GPU

Optimized for Intel integrated and discrete graphics

amd64

Intel NPU

Intel Neural Processing Unit acceleration

amd64

Intel CPU

Intel-specific CPU optimizations

amd64

NVIDIA GPU

CUDA-enabled GPU acceleration

arm64

Ampere Altra/One CPUs

Optimized for Ampere processors

Once installed, use list-engines and show-engine commands to explore the available engines.