Supported Models#

We support all safetensor formatted models on HuggingFace Hub for the following architectures.

Model

Size

HuggingFace Handle Example

Gemma 2

2B, 9B, 27B

google/gemma-2-2b

Llama 3.1

8B, 70B, 405B

meta-llama/Llama-3.1-8B