Skip to content

Commit dfa3f9a

Browse files
Wauplinosanseviero
andauthored
Complete list of PRO-only models (huggingface#1419)
* Add more PRO models * Update docs/api-inference/supported-models.md Co-authored-by: Omar Sanseviero <[email protected]> * update --------- Co-authored-by: Omar Sanseviero <[email protected]>
1 parent c4efdd7 commit dfa3f9a

File tree

1 file changed

+4
-0
lines changed

1 file changed

+4
-0
lines changed

docs/api-inference/supported-models.md

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -12,14 +12,18 @@ You can find:
1212

1313
In addition to thousands of public models available in the Hub, PRO and Enterprise users get higher [rate limits](./rate-limits) and free access to the following models:
1414

15+
<!-- Manually maintained hard-coded list based on https://github.com/huggingface-internal/api-inference/blob/main/master-rs/custom_config.yml -->
1516

1617
| Model | Size | Supported Context Length | Use |
1718
|--------------------------------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|----------------|--------------------------------------------------------------|
1819
| Meta Llama 3.1 Instruct | [8B](https://huggingface.co/meta-llama/Meta-Llama-3.1-8B-Instruct), [70B](https://huggingface.co/meta-llama/Meta-Llama-3.1-70B-Instruct) | 70B: 32k tokens / 8B: 8k tokens | High quality multilingual chat model with large context length |
1920
| Meta Llama 3 Instruct | [8B](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct), [70B](https://huggingface.co/meta-llama/Meta-Llama-3-70B-Instruct) | 8k tokens | One of the best chat models |
21+
| Meta Llama Guard 3 | [8B](https://huggingface.co/meta-llama/Llama-Guard-3-8B) | 4k tokens | |
2022
| Llama 2 Chat | [7B](https://huggingface.co/meta-llama/Llama-2-7b-chat-hf), [13B](https://huggingface.co/meta-llama/Llama-2-13b-chat-hf), [70B](https://huggingface.co/meta-llama/Llama-2-70b-chat-hf) | 4k tokens | One of the best conversational models |
23+
| DeepSeek Coder v2 | [236B](https://huggingface.co/deepseek-ai/DeepSeek-Coder-V2-Instruct) | 16k tokens | A model with coding capabilities. |
2124
| Bark | [0.9B](https://huggingface.co/suno/bark) | - | Text to audio generation |
2225

26+
This list is not exhaustive and might be updated in the future.
2327

2428
## Running Private Models
2529

0 commit comments

Comments
 (0)