Models

All models available on LLMsChat

DeepSeek V3 logo Active
DeepSeek V3 DeepSeek-V3 是一款拥有 6710 亿参数的混合专家(MoE)语言模型。
DeepSeek R1 logo
DeepSeek R1 DeepSeek-R1 是一款强化学习(RL)驱动的推理模型,解决了模型中的重复性和可读性问题。
DeepSeek-R1-Distill-Llama-70B logo
DeepSeek-R1-Distill-Llama-70B DeepSeek-R1 Llama-70B蒸馏版。
DeepSeek-R1-Distill-Qwen-32B logo
DeepSeek-R1-Distill-Qwen-32B The first reasoning model from DeepSeek, distilled into a 32B dense model. Outperforms o1-mini on multiple benchmarks.
Qwen/Qwen2.5-72B-Instruct logo
Qwen/Qwen2.5-72B-Instruct The latest Qwen open model with improved role-playing, long text generation and structured data understanding.
CohereForAI/c4ai-command-r-plus-08-2024 logo
CohereForAI/c4ai-command-r-plus-08-2024 Cohere's largest language model, optimized for conversational interaction and tool use. Now with the 2024 update!