Models

All models available on LLMsChat

Active

DeepSeek V3 DeepSeek-V3 是一款拥有 6710 亿参数的混合专家（MoE）语言模型。

DeepSeek R1 DeepSeek-R1 是一款强化学习（RL）驱动的推理模型，解决了模型中的重复性和可读性问题。

DeepSeek-R1-Distill-Llama-70B DeepSeek-R1 Llama-70B蒸馏版。

DeepSeek-R1-Distill-Qwen-32B The first reasoning model from DeepSeek, distilled into a 32B dense model. Outperforms o1-mini on multiple benchmarks.

Qwen/Qwen2.5-72B-Instruct The latest Qwen open model with improved role-playing, long text generation and structured data understanding.

CohereForAI/c4ai-command-r-plus-08-2024 Cohere's largest language model, optimized for conversational interaction and tool use. Now with the 2024 update!