ApX 标志ApX 标志

趋近智

Command R

参数

35B

上下文长度

128K

模态

Text

架构

Dense

许可证

CC-BY-NC

发布日期

11 Mar 2024

训练数据截止日期

Jun 2024

技术规格

注意力结构

Multi-Head Attention

隐藏维度大小

8192

层数

40

注意力头

64

键值头

8

激活函数

SwigLU

归一化

Layer Normalization

位置嵌入

Absolute Position Embedding

Command R

Cohere Command R is a generative language model architected specifically for high-performance enterprise workloads, with an emphasis on long-context processing and tool-augmented workflows. Built on an optimized decoder-only Transformer framework, the model utilizes Grouped Query Attention (GQA) to maintain a significant 128,000-token context window while reducing the memory overhead typically associated with large-scale attention mechanisms. It is designed to facilitate the transition from experimental prototypes to production-grade deployments by offering a balance between inference efficiency and high-fidelity output.

The model undergoes a multi-stage training process including extensive pre-training on a diverse multilingual corpus and subsequent alignment via supervised fine-tuning and preference optimization. A defining architectural feature is its native training for grounded generation, which allows the model to produce responses with precise inline citations from external document sources. This makes it particularly effective for retrieval-augmented generation (RAG) pipelines, where maintaining factual consistency and source traceability is a primary requirement. Furthermore, Command R supports sophisticated multi-step tool use, enabling it to act as an agent that can reason through complex tasks by interacting with external APIs, databases, and software tools.

Optimized for global business applications, Command R provides native support for 10 languages and is trained on 23 in total, ensuring versatility across international markets. The architecture incorporates advanced components such as Rotary Positional Embeddings (RoPE) and Layer Normalization to ensure stability and coherence when handling massive input sequences. By focusing on practical utility in tasks like document summarization, complex reasoning, and structured data analysis, Command R serves as a scalable backbone for automated enterprise systems and intelligent agentic workflows.

关于 Command


其他 Command 模型

评估基准

排名

#111

基准分数排名

Web Development

WebDev Arena

1227

68

排名

排名

#111

编程排名

#81

模型透明度

总分

B

64 / 100

GPU 要求

完整计算器

选择模型权重的量化方法

上下文大小:1024 个令牌

1k
63k
125k

所需显存:

推荐 GPU

Command R:规格和 GPU 显存要求