Command R Plus: Specifications and GPU VRAM Requirements

Command R Plus

闭源

开放权重

参数

104B

上下文长度

128K

模态

Text

架构

Dense

许可证

CC-BY-NC

发布日期

4 Apr 2024

训练数据截止日期

Feb 2023

技术规格

注意力结构

Multi-Head Attention

隐藏维度大小

层数

注意力头

键值头

激活函数

归一化

位置嵌入

Absolute Position Embedding

系统要求

不同量化方法和上下文大小的显存要求

Command R Plus

Cohere Command R Plus is a large language model developed by Cohere, designed to support demanding enterprise applications. This model is optimized for conversational interactions and tasks requiring extensive context, such as advanced Retrieval Augmented Generation (RAG) and multi-step tool use. Its primary function is to enable organizations to deploy sophisticated AI capabilities in production environments by balancing computational efficiency with high accuracy. The model offers comprehensive multilingual support, having been trained and evaluated across ten key global business languages, with additional pre-training data from thirteen other languages, facilitating its applicability in diverse linguistic contexts.

From an architectural standpoint, Command R Plus utilizes an optimized transformer architecture. Following its pretraining phase, the model undergoes supervised fine-tuning and preference training processes. These alignment procedures are crucial for refining the model's behavior to meet human preferences for helpfulness and safety in its generative outputs. A notable technical characteristic is its support for an expansive context window, capable of processing up to 128,000 tokens. This significant context length is achieved through specialized design, incorporating innovations in positional encodings to manage long dependencies effectively.

In terms of operational performance and applications, Command R Plus is engineered for large-scale production workloads. It demonstrates proficiency in complex RAG workflows, delivering grounded responses with inline citations to enhance reliability and mitigate issues such as hallucination. The model's multi-step tool use capabilities enable it to automate intricate business processes, including structured data analysis and dynamic updates within systems like customer relationship management (CRM). Furthermore, it possesses the ability to perform self-correction in instances of tool failure, thereby increasing the overall success rate of automated tasks. An August 2024 update to the model introduced enhancements, resulting in approximately 50% higher throughput and 25% lower latencies while maintaining the same hardware footprint.

关于 Command

其他 Command 模型

评估基准

排名适用于本地LLM。

排名

#53

基准	分数	排名
General Knowledge MMLU	0.69	9
Refactoring Aider Refactoring	0.38	12
Coding Aider Coding	0.38	17
Agentic Coding LiveBench Agentic	0.02	18
Data Analysis LiveBench Data Analysis	0.49	25
Coding LiveBench Coding	0.27	27
Reasoning LiveBench Reasoning	0.22	29
Mathematics LiveBench Mathematics	0.23	32

排名

#53

编程排名

#41

GPU 要求

完整计算器

量化

选择模型权重的量化方法

上下文大小：1024 个令牌

63k

125k

所需显存:

资源

官方文档发布说明下载权重

Command R Plus

技术规格

系统要求

Command R Plus

关于 Command

其他 Command 模型

评估基准

排名

GPU 要求

所需显存:

推荐 GPU

资源