ApX 标志ApX 标志

趋近智

Grok 4.1 Fast Non-Reasoning

参数

-

上下文长度

128K

模态

Text

架构

Dense

许可证

Proprietary

发布日期

1 Jun 2025

训练数据截止日期

Aug 2025

技术规格

注意力结构

Multi-Head Attention

隐藏维度大小

-

层数

-

注意力头

-

键值头

-

激活函数

SwigLU

归一化

RMS Normalization

位置嵌入

Absolute Position Embedding

Grok 4.1 Fast Non-Reasoning

Grok 4.1 Fast Non-Reasoning is a high-throughput, multimodal large language model developed by xAI, specifically engineered for low-latency agentic workflows and real-time tool orchestration. As the speed-optimized variant of the Grok 4.1 series, this model is designed to bypass the extended chain-of-thought processing characteristic of reasoning models, delivering immediate response generation suitable for time-sensitive applications. It is trained using long-horizon reinforcement learning (RL) in simulated environments, which enhances its reliability in multi-turn tool-calling scenarios and autonomous task execution.

Technically, the model utilizes a dense transformer architecture that supports an expansive 2-million-token context window, one of the largest available in the frontier API landscape. This architecture integrates Rotary Positional Embeddings (RoPE) and SwiGLU activation functions, optimized for maintaining high retrieval accuracy and factual consistency across extremely long sequences. The model's dual-mode capability allows developers to toggle between reasoning and non-reasoning via API parameters, with the non-reasoning variant providing significantly higher tokens-per-second and a lower price point by eliminating thinking token overhead.

Primary use cases for Grok 4.1 Fast Non-Reasoning include large-scale document analysis, real-time customer support agents, and complex back-end research tasks that require processing massive datasets without the computational delay of deep deliberation. By focusing on pattern-matching efficiency and state-of-the-art tool-calling accuracy, the model serves as a robust engine for production-grade AI agents that must interact with external APIs, search live web data via the X ecosystem, and execute remote code sessions with minimal inference lag.

关于 Grok

xAI's conversational AI models with real-time knowledge access and strong performance across reasoning, coding, and language tasks. Features extended context windows, fast inference variants, and specialized coding versions. Known for direct communication style and integration with X platform. Includes reasoning variants and optimized versions for different latency requirements.


其他 Grok 模型

评估基准

排名

#104

基准分数排名

Agentic Coding

LiveBench Agentic

0.10

37

0.54

42

0.23

42

0.58

42

0.39

45

排名

排名

#104

编程排名

#94