Parameters
-
Context Length
-
Modality
Text
Architecture
Dense
License
Proprietary
Release Date
28 May 2026
Knowledge Cutoff
-
Attention
Attention Structure
Multi-Head Attention
Attention Heads
-
Key-Value Heads
-
Attention Head Dimension
-
Position Embedding
Absolute Position Embedding
RoPE Theta
-
Sliding Window Attention
-
Sliding Window Size
-
Normalization
-
Activation Function
-
Dimensions
Hidden Dimension Size
-
Number of Layers
-
FFN Intermediate Size (Dense)
-
Multi-Token Prediction Heads
-
Tokenizer
Vocabulary Size
-
StepFun's high-speed specialized model released May 28, 2026, engineered for rapid real-time multilingual tasks, low-latency agent coordination, and fast conversational inference. Optimized for speed-first deployments where sub-second response times and broad language coverage are paramount, Step 3.7 Flash serves as a lightweight backbone for multilingual pipelines, real-time translation workflows, and latency-sensitive multi-agent coordination tasks.
StepFun's Step 3.7 generation focuses on high-speed, specialized inference for real-time multilingual tasks, rapid agent coordination, and low-latency conversational applications. The Flash tier is optimized for speed-first deployments where rapid response time and multilingual versatility are the primary requirements.
No evaluation benchmarks for Step 3.7 Flash available.
Overall Rank
-
Coding Rank
-
APX AI
Online