ApX 标志ApX 标志

趋近智

Typhoon-2-8B

参数

8B

上下文长度

128K

模态

Text

架构

Dense

许可证

Apache-2.0

发布日期

1 Jun 2024

训练数据截止日期

-

技术规格

注意力结构

Multi-Head Attention

隐藏维度大小

-

层数

-

注意力头

-

键值头

-

激活函数

-

归一化

-

位置嵌入

Absolute Position Embedding

系统要求

不同量化方法和上下文大小的显存要求

Typhoon-2-8B

Typhoon-2-8B is an 8 billion parameter model optimized for Thai language processing. It features an expanded context length of 128,000 tokens and supports function calling. The model is trained to handle Thai cultural nuances and specific domains such as Thai law and local administration. Released under the Apache 2.0 license.

关于 Typhoon

Typhoon is a Thai language model family developed by SCB 10X. It is specifically optimized for the Thai language, addressing complexities such as the lack of word delimiters and tonal nuances. The models are trained on Thai-centric datasets including legal, cultural, and historical documents to ensure localized context and knowledge.


其他 Typhoon 模型

评估基准

没有可用的 Typhoon-2-8B 评估基准。

排名

排名

-

编程排名

-

GPU 要求

完整计算器

选择模型权重的量化方法

上下文大小:1024 个令牌

1k
63k
125k

所需显存:

推荐 GPU