Model Architecture: allenai/OLMo-2-1124-7B

📊 Model Parameters

Total Parameters 7,298,617,344

Context Length 4,096

Hidden Size 4096

Layers 32

Attention Heads 32

KV Heads 32

FP32 (Full) 27.19 GB

FP16 (Half) 13.59 GB

INT8 (Quantized) 6.80 GB

INT4 (Quantized) 3.40 GB

Per Token (FP16) 524.29 KB

Max Context FP32 4.00 GB

Max Context FP16 2.00 GB

Max Context INT8 1.00 GB

Vocabulary Size100,352

Hidden Size4,096

FFN Intermediate Size11,008

Number of Layers32

Attention Heads32

KV Heads32

Max Context Length4,096

Attention BiasNo

Attention Dropout0%

Tied EmbeddingsNo

Activation Functionsilu

RMSNorm Epsilon1e-06

Pad Token ID100,277

BOS Token IDNot set

EOS Token ID100257

Model Dtypefloat32

Layer Types:

Attention

MLP/FFN

Normalization

Embedding