google-t5/t5-base

📊 Model Parameters

Total Parameters 272,252,160
Context Length 2,048
Hidden Size 768
Layers 12
Attention Heads 12
KV Heads 12

💾 Memory Requirements

FP32 (Full) 1.01 GB
FP16 (Half) 519.3 MB
INT8 (Quantized) 259.6 MB
INT4 (Quantized) 129.8 MB

🔑 KV Cache (Inference)

Per Token (FP16) 36.86 KB
Max Context FP32 144.0 MB
Max Context FP16 72.0 MB
Max Context INT8 36.0 MB

⚙️ Model Configuration

Core Architecture

Vocabulary Size32,128
FFN Intermediate Size3,072
Number of Layers12
Attention Heads12

Context & Position

Max Context Length512

Attention Configuration

Tied EmbeddingsYes

Activation & Normalization

RMSNorm Epsilon1e-06
Activation Functionrelu

Dropout (Training)

Hidden Dropout10.0%

Special Tokens

BOS Token IDNot set
Pad Token ID0
EOS Token ID1

Data Type

Model DtypeNot set
Layer Types:
Attention
MLP/FFN
Normalization
Embedding