HuggingArch - Model Analyzer

Model Configuration

Model ID

Batch Size

Sequence Length

Activation Dtype

Analysis time: ms

Total

Embeddings

Transformer

FLOPs / token

Attention Type

Layers

Memory / token

Hidden Size

Data Type float16

Module Name	Class	Parameters	Category