LLM architecture diagram
Tokenizer
Attention
Feedforward
Transformer
Embedding
Output