Positive TinyStories GPT

Reinforcement Learning Alignment Pipeline Pretraining → SFT → Policy Gradient RL

Model Size: 0.81M Params | Block Size: 64 | Char-Level GPT

What You Are Seeing