Experimental playground for benchmarking language model (LM) architectures, layers, and tricks on smaller datasets. Designed for flexible experimentation and exploration.
machine-learning deep-learning transformers pytorch gpt attention-mechanisms gpt-2 position-embedding large-language-models llm llm-training hymba diffusion-llm dllm
-
Updated
Jan 26, 2026 - Python