🚀 AI / ML Learner focusing on Large Language Models (LLMs).
I focus on the full lifecycle of LLMs, from Pre-training and SFT to Alignment (RLHF) and Inference Optimization.
- 🔭 I’m currently working on LLM Alignment (RLHF, PPO, DPO) & RAG Systems
- 🌱 I’m currently learning Agentic Workflows, DeepSpeed & Model Quantization (AWQ/GPTQ)
- 🎓 Background: Qilu University of Technology (QLUT)
- 🔬 Research Interests: Reward Modeling, Context Window Extension, Chain-of-Thought (CoT)
- 📫 How to reach me: 867762462f@gmail.com


