s-JoL

Follow

s-JoL

Follow

65 followers · 14 following

Achievements

Achievements

Pinned Loading

verl-pipeline verl-pipeline Public

Forked from agentica-project/verl-pipeline

Async pipelined version of Verl

Python
Open-Llama Open-Llama Public

The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.

Python 44 6
huggingface/transformers huggingface/transformers Public

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 143k 28.6k
baichuan-inc/Baichuan-7B baichuan-inc/Baichuan-7B Public

A large-scale 7B pretraining language model developed by BaiChuan-Inc.

Python 5.7k 506
TITAN-RL TITAN-RL Public

TITAN-RL is a distributed reinforcement learning framework that separates policy rollout, experience storage, and training into independent microservices. This design enables flexible scaling and e…

Python