Pinned Loading
-
verl-pipeline
verl-pipeline PublicForked from agentica-project/verl-pipeline
Async pipelined version of Verl
Python
-
Open-Llama
Open-Llama PublicThe complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.
-
huggingface/transformers
huggingface/transformers Public🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
-
baichuan-inc/Baichuan-7B
baichuan-inc/Baichuan-7B PublicA large-scale 7B pretraining language model developed by BaiChuan-Inc.
-
TITAN-RL
TITAN-RL PublicTITAN-RL is a distributed reinforcement learning framework that separates policy rollout, experience storage, and training into independent microservices. This design enables flexible scaling and e…
Python
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.