Skip to content
View WooooDyy's full-sized avatar
🤡
🤡

Block or report WooooDyy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. AgentGym-RL AgentGym-RL Public

    Code and implementations for the paper "AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning" by Zhiheng Xi et al.

    Python 438 41

  2. AgentGym AgentGym Public

    Code and implementations for the ACL 2025 paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhiheng Xi et al.

    Python 622 85

  3. LLM-Agent-Paper-List LLM-Agent-Paper-List Public

    The paper list of the 86-page SCIS cover paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.

    7.9k 480

  4. LLM-Reverse-Curriculum-RL LLM-Reverse-Curriculum-RL Public

    Implementation of the ICML 2024 paper "Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning" presented by Zhiheng Xi et al.

    Python 111 9

  5. MathCritique MathCritique Public

    Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".

    Python 56 1

  6. BMMR BMMR Public

    Python 14