Skip to content
@GAIR-NLP

Generative Artificial Intelligence Research Lab (GAIR)

Pinned Loading

  1. factool factool Public

    FacTool: Factuality Detection in Generative AI

    Python 842 63

Repositories

Showing 10 of 29 repositories
  • PC-Agent Public

    PC Agent: While You Sleep, AI Works - A Cognitive Journey into Digital World

    GAIR-NLP/PC-Agent’s past year of commit activity
    Python 95 MIT 6 5 0 Updated Dec 25, 2024
  • ReasonEval Public

    [AAAI 2025 oral] Evaluating Mathematical Reasoning Beyond Accuracy

    GAIR-NLP/ReasonEval’s past year of commit activity
    Python 39 2 1 0 Updated Dec 15, 2024
  • OlympicArena Public

    This is the official repository of the paper "OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI"

    GAIR-NLP/OlympicArena’s past year of commit activity
    JavaScript 89 4 1 0 Updated Dec 15, 2024
  • SimulateBench Public

    GPT as Human

    GAIR-NLP/SimulateBench’s past year of commit activity
    Python 18 2 0 0 Updated Dec 11, 2024
  • O1-Journey Public

    O1 Replication Journey: A Strategic Progress Report – Part I

    GAIR-NLP/O1-Journey’s past year of commit activity
    1,737 53 13 0 Updated Nov 30, 2024
  • MathPile Public

    [NeurlPS D&B 2024] Generative AI for Math: MathPile

    GAIR-NLP/MathPile’s past year of commit activity
    Python 400 Apache-2.0 21 0 0 Updated Oct 27, 2024
  • ProX Public

    Offical Repo for "Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale"

    GAIR-NLP/ProX’s past year of commit activity
    Python 205 Apache-2.0 15 2 0 Updated Oct 16, 2024
  • walnut-plan Public

    The Walnut Plan

    GAIR-NLP/walnut-plan’s past year of commit activity
    11 0 0 0 Updated Oct 10, 2024
  • OpenResearcher Public

    OpenResearcher, an advanced Scientific Research Assistant

    GAIR-NLP/OpenResearcher’s past year of commit activity
    HTML 412 Apache-2.0 31 1 2 Updated Oct 10, 2024
  • math-evaluation-harness Public Forked from ZubinGou/math-evaluation-harness

    A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. 🧮✨

    GAIR-NLP/math-evaluation-harness’s past year of commit activity
    Python 2 MIT 11 0 0 Updated Oct 7, 2024