chengzr01

Zirui Cheng chengzr01

M.S. in Computer Science at UIUC | B.Eng. in Computer Science and Technology from Tsinghua University | Machine Learning, Human-Computer Interaction

13 followers · 4 following

Achievements

Organizations

Stars

hcnoh / gail-pytorch

A simple implementation of Generative Adversarial Imitation Learning with PyTorch

Python 143 27 Updated Mar 22, 2022

google-deepmind / open_spiel

OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.

C++ 4,327 949 Updated Jan 6, 2025

WEIRDLabUW / vpl_llm

Forked from cassidylaidlaw/hidden-context

Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"

Python 14 1 Updated Aug 21, 2024

ucl-dark / llm_debate

Code release for "Debating with More Persuasive LLMs Leads to More Truthful Answers"

Python 94 12 Updated Mar 22, 2024

eliahuhorwitz / Academic-project-page-template

A project page template for academic papers. Demo at https://eliahuhorwitz.github.io/Academic-project-page-template/

JavaScript 2,410 367 Updated Dec 19, 2024

lwaekfjlk / gpu-bartender

Python 9 2 Updated Jan 13, 2025

soumyajit4419 / Portfolio

My self coded personal website build with React.js

JavaScript 4,910 2,629 Updated Aug 21, 2024

hashirshoaeb / home

The personal website/portfolio template by Hashir Shoaib. Built using React and Bootstrap.

JavaScript 1,382 1,682 Updated Dec 24, 2024

ulab-uiuc / research-town

A platform for developers to simulate collaborative research activities

Python 131 20 Updated Jan 14, 2025

huggingface / alignment-handbook

Robust recipes to align language models with human and AI preferences

Python 4,896 423 Updated Nov 21, 2024

RUCAIBox / LLMSurvey

The official GitHub page for the survey paper "A Survey of Large Language Models".

Python 10,801 844 Updated Aug 20, 2024

huggingface / trl

Train transformer language models with reinforcement learning.

Python 10,606 1,369 Updated Jan 15, 2025

eric-mitchell / direct-preference-optimization

Reference implementation for DPO (Direct Preference Optimization)

Python 2,320 192 Updated Aug 11, 2024

joonspk-research / generative_agents

Generative Agents: Interactive Simulacra of Human Behavior

18,133 2,378 Updated Aug 5, 2024

trokhymovych / KI_multilingual_training

Resources and code for reproducing the training procedure outlined in the research paper "Fair multilingual vandalism detection system for Wikipedia"

Jupyter Notebook 3 Updated Aug 30, 2023

alextamkin / generative-elicitation

Python 121 13 Updated Dec 1, 2023

OpenBMB / XAgent

An Autonomous LLM Agent for Complex Task Solving

Python 8,292 856 Updated Aug 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Zirui Cheng chengzr01

Achievements

Achievements

Organizations

Block or report chengzr01

Stars

hcnoh / gail-pytorch

google-deepmind / open_spiel

WEIRDLabUW / vpl_llm

ucl-dark / llm_debate

eliahuhorwitz / Academic-project-page-template

lwaekfjlk / gpu-bartender

soumyajit4419 / Portfolio

hashirshoaeb / home

ulab-uiuc / research-town

huggingface / alignment-handbook

RUCAIBox / LLMSurvey

huggingface / trl

eric-mitchell / direct-preference-optimization

joonspk-research / generative_agents

trokhymovych / KI_multilingual_training

alextamkin / generative-elicitation

OpenBMB / XAgent