Skip to content
View chengzr01's full-sized avatar

Organizations

@ulab-uiuc

Block or report chengzr01

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A simple implementation of Generative Adversarial Imitation Learning with PyTorch

Python 143 27 Updated Mar 22, 2022

OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.

C++ 4,327 949 Updated Jan 6, 2025

Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"

Python 14 1 Updated Aug 21, 2024

Code release for "Debating with More Persuasive LLMs Leads to More Truthful Answers"

Python 94 12 Updated Mar 22, 2024

A project page template for academic papers. Demo at https://eliahuhorwitz.github.io/Academic-project-page-template/

JavaScript 2,410 367 Updated Dec 19, 2024
Python 9 2 Updated Jan 13, 2025

My self coded personal website build with React.js

JavaScript 4,910 2,629 Updated Aug 21, 2024

The personal website/portfolio template by Hashir Shoaib. Built using React and Bootstrap.

JavaScript 1,382 1,682 Updated Dec 24, 2024

A platform for developers to simulate collaborative research activities

Python 131 20 Updated Jan 14, 2025

Robust recipes to align language models with human and AI preferences

Python 4,896 423 Updated Nov 21, 2024

The official GitHub page for the survey paper "A Survey of Large Language Models".

Python 10,801 844 Updated Aug 20, 2024

Train transformer language models with reinforcement learning.

Python 10,606 1,369 Updated Jan 15, 2025

Reference implementation for DPO (Direct Preference Optimization)

Python 2,320 192 Updated Aug 11, 2024

Generative Agents: Interactive Simulacra of Human Behavior

18,133 2,378 Updated Aug 5, 2024

Resources and code for reproducing the training procedure outlined in the research paper "Fair multilingual vandalism detection system for Wikipedia"

Jupyter Notebook 3 Updated Aug 30, 2023

An Autonomous LLM Agent for Complex Task Solving

Python 8,292 856 Updated Aug 12, 2024
Showing results