Stars
A simple implementation of Generative Adversarial Imitation Learning with PyTorch
OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.
WEIRDLabUW / vpl_llm
Forked from cassidylaidlaw/hidden-contextCode and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"
Code release for "Debating with More Persuasive LLMs Leads to More Truthful Answers"
A project page template for academic papers. Demo at https://eliahuhorwitz.github.io/Academic-project-page-template/
My self coded personal website build with React.js
The personal website/portfolio template by Hashir Shoaib. Built using React and Bootstrap.
A platform for developers to simulate collaborative research activities
Robust recipes to align language models with human and AI preferences
The official GitHub page for the survey paper "A Survey of Large Language Models".
Train transformer language models with reinforcement learning.
Reference implementation for DPO (Direct Preference Optimization)
Generative Agents: Interactive Simulacra of Human Behavior
Resources and code for reproducing the training procedure outlined in the research paper "Fair multilingual vandalism detection system for Wikipedia"
An Autonomous LLM Agent for Complex Task Solving