human-preferences

Star

Here are 12 public repositories matching this topic...

THUDM / ImageReward

Star

[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation

generative-model diffusion-models human-preferences rlhf

Updated Jan 24, 2025
Python

glgh / awesome-llm-human-preference-datasets

Star

A curated list of Human Preference Datasets for LLM fine-tuning, RLHF, and eval.

nlp machine-learning awesome-list datasets eval llm human-preferences rlhf

Updated Oct 4, 2023

naver / disco

Star

A Toolkit for Distributional Control of Generative Models

machine-learning ai alignment language-models monte-carlo-sampling generative-models fine-tuning human-preferences distributional-policy-gradients

Updated Sep 4, 2023
Python

SafeSora is a human preference dataset designed to support safety alignment research in the text-to-video generation field, aiming to enhance the helpfulness and harmlessness of Large Vision Models (LVMs).

alignment human-preferences text-to-video-generation large-vision-models

Updated Aug 20, 2024
Python

SteveKGYang / MetaAligner

Star

Models, data, and codes for the paper: MetaAligner: Towards Generalizable Multi-Objective Alignment of Language Models

natural-language-processing alignment gpt large-language-models human-preferences llama2

Updated Sep 26, 2024
Python

buaawgj / LOPE

Star

An implementation of Learning Online with trajectory Preference guidancE (LOPE) in PyTorch

python reinforcement-learning pytorch exploration ppo human-preferences

Updated Jul 10, 2024
Python

Lizhecheng02 / Kaggle-LMSYS

Star

Analyze a dataset of conversations from the Chatbot Arena, where various LLMs provide responses to user prompts. The goal is to develop a model that enhances chatbot interactions, ensuring they align more closely with human preferences.