Skip to content

Finetune LLMs to Predict Human Preference using Chatbot Arena conversations

Notifications You must be signed in to change notification settings

bhandeystruck/LLM-Classification-Finetuning-Submission

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

9 Commits
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

πŸ€– Chatbot Arena Preference Prediction

Kaggle Competition Submission
πŸ… Ranked #44 on the Public Leaderboard


πŸš€ Project Summary

This repository contains my solution for the Chatbot Arena Preference Prediction Kaggle competition.
The challenge was to predict which chatbot response a human judge preferred, given a prompt and two responses.

The submission ranked 44th on the public leaderboard out of hundreds of global teams 🌍.


πŸ“¦ Repo Structure

πŸ“‚ notebooks/         # Exploratory data analysis and experiments
πŸ“„ README.md          # Project overview (this file)

🧠 The Approach

  • ✨ Fine-tuned transformer-based models on human preference data.
  • πŸ” Analyzed semantic similarities between responses.
  • βš–οΈ Normalized token lengths and prompt alignment.
  • πŸ”€ Applied ensemble strategies to combine model strengths.
  • πŸ§ͺ Used stratified validation to handle subjectivity and avoid leakage.

🧰 Tech Stack

Tool Purpose
🐍 Python Programming Language
πŸ€— Transformers Pretrained NLP Models
πŸ”₯ PyTorch Deep Learning Framework
πŸ“Š Scikit-learn Metrics & Utilities
πŸ“š Pandas Data Manipulation
πŸ“ˆ Matplotlib Visualization

πŸ“ˆ Results

Metric Value
Leaderboard Rank πŸ₯‡ #44
Final Score [Insert final score]
Total Teams [Insert number]

πŸ’‘ Key Learnings

  • Human preferences in NLP are highly nuanced and often subjective.
  • Even small model tweaks (like input formatting or length balancing) had large effects on performance.
  • Ensembling and careful validation strategy were critical to climb the leaderboard.

πŸ™Œ Acknowledgments

Huge thanks to:

  • The Kaggle community for insightful discussions and open-source notebooks.
  • Competition organizers for an exciting and innovative challenge.
  • Open-source contributors to libraries like Hugging Face & PyTorch.

πŸ“¬ Contact

If you're interested in discussing the project or collaborating, please reach out at bhandeystruck@gmail.com


About

Finetune LLMs to Predict Human Preference using Chatbot Arena conversations

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published