thibaud-perrin

Follow

👋

Thibaud Perrin thibaud-perrin

👋

Follow

2 followers · 1 following

Achievements

Achievements

Pinned Loading

llm-research-toolbox llm-research-toolbox Public

A curated list of repositories exploring various aspects of Large Language Model (LLM) development, including fine-tuning, dataset generation, multimodal models, and preference alignment.
transformer transformer Public archive

Transformer model developed from scratch for a translation task. The design is heavily inspired by the original transformer model described in the seminal paper "Attention is All You Need".

Jupyter Notebook
classic-control classic-control Public

This project demonstrates the use of Q-learning and Deep Q-Networks (DQN) to solve several classic control environments provided by OpenAI Gym. The project includes the following Jupyter notebooks

Jupyter Notebook
box2d box2d Public

This project contains the implementation of reinforcement learning algorithms to solve the Lunar Lander and Bipedal Walker environments using the DQN and DDPG algorithms respectively.

Jupyter Notebook 1
mini-gpt mini-gpt Public

The goal of this project was to implement the encoder only transformer in order to recreate a mini version of GPT.

Jupyter Notebook
hibo-mistral-7b-fc hibo-mistral-7b-fc Public archive

Dataset and model fine-tuning for function calling

Jupyter Notebook