Skip to content
View thibaud-perrin's full-sized avatar
👋
👋

Block or report thibaud-perrin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. llm-research-toolbox llm-research-toolbox Public

    A curated list of repositories exploring various aspects of Large Language Model (LLM) development, including fine-tuning, dataset generation, multimodal models, and preference alignment.

  2. transformer transformer Public archive

    Transformer model developed from scratch for a translation task. The design is heavily inspired by the original transformer model described in the seminal paper "Attention is All You Need".

    Jupyter Notebook

  3. classic-control classic-control Public

    This project demonstrates the use of Q-learning and Deep Q-Networks (DQN) to solve several classic control environments provided by OpenAI Gym. The project includes the following Jupyter notebooks

    Jupyter Notebook

  4. box2d box2d Public

    This project contains the implementation of reinforcement learning algorithms to solve the Lunar Lander and Bipedal Walker environments using the DQN and DDPG algorithms respectively.

    Jupyter Notebook 1

  5. mini-gpt mini-gpt Public

    The goal of this project was to implement the encoder only transformer in order to recreate a mini version of GPT.

    Jupyter Notebook

  6. hibo-mistral-7b-fc hibo-mistral-7b-fc Public archive

    Dataset and model fine-tuning for function calling

    Jupyter Notebook