Skip to content
View pr-Mais's full-sized avatar
👾
👾

Organizations

@firebase @googlemaps @fluttercommunity @FlutterVikings @Thmanyah-LLC

Block or report pr-Mais

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

methods2test is a supervised dataset consisting of Test Cases and their corresponding Focal Methods from a set of Java software repositories

Python 145 39 Updated Dec 4, 2023

Analysis scripts for log data sets used in anomaly detection.

Python 57 6 Updated Jul 30, 2024

Firebase SDK for Cloud Functions

TypeScript 1,036 206 Updated Mar 11, 2025

Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-176B/bloom/gpt/bart/T5/MetaICL)

Python 552 59 Updated May 9, 2024

Mastering Diverse Domains through World Models

Python 1,543 260 Updated Feb 22, 2025

Train transformer language models with reinforcement learning.

Python 12,518 1,689 Updated Mar 15, 2025

Schedule-Free Optimization in PyTorch

Python 2,113 72 Updated Feb 28, 2025

Fine-tune LLM agents with online reinforcement learning

Python 1,082 50 Updated Mar 19, 2024

A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).

Python 816 49 Updated Mar 8, 2025

Reference implementation for DPO (Direct Preference Optimization)

Python 2,445 201 Updated Aug 11, 2024

FastAPI framework, high performance, easy to learn, fast to code, ready for production

Python 82,059 7,084 Updated Mar 10, 2025

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Python 10,100 1,790 Updated Mar 12, 2025

Powerful menu bar manager for macOS

Swift 17,347 307 Updated Jan 26, 2025
Python 1 Updated Jan 22, 2025

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

Python 7,764 676 Updated Feb 15, 2025

Implementation of the ICML 2024 paper "Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning" presented by Zhiheng Xi et al.

Python 93 6 Updated Feb 9, 2024

Comprehensive toolkit for Reinforcement Learning from Human Feedback (RLHF) training, featuring instruction fine-tuning, reward model training, and support for PPO and DPO algorithms with various c…

Python 144 12 Updated Mar 18, 2024

Monitoring recent cross-research on LLM & RL on arXiv for control. If there are good papers, PRs are welcome.

332 17 Updated Sep 12, 2024
Python 141 14 Updated May 2, 2024

Curated tutorials and resources for Large Language Models, Text2SQL, Text2DSL、Text2API、Text2Vis and more.

2,430 168 Updated Feb 19, 2025

Data for paper "Dr.Spider: A Diagnostic Evaluation Benchmark towards Text-to-SQL Robustness"

Python 32 6 Updated May 3, 2023
Python 56 15 Updated Oct 25, 2024

✨A static blog template built with Astro.

Astro 2,002 486 Updated Jan 19, 2025

Sample code illustrating the VS Code extension API.

TypeScript 9,155 3,546 Updated Mar 14, 2025
Python 148 93 Updated Mar 14, 2025

🦌 Soothing pastel theme for VSCode & Azure Data Studio

TypeScript 1,608 56 Updated Mar 3, 2025

An innovative superfamily of fonts for code

TypeScript 15,554 269 Updated Mar 7, 2025

LLM101n: Let's build a Storyteller

32,590 1,781 Updated Aug 1, 2024

A Redis Plugin for GenKit that adds Redis for efficient state storage, trace storage, caching, and rate limiting.

TypeScript 6 Updated Jun 11, 2024
Next
Showing results