pr-Mais

👾

Mais Alheraki pr-Mais

👾

Software engineer

595 followers · 45 following

@Thmanyah-LLC
Dammam, Saudi Arabia
09:52 - 3h ahead
g.dev/mais
@pr_Mais
https://mais.codes

Achievements

x3 x3

Achievements

x3 x3

Organizations

Lists (1)

Sort

Flutter

1 repository

Starred repositories

microsoft / methods2test

methods2test is a supervised dataset consisting of Test Cases and their corresponding Focal Methods from a set of Java software repositories

Python 145 39 Updated Dec 4, 2023

ait-aecid / anomaly-detection-log-datasets

Analysis scripts for log data sets used in anomaly detection.

Python 57 6 Updated Jul 30, 2024

firebase / firebase-functions

Firebase SDK for Cloud Functions

TypeScript 1,036 206 Updated Mar 11, 2025

voidful / TextRL

Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-176B/bloom/gpt/bart/T5/MetaICL)

Python 552 59 Updated May 9, 2024

danijar / dreamerv3

Mastering Diverse Domains through World Models

Python 1,543 260 Updated Feb 22, 2025

huggingface / trl

Train transformer language models with reinforcement learning.

Python 12,518 1,689 Updated Mar 15, 2025

facebookresearch / schedule_free

Schedule-Free Optimization in PyTorch

Python 2,113 72 Updated Feb 28, 2025

KhoomeiK / LlamaGym

Fine-tune LLM agents with online reinforcement learning

Python 1,082 50 Updated Mar 19, 2024

ContextualAI / HALOs

A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).

Python 816 49 Updated Mar 8, 2025

eric-mitchell / direct-preference-optimization

Reference implementation for DPO (Direct Preference Optimization)

Python 2,445 201 Updated Aug 11, 2024

fastapi / fastapi

FastAPI framework, high performance, easy to learn, fast to code, ready for production

Python 82,059 7,084 Updated Mar 10, 2025

DLR-RM / stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Python 10,100 1,790 Updated Mar 12, 2025

jordanbaird / Ice

Powerful menu bar manager for macOS

Swift 17,347 307 Updated Jan 26, 2025

ksaa-nlp / balsam-eval

Python 1 Updated Jan 22, 2025

lucidrains / PaLM-rlhf-pytorch

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

Python 7,764 676 Updated Feb 15, 2025

WooooDyy / LLM-Reverse-Curriculum-RL

Implementation of the ICML 2024 paper "Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning" presented by Zhiheng Xi et al.

Python 93 6 Updated Feb 9, 2024

raghavc / LLM-RLHF-Tuning-with-PPO-and-DPO

Comprehensive toolkit for Reinforcement Learning from Human Feedback (RLHF) training, featuring instruction fine-tuning, reward model training, and support for PPO and DPO algorithms with various c…

Python 144 12 Updated Mar 18, 2024