- Lisbon, Portugal
-
05:38
- same time - danieltrt.github.io
- @danieltrt7
Highlights
- Pro
Stars
Autonomously train research-agent LLMs on custom data using reinforcement learning and self-verification.
Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"
Clean, minimal, accessible reproduction of DeepSeek R1-Zero
Fully open reproduction of DeepSeek-R1
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
Systems design is the process of defining the architecture, modules, interfaces, and data for a system to satisfy specified requirements. Systems design could be seen as the application of systems …
The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!
Playing Pokemon Red with Reinforcement Learning
AAAI 2025: Counterexample Guided Program Repair Using Zero-Shot Learning and MaxSAT-based Fault Localization
ROCm / triton
Forked from triton-lang/tritonDevelopment repository for the Triton language and compiler
A high-throughput and memory-efficient inference and serving engine for LLMs
carolinacarreira / piranha
Forked from uber/piranhaA tool for refactoring code related to feature flag APIs
RepairLLaMA: Efficient Representations and Fine-Tuned Adapters for Program Repair http://arxiv.org/pdf/2312.15698
Mattermost is an open source platform for secure collaboration across the entire software development lifecycle..
An extremely fast Python package and project manager, written in Rust.
GUI analyzer for deep-diving into PDF files. Detect malicious payloads, understand object relationships, and extract key information for threat analysis.
An analysis tool for Python that blurs the line between testing and type systems.
The fuzzer afl++ is afl with community patches, qemu 5.1 upgrade, collision-free coverage, enhanced laf-intel & redqueen, AFLfast++ power schedules, MOpt mutators, unicorn_mode, and a lot more!
Community-led collection of essential ast-grep rules.