Skip to content
View justinthelaw's full-sized avatar

Block or report justinthelaw

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

My GitHub profile and personal website!

TypeScript 1 Updated Feb 24, 2025

Cost-efficient and pluggable Infrastructure components for GenAI inference

Jupyter Notebook 1,027 105 Updated Feb 24, 2025

Achieve the llama3 inference step-by-step, grasp the core concepts, master the process derivation, implement the code.

Jupyter Notebook 432 31 Updated Feb 24, 2025

vCluster - Create fully functional virtual Kubernetes clusters - Each vcluster runs inside a namespace of the underlying k8s cluster. It's cheaper than creating separate full-blown clusters and it …

Go 9,246 463 Updated Feb 24, 2025

Multi-tenancy and policy-based framework for Kubernetes.

Go 1,754 169 Updated Feb 21, 2025

Deploy a Production Ready Kubernetes Cluster

Jinja 16,557 6,557 Updated Feb 24, 2025

A Python framework to write Kubernetes operators in just a few lines of code

Python 2,236 166 Updated Dec 13, 2024

🦄 Type safe K8s middleware for humans

TypeScript 105 3 Updated Feb 23, 2025

Blueprint by Mozilla.ai for answering questions about structured documents

Python 26 1 Updated Feb 12, 2025

Blueprint by Mozilla.ai for finetuning a Speech-To-Text model in your own language

Python 10 1 Updated Feb 24, 2025

Kubernetes (k8s) device plugin to enable registration of AMD GPU to a container cluster

Go 306 54 Updated Feb 22, 2025

Scale from single vLLM instance to distributed vLLM deployment without changing any application code.

Python 508 62 Updated Feb 22, 2025

Fully open reproduction of DeepSeek-R1

Python 21,300 1,871 Updated Feb 24, 2025

AllenAI's post-training codebase

Python 2,678 330 Updated Feb 24, 2025

🛏 An HTML to Markdown converter written in JavaScript

HTML 9,297 898 Updated Jul 30, 2024

A course on aligning smol models.

Jupyter Notebook 5,472 1,885 Updated Jan 24, 2025

Python tool for converting files and office documents to Markdown.

HTML 38,870 1,788 Updated Feb 21, 2025

Machine Learning Containers for NVIDIA Jetson and JetPack-L4T

Jupyter Notebook 2,719 544 Updated Feb 24, 2025

Terraform module for scalable GitHub action runners on AWS

HCL 2,707 645 Updated Feb 19, 2025

Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.

TypeScript 31,114 2,986 Updated Feb 24, 2025

Instant Terminal Sharing

C 5,750 311 Updated Oct 16, 2023

Get your documents ready for gen AI

Python 21,971 1,245 Updated Feb 24, 2025

A blazing fast AI Gateway with integrated guardrails. Route to 200+ LLMs, 50+ AI Guardrails with 1 fast & friendly API.

TypeScript 7,097 526 Updated Feb 21, 2025

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

JavaScript 78,290 9,300 Updated Feb 24, 2025

Domain Adapted Language Modeling Toolkit - E2E RAG

Python 313 41 Updated Nov 8, 2024

Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models

Python 219 27 Updated Apr 23, 2024

An Open Source Toolkit For LLM Distillation

Python 508 53 Updated Jan 7, 2025

A toolkit to run Ray applications on Kubernetes

Go 1,495 471 Updated Feb 23, 2025

Giving Kubernetes Superpowers to everyone

Go 6,279 743 Updated Feb 24, 2025

A toolkit to create optimal Production-readyRetrieval Augmented Generation(RAG) setup for your data

Python 1,385 116 Updated Feb 3, 2025
Next
Showing results