-
Defense Unicorns
- New York City
-
09:47
- 5h behind - https://justinthelaw.github.io/justinthelaw
- in/justinwingchunglaw
Lists (2)
Sort Name ascending (A-Z)
Stars
My GitHub profile and personal website!
Cost-efficient and pluggable Infrastructure components for GenAI inference
Achieve the llama3 inference step-by-step, grasp the core concepts, master the process derivation, implement the code.
vCluster - Create fully functional virtual Kubernetes clusters - Each vcluster runs inside a namespace of the underlying k8s cluster. It's cheaper than creating separate full-blown clusters and it …
Multi-tenancy and policy-based framework for Kubernetes.
Deploy a Production Ready Kubernetes Cluster
A Python framework to write Kubernetes operators in just a few lines of code
Blueprint by Mozilla.ai for answering questions about structured documents
Blueprint by Mozilla.ai for finetuning a Speech-To-Text model in your own language
Kubernetes (k8s) device plugin to enable registration of AMD GPU to a container cluster
Scale from single vLLM instance to distributed vLLM deployment without changing any application code.
Fully open reproduction of DeepSeek-R1
🛏 An HTML to Markdown converter written in JavaScript
A course on aligning smol models.
Python tool for converting files and office documents to Markdown.
Machine Learning Containers for NVIDIA Jetson and JetPack-L4T
Terraform module for scalable GitHub action runners on AWS
Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.
A blazing fast AI Gateway with integrated guardrails. Route to 200+ LLMs, 50+ AI Guardrails with 1 fast & friendly API.
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
Domain Adapted Language Modeling Toolkit - E2E RAG
Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models
A toolkit to run Ray applications on Kubernetes
A toolkit to create optimal Production-readyRetrieval Augmented Generation(RAG) setup for your data