Skip to content

Fizzbb/LargeModel

Repository files navigation

LargeModel

Infra

imbue from baremetal to 70b model

Transformer

Training from scratch

Training framework performance

Fine-tune with single node

Inference explaination

Performance projection

Model reference

Tracing

Profiling

Trace analysis: https://github.com/facebookresearch/HolisticTraceAnalysis/tree/main/examples

Rewrite

Model Visulization

Training time, Flops estimation

GPU benchmarks

git clone https://github.com/te42kyfo/gpu-benches.git
cd gpu-benches/gpu-stream/
/usr/local/cuda/bin/nvcc -o stream main.cu
./stream

GPU foundamentals

Compilation

Chip architecture

About

large model practice

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published