Skip to content

Latest commit

 

History

History
70 lines (40 loc) · 2.56 KB

README.md

File metadata and controls

70 lines (40 loc) · 2.56 KB

AI Infra

Concepts intro

Hugging Face articles:

Some Best Practices

LLM

Optimization

Training Optimization: https://developer.nvidia.com/blog/mastering-llm-techniques-training Inference Optimization: https://developer.nvidia.com/blog/mastering-llm-techniques-inference-optimization

Application

RAG: https://www.youtube.com/watch?v=YuRFba27_1w Agent: https://www.youtube.com/watch?v=q1XFm21I-VQ

MLOps

Intro

https://madewithml.com/

Application

https://www.youtube.com/watch?v=45Zs12Xlg2g

Papers

Communication

Megatron-LM

Zero

Megascale

Cautious

Something critical:

  • Do not try to use LLM to enhance the learning process, such generating questions and answers. You will get nothing.
  • Consult the expert to save most of the time.

Acknowledgement

Thanks to Liyue Zhang and Guangnan Feng