Finetuning an LLM (7B Billion parameters or less) with Mac M4 Mini #1647

Hujaifa-Git · 2024-12-04T18:43:45Z

Hujaifa-Git
Dec 4, 2024

I am an LLM Developer. I wanted to buy either an Mac 2 Air (16 GB) or Mac M4 Mini (16 GB). I want to finetune small sized LLM with QLORA for experiment or research purposes. as they both have 16 GB GPU I understand these can devices can train and run these models. I was more interested about the speed. Both the speed of finetuning and the speed of inference.

Can anyone give me a suggestion on which device I should buy and what kind of improvement (in terms of speed) I can accept from the Mac M4 mini?

[P.S: I have never used any apple products before. So any review on how these devices really work in terms of finetuning LLM's would be really amazing to know or talk about ]

Answered by awni

Dec 5, 2024

I would look around on the internet for an estimate of fp32 / fp16 FLOPs for the machines you are interested in. The speed difference for LoRA / QLoRA training tends to follow the difference in peak flops since it's a very compute bound workflow.

View full answer

awni · 2024-12-05T14:33:21Z

awni
Dec 5, 2024
Maintainer

You should be able to QLoRA a 7B on either machine. The speed depends on several factors including:

Sequence length
Number of QLoRA layers
QLoRA rank

In terms of inference speed for the base M4 Mini you can expect ~25 toks/sec or more with a 4-bit 7B and perhaps 15-20 for the M2 Air.

2 replies

Hujaifa-Git Dec 5, 2024
Author

If the dataset and hyperparameters are exactly the same and I try to finetune on both Mac M4 mini and M2 Air. How much faster the Mac 4 mini will be?

awni Dec 5, 2024
Maintainer

I would look around on the internet for an estimate of fp32 / fp16 FLOPs for the machines you are interested in. The speed difference for LoRA / QLoRA training tends to follow the difference in peak flops since it's a very compute bound workflow.

Answer selected by Hujaifa-Git

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Finetuning an LLM (7B Billion parameters or less) with Mac M4 Mini #1647

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 1 comment 2 replies

{{title}}

{{title}}

{{title}}

Select a reply

Finetuning an LLM (7B Billion parameters or less) with Mac M4 Mini #1647

Hujaifa-Git Dec 4, 2024

Replies: 1 comment · 2 replies

awni Dec 5, 2024 Maintainer

Hujaifa-Git Dec 5, 2024 Author

awni Dec 5, 2024 Maintainer

Hujaifa-Git
Dec 4, 2024

Replies: 1 comment 2 replies

awni
Dec 5, 2024
Maintainer

Hujaifa-Git Dec 5, 2024
Author

awni Dec 5, 2024
Maintainer