-
Notifications
You must be signed in to change notification settings - Fork 308
Pull requests: AI-Hypercomputer/maxtext
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add option to skip initializing the jax distributed system
#1125
opened Dec 23, 2024 by
gobbleturk
Loading…
4 tasks done
Fix orbax to hf converter for Llama3.1-8B
#1123
opened Dec 23, 2024 by
khatwanimohit
Loading…
4 tasks done
Better save message for async checkpoint saving
pull ready
#1122
opened Dec 23, 2024 by
gobbleturk
Loading…
4 tasks done
Add smaler v5e topologies to train_compile
#1119
opened Dec 21, 2024 by
gobbleturk
Loading…
4 tasks done
Temporary solution to remove the unnecessary ar cache during prefill
#1115
opened Dec 20, 2024 by
zhihaoshan-google
•
Draft
4 tasks done
Refactor and parallelize MaxText test runner
#1113
opened Dec 20, 2024 by
shralex
Loading…
4 tasks done
Get all tests to pass locally with no special configuration
#1108
opened Dec 19, 2024 by
SamuelMarks
Loading…
4 tasks done
Fix typo in trillium llama2-70b script
#1105
opened Dec 19, 2024 by
wenxindongwork
Loading…
4 tasks done
Add Dockerfile and entrypoint script for the maxengine-server image
pull ready
#1092
opened Dec 10, 2024 by
vivianrwu
Loading…
4 tasks done
Manual test running for mem stats
pull ready
#1074
opened Dec 3, 2024 by
gobbleturk
Loading…
4 tasks done
Add Pallas GPU decode attention in Maxtext inference
#1066
opened Nov 26, 2024 by
tohaowu
Loading…
4 tasks done
Add Llama2-70b sparsecore collective model to trillium configs
#1042
opened Nov 15, 2024 by
Obliviour
Loading…
4 tasks done
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.