Skip to content

Pull requests: AI-Hypercomputer/maxtext

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

manually test swa
#1126 opened Dec 24, 2024 by gobbleturk Loading…
4 tasks done
Add option to skip initializing the jax distributed system
#1125 opened Dec 23, 2024 by gobbleturk Loading…
4 tasks done
Fix orbax to hf converter for Llama3.1-8B
#1123 opened Dec 23, 2024 by khatwanimohit Loading…
4 tasks done
Better save message for async checkpoint saving pull ready
#1122 opened Dec 23, 2024 by gobbleturk Loading…
4 tasks done
Add option to dump hlo
#1121 opened Dec 23, 2024 by gobbleturk Loading…
4 tasks done
Add smaler v5e topologies to train_compile
#1119 opened Dec 21, 2024 by gobbleturk Loading…
4 tasks done
Incorporate Orbax emergency replicator checkpoint manager
#1117 opened Dec 21, 2024 by xuefgu Draft
4 tasks done
Refactor and parallelize MaxText test runner
#1113 opened Dec 20, 2024 by shralex Loading…
4 tasks done
Get all tests to pass locally with no special configuration
#1108 opened Dec 19, 2024 by SamuelMarks Loading…
4 tasks done
Anisha ckpt2hf1
#1106 opened Dec 19, 2024 by A9isha Draft
4 tasks
Fix typo in trillium llama2-70b script
#1105 opened Dec 19, 2024 by wenxindongwork Loading…
4 tasks done
Add a Ray trainer for MaxText
#1098 opened Dec 13, 2024 by richardsliu Loading…
Add llama-405b configuration for v5p
#1095 opened Dec 10, 2024 by suexu1025 Loading…
4 tasks done
Add Pathways Support to Benchmark Runner
#1094 opened Dec 10, 2024 by SujeethJinesh Loading…
Add mixtral 8x7b config for gpu
#1090 opened Dec 9, 2024 by michelle-yooh Loading…
4 tasks done
Manual test running for mem stats pull ready
#1074 opened Dec 3, 2024 by gobbleturk Loading…
4 tasks done
Add Pallas GPU decode attention in Maxtext inference
#1066 opened Nov 26, 2024 by tohaowu Loading…
4 tasks done
Quantize megablox
#1062 opened Nov 25, 2024 by lenscloth Loading…
4 tasks done
[DO NOT MERGE] verify fix
#1045 opened Nov 16, 2024 by RissyRan Draft
Add Llama2-70b sparsecore collective model to trillium configs
#1042 opened Nov 15, 2024 by Obliviour Loading…
4 tasks done
Enable pathways workloads for v6e benchmarks
#1040 opened Nov 15, 2024 by sadikneipp Loading…
ProTip! Mix and match filters to narrow down what you’re looking for.