Hybrid Parallel Plugin下TP显存比同配置下deepspeed要高？？？ #6161

duomicoding · 2024-12-17T13:10:33Z

您好，请问下为啥Hybrid Parallel Plugin下TP显存比同配置下deepspeed要高？？？

Issues-translate-bot · 2024-12-17T13:10:48Z

Bot detected the issue body's language is not English, translate it automatically. 👯👭🏻🧑‍🤝‍🧑👫🧑🏿‍🤝‍🧑🏻👩🏾‍🤝‍👨🏿👬🏿

Title: Is the TP memory under Hybrid Parallel Plugin higher than the deepspeed under the same configuration? ? ?

Hello, may I ask why the TP memory under Hybrid Parallel Plugin is higher than the deepspeed under the same configuration? ? ?

duomicoding · 2024-12-17T13:28:28Z

好像是显存碎片造成？而且很严重，请问下有对应的优化改进措施吗？

Issues-translate-bot · 2024-12-17T13:28:41Z

Bot detected the issue body's language is not English, translate it automatically. 👯👭🏻🧑‍🤝‍🧑👫🧑🏿‍🤝‍🧑🏻👩🏾‍🤝‍👨🏿👬🏿

It seems to be caused by memory fragmentation? And it’s very serious. Are there any corresponding optimization and improvement measures?

ver217 · 2025-02-20T04:08:03Z

Deepspeed zero-3是完全切分权重而TP并不完全切分（例如非Linear/Embedding层）。当Activation较小时这种情况有可能发生，请提供更详细的信息

Issues-translate-bot · 2025-02-20T04:08:14Z

Bot detected the issue body's language is not English, translate it automatically. 👯👭🏻🧑‍🤝‍🧑👫🧑🏿‍🤝‍🧑🏻👩🏾‍🤝‍👨🏿👬🏿

Deepspeed zero-3 is a complete slicing of weights while TP is not fully slicing (e.g., non-Linear/Embedding layers). This may occur when Activation is small, please provide more detailed information

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hybrid Parallel Plugin下TP显存比同配置下deepspeed要高？？？ #6161

Hybrid Parallel Plugin下TP显存比同配置下deepspeed要高？？？ #6161

duomicoding commented Dec 17, 2024

Issues-translate-bot commented Dec 17, 2024

duomicoding commented Dec 17, 2024

Issues-translate-bot commented Dec 17, 2024

ver217 commented Feb 20, 2025

Issues-translate-bot commented Feb 20, 2025

Hybrid Parallel Plugin下TP显存比同配置下deepspeed要高？？？ #6161

Hybrid Parallel Plugin下TP显存比同配置下deepspeed要高？？？ #6161

Comments

duomicoding commented Dec 17, 2024

Issues-translate-bot commented Dec 17, 2024

duomicoding commented Dec 17, 2024

Issues-translate-bot commented Dec 17, 2024

ver217 commented Feb 20, 2025

Issues-translate-bot commented Feb 20, 2025