torch.compile and flashattention implementation in sonar #51

nitishpandey04 · 2024-12-19T06:55:00Z

Is torch.compile and flashattention already used in sonar implementation? if not, how to use it to make it faster?

sl5035 · 2025-01-02T07:55:14Z

have you figured it out?

nitishpandey04 · 2025-01-07T06:46:29Z

Not using fairseq2 but there's an implementation of Sonar given in huggingface by one of the authors of the paper. There you can make any modifications you want according to the transformers library API.

https://huggingface.co/cointegrated/SONAR_200_text_encoder

nitishpandey04 changed the title ~~Faster inference for sonar~~ torch.compile and flashattention implementation in sonar Dec 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

torch.compile and flashattention implementation in sonar #51

torch.compile and flashattention implementation in sonar #51

nitishpandey04 commented Dec 19, 2024 •

edited

Loading

sl5035 commented Jan 2, 2025

nitishpandey04 commented Jan 7, 2025

torch.compile and flashattention implementation in sonar #51

torch.compile and flashattention implementation in sonar #51

Comments

nitishpandey04 commented Dec 19, 2024 • edited Loading

sl5035 commented Jan 2, 2025

nitishpandey04 commented Jan 7, 2025

nitishpandey04 commented Dec 19, 2024 •

edited

Loading