🤦
Pinned Loading
-
EleutherAI/lm-evaluation-harness
EleutherAI/lm-evaluation-harness PublicA framework for few-shot evaluation of language models.
-
-
ZeRO-transformer
ZeRO-transformer PublicTwo implementations of ZeRO-1 optimizer sharding in JAX
Python 13
-
-
tritonformer
tritonformer PublicDifferentiable transformer in Triton, matching the performance of PyTorch + cuDNN/cuBLAS
Python 4
-
hawk-pytorch
hawk-pytorch PublicPyTorch implementation of Hawk from "Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models" (https://arxiv.org/abs/2402.19427). Compatible with torch.compile.
Python
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.