Skip to content
Change the repository type filter

All

    Repositories list

    • Tile primitives for speedy kernels
      Cuda
      MIT License
      581.5k182Updated Oct 2, 2024Oct 2, 2024
    • WONDERBREAD benchmark + dataset for BPM tasks
      Jupyter Notebook
      31700Updated Sep 24, 2024Sep 24, 2024
    • An open science effort to benchmark legal reasoning in foundation models
      Python
      4333084Updated Aug 25, 2024Aug 25, 2024
    • based

      Public
      Code for exploring Based models from "Simple linear attention language models balance the recall-throughput tradeoff"
      Python
      Apache License 2.0
      1420940Updated Aug 16, 2024Aug 16, 2024
    • The simplest, fastest repository for training/finetuning medium-sized GPTs. Now, with kittens!
      Makefile
      MIT License
      5.7k4500Updated Aug 4, 2024Aug 4, 2024
    • Automating enterprise workflows with multimodal agents
      Jupyter Notebook
      Apache License 2.0
      148710Updated Jul 29, 2024Jul 29, 2024
    • hgcn

      Public
      Hyperbolic Graph Convolutional Networks in PyTorch.
      Python
      107593183Updated Jul 25, 2024Jul 25, 2024
    • manifest

      Public
      Prompt programming with FMs.
      Python
      Apache License 2.0
      4643852Updated Jul 22, 2024Jul 22, 2024
    • Python
      14210Updated Jul 9, 2024Jul 9, 2024
    • hyena-dna

      Public
      Official implementation for HyenaDNA, a long-range genomic foundation model built with Hyena
      Assembly
      Apache License 2.0
      82574285Updated Jun 15, 2024Jun 15, 2024
    • safari

      Public
      Convolutions for Sequence Modeling
      Assembly
      Apache License 2.0
      71864221Updated Jun 13, 2024Jun 13, 2024
    • A framework for few-shot evaluation of language models.
      Python
      MIT License
      1.7k800Updated Jun 8, 2024Jun 8, 2024
    • A framework for few-shot evaluation of language models.
      Python
      MIT License
      1.7k700Updated Jun 3, 2024Jun 3, 2024
    • axolive

      Public
      Go ahead and axolotl questions
      Python
      Apache License 2.0
      839000Updated Jun 3, 2024Jun 3, 2024
    • Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.
      Jupyter Notebook
      1.8k000Updated Jun 3, 2024Jun 3, 2024
    • Python
      Apache License 2.0
      2816120Updated May 27, 2024May 27, 2024
    • m2

      Public
      Repo for "Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture"
      Assembly
      Apache License 2.0
      43535241Updated May 20, 2024May 20, 2024
    • zoology

      Public
      Understand and test language model architectures on synthetic tasks.
      Python
      Apache License 2.0
      2615751Updated May 1, 2024May 1, 2024
    • evaporate

      Public
      This repo contains data and code for the paper "Language Models Enable Simple Systems for Generating Structured Views of Heterogeneous Data Lakes"
      Python
      4547692Updated Mar 26, 2024Mar 26, 2024
    • meerkat

      Public
      Creative interactive views of any dataset.
      Python
      Apache License 2.0
      4382492Updated Feb 25, 2024Feb 25, 2024
    • FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Cores
      C++
      Apache License 2.0
      27270163Updated Feb 13, 2024Feb 13, 2024
    • Building blocks for foundation models.
      1336400Updated Jan 3, 2024Jan 3, 2024
    • Resources for Data Centric AI
      TeX
      Apache License 2.0
      1171.1k66Updated Dec 13, 2023Dec 13, 2023
    • skill-it

      Public
      Skill-It! A Data-Driven Skills Framework for Understanding and Training Language Models
      Jupyter Notebook
      Apache License 2.0
      73900Updated Oct 31, 2023Oct 31, 2023
    • domino

      Public
      Python
      Apache License 2.0
      2413340Updated Oct 30, 2023Oct 30, 2023
    • Structured matrices for compressing neural networks
      Python
      Apache License 2.0
      216551Updated Oct 5, 2023Oct 5, 2023
    • butterfly

      Public
      Butterfly matrix multiplication in PyTorch
      Python
      Apache License 2.0
      31161163Updated Oct 5, 2023Oct 5, 2023
    • HypHC

      Public
      Hyperbolic Hierarchical Clustering.
      Python
      2619151Updated Oct 5, 2023Oct 5, 2023
    • H3

      Public
      Language Modeling with the H3 State Space Model
      Assembly
      Apache License 2.0
      53511140Updated Sep 29, 2023Sep 29, 2023
    • embroid

      Public
      Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification
      Jupyter Notebook
      01100Updated Aug 12, 2023Aug 12, 2023