Skip to content
Change the repository type filter

All

    Repositories list

    • MF2

      Public
      Python
      0300Updated Jul 25, 2025Jul 25, 2025
    • Code for the paper "Instituto de Telecomunicações at IWSLT 2025: Aligning Small-Scale Speech and Language Models for Speech-to-Text Learning"
      Python
      0100Updated Jul 17, 2025Jul 17, 2025
    • Python
      0100Updated Jul 15, 2025Jul 15, 2025
    • adasplash

      Public
      AdaSplash: Adaptive Sparse Flash Attention (aka Flash Entmax Attention)
      Python
      11910Updated Jul 15, 2025Jul 15, 2025
    • lmms-eval

      Public
      Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.
      Python
      348000Updated Jun 30, 2025Jun 30, 2025
    • A fork of lm-eval-harness.
      Python
      2.6k000Updated Jun 29, 2025Jun 29, 2025
    • Python
      43000Updated Jun 24, 2025Jun 24, 2025
    • Ongoing research training transformer models at scale
      Python
      3k101Updated Jun 20, 2025Jun 20, 2025
    • From a+b to sparsemax(QK^T)V in Triton!
      Jupyter Notebook
      01500Updated Jun 19, 2025Jun 19, 2025
    • A package for sampling from Gibbs distributions during inference with LLMs.
      Python
      2810Updated Jun 12, 2025Jun 12, 2025
    • zsb

      Public
      Python
      0500Updated Jun 9, 2025Jun 9, 2025
    • Watching the Watchers: Exposing Gender Disparities in Machine Translation Quality Estimation
      0000Updated May 26, 2025May 26, 2025
    • treqa

      Public
      LLM-based QAG framework for MT Evaluation
      Python
      1311Updated May 13, 2025May 13, 2025
    • Repository containing code to reproduce results of the paper "Sparse Activations as Conformal Predictors".
      Jupyter Notebook
      0100Updated Apr 27, 2025Apr 27, 2025
    • A PyTorch native library for large model training
      Python
      455000Updated Apr 1, 2025Apr 1, 2025
    • fy-vi

      Public
      Jupyter Notebook
      0000Updated Mar 21, 2025Mar 21, 2025
    • doce

      Public
      This is the a repo of DOCE
      Python
      1200Updated Mar 14, 2025Mar 14, 2025
    • latim

      Public
      Jupyter Notebook
      0500Updated Feb 24, 2025Feb 24, 2025
    • CHM-Net

      Public
      Modern Hopfield Networks with Continuous-Time Memories
      Python
      0000Updated Feb 21, 2025Feb 21, 2025
    • 0000Updated Feb 17, 2025Feb 17, 2025
    • \infty-Video: A Training-Free Approach to Long Video Understanding via Continuous-Time Memory Consolidation
      Python
      01510Updated Feb 14, 2025Feb 14, 2025
    • ssm-mt

      Public
      Jupyter Notebook
      0100Updated Feb 8, 2025Feb 8, 2025
    • Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks
      Python
      464000Updated Feb 4, 2025Feb 4, 2025
    • HFYN

      Public
      Hopfield-Fenchel-Young Networks: A Unified Framework for Associative Memory Retrieval
      Jupyter Notebook
      0100Updated Jan 31, 2025Jan 31, 2025
    • Jupyter Notebook
      1200Updated Oct 15, 2024Oct 15, 2024
    • Python
      0200Updated Oct 10, 2024Oct 10, 2024
    • axolotl

      Public
      Go ahead and axolotl questions
      Python
      1.1k000Updated Sep 26, 2024Sep 26, 2024
    • nanotron

      Public
      Minimalistic large language model 3D-parallelism training
      Python
      218000Updated Sep 19, 2024Sep 19, 2024
    • Python
      76627Updated Aug 29, 2024Aug 29, 2024
    • DeepSPIN's submission to SIGMORPHON 2020
      Python
      1511Updated Jul 25, 2024Jul 25, 2024