efficient-ai

Star

Here are 4 public repositories matching this topic...

NVlabs / Long-RL

Star

Long-RL: Scaling RL to Long Sequences

reinforcement-learning multi-modality long-sequence large-language-models sequence-parallelism efficient-ai

Updated Jul 25, 2025
Python

tiannuo-yang / SearchAgent-X

Star

A High-Efficiency System of Large Language Model Based Search Agents

agent information-retrieval ai approximate-nearest-neighbor-search post-training rag llm rlhf llm-serving vllm efficient-ai

Updated Jul 2, 2025
Python

BaiTheBest / SparseLLM

Star

Official Repo for SparseLLM: Global Pruning of LLMs (NeurIPS 2024)

pruning model-compression inference-optimization alternating-optimization large-language-models efficient-ai

Updated Mar 27, 2025
Python

Dynamic Attention Mask (DAM) generate adaptive sparse attention masks per layer and head for Transformer models, enabling long-context inference with lower compute and memory overhead without fine-tuning.

inference-optimization sparse-attention efficient-ai

Updated Jun 16, 2025
Python

Improve this page

Add a description, image, and links to the efficient-ai topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the efficient-ai topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

efficient-ai

Here are 4 public repositories matching this topic...

NVlabs / Long-RL

tiannuo-yang / SearchAgent-X

BaiTheBest / SparseLLM

ResponsibleAILab / DAM

Improve this page

Add this topic to your repo