efficient-ai

Star

Here are 40 public repositories matching this topic...

NVlabs / Long-RL

Star

Long-RL: Scaling RL to Long Sequences (NeurIPS 2025)

reinforcement-learning multi-modality long-sequence large-language-models sequence-parallelism efficient-ai

Updated Sep 24, 2025
Python

SimonZeng7108 / efficientsam3

Star

EfficientSAM3 compresses SAM3 into lightweight, edge-friendly models via progressive knowledge distillation for fast promptable concept segmentation and tracking.

tracking computer-vision artificial-intelligence segmentation video-object-segmentation edge-ai video-object-tracking vllm segment-anything efficient-ai

Updated Jun 12, 2026
Jupyter Notebook

cokeshao / Awesome-Multimodal-Token-Compression

Star

[TMLR 2026] Survey: https://arxiv.org/pdf/2507.20198

awesome-list model-acceleration long-context mllm efficient-ai token-compression efficient-mllm

Updated May 29, 2026

jeho-lee / Awesome-On-Device-AI-Systems

Star

machine-learning edge-computing mobile-systems on-device-ai resource-constrained-devices efficient-ai

Updated Jun 21, 2026

tiannuo-yang / SearchAgent-X

Star

A High-Efficiency System of Large Language Model Based Search Agents

agent information-retrieval ai approximate-nearest-neighbor-search post-training rag llm rlhf llm-serving vllm efficient-ai

Updated Jul 2, 2025
Python

BaiTheBest / SparseLLM

Star

Official Repo for SparseLLM: Global Pruning of LLMs (NeurIPS 2024)

pruning model-compression inference-optimization alternating-optimization large-language-models efficient-ai

Updated Mar 27, 2025
Python

Liu-Hy / WMDD

Star

Official PyTorch implementation of the paper "Dataset Distillation via the Wasserstein Metric" (ICCV 2025).

efficiency optimal-transport distillation dataset-distillation efficient-ai

Updated Mar 8, 2026
Python

erectbranch / MIT-Efficient-AI

Star

TinyML and Efficient Deep Learning Computing | MIT 6.S965/6.5940

ai deep-learning lecture-notes tinyml efficient-ai mit-6s965 mit-65940

Updated Jun 12, 2026

fangvv / EdgeDI

Star

Code for paper "Joint Architecture Design and Workload Partitioning for DNN Inference on Industrial IoT Clusters"

distributed-systems deep-learning bigdata distributed-computing vgg resnet pruning iot-device edge-computing heterogeneous-systems feature-map industrial-iot edge-intelligence parallel-inference efficient-ai

Updated Jun 17, 2026
Python

This is a repository accompanying the survey Edge AI Meets LLM (coming soon), containing a comprehensive list of papers, codebases, toolchains, and open-source frameworks. It is intended to serve as a handbook for researchers and developers interested in Edge/Mobile LLMs.

edge-ai on-device-ai llm-inference efficient-ai edge-llm

Updated Jun 5, 2025

ResponsibleAILab / DAM

Star

Dynamic Attention Mask (DAM) generate adaptive sparse attention masks per layer and head for Transformer models, enabling long-context inference with lower compute and memory overhead without fine-tuning.

inference-optimization sparse-attention efficient-ai

Updated Jun 16, 2025
Python

fangvv / EdgeKE

Star

Code for paper "EdgeKE: An On-Demand Deep Learning IoT System for Cognitive Big Data on Industrial Edge Devices"

iot deep-neural-networks deep-learning inference resnet knowledge-distillation edge-computing edge-devices resnets inference-optimization early-exit jetson-nano branchynet edge-applications accuracy-requirements efficient-ai

Updated Jun 18, 2026
Python

yumozi / GUARD

Star

Official PyTorch implementation of the paper "Towards Adversarially Robust Dataset Distillation by Curvature Regularization" (AAAI 2025).

computer-vision efficiency robustness distillation dataset-distillation efficient-ai aaai2025

Updated Oct 21, 2025
Python

Shikha-code36 / early-exit-cnn

Star

A deep learning framework that implements Early Exit strategies in Convolutional Neural Networks (CNNs) using Deep Q-Learning (DQN). This project enhances computational efficiency by dynamically determining the optimal exit point in a neural network for image classification tasks on CIFAR-10.

reinforcement-learning deep-learning cnn pytorch dqn image-classification cifar10 cifar-10 pytorch-cnn cnn-pytorch cifar10-classification early-exit model-optimization efficient-ai

Updated Feb 23, 2025
Jupyter Notebook

Chenqing-Lin / FAIR-Pruner

Star

Research-ready and production-friendly neural network pruning for PyTorch—transparent methods, reproducible baselines, and deployment metrics to compress models for real-world use.

machine-learning acceleration computer-vision deep-learning reproducible-research optimization latency torch pytorch energy-efficiency model-compression inference-optimization edge-ai structured-pruning green-ai unstructured-pruning neural-network-pruning efficient-ai low-latency-inference

Updated Jan 25, 2026
Python

Pro-GenAI / Gen-UI-Lang

Star

⚡ Fast, concise, LLM-first Generative UI language

ai llms generative-ai gen-ai genai generative-ui efficient-ai ai-scalability gen-ai-app efficient-llms

Updated Dec 20, 2025
Python

LumGenLab / LumGPT

Star

Transformer (GPT) implemented from scratch in C++. Runs on modest hardware with complete mathematical derivations and optimized tensor operations.

deep-learning transformer cpp17 gpt language-model efficient-ai opensource-llm lumgenlab

Updated Jan 6, 2026
C++

raphischer / ai-energy-validation

Star

Ground-Truthing AI Energy Consumption: Validating CodeCarbon Against External Measurements

sustainability ai energy-consumption efficient-ai

Updated May 30, 2026
Python

EGen-V / Transformer-Hierarchical-Layers

Star

A non-Transformer hierarchical recurrent network with differentiable Gumbel-Softmax routing and bounded memory slots. Runs 7B+ parameter models layer-by-layer on low-budget GPUs.

deep-learning pytorch recurrence attention-mechanism hardware-acceleration memory-augmented-neural-networks llm efficient-ai memory-augmented infinite-context

Updated Jan 15, 2026
Python

fangvv / ACS

Star

Code for paper "Dynamic Deep Neural Network Inference via Adaptive Channel Skipping"

deep-neural-networks bigdata dnn neural-networks iot-application edge-computing group-convolution algorithm-optimization edge-intelligence ai-algorithms edge-inference efficient-ai

Updated Jun 18, 2026
Python

Improve this page

Add a description, image, and links to the efficient-ai topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the efficient-ai topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

efficient-ai

Here are 40 public repositories matching this topic...

NVlabs / Long-RL

SimonZeng7108 / efficientsam3

cokeshao / Awesome-Multimodal-Token-Compression

jeho-lee / Awesome-On-Device-AI-Systems

tiannuo-yang / SearchAgent-X

BaiTheBest / SparseLLM

Liu-Hy / WMDD

erectbranch / MIT-Efficient-AI

fangvv / EdgeDI

yifu-ding / Awesome-Edge-LLMs

ResponsibleAILab / DAM

fangvv / EdgeKE

yumozi / GUARD

Shikha-code36 / early-exit-cnn

Chenqing-Lin / FAIR-Pruner

Pro-GenAI / Gen-UI-Lang

LumGenLab / LumGPT

raphischer / ai-energy-validation

EGen-V / Transformer-Hierarchical-Layers

fangvv / ACS

Improve this page

Add this topic to your repo