token-pruning | Topic | Ecosyste.ms: Repos

Topic: "token-pruning"

📚 Collection of token-level model compression resources.

Size: 1.73 MB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 128 - Forks: 4

This is a collection of our research on efficient AI, covering hardware-aware NAS and model compression.

Language: Python - Size: 12 MB - Last synced at: 2 days ago - Pushed at: 9 months ago - Stars: 83 - Forks: 7

[CVPR 2025] DivPrune: Diversity-based Visual Token Pruning for Large Multimodal Models

Language: Python - Size: 11 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 18 - Forks: 0

Official implementation of CVPR 2024 paper "vid-TLDR: Training Free Token merging for Light-weight Video Transformer".

Language: Python - Size: 1.04 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 18 - Forks: 0

An implementation of LazyLLM token pruning for LLaMa 2 model family.

Language: Python - Size: 27.3 KB - Last synced at: 3 months ago - Pushed at: 6 months ago - Stars: 11 - Forks: 0

😎 Awesome papers on token redundancy reduction

Size: 80.1 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 3 - Forks: 0

Language: Python - Size: 1.8 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0