GitHub topics: model-merging
arcee-ai/mergekit
Tools for merging pretrained large language models.
Language: Python - Size: 879 KB - Last synced at: 1 day ago - Pushed at: 2 days ago - Stars: 5,652 - Forks: 540

tommasomncttn/mergenetic
Flexible library for merging large language models (LLMs) via evolutionary optimization.
Language: Jupyter Notebook - Size: 10.8 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 21 - Forks: 0

EnnengYang/Awesome-Model-Merging-Methods-Theories-Applications
Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities. arXiv:2408.07666.
Size: 2.04 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 378 - Forks: 17

tanganke/fusion_bench
FusionBench: A Comprehensive Benchmark/Toolkit of Deep Model Fusion
Language: Python - Size: 50.9 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 124 - Forks: 12

tanguy8001/continual-learning-via-model-merging
[ETH Zürich] Continual Learning via Model Merging using Linear Mode Connectivity properties for minimum-loss curve finding
Language: Jupyter Notebook - Size: 790 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 1 - Forks: 0

bloomberg/dataless-model-merging
Code release for Dataless Knowledge Fusion by Merging Weights of Language Models (https://openreview.net/forum?id=FCnohuR6AnM)
Language: Python - Size: 114 KB - Last synced at: 8 days ago - Pushed at: almost 2 years ago - Stars: 89 - Forks: 5

uiuctml/TaskVectorBasis
[Arxiv] Code repo for the paper entitled "Efficient Model Editing with Task Vector Bases: A Theoretical Framework and Scalable Approach"
Language: Python - Size: 290 KB - Last synced at: 26 days ago - Pushed at: 26 days ago - Stars: 0 - Forks: 0

fusion-bench/fusion-bench-project-template
This repository serves as a template for creating new projects based on FusionBench. It includes all the necessary configurations and boilerplate code to get started quickly.
Language: Python - Size: 23.4 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 4 - Forks: 0

declare-lab/della
DELLA-Merging: Reducing Interference in Model Merging through Magnitude-Based Sampling
Language: Python - Size: 160 KB - Last synced at: about 1 month ago - Pushed at: 10 months ago - Stars: 29 - Forks: 2

Mamiglia/mergecraft
Mergecraft is a simple library to streamline model merging operations, with seamless integration with HuggingFace🤗
Language: Jupyter Notebook - Size: 1.11 MB - Last synced at: about 1 month ago - Pushed at: 8 months ago - Stars: 3 - Forks: 0

Nithin-GK/MaxFusion
[ECCV'24] MaxFusion: Plug & Play multimodal generation in text to image diffusion models
Language: Jupyter Notebook - Size: 8.7 MB - Last synced at: about 2 months ago - Pushed at: 6 months ago - Stars: 22 - Forks: 2

HelloZicky/CKI
[AAAI2025 (Oral)] PyTorch implementation of "Optimize Incompatible Parameters Through Compatibility-aware Knowledge Integration".
Language: Python - Size: 4.42 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

isaacus-dev/terge
An easy-to-use Python library for merging PyTorch models.
Language: Python - Size: 870 KB - Last synced at: 25 days ago - Pushed at: 11 months ago - Stars: 9 - Forks: 0

iurada/talos-task-arithmetic
Official repository of our work "Efficient Model Editing with Task-Localized Sparse Fine-tuning" accepted at ICLR 2025
Language: Python - Size: 84 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

flowritecom/flow-merge
flow-merge is a powerful Python library that enables seamless merging of multiple transformer-based language models using the most popular merge methods such as model soups, SLERP, ties-MERGING or DARE.
Language: Python - Size: 1.58 MB - Last synced at: 3 days ago - Pushed at: 3 months ago - Stars: 17 - Forks: 1

zjunlp/ModelKinship
Exploring Model Kinship for Merging Large Language Models
Language: Jupyter Notebook - Size: 3.07 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 23 - Forks: 2

naskio/mergeui
All-in-one UI for merged LLMs in Hugging Face
Language: Python - Size: 422 KB - Last synced at: 20 days ago - Pushed at: 11 months ago - Stars: 24 - Forks: 3

uiuctml/Localize-and-Stitch
Localize-and-Stitch: Efficient Model Merging via Sparse Task Arithmetic
Language: Python - Size: 176 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 22 - Forks: 0

zhangyikaii/LAMDA-ZhiJian
ZhiJian: A Unifying and Rapidly Deployable Toolbox for Pre-trained Model Reuse
Language: Python - Size: 13 MB - Last synced at: 26 days ago - Pushed at: over 1 year ago - Stars: 51 - Forks: 2

EnnengYang/AdaMerging
AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.
Language: Python - Size: 678 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 47 - Forks: 2

nik-dim/tall_masks
Official repository of "Localizing Task Information for Improved Model Merging and Compression" [ICML 2024]
Language: Python - Size: 538 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 34 - Forks: 0

EnnengYang/SurgeryV2
SurgeryV2: Bridging the Gap Between Model Merging and Multi-Task Learning with Deep Representation Surgery. Arxiv, 2024.
Language: Python - Size: 540 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 1 - Forks: 0

EnnengYang/RepresentationSurgery
Representation Surgery for Multi-Task Model Merging. ICML, 2024.
Language: Python - Size: 1.34 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 20 - Forks: 2

danielm1405/magmax
[ECCV 2024] MagMax: Leveraging Model Merging for Seamless Continual Learning (official repository)
Language: Python - Size: 187 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 3 - Forks: 0

fr3nz99/Ratatouille-Model-Merging
Advanced Transfer Learning project with the purpose of obtaining the best model by mixing three twice-fine-tuned models.
Language: Jupyter Notebook - Size: 2.22 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

RS2002/PianoBART2
PianoBART2: Task-oriented Music Generation Model Based on Adversarial Learning
Language: Python - Size: 24.4 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

one-some/lazy-transformers-merge
Merge transformers without using like a bajillion GB of RAM
Language: Python - Size: 44.9 KB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 6 - Forks: 0
