GitHub topics: mla
abdelfattah-lab/xKV
xKV: Cross-Layer SVD for KV-Cache Compression
Language: Python - Size: 30.9 MB - Last synced at: about 16 hours ago - Pushed at: about 17 hours ago - Stars: 27 - Forks: 2

fxmeng/TransMLA
TransMLA: Multi-Head Latent Attention Is All You Need
Language: Python - Size: 332 KB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 299 - Forks: 22

xlite-dev/Awesome-LLM-Inference
📚A curated list of Awesome LLM Inference Papers with Codes.
Language: Python - Size: 115 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 4,123 - Forks: 287

Bruce-Lee-LY/decoding_attention
Decoding Attention is specially optimized for MHA, MQA, GQA and MLA using CUDA core for the decoding stage of LLM inference.
Language: C++ - Size: 867 KB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 37 - Forks: 4

scar-ai/The-Latentformer
Latentformer is a transformer model with latent attention designed for efficient training. It features learnable positional embeddings, rotary position encoding, and MLA to optimize speed and performance while maintaining model quality.
Language: Python - Size: 44.9 KB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 2 - Forks: 0

xlite-dev/ffpa-attn
📚FFPA(Split-D): Extend FlashAttention with Split-D for large headdim, O(1) GPU SRAM complexity, 1.8x~3x↑🎉 faster than SDPA EA.
Language: Cuda - Size: 4.21 MB - Last synced at: 9 days ago - Pushed at: about 1 month ago - Stars: 185 - Forks: 8

hahnec/plenopticam
Light-field imaging application for plenoptic cameras
Language: Python - Size: 227 MB - Last synced at: 25 days ago - Pushed at: over 1 year ago - Stars: 222 - Forks: 38

hahnec/plenoptisign
Light field geometry estimator for plenoptic cameras
Language: Python - Size: 12.2 MB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 42 - Forks: 13

LemonAttn/mini_transformer
最小Transformer架构,能够快速搭建现在各种Transformer架构模型
Language: Python - Size: 389 KB - Last synced at: 6 days ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

junfanz1/MiniGPT-and-DeepSeek-MLA-Multi-Head-Latent-Attention
An efficient and scalable attention module designed to reduce memory usage and improve inference speed in large language models. Designed and implemented the Multi-Head Latent Attention (MLA) module as a drop-in replacement for traditional multi-head attention (MHA) in large language models.
Language: Python - Size: 74.2 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

shadowpa0327/Palu
Code for Palu: Compressing KV-Cache with Low-Rank Projection
Language: Python - Size: 337 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 76 - Forks: 4

aeksco/hardcider
:beer: CLI for quickly generating citations for websites and books
Language: JavaScript - Size: 429 KB - Last synced at: 24 days ago - Pushed at: over 6 years ago - Stars: 18 - Forks: 2

lemonyte/mla-terminal
A recreation of the terminal interface from the video game The Talos Principle.
Language: C# - Size: 35.2 KB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 0

James-E-A/mla69 📦
[ON HOLD] "Easily" write MLA papers in Markdown
Language: TeX - Size: 3.91 KB - Last synced at: 10 months ago - Pushed at: over 7 years ago - Stars: 1 - Forks: 0

kdkasad/latex-mlareport
MLA-style document class for LaTeX
Language: TeX - Size: 12.7 KB - Last synced at: 4 months ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

kylecorry31/citation-helper 📦
An APA citation helper website (without ads!)
Language: JavaScript - Size: 12.7 KB - Last synced at: about 1 year ago - Pushed at: over 6 years ago - Stars: 2 - Forks: 1

GMMDMDIDEMS/titlecase-converter
Title case converter supporting AMA, AP, APA, Bluebook, Chicago, MLA, NY Times, and Wikipedia style.
Size: 3.91 KB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

ssterling/mla.cls
LaTeX class for MLA-style papers
Language: TeX - Size: 2.58 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 0

WillDev12/MLA-Helper
Provided is a Google Apps Script that's soul purpose is to help make MLA writing easier
Language: HTML - Size: 945 KB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 5 - Forks: 0

tifrueh/mlaatkst
Footnote helper for KST students
Language: C++ - Size: 1.75 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

tohid-yousefi/Predicting_the_Energy_Efficiency_of_Buildings
In this section, predicting the energy efficiency of buildings with machine learning algorithms.
Language: Jupyter Notebook - Size: 191 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

jmclawson/biblatex-mla
MLA-style citations and bibliographies using Biblatex
Language: TeX - Size: 5.4 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 21 - Forks: 9

nahueespinosa/usb_midi_bridge
Implementation of a composite USB device containing a CDC interface and audio MIDI interface for PIC18F2550/4550 based on examples of the Microchip Libraries for Applications.
Language: C - Size: 11.8 MB - Last synced at: over 2 years ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 1

justsoft/MLA-Tuner
MLA Remote Tuner
Language: C++ - Size: 1.47 MB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

finnbear/cite
Automatic, ad-free citations
Language: JavaScript - Size: 4.22 MB - Last synced at: 17 days ago - Pushed at: almost 8 years ago - Stars: 2 - Forks: 0

ariporad/schoolkit
A Simple Toolkit for Managing Schoolwork
Language: Shell - Size: 197 KB - Last synced at: 3 months ago - Pushed at: almost 6 years ago - Stars: 1 - Forks: 0

RGBA-CRT/PIC-SFC-PSX-Interface
USB SNES/PSX gamepad converter using PIC18F14K50
Language: C - Size: 312 KB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 0

MechaDragonX/hawkeye
A application that helps you create and manage citations for a research paper or other project. Named after personal adjudant and bodyguard to Cl. Mustang, Lt. Hawkeye in the Fullmetal Alchemist manga and anime series.
Language: HTML - Size: 8.79 KB - Last synced at: 5 days ago - Pushed at: over 5 years ago - Stars: 2 - Forks: 0

luciodj/PIC32MikromediaCode
Code examples from the Graphics, Touch, Sound and USB book ported to the PIC32Mikromedia board
Language: C - Size: 1.33 MB - Last synced at: over 2 years ago - Pushed at: almost 6 years ago - Stars: 1 - Forks: 1

luciodj/PIC24MikromediaCode
This is the source code for the book: "Graphic, Touch, Sound and USB". This repository contains the PIC24 Mikromedia specific version.
Language: C - Size: 1.44 MB - Last synced at: over 2 years ago - Pushed at: almost 6 years ago - Stars: 0 - Forks: 1
