GitHub topics: cross-attention
oppolla/Self-Organizing-Virtual-Lifeform
SOVL System (Self-Organizing Virtual Lifeform): A complex, purpose-agnostic autonomous agent with continuous, asynchronous learning capabilities via a dynamic scaffolded LLM and a frozen base LLM
Language: Python - Size: 4.3 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 6 - Forks: 1

HaozheLiu-ST/T-GATE
T-GATE: Temporally Gating Attention to Accelerate Diffusion Model for Free!
Language: Python - Size: 54.8 MB - Last synced at: about 5 hours ago - Pushed at: 3 months ago - Stars: 396 - Forks: 24

wooyeolbaek/attention-map-diffusers
🚀 Cross attention map tools for huggingface/diffusers
Language: Python - Size: 7.89 MB - Last synced at: 1 day ago - Pushed at: 4 months ago - Stars: 283 - Forks: 21

autonomousvision/unimatch
[TPAMI'23] Unifying Flow, Stereo and Depth Estimation
Language: Python - Size: 21.4 MB - Last synced at: 2 days ago - Pushed at: 4 months ago - Stars: 1,246 - Forks: 124

bloc97/CrossAttentionControl
Unofficial implementation of "Prompt-to-Prompt Image Editing with Cross Attention Control" with Stable Diffusion
Language: Jupyter Notebook - Size: 62.3 MB - Last synced at: about 19 hours ago - Pushed at: over 2 years ago - Stars: 1,329 - Forks: 88

unum-cloud/uform
Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and 🔜 video, up to 5x faster than OpenAI CLIP and LLaVA 🖼️ & 🖋️
Language: Python - Size: 669 KB - Last synced at: 6 days ago - Pushed at: 4 months ago - Stars: 1,131 - Forks: 66

ntat/Class-Conditional-Diffusion
Conditional Diffuser from scratch, applied on CelebA-HQ, Cifar10 and MNIST.
Size: 4.36 MB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 0 - Forks: 0

srinadh99/AstroFormer
Photometry Guided Cross Attention Transformers for Astronomical Image Processing
Language: Jupyter Notebook - Size: 22.2 MB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 0 - Forks: 0

lanl/EPBD-BERT
Transcription factor binding site prediction for novel DNA sequence data aiding in mutation identification and drug discovery
Language: Jupyter Notebook - Size: 4.17 MB - Last synced at: about 18 hours ago - Pushed at: 9 months ago - Stars: 7 - Forks: 1

laowu-code/iTansformer_LSTM_CrossAttention_KAN
This is the implementation of the paper Enhanced Photovoltaic Power Forecasting: An iTransformer and LSTM-Based Model Integrating Temporal and Covariate Interactions
Language: Python - Size: 81.9 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 28 - Forks: 2

david-gimeno/interpreting-ssl-parkinson-speech
Official source code of the paper: "Unveiling Interpretability in Self-Supervised Speech Representations for Parkinson’s Diagnosis"
Language: Jupyter Notebook - Size: 3.78 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 4 - Forks: 1

EnergyAttention/Energy-Based-CrossAttention
The official repository of "Energy-Based Cross Attention for Bayesian Context Update in Text-to-Image Diffusion Models".
Language: Python - Size: 15.3 MB - Last synced at: about 2 months ago - Pushed at: about 1 year ago - Stars: 48 - Forks: 3

lucidrains/CALM-pytorch
Implementation of CALM from the paper "LLM Augmented LLMs: Expanding Capabilities through Composition", out of Google Deepmind
Language: Python - Size: 939 KB - Last synced at: about 1 month ago - Pushed at: 8 months ago - Stars: 174 - Forks: 11

prasunroy/mcma
:fire: Exploring Mutual Cross-Modal Attention for Context-Aware Human Affordance Generation (official code).
Size: 8.79 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

evNLP/SelfAttention
Transformer Model Implementation in PyTorch
Language: Jupyter Notebook - Size: 771 KB - Last synced at: about 2 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

praveena2j/Dynamic-CrossAttention
IEEE ICME : "Cross-Attention is not always needed: Dynamic Cross-Attention for Audio-Visual Dimensional Emotion Recognition"
Language: Python - Size: 2.26 MB - Last synced at: 2 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

jhaayush2004/My-Transformer
Code implementation of Transformer Model in "Attention is All You Need" in PyTorch.
Language: Jupyter Notebook - Size: 5.25 MB - Last synced at: about 1 month ago - Pushed at: 12 months ago - Stars: 1 - Forks: 0

cent664/SSRIW
Tensorflow implementation of 'Robust Image Watermarking based on Cross-Attention and Invariant Domain Learning'
Language: Jupyter Notebook - Size: 4.39 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 10 - Forks: 0

abelchai/Cross-Learning-Vision-Transformer-CL-ViT
Pytorch implementation of CL-ViT and FF-ViT models
Language: Python - Size: 3.74 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

Caoxuheng/FeafusFormer
TGRS: Code for "Unsupervised Hybrid Network of Transformer and CNN for Blind Hyperspectral and RGB Image Fusion"
Language: Python - Size: 124 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 5 - Forks: 0

tasinislam21/FashionFlow
This model synthesises high-fidelity fashion videos from single images featuring spontaneous and believable movements.
Language: Python - Size: 6.7 MB - Last synced at: 10 months ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0

Anne-Andresen/Multi-Modal-cuda-C-GAN
Raw C/cuda implementation of 3d GAN
Language: Cuda - Size: 156 KB - Last synced at: about 2 months ago - Pushed at: 10 months ago - Stars: 1 - Forks: 1

gatsby2016/PhiHER2
PhiHER2 model, a phenotype-informed weakly supervised model for HER2 status prediction from pathological images
Language: Jupyter Notebook - Size: 13.4 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 1

sfu-mial/SLiMe
1-shot image segmentation using Stable Diffusion
Language: Python - Size: 23.5 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

lokeshmeesala/clickbait_detection
Clickbait detection using custom cross attention transformer model
Language: Python - Size: 1.5 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

aliasgharkhani/SLiMe
1-shot image segmentation using Stable Diffusion
Language: Python - Size: 31.1 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 95 - Forks: 8

timbroed/HRFuser
[ITSC-2023] HRFuser: A Multi-resolution Sensor Fusion Architecture for 2D Object Detection
Language: Python - Size: 47.4 MB - Last synced at: 11 months ago - Pushed at: almost 2 years ago - Stars: 29 - Forks: 2

jwings1/H2O
3D Human-Object Interaction in Video A New Approach to Object Tracking via Cross-Modal Attention
Language: Python - Size: 800 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

Cominclip/Crossattention_map
Language: Python - Size: 5.86 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

akashe/Multimodal-action-recognition
Code on selecting an action based on multimodal inputs. Here in this case inputs are voice and text.
Language: Python - Size: 64.7 MB - Last synced at: over 1 year ago - Pushed at: almost 4 years ago - Stars: 69 - Forks: 11

augustwester/transformer-xl
A lightweight PyTorch implementation of the Transformer-XL architecture proposed by Dai et al. (2019)
Language: Python - Size: 440 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 21 - Forks: 1
