GitHub topics: cross-attention

Repositories

oppolla/Self-Organizing-Virtual-Lifeform

SOVL System (Self-Organizing Virtual Lifeform): A complex, purpose-agnostic autonomous agent with continuous, asynchronous learning capabilities via a dynamic scaffolded LLM and a frozen base LLM

Language: Python - Size: 4.3 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 6 - Forks: 1

HaozheLiu-ST/T-GATE

T-GATE: Temporally Gating Attention to Accelerate Diffusion Model for Free!

Language: Python - Size: 54.8 MB - Last synced at: about 5 hours ago - Pushed at: 3 months ago - Stars: 396 - Forks: 24

wooyeolbaek/attention-map-diffusers

🚀 Cross attention map tools for huggingface/diffusers

Language: Python - Size: 7.89 MB - Last synced at: 1 day ago - Pushed at: 4 months ago - Stars: 283 - Forks: 21

autonomousvision/unimatch

[TPAMI'23] Unifying Flow, Stereo and Depth Estimation

Language: Python - Size: 21.4 MB - Last synced at: 2 days ago - Pushed at: 4 months ago - Stars: 1,246 - Forks: 124

bloc97/CrossAttentionControl

Unofficial implementation of "Prompt-to-Prompt Image Editing with Cross Attention Control" with Stable Diffusion

Language: Jupyter Notebook - Size: 62.3 MB - Last synced at: about 19 hours ago - Pushed at: over 2 years ago - Stars: 1,329 - Forks: 88

unum-cloud/uform

Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and 🔜 video, up to 5x faster than OpenAI CLIP and LLaVA 🖼️ & 🖋️

Language: Python - Size: 669 KB - Last synced at: 6 days ago - Pushed at: 4 months ago - Stars: 1,131 - Forks: 66

ntat/Class-Conditional-Diffusion

Conditional Diffuser from scratch, applied on CelebA-HQ, Cifar10 and MNIST.

Size: 4.36 MB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 0 - Forks: 0

srinadh99/AstroFormer

Photometry Guided Cross Attention Transformers for Astronomical Image Processing

Language: Jupyter Notebook - Size: 22.2 MB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 0 - Forks: 0

lanl/EPBD-BERT

Transcription factor binding site prediction for novel DNA sequence data aiding in mutation identification and drug discovery

Language: Jupyter Notebook - Size: 4.17 MB - Last synced at: about 18 hours ago - Pushed at: 9 months ago - Stars: 7 - Forks: 1

laowu-code/iTansformer_LSTM_CrossAttention_KAN

This is the implementation of the paper Enhanced Photovoltaic Power Forecasting: An iTransformer and LSTM-Based Model Integrating Temporal and Covariate Interactions

Language: Python - Size: 81.9 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 28 - Forks: 2

david-gimeno/interpreting-ssl-parkinson-speech

Official source code of the paper: "Unveiling Interpretability in Self-Supervised Speech Representations for Parkinson’s Diagnosis"

Language: Jupyter Notebook - Size: 3.78 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 4 - Forks: 1

EnergyAttention/Energy-Based-CrossAttention

The official repository of "Energy-Based Cross Attention for Bayesian Context Update in Text-to-Image Diffusion Models".

Language: Python - Size: 15.3 MB - Last synced at: about 2 months ago - Pushed at: about 1 year ago - Stars: 48 - Forks: 3

lucidrains/CALM-pytorch

Implementation of CALM from the paper "LLM Augmented LLMs: Expanding Capabilities through Composition", out of Google Deepmind

Language: Python - Size: 939 KB - Last synced at: about 1 month ago - Pushed at: 8 months ago - Stars: 174 - Forks: 11

prasunroy/mcma

:fire: Exploring Mutual Cross-Modal Attention for Context-Aware Human Affordance Generation (official code).

Size: 8.79 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

evNLP/SelfAttention

Transformer Model Implementation in PyTorch

Language: Jupyter Notebook - Size: 771 KB - Last synced at: about 2 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

praveena2j/Dynamic-CrossAttention

IEEE ICME : "Cross-Attention is not always needed: Dynamic Cross-Attention for Audio-Visual Dimensional Emotion Recognition"

Language: Python - Size: 2.26 MB - Last synced at: 2 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

jhaayush2004/My-Transformer

Code implementation of Transformer Model in "Attention is All You Need" in PyTorch.

Language: Jupyter Notebook - Size: 5.25 MB - Last synced at: about 1 month ago - Pushed at: 12 months ago - Stars: 1 - Forks: 0

cent664/SSRIW

Tensorflow implementation of 'Robust Image Watermarking based on Cross-Attention and Invariant Domain Learning'

Language: Jupyter Notebook - Size: 4.39 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 10 - Forks: 0

abelchai/Cross-Learning-Vision-Transformer-CL-ViT

Pytorch implementation of CL-ViT and FF-ViT models

Language: Python - Size: 3.74 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

Caoxuheng/FeafusFormer

TGRS: Code for "Unsupervised Hybrid Network of Transformer and CNN for Blind Hyperspectral and RGB Image Fusion"

Language: Python - Size: 124 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 5 - Forks: 0

tasinislam21/FashionFlow

This model synthesises high-fidelity fashion videos from single images featuring spontaneous and believable movements.

Language: Python - Size: 6.7 MB - Last synced at: 10 months ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0

Anne-Andresen/Multi-Modal-cuda-C-GAN

Raw C/cuda implementation of 3d GAN

Language: Cuda - Size: 156 KB - Last synced at: about 2 months ago - Pushed at: 10 months ago - Stars: 1 - Forks: 1

gatsby2016/PhiHER2

PhiHER2 model, a phenotype-informed weakly supervised model for HER2 status prediction from pathological images

Language: Jupyter Notebook - Size: 13.4 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 1

sfu-mial/SLiMe

1-shot image segmentation using Stable Diffusion

Language: Python - Size: 23.5 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

lokeshmeesala/clickbait_detection

Clickbait detection using custom cross attention transformer model

Language: Python - Size: 1.5 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

aliasgharkhani/SLiMe

1-shot image segmentation using Stable Diffusion

Language: Python - Size: 31.1 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 95 - Forks: 8

timbroed/HRFuser

[ITSC-2023] HRFuser: A Multi-resolution Sensor Fusion Architecture for 2D Object Detection

Language: Python - Size: 47.4 MB - Last synced at: 11 months ago - Pushed at: almost 2 years ago - Stars: 29 - Forks: 2

jwings1/H2O

3D Human-Object Interaction in Video A New Approach to Object Tracking via Cross-Modal Attention

Language: Python - Size: 800 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

Cominclip/Crossattention_map

Language: Python - Size: 5.86 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

akashe/Multimodal-action-recognition

Code on selecting an action based on multimodal inputs. Here in this case inputs are voice and text.

Language: Python - Size: 64.7 MB - Last synced at: over 1 year ago - Pushed at: almost 4 years ago - Stars: 69 - Forks: 11

augustwester/transformer-xl

A lightweight PyTorch implementation of the Transformer-XL architecture proposed by Dai et al. (2019)

Language: Python - Size: 440 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 21 - Forks: 1

Related Keywords

cross-attention 31 pytorch 11 self-attention 7 transformer 6 deep-learning 5 stable-diffusion 5 transformers 4 attention-mechanism 4 computer-vision 3 python 3 1-shot-segmentation 2 text-to-image 2 few-shot-segmentation 2 cross-attention-map 2 image-segmentation 2 multimodal-deep-learning 2 was-attention 2 vision-transformer 2 multimodal-learning 2 diffusion 2 diffusers 2 cross-attention-diffusers 2 bert 2 ai 2 diffusion-models 2 self-supervised-learning 2 super-resolution 1 blind-fusion 1 supervised-learning 1 positional-encoding 1 plant-disease-identification 1 transformer-architecture 1 invariant-domain-representation 1 robust-image-watermarking 1 feedforward-neural-network 1 encoder-decoder 1 emotion-recognition 1 audio-visual-learning 1 attention-model 1 attention 1 affective-computing 1 vae 1 human-pose 1 affordance-generation 1 attention-mechanisms 1 artificial-intelligence 1 energy-based-model 1 ebm 1 speech-analysis 1 xlnet 1 transformer-xl 1 nlp 1 multimodality 1 multimodal-fusion 1 multimodal-data 1 multimodal-action-recognition 1 generative-model 1 object-tracking 1 human-object-interaction 1 sensor-fusion 1 object-detection 1 adverse-weather-condition 1 text-classification 1 semi-supervised-learning 1 prototype-pattern 1 pathological-images 1 multi-instance-learning 1 transformers-c 1 transformer-pytorch 1 medical-imaging 1 low-level-programming 1 gan-models 1 gan 1 cuda 1 cross-attention-c 1 c 1 3d-models 1 3d 1 video-synthesis 1 latent-diffusion 1 fashion 1 autonomous-agents 1 visualization 1 correspondence 1 depth 1 matching 1 optical-flow 1 stereo 1 unified-model 1 clip 1 clustering 1 contrastive-learning 1 huggingface-transformers 1 image-search 1 language-vision 1 llava 1 multi-lingual 1 multimodal 1 confidence-based-learning 1 dreaming-ai 1

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Repos