Topic: "transformer-models"
cmhungsteve/Awesome-Transformer-Attention
An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
Size: 5.65 MB - Last synced at: 11 days ago - Pushed at: 10 months ago - Stars: 4,848 - Forks: 492

kyegomez/swarms
The Enterprise-Grade Production-Ready Multi-Agent Orchestration Framework. Website: https://swarms.ai
Language: Python - Size: 104 MB - Last synced at: 5 days ago - Pushed at: 6 days ago - Stars: 4,844 - Forks: 557

OpenNMT/CTranslate2
Fast inference engine for Transformer models
Language: C++ - Size: 14.5 MB - Last synced at: 5 days ago - Pushed at: about 1 month ago - Stars: 3,785 - Forks: 354

sovrasov/flops-counter.pytorch
Flops counter for neural networks in pytorch framework
Language: Python - Size: 179 KB - Last synced at: about 8 hours ago - Pushed at: 4 months ago - Stars: 2,891 - Forks: 309

VITA-Group/TransGAN
[NeurIPS‘2021] "TransGAN: Two Pure Transformers Can Make One Strong GAN, and That Can Scale Up", Yifan Jiang, Shiyu Chang, Zhangyang Wang
Language: Python - Size: 139 MB - Last synced at: 4 days ago - Pushed at: over 2 years ago - Stars: 1,671 - Forks: 204

vturrisi/solo-learn
solo-learn: a library of self-supervised methods for visual representation learning powered by Pytorch Lightning
Language: Python - Size: 5.04 MB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 1,479 - Forks: 190

uncbiag/Awesome-Foundation-Models
A curated list of foundation models for vision and language tasks
Size: 358 KB - Last synced at: 12 days ago - Pushed at: 16 days ago - Stars: 989 - Forks: 46

harleyszhang/llm_note
LLM notes, including model inference, transformer model structure, and llm framework code analysis notes.
Language: Python - Size: 177 MB - Last synced at: about 6 hours ago - Pushed at: about 7 hours ago - Stars: 765 - Forks: 78

daiquocnguyen/Graph-Transformer
Universal Graph Transformer Self-Attention Networks (TheWebConf WWW 2022) (Pytorch and Tensorflow)
Language: Python - Size: 109 MB - Last synced at: 12 months ago - Pushed at: over 2 years ago - Stars: 622 - Forks: 77

ScalaConsultants/Aspect-Based-Sentiment-Analysis
💭 Aspect-Based-Sentiment-Analysis: Transformer & Explainable ML (TensorFlow)
Language: Python - Size: 1.8 MB - Last synced at: 14 days ago - Pushed at: about 2 years ago - Stars: 568 - Forks: 90

imperial-qore/TranAD
[VLDB'22] Anomaly Detection using Transformers, self-conditioning and adversarial training.
Language: Python - Size: 133 MB - Last synced at: 5 months ago - Pushed at: 10 months ago - Stars: 526 - Forks: 160

cuiziteng/Illumination-Adaptive-Transformer
🌕 [BMVC 2022] You Only Need 90K Parameters to Adapt Light: A Light Weight Transformer for Image Enhancement and Exposure Correction. SOTA for low light enhancement, 0.004 seconds try this for pre-processing.
Language: Python - Size: 29.4 MB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 513 - Forks: 49

philipturner/metal-flash-attention
FlashAttention (Metal Port)
Language: Swift - Size: 9.26 MB - Last synced at: about 2 months ago - Pushed at: 8 months ago - Stars: 459 - Forks: 23

usefulsensors/useful-transformers
Efficient Inference of Transformer models
Language: C++ - Size: 135 MB - Last synced at: about 19 hours ago - Pushed at: 9 months ago - Stars: 432 - Forks: 42

HHousen/TransformerSum
Models to perform neural summarization (extractive and abstractive) using machine learning transformers and a tool to convert abstractive summarization datasets to the extractive task.
Language: Python - Size: 11.7 MB - Last synced at: 13 days ago - Pushed at: about 2 years ago - Stars: 430 - Forks: 57

RetroCirce/HTS-Audio-Transformer
The official code repo of "HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection"
Language: Python - Size: 896 KB - Last synced at: 5 days ago - Pushed at: 9 months ago - Stars: 407 - Forks: 68

audeering/w2v2-how-to
How to use our public wav2vec2 dimensional emotion model
Language: Jupyter Notebook - Size: 98.6 KB - Last synced at: about 1 year ago - Pushed at: almost 2 years ago - Stars: 398 - Forks: 47

ThilinaRajapakse/pytorch-transformers-classification
Based on the Pytorch-Transformers library by HuggingFace. To be used as a starting point for employing Transformer models in text classification tasks. Contains code to easily train BERT, XLNet, RoBERTa, and XLM models for text classification.
Language: Jupyter Notebook - Size: 182 KB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 289 - Forks: 101

dpressel/mint
MinT: Minimal Transformer Library and Tutorials
Language: Python - Size: 123 KB - Last synced at: 4 days ago - Pushed at: almost 3 years ago - Stars: 254 - Forks: 14

xashru/punctuation-restoration
Punctuation Restoration using Transformer Models for High-and Low-Resource Languages
Language: Python - Size: 10.7 MB - Last synced at: 3 months ago - Pushed at: 10 months ago - Stars: 206 - Forks: 67

RetroCirce/Zero_Shot_Audio_Source_Separation
The official code repo for "Zero-shot Audio Source Separation through Query-based Learning from Weakly-labeled Data", in AAAI 2022
Language: Python - Size: 684 KB - Last synced at: 5 days ago - Pushed at: almost 3 years ago - Stars: 199 - Forks: 33

yizhongw/Tk-Instruct
Tk-Instruct is a Transformer model that is tuned to solve many NLP tasks by following instructions.
Language: Python - Size: 9.93 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 179 - Forks: 28

csinva/imodelsX
Interpret text data using LLMs (scikit-learn compatible).
Language: Python - Size: 35 MB - Last synced at: 5 days ago - Pushed at: about 2 months ago - Stars: 163 - Forks: 26

OpenMachine-ai/transformer-tricks
A collection of tricks and tools to speed up transformer models
Language: TeX - Size: 10.2 MB - Last synced at: 30 days ago - Pushed at: about 1 month ago - Stars: 152 - Forks: 8

davidnvq/grit
GRIT: Faster and Better Image-captioning Transformer (ECCV 2022)
Language: Python - Size: 84.2 MB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 151 - Forks: 20

AnkurDeria/MFT
Pytorch implementation of Multimodal Fusion Transformer for Remote Sensing Image Classification.
Language: Jupyter Notebook - Size: 2.13 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 130 - Forks: 8

catherinesyeh/attention-viz
Visualizing query-key interactions in language + vision transformers
Language: HTML - Size: 1.01 GB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 103 - Forks: 12

kyegomez/Algorithm-Of-Thoughts
My implementation of "Algorithm of Thoughts: Enhancing Exploration of Ideas in Large Language Models"
Language: Python - Size: 294 KB - Last synced at: 4 days ago - Pushed at: over 1 year ago - Stars: 98 - Forks: 15

crockwell/rel_pose
[3DV 2022] The 8-Point Algorithm as an Inductive Bias for Relative Pose Prediction by ViTs
Language: Python - Size: 1.49 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 89 - Forks: 5

Bindwell/PLAPT
Codebase and CLI for PLAPT: A state-of-the-art protein-ligand binding affinity model for drug discovery
Language: Mathematica - Size: 93 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 88 - Forks: 10

Sea-Snell/grokking
unofficial re-implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"
Language: Python - Size: 1.82 MB - Last synced at: about 1 month ago - Pushed at: almost 3 years ago - Stars: 77 - Forks: 15

aofrancani/TSformer-VO
Implementation of the paper "Transformer-based model for monocular visual odometry: a video understanding approach".
Language: Python - Size: 303 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 74 - Forks: 11

cptq/SignNet-BasisNet
SignNet and BasisNet
Language: Python - Size: 2.55 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 72 - Forks: 13

GuoLanqing/ShadowFormer
ShadowFormer (AAAI2023), Pytorch implementation
Language: Python - Size: 1.12 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 69 - Forks: 9

sagizty/Multi-Stage-Hybrid-Transformer
Official release of MSHT: Multi-stage Hybrid Transformer for the ROSE Image Analysis of Pancreatic Cancer https://ieeexplore.ieee.org/document/10006398
Language: Jupyter Notebook - Size: 128 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 67 - Forks: 7

ayanglab/SwinMR
This is the official implementation of our proposed SwinMR
Language: Python - Size: 349 MB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 58 - Forks: 9

banglanlp/bnlp-resources
Awesome datasets for Bangla language computing.
Language: Python - Size: 242 MB - Last synced at: 3 months ago - Pushed at: about 3 years ago - Stars: 57 - Forks: 21

voidful/TFkit
🤖📇 handling multiple nlp task in one pipeline
Language: Python - Size: 15.8 MB - Last synced at: 20 days ago - Pushed at: about 2 years ago - Stars: 56 - Forks: 6

rayabhisek123/CFAT
[CVPR 2024] "CFAT: Unleashing Triangular Windows for Image Super-resolution"
Language: Python - Size: 3.48 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 52 - Forks: 3

marslanm/Multimodality-Representation-Learning
This repository provides a comprehensive collection of research papers focused on multimodal representation learning, all of which have been cited and discussed in the survey just accepted https://dl.acm.org/doi/abs/10.1145/3617833 .
Size: 63.3 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 51 - Forks: 7

sviperm/neuro-comma
🇷🇺 Punctuation restoration production-ready model for Russian language 🇷🇺
Language: Python - Size: 552 KB - Last synced at: over 1 year ago - Pushed at: almost 4 years ago - Stars: 51 - Forks: 9

sovit-123/vision_transformers
Vision Transformers for image classification, image segmentation, and object detection.
Language: Python - Size: 44.1 MB - Last synced at: about 1 month ago - Pushed at: 7 months ago - Stars: 49 - Forks: 9

cerbrec/graphbook
Expedite Discovery. Cerbrec Graphbook is a graphical AI platform for everyone to build bespoke AI solutions.
Size: 16.1 MB - Last synced at: 20 days ago - Pushed at: 20 days ago - Stars: 47 - Forks: 19

KittenCN/predict_Lottery_ticket_pytorch
pytorch下基于transformer / LSTM模型的彩票预测
Language: Python - Size: 147 KB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 46 - Forks: 17

wq2012/SpeakerRecognitionFromScratch
Final project for the Speaker Recognition course on Udemy, 机器之心, 深蓝学院 and 语音之家
Language: Python - Size: 9.2 MB - Last synced at: 30 days ago - Pushed at: about 1 year ago - Stars: 44 - Forks: 14

ZhouYuxuanYX/Hyperformer
This is the official implementation of our paper "Hypergraph Transformer for Skeleton-based Action Recognition."
Language: Python - Size: 1.25 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 42 - Forks: 11

andrerochow/fsrt
Official implementation of the CVPR 2024 paper "FSRT: Facial Scene Representation Transformer for Face Reenactment from Factorized Appearance, Head-pose, and Facial Expression Features"
Language: Python - Size: 22.5 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 39 - Forks: 1

Soham-Deshpande/Stock-TFT
Stock price prediction using a Temporal Fusion Transformer
Language: TeX - Size: 81.9 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 38 - Forks: 13

oliverguhr/spelling
This is a neural spell checker
Language: Python - Size: 63.4 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 33 - Forks: 6

MasterHow/FlowLens
PyTorch implementation of FlowLens (https://arxiv.org/pdf/2211.11293)
Language: Python - Size: 35.3 MB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 30 - Forks: 1

CEC-Agent/CEC
Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"
Language: Python - Size: 450 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 29 - Forks: 4

tobna/WhatTransformerToFavor
Github repository for the paper Which Transformer to Favor: A Comparative Analysis of Efficiency in Vision Transformers.
Language: Python - Size: 2.85 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 27 - Forks: 7

januverma/transformers-stuff
Codes, scripts, and notebooks on various aspects of transformer models.
Language: Jupyter Notebook - Size: 473 KB - Last synced at: 5 months ago - Pushed at: about 2 years ago - Stars: 27 - Forks: 4

cankocagil/SwinDetr
Integration of Swin Transformer to DETR for Robust Object Detection (DEMO)
Language: Jupyter Notebook - Size: 2.81 MB - Last synced at: about 1 year ago - Pushed at: almost 4 years ago - Stars: 26 - Forks: 5

pbcquoc/transformer
Dịch máy giữa ngôn ngữ anh-viet
Language: Jupyter Notebook - Size: 239 KB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 26 - Forks: 20

dweam-team/awesome-world-model-games
List of world model games and where to find them!
Size: 66.4 KB - Last synced at: 1 day ago - Pushed at: 27 days ago - Stars: 22 - Forks: 0

julienkay/com.doji.transformers
A Unity package to run pretrained transformer models with Unity Sentis
Language: C# - Size: 714 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 22 - Forks: 2

mohd-faizy/06P_Sentiment-Analysis-With-Deep-Learning-Using-BERT
Finetuning BERT in PyTorch for sentiment analysis.
Language: Jupyter Notebook - Size: 20.6 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 22 - Forks: 9

RyanLauQF/BloomBERT
Task Complexity Classifier using Transformer-based NLP model based on Bloom's Taxonomy
Language: Jupyter Notebook - Size: 1010 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 19 - Forks: 5

kyegomez/GPT3
An implementation of the base GPT-3 Model architecture from the paper by OPENAI "Language Models are Few-Shot Learners"
Language: Python - Size: 236 KB - Last synced at: 5 days ago - Pushed at: 11 months ago - Stars: 19 - Forks: 2

pratham16cse/AggForecaster
Code for "Coherent Probabilistic Aggregate Queries on Long-horizon Forecasts", IJCAI 2022
Language: Python - Size: 89.4 MB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 18 - Forks: 4

soarsmu/AutoPruner
AutoPruner: Transformer-based Call Graph Pruning (ESEC/FSE 2022, Research Track)
Language: Python - Size: 950 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 17 - Forks: 3

lu-wo/DETRtime
DETRtime, a framework for time-series segmentation
Language: Python - Size: 102 KB - Last synced at: about 1 year ago - Pushed at: almost 2 years ago - Stars: 17 - Forks: 0

rbitr/ferrite
Simple, lightweight transformers in Fortran
Language: Fortran - Size: 28.3 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 16 - Forks: 1

firojalam/crisis_datasets_benchmarks
Crisis Dataset for Benchmarks Experiments
Language: Python - Size: 1.41 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 16 - Forks: 5

domyounglee/Transformer-Summarization
An optimized Transformer based abstractive summarization model with Tensorflow
Language: Python - Size: 42.8 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 16 - Forks: 2

rahul-jha98/JustJoking.ai
Using a Transformer for learning the Language Model and Generate Short Jokes
Language: Jupyter Notebook - Size: 1.12 MB - Last synced at: about 1 month ago - Pushed at: almost 5 years ago - Stars: 13 - Forks: 2

programmer290399/pyqna
A simple python package for question answering.
Language: Python - Size: 3.95 MB - Last synced at: 9 months ago - Pushed at: almost 2 years ago - Stars: 11 - Forks: 5

wbsg-uni-mannheim/productCategorization
This repository contains code and data download instructions for the workshop paper "Improving Hierarchical Product Classification using Domain-specific Language Modelling" by Alexander Brinkmann and Christian Bizer.
Language: Python - Size: 453 KB - Last synced at: almost 2 years ago - Pushed at: about 4 years ago - Stars: 11 - Forks: 2

kyegomez/ShallowFF
Zeta implemantion of "Rethinking Attention: Exploring Shallow Feed-Forward Neural Networks as an Alternative to Attention Layers in Transformers"
Language: Python - Size: 36.2 MB - Last synced at: 5 days ago - Pushed at: 7 days ago - Stars: 10 - Forks: 1

usualheart/PRformer
The official repository of the PRformer paper: "PRformer: Pyramidal Recurrent Transformer for Multivariate Time Series Forecasting." This work is developed by the Lab of Professor Feiping Nie ([email protected]) and Xuelong Li ([email protected]) , Northwestern Polytechnical University.
Language: Python - Size: 3.69 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 10 - Forks: 0

Moeinh77/Virus-DNA-classification-BERT
Classification of 6 viruses including covid-19 based on their DNA sequences using Transformers
Language: Jupyter Notebook - Size: 6.81 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 10 - Forks: 3

Starscream-11813/MathBot
MathBot is a transformer-based Math Word Problem (MWP) solver made as the Lab project for CSE 4622: Machine Learning Lab.
Language: Jupyter Notebook - Size: 24.7 MB - Last synced at: 11 months ago - Pushed at: almost 3 years ago - Stars: 10 - Forks: 2

Merterm/Modeling-Intensification-for-SLG
Public repo for the paper: "Modeling Intensification for Sign Language Generation: A Computational Approach" by Mert Inan*, Yang Zhong*, Sabit Hassan*, Lorna Quandt, Malihe Alikhani
Language: Python - Size: 7 MB - Last synced at: over 1 year ago - Pushed at: about 3 years ago - Stars: 10 - Forks: 2

LazerLambda/Promptzl
Turn LLMs into zero-shot PyTorch classifiers!
Language: Python - Size: 527 KB - Last synced at: 23 days ago - Pushed at: 4 months ago - Stars: 9 - Forks: 0

GabMartino/TransformerForDummies
Annotated implementation of vanilla Transformers to guide through all the ambiguities.
Language: Python - Size: 4.57 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 8 - Forks: 0

MohanKrishnaGR/Infosys_Text-Summarization
This repository contains the implementation of a Transformer-based model for abstractive text summarization and a rule-based approach for extractive text summarization.
Language: Jupyter Notebook - Size: 15.9 MB - Last synced at: about 1 month ago - Pushed at: 10 months ago - Stars: 8 - Forks: 2

smitkiri/news-qa
Reading comprehension based question-answering model for news articles.
Language: Jupyter Notebook - Size: 25.6 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 8 - Forks: 2

rafaljanwojcik/SentenceBERT_vs_SiameseLSTM
Master's thesis repository with evaluation of BERT-based models on Quora Question Dataset, in comparison to Siamese LSTM models
Language: Jupyter Notebook - Size: 32.9 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 8 - Forks: 2

kyegomez/LongVit
A simplistic pytorch implementation of LongVit using my previous implementation of LongNet as a foundation.
Language: Shell - Size: 2.15 MB - Last synced at: 5 days ago - Pushed at: 6 months ago - Stars: 7 - Forks: 0

banglanlp/bangla-sentiment-classification
Bangla Sentiment Classification
Size: 8.08 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 7 - Forks: 3

lizhh268/ShadowMaskFormer
[TAI 2025] Official implementation of TAI-accepted paper: ShadowMaskFormer: Mask Augmented Patch Embedding for Shadow Removal
Language: Python - Size: 4.09 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 6 - Forks: 0

alphagov/govuk-content-metadata
GovNER: an encoder-based language model (RoBERTa) fine-tuned to perform Named Entity Recognition (NER) on GOV.UK content
Language: Python - Size: 21.2 MB - Last synced at: 3 days ago - Pushed at: about 2 months ago - Stars: 6 - Forks: 1

djene-mengistu/dseg_models
This repo contains implementation of deep learning-based steel surface defect segmentation models.
Language: Python - Size: 13.1 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 6 - Forks: 1

harmanpreet93/low-resource-machine-translation
Low resource machine translation using Transformers and Iterative Back translation
Language: Python - Size: 7.34 MB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 6 - Forks: 1

szczyglis-dev/gpt3-py
[Python] "Bring-Your-Own-Key" terminal based application allowing interaction with the OpenAI's GPT-3 artificial intelligence. It provides a chat mode, code generation in Python, C++, C#, Java, Javascript, TypeScript, PHP, Assembly, SQL, Bash, Ruby, Go, Perl, R, Matlab, Q# and more.
Language: Python - Size: 98.6 KB - Last synced at: about 1 month ago - Pushed at: 9 months ago - Stars: 5 - Forks: 2

NSLab-CUK/Context-Aware-Residual-Transformer
Context-Aware Residual Transformer (CART) is a kiosk recommendation system (CART) that utilizes self-supervised learning techniques tailored to kiosks in an offline retail environment and developed by a collaboration between NS Lab @ CUK and IIP Lab @ Gachon University based on pure PyTorch backend.
Language: Python - Size: 9.55 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 5 - Forks: 2

kamrulhasanrony/Vision-Transformer-based-Food-Classification
Vision Transformer Based Food-101 Classification
Language: Python - Size: 1020 KB - Last synced at: 9 months ago - Pushed at: almost 2 years ago - Stars: 5 - Forks: 1

abhaskumarsinha/Keras-implementation-of-Transformer-Architecture
This repository presents a Python-based implementation of the Transformer architecture on Keras TensorFlow library, as proposed by Vaswani et al. in their 2017 paper "Attention is all you need."
Language: Jupyter Notebook - Size: 223 KB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 5 - Forks: 3

rishabkr/Attention-Is-All-You-Need-Explained-PyTorch
A paper implementation and tutorial from scratch combining various great resources for implementing Transformers discussesd in Attention in All You Need Paper for the task of German to English Translation.
Language: Jupyter Notebook - Size: 84.7 MB - Last synced at: almost 2 years ago - Pushed at: almost 4 years ago - Stars: 5 - Forks: 2

gallipoligiuseppe/TST-CycleGAN
This repository contains the code for the paper "Self-supervised Text Style Transfer using Cycle-Consistent Adversarial Networks".
Language: Python - Size: 5.85 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 4 - Forks: 0

KwokHing/AI-Planet-LLM-Bootcamp-Challenge
An LLM challenge to (i) fine-tune pre-trained HuggingFace transformer model to build a Code Generation language model, and (ii) build a retrieval-augmented generation (RAG) application using LangChain
Language: Jupyter Notebook - Size: 874 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 0

naokishibuya/simple_transformer
A Transformer Implementation that is easy to understand and customizable.
Language: Python - Size: 46.9 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 2

Merterm/COSMic
Public repo for the paper: "COSMic: A Coherence-Aware Generation Metric for Image Descriptions" by Mert İnan, Piyush Sharma, Baber Khalid, Radu Soricut, Matthew Stone, Malihe Alikhani
Language: Python - Size: 396 KB - Last synced at: over 1 year ago - Pushed at: about 3 years ago - Stars: 4 - Forks: 0

Jmkernes/Shakespeare-Translator
Dost thou readeth thine description? An English->Shakespearian translator in TensorFlow 2.0+
Language: Jupyter Notebook - Size: 101 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 4 - Forks: 1

kreasof-ai/lawful-diffusion
Lawful Diffusion, ethical way to address copyright violation in text-to-image generative model.
Language: Python - Size: 1.43 MB - Last synced at: 30 days ago - Pushed at: 7 months ago - Stars: 3 - Forks: 1

ksm26/Embedding-Models-From-Architecture-to-Implementation
Understand and build embedding models, focusing on word and sentence embeddings, dual encoder architectures. Learn to train embedding models using contrastive loss, implement them in semantic search and RAG systems.
Language: Jupyter Notebook - Size: 2 MB - Last synced at: about 1 month ago - Pushed at: 9 months ago - Stars: 3 - Forks: 0

liuzwin98/DSCMT
code released
Language: Python - Size: 64.5 KB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 3 - Forks: 0

swechhasingh/nlp-from-scratch
Implementing language models using simple NLP techniques to advance Transformer architecture based models in pytorch
Language: Jupyter Notebook - Size: 7.82 MB - Last synced at: almost 2 years ago - Pushed at: about 2 years ago - Stars: 3 - Forks: 0

retkowsky/many_models_image_classification
Many models image classification using Transformers models
Language: Jupyter Notebook - Size: 5.46 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 0
