An open API service providing repository metadata for many open source software ecosystems.

Topic: "transformer-models"

cmhungsteve/Awesome-Transformer-Attention

An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites

Size: 5.65 MB - Last synced at: 11 days ago - Pushed at: 10 months ago - Stars: 4,848 - Forks: 492

kyegomez/swarms

The Enterprise-Grade Production-Ready Multi-Agent Orchestration Framework. Website: https://swarms.ai

Language: Python - Size: 104 MB - Last synced at: 5 days ago - Pushed at: 6 days ago - Stars: 4,844 - Forks: 557

OpenNMT/CTranslate2

Fast inference engine for Transformer models

Language: C++ - Size: 14.5 MB - Last synced at: 5 days ago - Pushed at: about 1 month ago - Stars: 3,785 - Forks: 354

sovrasov/flops-counter.pytorch

Flops counter for neural networks in pytorch framework

Language: Python - Size: 179 KB - Last synced at: about 8 hours ago - Pushed at: 4 months ago - Stars: 2,891 - Forks: 309

VITA-Group/TransGAN

[NeurIPS‘2021] "TransGAN: Two Pure Transformers Can Make One Strong GAN, and That Can Scale Up", Yifan Jiang, Shiyu Chang, Zhangyang Wang

Language: Python - Size: 139 MB - Last synced at: 4 days ago - Pushed at: over 2 years ago - Stars: 1,671 - Forks: 204

vturrisi/solo-learn

solo-learn: a library of self-supervised methods for visual representation learning powered by Pytorch Lightning

Language: Python - Size: 5.04 MB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 1,479 - Forks: 190

uncbiag/Awesome-Foundation-Models

A curated list of foundation models for vision and language tasks

Size: 358 KB - Last synced at: 12 days ago - Pushed at: 16 days ago - Stars: 989 - Forks: 46

harleyszhang/llm_note

LLM notes, including model inference, transformer model structure, and llm framework code analysis notes.

Language: Python - Size: 177 MB - Last synced at: about 6 hours ago - Pushed at: about 7 hours ago - Stars: 765 - Forks: 78

daiquocnguyen/Graph-Transformer

Universal Graph Transformer Self-Attention Networks (TheWebConf WWW 2022) (Pytorch and Tensorflow)

Language: Python - Size: 109 MB - Last synced at: 12 months ago - Pushed at: over 2 years ago - Stars: 622 - Forks: 77

ScalaConsultants/Aspect-Based-Sentiment-Analysis

💭 Aspect-Based-Sentiment-Analysis: Transformer & Explainable ML (TensorFlow)

Language: Python - Size: 1.8 MB - Last synced at: 14 days ago - Pushed at: about 2 years ago - Stars: 568 - Forks: 90

imperial-qore/TranAD

[VLDB'22] Anomaly Detection using Transformers, self-conditioning and adversarial training.

Language: Python - Size: 133 MB - Last synced at: 5 months ago - Pushed at: 10 months ago - Stars: 526 - Forks: 160

cuiziteng/Illumination-Adaptive-Transformer

🌕 [BMVC 2022] You Only Need 90K Parameters to Adapt Light: A Light Weight Transformer for Image Enhancement and Exposure Correction. SOTA for low light enhancement, 0.004 seconds try this for pre-processing.

Language: Python - Size: 29.4 MB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 513 - Forks: 49

philipturner/metal-flash-attention

FlashAttention (Metal Port)

Language: Swift - Size: 9.26 MB - Last synced at: about 2 months ago - Pushed at: 8 months ago - Stars: 459 - Forks: 23

usefulsensors/useful-transformers

Efficient Inference of Transformer models

Language: C++ - Size: 135 MB - Last synced at: about 19 hours ago - Pushed at: 9 months ago - Stars: 432 - Forks: 42

HHousen/TransformerSum

Models to perform neural summarization (extractive and abstractive) using machine learning transformers and a tool to convert abstractive summarization datasets to the extractive task.

Language: Python - Size: 11.7 MB - Last synced at: 13 days ago - Pushed at: about 2 years ago - Stars: 430 - Forks: 57

RetroCirce/HTS-Audio-Transformer

The official code repo of "HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection"

Language: Python - Size: 896 KB - Last synced at: 5 days ago - Pushed at: 9 months ago - Stars: 407 - Forks: 68

audeering/w2v2-how-to

How to use our public wav2vec2 dimensional emotion model

Language: Jupyter Notebook - Size: 98.6 KB - Last synced at: about 1 year ago - Pushed at: almost 2 years ago - Stars: 398 - Forks: 47

ThilinaRajapakse/pytorch-transformers-classification

Based on the Pytorch-Transformers library by HuggingFace. To be used as a starting point for employing Transformer models in text classification tasks. Contains code to easily train BERT, XLNet, RoBERTa, and XLM models for text classification.

Language: Jupyter Notebook - Size: 182 KB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 289 - Forks: 101

dpressel/mint

MinT: Minimal Transformer Library and Tutorials

Language: Python - Size: 123 KB - Last synced at: 4 days ago - Pushed at: almost 3 years ago - Stars: 254 - Forks: 14

xashru/punctuation-restoration

Punctuation Restoration using Transformer Models for High-and Low-Resource Languages

Language: Python - Size: 10.7 MB - Last synced at: 3 months ago - Pushed at: 10 months ago - Stars: 206 - Forks: 67

RetroCirce/Zero_Shot_Audio_Source_Separation

The official code repo for "Zero-shot Audio Source Separation through Query-based Learning from Weakly-labeled Data", in AAAI 2022

Language: Python - Size: 684 KB - Last synced at: 5 days ago - Pushed at: almost 3 years ago - Stars: 199 - Forks: 33

yizhongw/Tk-Instruct

Tk-Instruct is a Transformer model that is tuned to solve many NLP tasks by following instructions.

Language: Python - Size: 9.93 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 179 - Forks: 28

csinva/imodelsX

Interpret text data using LLMs (scikit-learn compatible).

Language: Python - Size: 35 MB - Last synced at: 5 days ago - Pushed at: about 2 months ago - Stars: 163 - Forks: 26

OpenMachine-ai/transformer-tricks

A collection of tricks and tools to speed up transformer models

Language: TeX - Size: 10.2 MB - Last synced at: 30 days ago - Pushed at: about 1 month ago - Stars: 152 - Forks: 8

davidnvq/grit

GRIT: Faster and Better Image-captioning Transformer (ECCV 2022)

Language: Python - Size: 84.2 MB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 151 - Forks: 20

AnkurDeria/MFT

Pytorch implementation of Multimodal Fusion Transformer for Remote Sensing Image Classification.

Language: Jupyter Notebook - Size: 2.13 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 130 - Forks: 8

catherinesyeh/attention-viz

Visualizing query-key interactions in language + vision transformers

Language: HTML - Size: 1.01 GB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 103 - Forks: 12

kyegomez/Algorithm-Of-Thoughts

My implementation of "Algorithm of Thoughts: Enhancing Exploration of Ideas in Large Language Models"

Language: Python - Size: 294 KB - Last synced at: 4 days ago - Pushed at: over 1 year ago - Stars: 98 - Forks: 15

crockwell/rel_pose

[3DV 2022] The 8-Point Algorithm as an Inductive Bias for Relative Pose Prediction by ViTs

Language: Python - Size: 1.49 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 89 - Forks: 5

Bindwell/PLAPT

Codebase and CLI for PLAPT: A state-of-the-art protein-ligand binding affinity model for drug discovery

Language: Mathematica - Size: 93 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 88 - Forks: 10

Sea-Snell/grokking

unofficial re-implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"

Language: Python - Size: 1.82 MB - Last synced at: about 1 month ago - Pushed at: almost 3 years ago - Stars: 77 - Forks: 15

aofrancani/TSformer-VO

Implementation of the paper "Transformer-based model for monocular visual odometry: a video understanding approach".

Language: Python - Size: 303 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 74 - Forks: 11

cptq/SignNet-BasisNet

SignNet and BasisNet

Language: Python - Size: 2.55 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 72 - Forks: 13

GuoLanqing/ShadowFormer

ShadowFormer (AAAI2023), Pytorch implementation

Language: Python - Size: 1.12 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 69 - Forks: 9

sagizty/Multi-Stage-Hybrid-Transformer

Official release of MSHT: Multi-stage Hybrid Transformer for the ROSE Image Analysis of Pancreatic Cancer https://ieeexplore.ieee.org/document/10006398

Language: Jupyter Notebook - Size: 128 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 67 - Forks: 7

ayanglab/SwinMR

This is the official implementation of our proposed SwinMR

Language: Python - Size: 349 MB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 58 - Forks: 9

banglanlp/bnlp-resources

Awesome datasets for Bangla language computing.

Language: Python - Size: 242 MB - Last synced at: 3 months ago - Pushed at: about 3 years ago - Stars: 57 - Forks: 21

voidful/TFkit

🤖📇 handling multiple nlp task in one pipeline

Language: Python - Size: 15.8 MB - Last synced at: 20 days ago - Pushed at: about 2 years ago - Stars: 56 - Forks: 6

rayabhisek123/CFAT

[CVPR 2024] "CFAT: Unleashing Triangular Windows for Image Super-resolution"

Language: Python - Size: 3.48 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 52 - Forks: 3

marslanm/Multimodality-Representation-Learning

This repository provides a comprehensive collection of research papers focused on multimodal representation learning, all of which have been cited and discussed in the survey just accepted https://dl.acm.org/doi/abs/10.1145/3617833 .

Size: 63.3 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 51 - Forks: 7

sviperm/neuro-comma

🇷🇺 Punctuation restoration production-ready model for Russian language 🇷🇺

Language: Python - Size: 552 KB - Last synced at: over 1 year ago - Pushed at: almost 4 years ago - Stars: 51 - Forks: 9

sovit-123/vision_transformers

Vision Transformers for image classification, image segmentation, and object detection.

Language: Python - Size: 44.1 MB - Last synced at: about 1 month ago - Pushed at: 7 months ago - Stars: 49 - Forks: 9

cerbrec/graphbook

Expedite Discovery. Cerbrec Graphbook is a graphical AI platform for everyone to build bespoke AI solutions.

Size: 16.1 MB - Last synced at: 20 days ago - Pushed at: 20 days ago - Stars: 47 - Forks: 19

KittenCN/predict_Lottery_ticket_pytorch

pytorch下基于transformer / LSTM模型的彩票预测

Language: Python - Size: 147 KB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 46 - Forks: 17

wq2012/SpeakerRecognitionFromScratch

Final project for the Speaker Recognition course on Udemy, 机器之心, 深蓝学院 and 语音之家

Language: Python - Size: 9.2 MB - Last synced at: 30 days ago - Pushed at: about 1 year ago - Stars: 44 - Forks: 14

ZhouYuxuanYX/Hyperformer

This is the official implementation of our paper "Hypergraph Transformer for Skeleton-based Action Recognition."

Language: Python - Size: 1.25 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 42 - Forks: 11

andrerochow/fsrt

Official implementation of the CVPR 2024 paper "FSRT: Facial Scene Representation Transformer for Face Reenactment from Factorized Appearance, Head-pose, and Facial Expression Features"

Language: Python - Size: 22.5 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 39 - Forks: 1

Soham-Deshpande/Stock-TFT

Stock price prediction using a Temporal Fusion Transformer

Language: TeX - Size: 81.9 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 38 - Forks: 13

oliverguhr/spelling

This is a neural spell checker

Language: Python - Size: 63.4 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 33 - Forks: 6

MasterHow/FlowLens

PyTorch implementation of FlowLens (https://arxiv.org/pdf/2211.11293)

Language: Python - Size: 35.3 MB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 30 - Forks: 1

CEC-Agent/CEC

Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"

Language: Python - Size: 450 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 29 - Forks: 4

tobna/WhatTransformerToFavor

Github repository for the paper Which Transformer to Favor: A Comparative Analysis of Efficiency in Vision Transformers.

Language: Python - Size: 2.85 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 27 - Forks: 7

januverma/transformers-stuff

Codes, scripts, and notebooks on various aspects of transformer models.

Language: Jupyter Notebook - Size: 473 KB - Last synced at: 5 months ago - Pushed at: about 2 years ago - Stars: 27 - Forks: 4

cankocagil/SwinDetr

Integration of Swin Transformer to DETR for Robust Object Detection (DEMO)

Language: Jupyter Notebook - Size: 2.81 MB - Last synced at: about 1 year ago - Pushed at: almost 4 years ago - Stars: 26 - Forks: 5

pbcquoc/transformer

Dịch máy giữa ngôn ngữ anh-viet

Language: Jupyter Notebook - Size: 239 KB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 26 - Forks: 20

dweam-team/awesome-world-model-games

List of world model games and where to find them!

Size: 66.4 KB - Last synced at: 1 day ago - Pushed at: 27 days ago - Stars: 22 - Forks: 0

julienkay/com.doji.transformers

A Unity package to run pretrained transformer models with Unity Sentis

Language: C# - Size: 714 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 22 - Forks: 2

mohd-faizy/06P_Sentiment-Analysis-With-Deep-Learning-Using-BERT

Finetuning BERT in PyTorch for sentiment analysis.

Language: Jupyter Notebook - Size: 20.6 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 22 - Forks: 9

RyanLauQF/BloomBERT

Task Complexity Classifier using Transformer-based NLP model based on Bloom's Taxonomy

Language: Jupyter Notebook - Size: 1010 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 19 - Forks: 5

kyegomez/GPT3

An implementation of the base GPT-3 Model architecture from the paper by OPENAI "Language Models are Few-Shot Learners"

Language: Python - Size: 236 KB - Last synced at: 5 days ago - Pushed at: 11 months ago - Stars: 19 - Forks: 2

pratham16cse/AggForecaster

Code for "Coherent Probabilistic Aggregate Queries on Long-horizon Forecasts", IJCAI 2022

Language: Python - Size: 89.4 MB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 18 - Forks: 4

soarsmu/AutoPruner

AutoPruner: Transformer-based Call Graph Pruning (ESEC/FSE 2022, Research Track)

Language: Python - Size: 950 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 17 - Forks: 3

lu-wo/DETRtime

DETRtime, a framework for time-series segmentation

Language: Python - Size: 102 KB - Last synced at: about 1 year ago - Pushed at: almost 2 years ago - Stars: 17 - Forks: 0

rbitr/ferrite

Simple, lightweight transformers in Fortran

Language: Fortran - Size: 28.3 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 16 - Forks: 1

firojalam/crisis_datasets_benchmarks

Crisis Dataset for Benchmarks Experiments

Language: Python - Size: 1.41 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 16 - Forks: 5

domyounglee/Transformer-Summarization

An optimized Transformer based abstractive summarization model with Tensorflow

Language: Python - Size: 42.8 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 16 - Forks: 2

rahul-jha98/JustJoking.ai

Using a Transformer for learning the Language Model and Generate Short Jokes

Language: Jupyter Notebook - Size: 1.12 MB - Last synced at: about 1 month ago - Pushed at: almost 5 years ago - Stars: 13 - Forks: 2

programmer290399/pyqna

A simple python package for question answering.

Language: Python - Size: 3.95 MB - Last synced at: 9 months ago - Pushed at: almost 2 years ago - Stars: 11 - Forks: 5

wbsg-uni-mannheim/productCategorization

This repository contains code and data download instructions for the workshop paper "Improving Hierarchical Product Classification using Domain-specific Language Modelling" by Alexander Brinkmann and Christian Bizer.

Language: Python - Size: 453 KB - Last synced at: almost 2 years ago - Pushed at: about 4 years ago - Stars: 11 - Forks: 2

kyegomez/ShallowFF

Zeta implemantion of "Rethinking Attention: Exploring Shallow Feed-Forward Neural Networks as an Alternative to Attention Layers in Transformers"

Language: Python - Size: 36.2 MB - Last synced at: 5 days ago - Pushed at: 7 days ago - Stars: 10 - Forks: 1

usualheart/PRformer

The official repository of the PRformer paper: "PRformer: Pyramidal Recurrent Transformer for Multivariate Time Series Forecasting." This work is developed by the Lab of Professor Feiping Nie ([email protected]) and Xuelong Li ([email protected]) , Northwestern Polytechnical University.

Language: Python - Size: 3.69 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 10 - Forks: 0

Moeinh77/Virus-DNA-classification-BERT

Classification of 6 viruses including covid-19 based on their DNA sequences using Transformers

Language: Jupyter Notebook - Size: 6.81 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 10 - Forks: 3

Starscream-11813/MathBot

MathBot is a transformer-based Math Word Problem (MWP) solver made as the Lab project for CSE 4622: Machine Learning Lab.

Language: Jupyter Notebook - Size: 24.7 MB - Last synced at: 11 months ago - Pushed at: almost 3 years ago - Stars: 10 - Forks: 2

Merterm/Modeling-Intensification-for-SLG

Public repo for the paper: "Modeling Intensification for Sign Language Generation: A Computational Approach" by Mert Inan*, Yang Zhong*, Sabit Hassan*, Lorna Quandt, Malihe Alikhani

Language: Python - Size: 7 MB - Last synced at: over 1 year ago - Pushed at: about 3 years ago - Stars: 10 - Forks: 2

LazerLambda/Promptzl

Turn LLMs into zero-shot PyTorch classifiers!

Language: Python - Size: 527 KB - Last synced at: 23 days ago - Pushed at: 4 months ago - Stars: 9 - Forks: 0

GabMartino/TransformerForDummies

Annotated implementation of vanilla Transformers to guide through all the ambiguities.

Language: Python - Size: 4.57 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 8 - Forks: 0

MohanKrishnaGR/Infosys_Text-Summarization

This repository contains the implementation of a Transformer-based model for abstractive text summarization and a rule-based approach for extractive text summarization.

Language: Jupyter Notebook - Size: 15.9 MB - Last synced at: about 1 month ago - Pushed at: 10 months ago - Stars: 8 - Forks: 2

smitkiri/news-qa

Reading comprehension based question-answering model for news articles.

Language: Jupyter Notebook - Size: 25.6 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 8 - Forks: 2

rafaljanwojcik/SentenceBERT_vs_SiameseLSTM

Master's thesis repository with evaluation of BERT-based models on Quora Question Dataset, in comparison to Siamese LSTM models

Language: Jupyter Notebook - Size: 32.9 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 8 - Forks: 2

kyegomez/LongVit

A simplistic pytorch implementation of LongVit using my previous implementation of LongNet as a foundation.

Language: Shell - Size: 2.15 MB - Last synced at: 5 days ago - Pushed at: 6 months ago - Stars: 7 - Forks: 0

banglanlp/bangla-sentiment-classification

Bangla Sentiment Classification

Size: 8.08 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 7 - Forks: 3

lizhh268/ShadowMaskFormer

[TAI 2025] Official implementation of TAI-accepted paper: ShadowMaskFormer: Mask Augmented Patch Embedding for Shadow Removal

Language: Python - Size: 4.09 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 6 - Forks: 0

alphagov/govuk-content-metadata

GovNER: an encoder-based language model (RoBERTa) fine-tuned to perform Named Entity Recognition (NER) on GOV.UK content

Language: Python - Size: 21.2 MB - Last synced at: 3 days ago - Pushed at: about 2 months ago - Stars: 6 - Forks: 1

djene-mengistu/dseg_models

This repo contains implementation of deep learning-based steel surface defect segmentation models.

Language: Python - Size: 13.1 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 6 - Forks: 1

harmanpreet93/low-resource-machine-translation

Low resource machine translation using Transformers and Iterative Back translation

Language: Python - Size: 7.34 MB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 6 - Forks: 1

szczyglis-dev/gpt3-py

[Python] "Bring-Your-Own-Key" terminal based application allowing interaction with the OpenAI's GPT-3 artificial intelligence. It provides a chat mode, code generation in Python, C++, C#, Java, Javascript, TypeScript, PHP, Assembly, SQL, Bash, Ruby, Go, Perl, R, Matlab, Q# and more.

Language: Python - Size: 98.6 KB - Last synced at: about 1 month ago - Pushed at: 9 months ago - Stars: 5 - Forks: 2

NSLab-CUK/Context-Aware-Residual-Transformer

Context-Aware Residual Transformer (CART) is a kiosk recommendation system (CART) that utilizes self-supervised learning techniques tailored to kiosks in an offline retail environment and developed by a collaboration between NS Lab @ CUK and IIP Lab @ Gachon University based on pure PyTorch backend.

Language: Python - Size: 9.55 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 5 - Forks: 2

kamrulhasanrony/Vision-Transformer-based-Food-Classification

Vision Transformer Based Food-101 Classification

Language: Python - Size: 1020 KB - Last synced at: 9 months ago - Pushed at: almost 2 years ago - Stars: 5 - Forks: 1

abhaskumarsinha/Keras-implementation-of-Transformer-Architecture

This repository presents a Python-based implementation of the Transformer architecture on Keras TensorFlow library, as proposed by Vaswani et al. in their 2017 paper "Attention is all you need."

Language: Jupyter Notebook - Size: 223 KB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 5 - Forks: 3

rishabkr/Attention-Is-All-You-Need-Explained-PyTorch

A paper implementation and tutorial from scratch combining various great resources for implementing Transformers discussesd in Attention in All You Need Paper for the task of German to English Translation.

Language: Jupyter Notebook - Size: 84.7 MB - Last synced at: almost 2 years ago - Pushed at: almost 4 years ago - Stars: 5 - Forks: 2

gallipoligiuseppe/TST-CycleGAN

This repository contains the code for the paper "Self-supervised Text Style Transfer using Cycle-Consistent Adversarial Networks".

Language: Python - Size: 5.85 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 4 - Forks: 0

KwokHing/AI-Planet-LLM-Bootcamp-Challenge

An LLM challenge to (i) fine-tune pre-trained HuggingFace transformer model to build a Code Generation language model, and (ii) build a retrieval-augmented generation (RAG) application using LangChain

Language: Jupyter Notebook - Size: 874 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 0

naokishibuya/simple_transformer

A Transformer Implementation that is easy to understand and customizable.

Language: Python - Size: 46.9 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 2

Merterm/COSMic

Public repo for the paper: "COSMic: A Coherence-Aware Generation Metric for Image Descriptions" by Mert İnan, Piyush Sharma, Baber Khalid, Radu Soricut, Matthew Stone, Malihe Alikhani

Language: Python - Size: 396 KB - Last synced at: over 1 year ago - Pushed at: about 3 years ago - Stars: 4 - Forks: 0

Jmkernes/Shakespeare-Translator

Dost thou readeth thine description? An English->Shakespearian translator in TensorFlow 2.0+

Language: Jupyter Notebook - Size: 101 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 4 - Forks: 1

kreasof-ai/lawful-diffusion

Lawful Diffusion, ethical way to address copyright violation in text-to-image generative model.

Language: Python - Size: 1.43 MB - Last synced at: 30 days ago - Pushed at: 7 months ago - Stars: 3 - Forks: 1

ksm26/Embedding-Models-From-Architecture-to-Implementation

Understand and build embedding models, focusing on word and sentence embeddings, dual encoder architectures. Learn to train embedding models using contrastive loss, implement them in semantic search and RAG systems.

Language: Jupyter Notebook - Size: 2 MB - Last synced at: about 1 month ago - Pushed at: 9 months ago - Stars: 3 - Forks: 0

liuzwin98/DSCMT

code released

Language: Python - Size: 64.5 KB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 3 - Forks: 0

swechhasingh/nlp-from-scratch

Implementing language models using simple NLP techniques to advance Transformer architecture based models in pytorch

Language: Jupyter Notebook - Size: 7.82 MB - Last synced at: almost 2 years ago - Pushed at: about 2 years ago - Stars: 3 - Forks: 0

retkowsky/many_models_image_classification

Many models image classification using Transformers models

Language: Jupyter Notebook - Size: 5.46 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 0

Related Topics
deep-learning 70 nlp 48 machine-learning 46 pytorch 45 transformers 38 transformer 36 natural-language-processing 31 transformer-architecture 28 python 27 bert 22 artificial-intelligence 19 computer-vision 18 attention-mechanism 16 nlp-machine-learning 15 ai 15 llm 15 text-classification 13 vision-transformer 13 tensorflow 13 bert-model 12 gpt 11 huggingface 11 sentiment-analysis 9 large-language-models 9 huggingface-transformers 8 generative-ai 8 machine-translation 8 named-entity-recognition 7 cnn 7 classification 7 neural-machine-translation 7 self-attention 6 fine-tuning 6 transformer-pytorch 6 neural-networks 6 neural-network 6 time-series-forecasting 6 lstm-neural-networks 6 lstm 5 distilbert 5 multimodal-deep-learning 5 ml 5 streamlit-webapp 5 tensorflow2 5 time-series 5 bert-embeddings 5 transformers-models 5 gpt-2 5 text-generation 5 attention-is-all-you-need 5 deep-neural-networks 5 language-model 5 image-captioning 5 vit 4 prompt-engineering 4 dataset 4 streamlit 4 sentence-embeddings 4 langchain 4 encoder-decoder-model 4 convolutional-neural-networks 4 transformer-encoder 4 ner 4 attention 4 deeplearning 4 summarization 4 chatbot 4 t5-model 4 awesome-list 4 fastapi 4 attention-model 4 audio-classification 3 attention-mechanisms 3 speech-emotion-recognition 3 data-visualization 3 gru 3 generative-model 3 object-detection 3 swin-transformer 3 hyperparameter-tuning 3 pre-trained-model 3 multimodal-datasets 3 natural-language-generation 3 topic-modeling 3 social-media 3 large-language-model 3 anomaly-detection 3 stable-diffusion 3 few-shot-learning 3 docker 3 gpt4 3 image-restoration 3 question-answering 3 llms 3 pytorch-lightning 3 roberta 3 t5 3 text-summarization 3 gan 3 graph-neural-networks 3