An open API service providing repository metadata for many open source software ecosystems.

Topic: "decoder-model"

shivendrra/SmallLanguageModel

a LLM cookbook, for building your own from scratch, all the way from gathering data to training a model

Language: Jupyter Notebook - Size: 66.7 MB - Last synced at: 9 months ago - Pushed at: over 1 year ago - Stars: 139 - Forks: 19

logic-OT/Decoder-Only-LLM

This repository features a custom-built decoder-only language model (LLM) with a total of 37 million parameters 🔥. I train the model to be able to ask question from a given context

Language: Jupyter Notebook - Size: 396 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 9 - Forks: 3

partarstu/transformers-in-java

Experimental project for AI and NLP based on Transformer Architecture

Language: Java - Size: 443 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 7 - Forks: 1

SharathHebbar/Transformers

Transformers Intuition

Language: Jupyter Notebook - Size: 34.3 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 6 - Forks: 1

HxCodeWarrior/StellarByte

从零实现基础的Transformer的Decoerder-Only模型,并进行模型升级,构建专属于自己的LLM模型

Language: Python - Size: 17.2 MB - Last synced at: 17 days ago - Pushed at: 19 days ago - Stars: 5 - Forks: 1

aiden200/GPT3_Implementation

Implementation of the GPT-3 paper: Language Models are Few-Shot Learners

Language: Python - Size: 763 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 4 - Forks: 0

LaurentVeyssier/Image-Captioning-Project-with-full-Encoder-Decoder-model

Generate caption on images using CNN Encoder- LSTM Decoder structure

Language: Jupyter Notebook - Size: 2.34 MB - Last synced at: almost 2 years ago - Pushed at: over 5 years ago - Stars: 4 - Forks: 2

mbnczy/GenAI4SeqCls

Generative AI fine-tune and inference for sequence classification tasks

Language: Python - Size: 255 KB - Last synced at: 29 days ago - Pushed at: about 1 month ago - Stars: 3 - Forks: 0

shivendrra/enigma

a dna sequence generation/classification using transformers

Language: Jupyter Notebook - Size: 14.3 MB - Last synced at: 5 months ago - Pushed at: 11 months ago - Stars: 2 - Forks: 0

edwinthomas444/cheese_advertisement_generator

An LLM based tool for generation of cheese advirtisements

Language: Jupyter Notebook - Size: 7.59 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

Aryan0419/Image-Captioning-CNN-LSTM

🖼️ Generate descriptive captions for images using a CNN-LSTM model, combining computer vision and NLP for effective storytelling.

Language: Python - Size: 1.37 MB - Last synced at: 7 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

dipankarsrirag/lordd

Code and dataset used to train dialect adapters for decoder models.

Language: Python - Size: 1.25 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

gatorduck/Creating_Custom_Decoder_Transformer

Custom decoder Transformer that treats a patient's medical journey like a story told through diagnosis codes instead of words.

Language: Jupyter Notebook - Size: 43.5 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

muhammadhussain-2009/Building-A-Transformer-From-Scratch

Coding A Decoder Only Transformer Like ChatGPT From Scratch

Language: Jupyter Notebook - Size: 2.84 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

Shuyib/HF_model_preview

Using LLMs in huggingface for sentiment analysis, translation, summarization and extractive question answering

Language: Jupyter Notebook - Size: 156 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

sea-rod/minigpt

A mini version of GPT implemented on shakespear using BPE

Language: Jupyter Notebook - Size: 432 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

SLotAbr/Decoder_model

Decoder model for language modelling

Language: Python - Size: 48.8 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

KempnerInstitute/minOLMo

An explainable and simplified version of OLMo model

Language: Jupyter Notebook - Size: 94.1 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

hardaatbaath/multimodal_vision_model

A multimodal vision model that takes in an image and a prompt query, and output the answer

Size: 0 Bytes - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

ahmedelsayed968/Arabic-Text-Summarizer

Build Text summarizer for arabic language

Language: Jupyter Notebook - Size: 5.05 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

DaniyalAhmedKhan1234/Academic-Text-Simplification

This project aims to simplify texts from research papers using advanced natural language processing (NLP) techniques, making them more accessible to a broader audience

Language: Jupyter Notebook - Size: 37.1 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

JasonShao55/NLP-Transformer-Implementattion

Language: Python - Size: 192 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Muhammad-Ibrahim-Khan/minigpt

A miniGPT inspired from the original NanoGPT released by OpenAI. This is a notebook to walk through the decoder part of the transformer architecture with details outlined.

Language: Jupyter Notebook - Size: 13.7 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

Related Topics
transformer 10 llm 7 gpt 4 encoder-decoder-model 4 nlp 4 pytorch 4 machine-learning 3 attention-mechanism 3 python 3 transformers 3 rnn-encoder-decoder 2 encoder 2 nlp-machine-learning 2 embeddings 2 encoder-model 2 bleu-score 2 sequence-to-sequence 2 ai 2 rnn-lstm 2 large-language-models 2 classification 2 deep-learning 2 fine-tuning 2 llms 2 bert-model 2 keras 1 inceptionv3-model 1 imagecaptioning 1 lstm-neural-networks 1 language-modelling 1 image-processing 1 convolutional-neural-networks 1 api 1 ai-project 1 tokeniser 1 softmax 1 pytorch-lightning 1 positional-encoding 1 chatgpt 1 ai-implementations 1 ai-architecture 1 transfomer 1 minigpt 1 decoder-only 1 translation 1 dialects 1 dialect-adaptation 1 vision-transformer 1 text-summarization 1 text-analysis 1 pytroch 1 olmo 1 text-processing 1 sari 1 research-paper 1 paraphrase-generation 1 jupyter-notebook 1 pretrained-models 1 neural-network 1 keras-tensorflow 1 healthcare 1 rnn 1 pytorch-implementation 1 portfolio-project 1 multimodal 1 finetune-llms 1 lstm 1 image-captioning 1 caption-generation 1 gpt-3 1 tokenization 1 semantic-similarity 1 masked-language-models 1 causal-language-modeling 1 attention-is-all-you-need 1 self-attention 1 samediff 1 java 1 encoder-network 1 dl4j 1 small-models 1 inference 1 computer-vision 1 neural-networks 1 llm-training 1 llm-cookbook 1 summarization 1 sentiment-analysis 1 qwen2-5 1 question-answering 1 llm-inference 1 helsinki-nlp 1 facebook-bart 1 extractive-question-answering 1 explainable-ai 1 text-generation 1 generative-modeling 1 encoder-decoder-architecture 1 data-to-text 1 advertisement-generation 1