Topic: "decoder-model"
shivendrra/SmallLanguageModel
a LLM cookbook, for building your own from scratch, all the way from gathering data to training a model
Language: Jupyter Notebook - Size: 66.7 MB - Last synced at: 9 months ago - Pushed at: over 1 year ago - Stars: 139 - Forks: 19
logic-OT/Decoder-Only-LLM
This repository features a custom-built decoder-only language model (LLM) with a total of 37 million parameters 🔥. I train the model to be able to ask question from a given context
Language: Jupyter Notebook - Size: 396 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 9 - Forks: 3
partarstu/transformers-in-java
Experimental project for AI and NLP based on Transformer Architecture
Language: Java - Size: 443 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 7 - Forks: 1
SharathHebbar/Transformers
Transformers Intuition
Language: Jupyter Notebook - Size: 34.3 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 6 - Forks: 1
HxCodeWarrior/StellarByte
从零实现基础的Transformer的Decoerder-Only模型,并进行模型升级,构建专属于自己的LLM模型
Language: Python - Size: 17.2 MB - Last synced at: 17 days ago - Pushed at: 19 days ago - Stars: 5 - Forks: 1
aiden200/GPT3_Implementation
Implementation of the GPT-3 paper: Language Models are Few-Shot Learners
Language: Python - Size: 763 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 4 - Forks: 0
LaurentVeyssier/Image-Captioning-Project-with-full-Encoder-Decoder-model
Generate caption on images using CNN Encoder- LSTM Decoder structure
Language: Jupyter Notebook - Size: 2.34 MB - Last synced at: almost 2 years ago - Pushed at: over 5 years ago - Stars: 4 - Forks: 2
mbnczy/GenAI4SeqCls
Generative AI fine-tune and inference for sequence classification tasks
Language: Python - Size: 255 KB - Last synced at: 29 days ago - Pushed at: about 1 month ago - Stars: 3 - Forks: 0
shivendrra/enigma
a dna sequence generation/classification using transformers
Language: Jupyter Notebook - Size: 14.3 MB - Last synced at: 5 months ago - Pushed at: 11 months ago - Stars: 2 - Forks: 0
edwinthomas444/cheese_advertisement_generator
An LLM based tool for generation of cheese advirtisements
Language: Jupyter Notebook - Size: 7.59 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0
Aryan0419/Image-Captioning-CNN-LSTM
🖼️ Generate descriptive captions for images using a CNN-LSTM model, combining computer vision and NLP for effective storytelling.
Language: Python - Size: 1.37 MB - Last synced at: 7 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0
dipankarsrirag/lordd
Code and dataset used to train dialect adapters for decoder models.
Language: Python - Size: 1.25 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0
gatorduck/Creating_Custom_Decoder_Transformer
Custom decoder Transformer that treats a patient's medical journey like a story told through diagnosis codes instead of words.
Language: Jupyter Notebook - Size: 43.5 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0
muhammadhussain-2009/Building-A-Transformer-From-Scratch
Coding A Decoder Only Transformer Like ChatGPT From Scratch
Language: Jupyter Notebook - Size: 2.84 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0
Shuyib/HF_model_preview
Using LLMs in huggingface for sentiment analysis, translation, summarization and extractive question answering
Language: Jupyter Notebook - Size: 156 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0
sea-rod/minigpt
A mini version of GPT implemented on shakespear using BPE
Language: Jupyter Notebook - Size: 432 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0
SLotAbr/Decoder_model
Decoder model for language modelling
Language: Python - Size: 48.8 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0
KempnerInstitute/minOLMo
An explainable and simplified version of OLMo model
Language: Jupyter Notebook - Size: 94.1 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0
hardaatbaath/multimodal_vision_model
A multimodal vision model that takes in an image and a prompt query, and output the answer
Size: 0 Bytes - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0
ahmedelsayed968/Arabic-Text-Summarizer
Build Text summarizer for arabic language
Language: Jupyter Notebook - Size: 5.05 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0
DaniyalAhmedKhan1234/Academic-Text-Simplification
This project aims to simplify texts from research papers using advanced natural language processing (NLP) techniques, making them more accessible to a broader audience
Language: Jupyter Notebook - Size: 37.1 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0
JasonShao55/NLP-Transformer-Implementattion
Language: Python - Size: 192 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0
Muhammad-Ibrahim-Khan/minigpt
A miniGPT inspired from the original NanoGPT released by OpenAI. This is a notebook to walk through the decoder part of the transformer architecture with details outlined.
Language: Jupyter Notebook - Size: 13.7 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0