Topic: "causal-language-modeling"
hogru/MolReactGen
Auto-regressive causal language model for molecule (SMILES) and reaction template (SMARTS) generation based on the Hugging Face implementation of OpenAI's GPT-2 transformer decoder model
Language: Jupyter Notebook - Size: 22.9 MB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 17 - Forks: 3
rhubarbwu/linguistic-collapse
Codebase for Linguistic Collapse: Neural Collapse in (Large) Language Models [NeurIPS 2024] [arXiv:2405.17767]
Language: Python - Size: 196 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 11 - Forks: 1
DunnBC22/NLP_Projects
Repository for My HuggingFace Natural Language Processing Projects
Language: Jupyter Notebook - Size: 6.89 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 7 - Forks: 3
SharathHebbar/Transformers
Transformers Intuition
Language: Jupyter Notebook - Size: 34.3 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 6 - Forks: 1
Anonym0usWork1221/python-code-docstring-scraper
A multi-threaded GitHub scraper to collect Python code with docstrings from public repositories, creating a well-documented dataset for the JaraConverse LLM model.
Language: Python - Size: 454 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0
Cyrilvallez/LLM_playground
A quick and easy way to interact with open-source LLMs.
Language: Python - Size: 332 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 1
nnilayy/llm-app
Language: Jupyter Notebook - Size: 959 KB - Last synced at: 11 months ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0
tranquoctrinh/huggingface-transformers-examples
Fine-tuning (or training from scratch) the library models for language modeling on a text dataset for GPT, GPT-2, ALBERT, BERT, DitilBERT, RoBERTa, XLNet... GPT and GPT-2 are trained or fine-tuned using a causal language modeling (CLM) loss while ALBERT, BERT, DistilBERT and RoBERTa are trained or fine-tuned using a masked language modeling (MLM) loss.
Language: Python - Size: 38.1 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0
NJUxlj/bert-based-autoregressive-model
Change the Bert model to a GPT-style autoregressive decoder.
Language: Python - Size: 43.9 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0
JersonGB22/CausalLanguageModeling-TensorFlow
Language: Jupyter Notebook - Size: 680 KB - Last synced at: 8 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0
ShiningLab/PromptSub
This repository is for the paper Lexical Substitution as Causal Language Modeling. In Proceedings of the 13th Joint Conference on Lexical and Computational Semantics (*SEM 2024), Mexico City, Mexico. Association for Computational Linguistics.
Language: Python - Size: 4.32 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0
saagar-parikh/ASR_LLM_Rescoring Fork of jdannem6/ASR_LLM_Rescoring
Rescoring Automatic Speech Recognition using Large Language Models
Language: Jupyter Notebook - Size: 25 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0
thibaud-perrin/hibo-mistral-7b-fc
Dataset and model fine-tuning for function calling
Language: Jupyter Notebook - Size: 1.08 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0
samyak24jain/gpt2-intent-classification
Causal language modeling and intent classification using GPT-2.
Language: Python - Size: 6.84 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0
Jayveersinh-Raj/LoRA_implementation
This is the implementation of low rank adaptation (LoRA) which is a subset of parameter efficient fine tuning (PEFT).
Language: Jupyter Notebook - Size: 39.1 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0
aneesh-aparajit/picturebook.ai
An AI generated picturebook.
Language: Python - Size: 2.56 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0
PastelBelem8/ml4nlp-cogsci-summer22
Course materials for the Machine Learning for NLP course taught by Sameer Singh for the Cognitive Science summer school 2022.
Language: Jupyter Notebook - Size: 647 KB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0