An open API service providing repository metadata for many open source software ecosystems.

Topic: "causal-language-modeling"

hogru/MolReactGen

Auto-regressive causal language model for molecule (SMILES) and reaction template (SMARTS) generation based on the Hugging Face implementation of OpenAI's GPT-2 transformer decoder model

Language: Jupyter Notebook - Size: 22.9 MB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 17 - Forks: 3

rhubarbwu/linguistic-collapse

Codebase for Linguistic Collapse: Neural Collapse in (Large) Language Models [NeurIPS 2024] [arXiv:2405.17767]

Language: Python - Size: 196 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 11 - Forks: 1

DunnBC22/NLP_Projects

Repository for My HuggingFace Natural Language Processing Projects

Language: Jupyter Notebook - Size: 6.89 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 7 - Forks: 3

SharathHebbar/Transformers

Transformers Intuition

Language: Jupyter Notebook - Size: 34.3 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 6 - Forks: 1

Anonym0usWork1221/python-code-docstring-scraper

A multi-threaded GitHub scraper to collect Python code with docstrings from public repositories, creating a well-documented dataset for the JaraConverse LLM model.

Language: Python - Size: 454 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

Cyrilvallez/LLM_playground

A quick and easy way to interact with open-source LLMs.

Language: Python - Size: 332 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 1

nnilayy/llm-app

Language: Jupyter Notebook - Size: 959 KB - Last synced at: 11 months ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

tranquoctrinh/huggingface-transformers-examples

Fine-tuning (or training from scratch) the library models for language modeling on a text dataset for GPT, GPT-2, ALBERT, BERT, DitilBERT, RoBERTa, XLNet... GPT and GPT-2 are trained or fine-tuned using a causal language modeling (CLM) loss while ALBERT, BERT, DistilBERT and RoBERTa are trained or fine-tuned using a masked language modeling (MLM) loss.

Language: Python - Size: 38.1 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

NJUxlj/bert-based-autoregressive-model

Change the Bert model to a GPT-style autoregressive decoder.

Language: Python - Size: 43.9 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

JersonGB22/CausalLanguageModeling-TensorFlow

Language: Jupyter Notebook - Size: 680 KB - Last synced at: 8 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

ShiningLab/PromptSub

This repository is for the paper Lexical Substitution as Causal Language Modeling. In Proceedings of the 13th Joint Conference on Lexical and Computational Semantics (*SEM 2024), Mexico City, Mexico. Association for Computational Linguistics.

Language: Python - Size: 4.32 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

saagar-parikh/ASR_LLM_Rescoring Fork of jdannem6/ASR_LLM_Rescoring

Rescoring Automatic Speech Recognition using Large Language Models

Language: Jupyter Notebook - Size: 25 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

thibaud-perrin/hibo-mistral-7b-fc

Dataset and model fine-tuning for function calling

Language: Jupyter Notebook - Size: 1.08 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

samyak24jain/gpt2-intent-classification

Causal language modeling and intent classification using GPT-2.

Language: Python - Size: 6.84 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Jayveersinh-Raj/LoRA_implementation

This is the implementation of low rank adaptation (LoRA) which is a subset of parameter efficient fine tuning (PEFT).

Language: Jupyter Notebook - Size: 39.1 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

aneesh-aparajit/picturebook.ai

An AI generated picturebook.

Language: Python - Size: 2.56 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

PastelBelem8/ml4nlp-cogsci-summer22

Course materials for the Machine Learning for NLP course taught by Sameer Singh for the Cognitive Science summer school 2022.

Language: Jupyter Notebook - Size: 647 KB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0