An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: pretrained-language-model

Separius/awesome-sentence-embedding 📦

A curated list of pretrained sentence and word embedding models

Language: Python - Size: 282 KB - Last synced at: 1 day ago - Pushed at: about 4 years ago - Stars: 2,258 - Forks: 262

AndrewZhe/lawyer-llama

中文法律LLaMA (LLaMA for Chinese legel domain)

Language: Python - Size: 6.85 MB - Last synced at: 3 days ago - Pushed at: 10 months ago - Stars: 946 - Forks: 130

SuperBruceJia/Awesome-LLM-Self-Consistency

Awesome LLM Self-Consistency: a curated list of Self-consistency in Large Language Models

Size: 149 KB - Last synced at: 4 days ago - Pushed at: 10 months ago - Stars: 99 - Forks: 8

microsoft/torchscale

Foundation Architecture for (M)LLMs

Language: Python - Size: 361 KB - Last synced at: 3 days ago - Pushed at: about 1 year ago - Stars: 3,082 - Forks: 219

Clarifai/examples

Examples for Clarifai Python SDK and Integrations. Give the repo a star ⭐

Language: Jupyter Notebook - Size: 168 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 20 - Forks: 3

hyintell/awesome-refreshing-llms

EMNLP'23 survey: a curation of awesome papers and resources on refreshing large language models (LLMs) without expensive retraining.

Size: 2.31 MB - Last synced at: about 4 hours ago - Pushed at: over 1 year ago - Stars: 133 - Forks: 10

qianlima-lab/awesome-lifelong-learning-methods-for-llm Fork of zzz47zzz/awesome-lifelong-learning-methods-for-llm

This repository collects awesome survey, resource, and paper for Lifelong Learning for Large Language Models. (Updated Regularly)

Size: 428 KB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 52 - Forks: 1

zzz47zzz/awesome-lifelong-learning-methods-for-llm

[ACM Computing Surveys 2025] This repository collects awesome survey, resource, and paper for Lifelong Learning with Large Language Models. (Updated Regularly)

Size: 428 KB - Last synced at: 16 days ago - Pushed at: 17 days ago - Stars: 129 - Forks: 6

thunlp/OpenDelta

A plug-and-play library for parameter-efficient-tuning (Delta Tuning)

Language: Python - Size: 42 MB - Last synced at: 17 days ago - Pushed at: 9 months ago - Stars: 1,028 - Forks: 83

Kitsunp/Small-lenguaje-Model-Hybrid-Norm-Furier-Formers

A compact language model implementing HybridNorm and Fourier-based attention. Combines CoLA (low-rank projections), FANformer, and hybrid normalization to create an efficient decoder-only transformer. Leverages periodicity modeling and gated residuals to enhance performance while maintaining a small parameter footprint.

Language: Python - Size: 4.6 MB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 4 - Forks: 0

vyasdeepti/Text-to-Image-Generator-using-GANs

This repository implements a Text-to-Image Generator using Generative Adversarial Networks (GANs). The project takes textual descriptions as input and generates corresponding images, demonstrating the capability of deep learning models to bridge the gap between natural language and visual content.

Language: Jupyter Notebook - Size: 2.15 MB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 0 - Forks: 0

hojat72elect/IMDB_storyline_summaries_database

The database IMDB storylines and their summaries

Size: 1.21 MB - Last synced at: 24 days ago - Pushed at: 24 days ago - Stars: 0 - Forks: 0

cumc-dbmi/cehrbert

CEHR-BERT: Incorporating temporal information from structured EHR data to improve prediction tasks

Language: Python - Size: 17.5 MB - Last synced at: 26 days ago - Pushed at: 26 days ago - Stars: 35 - Forks: 11

DC-research/TEMPO

The official code for "TEMPO: Prompt-based Generative Pre-trained Transformer for Time Series Forecasting (ICLR 2024)". TEMPO is one of the very first open source Time Series Foundation Models for forecasting task v1.0 version.

Language: Python - Size: 1.82 MB - Last synced at: 15 days ago - Pushed at: 4 months ago - Stars: 109 - Forks: 15

ajs7270/EFE-Reasoner

The implementation of the Explicit Feature Extraction (EFE) Reasoner, a model designed to improve reasoning about numerical magnitudes in math word problems.

Language: Python - Size: 7.15 MB - Last synced at: 8 days ago - Pushed at: 29 days ago - Stars: 0 - Forks: 0

azminewasi/Awesome-LLMs-ICLR-24

It is a comprehensive resource hub compiling all LLM papers accepted at the International Conference on Learning Representations (ICLR) in 2024.

Size: 821 KB - Last synced at: about 5 hours ago - Pushed at: about 1 year ago - Stars: 62 - Forks: 3

microsoft/COCO-LM

[NeurIPS 2021] COCO-LM: Correcting and Contrasting Text Sequences for Language Model Pretraining

Language: Python - Size: 4.1 MB - Last synced at: 3 days ago - Pushed at: almost 2 years ago - Stars: 117 - Forks: 13

THUDM/P-tuning-v2

An optimized deep prompt tuning strategy comparable to fine-tuning across scales and tasks

Language: Python - Size: 1.41 MB - Last synced at: 24 days ago - Pushed at: over 1 year ago - Stars: 2,037 - Forks: 203

ZhaohanM/FusionGDA

we propose a novel FusionGDA model, which utilises a pre-training phase with a fusion module to enrich the gene and disease semantic representations encoded by pre-trained language models.

Language: Jupyter Notebook - Size: 1.61 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 3 - Forks: 2

wenge-research/YAYI2 📦

YAYI 2 是中科闻歌研发的新一代开源大语言模型,采用了超过 2 万亿 Tokens 的高质量、多语言语料进行预训练。(Repo for YaYi 2 Chinese LLMs)

Language: Python - Size: 1.3 MB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 3,624 - Forks: 19

RenzeLou/awesome-instruction-learning

Papers and Datasets on Instruction Tuning and Following. ✨✨✨

Language: Python - Size: 6.25 MB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 493 - Forks: 24

hoadm-net/FTVPLM

Tinh chỉnh mô hình ngôn ngữ lớn tiếng Việt cho một số tác vụ xử lý ngôn ngữ tự nhiên.

Language: Python - Size: 0 Bytes - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

AM-Ankitgit/Complete-Deep-Learning-Algorithms

deep-learning machine-learning

Language: Jupyter Notebook - Size: 196 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

allenai/dont-stop-pretraining

Code associated with the Don't Stop Pretraining ACL 2020 paper

Language: Python - Size: 554 KB - Last synced at: about 2 months ago - Pushed at: over 3 years ago - Stars: 529 - Forks: 73

xcfcode/Summarization-Papers 📦

Summarization Papers

Language: TeX - Size: 40.1 MB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 1,010 - Forks: 144

yukito0209/sentiment-analysis-of-taptap-game-user-reviews

A Data-Driven Study on Sentiment Analysis of TapTap Game User Reviews

Language: Python - Size: 4.88 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

heraclex12/NLP2SPARQL

Translate Natural Language Processing to SPARQL Query and vice versa

Language: Python - Size: 223 KB - Last synced at: about 2 months ago - Pushed at: about 2 years ago - Stars: 51 - Forks: 12

OpenBMB/CPM-Live

Live Training for Open-source Big Models

Language: Python - Size: 1.11 MB - Last synced at: 7 days ago - Pushed at: about 2 years ago - Stars: 506 - Forks: 39

Hzfinfdu/Diffusion-BERT

ACL'2023: DiffusionBERT: Improving Generative Masked Language Models with Diffusion Models

Language: Python - Size: 1.69 MB - Last synced at: 26 days ago - Pushed at: over 1 year ago - Stars: 307 - Forks: 24

apsinghAnalytics/FinRAGify_App

An LLM app leveraging RAG with LangChain and GPT-4 mini to analyze earnings call transcripts, assess company performance, using natural language queries (NLP), FAISS (vector database), and Hugging Face re-ranking models.

Language: Jupyter Notebook - Size: 4.85 MB - Last synced at: 2 months ago - Pushed at: 10 months ago - Stars: 2 - Forks: 0

wxl1999/UniCRS

[KDD22] Official PyTorch implementation for "Towards Unified Conversational Recommender Systems via Knowledge-Enhanced Prompt Learning".

Language: Python - Size: 13 MB - Last synced at: 2 months ago - Pushed at: 5 months ago - Stars: 27 - Forks: 19

gaoisbest/NLP-Projects

word2vec, sentence2vec, machine reading comprehension, dialog system, text classification, pretrained language model (i.e., XLNet, BERT, ELMo, GPT), sequence labeling, information retrieval, information extraction (i.e., entity, relation and event extraction), knowledge graph, text generation, network embedding

Language: OpenEdge ABL - Size: 384 MB - Last synced at: 2 months ago - Pushed at: over 4 years ago - Stars: 544 - Forks: 151

GanjinZero/CODER

CODER: Knowledge infused cross-lingual medical term embedding for term normalization. [JBI, ACL-BioNLP 2022]

Language: Python - Size: 5.63 MB - Last synced at: 2 months ago - Pushed at: almost 3 years ago - Stars: 78 - Forks: 5

EagleW/Multimedia-Generative-Script-Learning

Official implementation of the ACL Findings 2023 paper: Multimedia Generative Script Learning for Task Planning

Language: Python - Size: 45.9 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 8 - Forks: 0

thunlp/Prompt-Transferability

On Transferability of Prompt Tuning for Natural Language Processing

Language: Python - Size: 629 MB - Last synced at: about 2 months ago - Pushed at: about 1 year ago - Stars: 99 - Forks: 11

imSanko/Image_Caption_Generator_With_Transformers 📦

This repository contains code for generating captions for images using a Transformer-based model. The model used is the `VisionEncoderDecoderModel` from the Hugging Face Transformers library, specifically the `nlpconnect/vit-gpt2-image-captioning` model.

Language: Jupyter Notebook - Size: 233 KB - Last synced at: 2 months ago - Pushed at: 10 months ago - Stars: 12 - Forks: 1

SJTU-IPADS/Bamboo

Bamboo-7B Large Language Model

Size: 223 KB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 92 - Forks: 1

megagonlabs/cocosum

:coconut: Code & Data for Comparative Opinion Summarization via Collaborative Decoding (Iso et al; Findings of ACL 2022)

Language: Python - Size: 864 KB - Last synced at: 5 days ago - Pushed at: 3 months ago - Stars: 21 - Forks: 2

thunlp/CokeBERT

CokeBERT: Contextual Knowledge Selection and Embedding towards Enhanced Pre-Trained Language Models

Language: Python - Size: 101 MB - Last synced at: about 2 months ago - Pushed at: almost 2 years ago - Stars: 31 - Forks: 9

Evfidiw/LMs_NLU

Exploring different language models on text classification tasks.

Language: Python - Size: 21.1 MB - Last synced at: 3 days ago - Pushed at: 4 months ago - Stars: 3 - Forks: 0

zzz47zzz/codebase-for-incremental-learning-with-llm

[ACL2024] A Codebase for Incremental Learning with Large Language Models; Official released code for "Learn or Recall? Revisiting Incremental Learning with Pre-trained Language Models (ACL 2024)", "Incremental Sequence Labeling: A Tale of Two Shifts (ACL 2024 Findings)", and "Concept-1K: A Novel Benchmark for Instance Incremental Learning (arxiv)"

Language: Python - Size: 2.88 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 30 - Forks: 7

XCollab/HuggingFace

This repository provides an overview of Hugging Face's Transformers library, a powerful tool for natural language processing (NLP) and machine learning tasks.

Language: Jupyter Notebook - Size: 1.57 MB - Last synced at: 2 months ago - Pushed at: 5 months ago - Stars: 3 - Forks: 1

EagleW/Scientific-Inspiration-Machines-Optimized-for-Novelty

Official implementation of the ACL 2024: Scientific Inspiration Machines Optimized for Novelty

Language: Python - Size: 340 MB - Last synced at: 5 months ago - Pushed at: about 1 year ago - Stars: 71 - Forks: 11

EngineeringSoftware/CoditT5

CoditT5: Pretraining for Source Code and Natural Language Editing

Language: Python - Size: 91.9 MB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 28 - Forks: 3

CAI991108/Machine-Learning-and-Language-Model

This project explores GPT-2 and Llama models through pre-training, fine-tuning, and Chain-of-Thought (CoT) prompting. It includes memory-efficient optimizations (SGD, LoRA, BAdam) and evaluations on math datasets (GSM8K, NumGLUE, StimulEq, SVAMP).

Language: Python - Size: 54.5 MB - Last synced at: 4 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

LYH-YF/MWPToolkit

MWPToolkit is an open-source framework for math word problem(MWP) solvers.

Language: Python - Size: 59.8 MB - Last synced at: 16 days ago - Pushed at: over 2 years ago - Stars: 163 - Forks: 37

FranxYao/PoincareProbe

Implementation of ICLR 21 paper: Probing BERT in Hyperbolic Spaces

Language: Jupyter Notebook - Size: 5.94 MB - Last synced at: about 2 months ago - Pushed at: about 4 years ago - Stars: 58 - Forks: 5

RUCAIBox/UniCRS Fork of wxl1999/UniCRS

[KDD22] Official PyTorch implementation for "Towards Unified Conversational Recommender Systems via Knowledge-Enhanced Prompt Learning".

Language: Python - Size: 13 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 83 - Forks: 14

shreydan/masked-language-modeling

Transformers Pre-Training with MLM objective — implemented encoder-only model and trained from scratch on Wikipedia dataset.

Language: Jupyter Notebook - Size: 221 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 2

ZhengZixiang/ATPapers

Worth-reading papers and related resources on attention mechanism, Transformer and pretrained language model (PLM) such as BERT. 值得一读的注意力机制、Transformer和预训练语言模型论文与相关资源集合

Size: 648 KB - Last synced at: about 1 month ago - Pushed at: about 4 years ago - Stars: 133 - Forks: 13

theblackcat102/unify-learning-paradigms

data collator for UL2 and U-PaLM

Language: Python - Size: 158 KB - Last synced at: 3 months ago - Pushed at: almost 2 years ago - Stars: 29 - Forks: 1

eric11eca/reckoning-metakg

RECKONING is a bi-level learning algorithm that improves language models' reasoning ability by folding contextual knowledge into parametric knowledge through back-propagation.

Language: Python - Size: 21.4 MB - Last synced at: about 2 months ago - Pushed at: 7 months ago - Stars: 5 - Forks: 1

SreeEswaran/Train-your-LLM

This repository contains code and resources for training, fine-tuning, and deploying large language models using Hugging Face's Transformers library.

Language: Python - Size: 33.2 KB - Last synced at: 2 months ago - Pushed at: 7 months ago - Stars: 3 - Forks: 2

GanjinZero/BioBART

BioBART: Pretraining and Evaluation of A Biomedical Generative Language Model [ACL-BioNLP 2022]

Language: Python - Size: 117 KB - Last synced at: 2 months ago - Pushed at: over 2 years ago - Stars: 52 - Forks: 4

ImKeTT/ReSee

[EMNLP'23 Oral] ReSee: Responding through Seeing Fine-grained Visual Knowledge in Open-domain Dialogue PyTorch Implementation

Language: Python - Size: 464 KB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 12 - Forks: 0

SrulyRosenblat/Detecting-Pretraining-Data-Using-Probability-Slopes

A new method for recognizing text that is included in an LLM's training data.

Language: Jupyter Notebook - Size: 1.38 MB - Last synced at: 3 days ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

20101301-Alina-Hasan/Robust-Fake-Review-Detection-using-Uncertainty-Aware-LSTM-and-BERT

Our study utilizes BERT and LSTM models alongside Monte Carlo Dropout (MCD) on the Yelp Labelled Dataset. MCD bolsters robustness by introducing uncertainty through neuron dropout. The BERT-embedded MCD achieves an impressive 91.75% accuracy, surpassing the LSTM model.

Size: 1.36 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

zerohd4869/CIFM

The official repository for ACL 2024 paper "Representation Learning with Conditional Information Flow Maximization"

Language: Python - Size: 5.42 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 4 - Forks: 1

wxl1999/CFCRS

[KDD23] Official PyTorch implementation for "Improving Conversational Recommendation Systems via Counterfactual Data Simulation".

Language: Python - Size: 10.5 MB - Last synced at: 2 months ago - Pushed at: about 2 years ago - Stars: 8 - Forks: 3

lgalke/text-clf-baselines

WideMLP for Text Classification

Language: Python - Size: 78.1 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 27 - Forks: 3

HySonLab/Protein_Pretrain

Multimodal Pretraining for Unsupervised Protein Representation Learning

Language: Python - Size: 241 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 12 - Forks: 2

git-disl/BERT4ETH

BERT4ETH: A Pre-trained Transformer for Ethereum Fraud Detection (WWW23)

Language: Python - Size: 6.88 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 95 - Forks: 15

ZobayerAkib/Transfer-Learning-for-NLP-with-TensorFlow-Hub

This project demonstrates the use of various pre-trained models for transfer learning in NLP using TensorFlow Hub.

Language: Jupyter Notebook - Size: 3.05 MB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

vgaraujov/Seq2Seq-Spanish-PLMs

Sequence-to-Sequence Spanish Pre-trained Language Models

Language: Python - Size: 115 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

microsoft/AMOS

[ICLR 2022] Pretraining Text Encoders with Adversarial Mixture of Training Signal Generators

Language: Python - Size: 3.93 MB - Last synced at: 3 days ago - Pushed at: almost 2 years ago - Stars: 24 - Forks: 2

DooPhiLong/Emotion-classification

Emotion classification base on short texts

Size: 2.93 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

OpenMatch/COCO-DR

[EMNLP 2022] This is the code repo for our EMNLP‘22 paper "COCO-DR: Combating Distribution Shifts in Zero-Shot Dense Retrieval with Contrastive and Distributionally Robust Learning".

Language: Python - Size: 2.2 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 45 - Forks: 4

RUCAIBox/CFCRS Fork of wxl1999/CFCRS

[KDD23] Official PyTorch implementation for "Improving Conversational Recommendation Systems via Counterfactual Data Simulation".

Language: Python - Size: 10.5 MB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 11 - Forks: 0

GVanave/Langchain-Chatbot

Langchain Chatbot Project utilizes Langchain and Streamlit to develop interactive chatbots. Leveraging natural language processing, the project demonstrates two approaches: a CSV-based chatbot and a Llama pretrained model.

Language: Jupyter Notebook - Size: 15.6 KB - Last synced at: 3 days ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

KaikePing/SynPL

SynPL: a zero-shot prompt language model to process multiple-choice questions on synonyms

Language: Jupyter Notebook - Size: 988 KB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

expertailab/An-Empirical-study-on-Pre-trained-Embeddings-and-Language-Models-for-Bot-Detection

Code used in An Empirical study on Pre-trained Embeddings and Language Models for Bot Detection.

Language: Jupyter Notebook - Size: 725 KB - Last synced at: about 1 year ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 1

ChangwenXu98/TransPolymer

Implementation of "TransPolymer: a Transformer-based language model for polymer property predictions" in PyTorch

Language: Python - Size: 1.65 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 43 - Forks: 12

wutaiqiang/WID-NAACL2024

Code for paper: Weight-Inherited Distillation for Task-Agnostic BERT Compression

Language: Python - Size: 351 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

juyongjiang/Awesome-ANCE

Implementation of paper "Approximate Nearest Neighbor Negative Contrastive Learning for Dense Text Retrieval"

Language: Python - Size: 997 KB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 11 - Forks: 0

yingyuankai/AiSpace

AiSpace: Better practices for deep learning model development and deployment For Tensorflow 2.0

Language: Python - Size: 806 KB - Last synced at: 8 months ago - Pushed at: over 2 years ago - Stars: 26 - Forks: 4

zjukg/DUET

[Paper][AAAI 2023] DUET: Cross-modal Semantic Grounding for Contrastive Zero-shot Learning

Language: Python - Size: 7.63 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 35 - Forks: 8

RUCAIBox/ELMER

This repository is the official implementation of our EMNLP 2022 paper ELMER: A Non-Autoregressive Pre-trained Language Model for Efficient and Effective Text Generation

Language: Python - Size: 6.24 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 24 - Forks: 3

jeffhj/VER

The official repo for "VER: Unifying Verbalizing Entities and Relations" (Findings of EMNLP '23)

Language: Python - Size: 3.07 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 6 - Forks: 0

jeffhj/LM_PersonalInfoLeak

The code and data for "Are Large Pre-Trained Language Models Leaking Your Personal Information?" (Findings of EMNLP '22)

Language: Python - Size: 2.6 MB - Last synced at: 11 months ago - Pushed at: over 2 years ago - Stars: 18 - Forks: 4

zzz47zzz/CET

[ACL2023] Preserving Commonsense Knowledge from Pre-trained Language Models via Causal Inference

Language: Python - Size: 443 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 8 - Forks: 1

yueyu1030/AttrPrompt

[NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.

Language: Python - Size: 705 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 94 - Forks: 5

bigai-nlco/CDBert Fork of patrick-tssn/CDBert

[ACL2023-Findings] Shuo Wen Jie Zi is a new learning paradigm that enhances the semantics understanding ability of the Chinese PLMs with dictionary knowledge and structure of Chinese characters

Language: Python - Size: 5.06 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

yzhan238/PIEClass

The source code used for paper "PIEClass: Weakly-Supervised Text Classification with Prompting and Noise-Robust Iterative Ensemble Training", published in EMNLP 2023.

Language: Python - Size: 16.2 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 0

yumeng5/SuperGen

[NeurIPS 2022] Generating Training Data with Language Models: Towards Zero-Shot Language Understanding

Language: Python - Size: 47.4 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 54 - Forks: 10

youlandasu/Choice-Fusion

Choice Fusion as Knowledge for Zero-Shot Dialogue State Tracking (ICASSP 2023)

Language: Python - Size: 98.6 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

Strong-AI-Lab/Logical-Reasoning-Reading-Comprehension-ReClor

The source code for #5 in the Logical Reasoning Reading Comprehension Leaderboard `ReClor`.

Language: Python - Size: 11.9 MB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 4 - Forks: 1

alexandra-chron/hierarchical-domain-adaptation

Code of NAACL 2022 "Efficient Hierarchical Domain Adaptation for Pretrained Language Models" paper.

Language: Python - Size: 5.6 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 23 - Forks: 3

irenepisani/Key_Point_Analysis

Key Point Analysis: implementation of two-component system for performing Key Point Matching and Key Point Generation task with multiple PLMs.

Language: Jupyter Notebook - Size: 387 KB - Last synced at: almost 2 years ago - Pushed at: almost 3 years ago - Stars: 3 - Forks: 1

patrick-tssn/CDBert

[ACL2023] Shuo Wen Jie Zi is a new learning paradigm that enhances the semantics understanding ability of the Chinese PLMs with dictionary knowledge and structure of Chinese characters

Language: Python - Size: 5.05 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 3 - Forks: 0

juletx/gpt2-eus

Pretraining GPT2 model on Basque language

Language: Python - Size: 12.7 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

etetteh/bio-electra

BioMedical Language Processing with ELECTRA

Language: Python - Size: 2.64 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

cheneydon/hrkd

This repository contains the code for the paper in EMNLP 2021: "HRKD: Hierarchical Relational Knowledge Distillation for Cross-domain Language Model Compression".

Language: Python - Size: 37.1 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 4 - Forks: 2

cheneydon/efficient-bert

This repository contains the code for the paper in Findings of EMNLP 2021: "EfficientBERT: Progressively Searching Multilayer Perceptron via Warm-up Knowledge Distillation".

Language: Python - Size: 120 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 31 - Forks: 4

WeSeewy/Chinese-Clickbait

[CSCWD'23] Detecting Clickbait in Chinese Social Media by Prompt Learning

Language: Python - Size: 238 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

DAMO-NLP-SG/PeerDA

Source code of "PeerDA: Data Augmentation via Modeling Peer Relation for Span Identification Tasks" (ACL23)

Language: Python - Size: 6.39 MB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 4 - Forks: 1

dmhyun/MSRP

Official repository of Generating Multiple-Length Summaries via Reinforcement Learning for Unsupervised Sentence Summarization [EMNLP'22 Findings]

Language: Python - Size: 74.1 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 4 - Forks: 0

jeffhj/S-TEST

The implementation for "Can Language Models Be Specific? How?"

Language: Python - Size: 540 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

AndyCheang/TempoSum

TempoSum: Evaluating the Temporal Generalization of Abstractive Summarization

Size: 6.84 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 2 - Forks: 0

umanlp/Multi2WOZ Fork of chiachienhung/Multi2WOZ

Multi2WOZ: A Robust Multilingual Dataset and Conversational Pretraining for Task-Oriented Dialog

Language: Python - Size: 135 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 5 - Forks: 0

chiachienhung/Multi2WOZ

Multi2WOZ: A Robust Multilingual Dataset and Conversational Pretraining for Task-Oriented Dialog

Language: Python - Size: 135 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 2 - Forks: 1

Related Keywords
pretrained-language-model 124 natural-language-processing 32 nlp 25 bert 18 pretrained-models 17 deep-learning 16 language-model 15 large-language-models 14 pytorch 12 transformers 12 llm 12 text-generation 11 dialogue-systems 10 transformer 10 natural-language-understanding 9 dialog 9 machine-learning 9 text-classification 8 python 7 pretraining 7 gpt 5 fine-tuning 5 prompt 5 llms 5 transfer-learning 5 natural-language-generation 5 artificial-intelligence 5 contrastive-learning 5 parameter-efficient-learning 5 prompt-tuning 4 datasets 4 continual-learning 4 retrieval-augmented-generation 4 large-language-model 4 knowledge-distillation 4 bert-model 4 dialogue 4 recommendation 4 recommender-system 4 llama 4 conversational-bots 4 information-retrieval 4 conversation 4 conversational-ai 4 incremental-learning 3 prompts 3 llm-training 3 recommendation-system 3 data-augmentation 3 task-oriented-dialogue 3 question-answering 3 data-science 3 representation-learning 3 tensorflow 3 domain-adaptation 3 zero-shot 3 awesome-list 3 zero-shot-learning 3 transformers-models 2 temporal-data 2 math-word-problem 2 huggingface-transformers 2 multilingual 2 language-adaptation 2 xlmroberta 2 roberta 2 gpt4 2 llm-inference 2 model-compression 2 chinese 2 non-autoregressive-translation 2 named-entity-recognition 2 sentiment-analysis 2 acl2023 2 embeddings 2 tensorflow2 2 machine-reading-comprehension 2 knowledge-graph 2 clustering 2 prompt-learning 2 lstm 2 masked-language-models 2 chatgpt 2 conversational-recommendation 2 conversational-recommender-system 2 data-augmentation-strategies 2 unsupervised-learning 2 survey 2 lifelong-learning 2 computer-vision 2 bert-embeddings 2 pretrained-embedding 2 dataset 2 semantics 2 ai 2 awesome 2 gpt2 2 domain-specific-models 2 personal-privacy 1 xlnet 1