An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: pre-trained-language-models

thunlp/OpenPrompt

An Open-Source Framework for Prompt-Learning.

Language: Python - Size: 14.4 MB - Last synced at: about 1 hour ago - Pushed at: 11 months ago - Stars: 4,646 - Forks: 471

RUCAIBox/LLMSurvey

The official GitHub page for the survey paper "A Survey of Large Language Models".

Language: Python - Size: 43.1 MB - Last synced at: about 17 hours ago - Pushed at: 4 months ago - Stars: 11,605 - Forks: 907

zjunlp/KnowLM

An Open-sourced Knowledgable Large Language Model Framework.

Language: Python - Size: 38.7 MB - Last synced at: about 16 hours ago - Pushed at: 5 months ago - Stars: 1,327 - Forks: 132

zjunlp/KnowledgeEditingPapers

Must-read Papers on Knowledge Editing for Large Language Models.

Size: 154 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 1,100 - Forks: 76

ddangelov/Top2Vec

Top2Vec learns jointly embedded topic, document and word vectors.

Language: Python - Size: 83.4 MB - Last synced at: about 6 hours ago - Pushed at: 7 months ago - Stars: 3,060 - Forks: 376

ymcui/Chinese-LLaMA-Alpaca

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Language: Python - Size: 23 MB - Last synced at: 5 days ago - Pushed at: about 1 year ago - Stars: 18,859 - Forks: 1,891

airaria/TextPruner

A PyTorch-based model pruning toolkit for pre-trained language models

Language: Python - Size: 10.5 MB - Last synced at: 5 days ago - Pushed at: almost 2 years ago - Stars: 387 - Forks: 35

cedrickchee/awesome-transformer-nlp

A curated list of NLP resources focused on Transformer networks, attention mechanism, GPT, BERT, ChatGPT, LLMs, and transfer learning.

Size: 905 KB - Last synced at: about 17 hours ago - Pushed at: 8 months ago - Stars: 1,097 - Forks: 131

zjunlp/MKG_Analogy

[ICLR 2023] Multimodal Analogical Reasoning over Knowledge Graphs

Language: Python - Size: 27.9 MB - Last synced at: 11 days ago - Pushed at: 11 months ago - Stars: 119 - Forks: 12

zjunlp/ChatCell

ChatCell: Facilitating Single-Cell Analysis with Natural Language

Language: Python - Size: 17 MB - Last synced at: 11 days ago - Pushed at: 19 days ago - Stars: 50 - Forks: 10

lyy1994/awesome-data-contamination

The Paper List on Data Contamination for Large Language Models Evaluation.

Size: 106 KB - Last synced at: about 3 hours ago - Pushed at: 3 months ago - Stars: 95 - Forks: 3

zjunlp/DART

[ICLR 2022] Differentiable Prompt Makes Pre-trained Language Models Better Few-shot Learners

Language: Python - Size: 74.2 KB - Last synced at: 3 days ago - Pushed at: over 2 years ago - Stars: 132 - Forks: 17

brightmart/roberta_zh

RoBERTa中文预训练模型: RoBERTa for Chinese

Language: Python - Size: 308 KB - Last synced at: 26 days ago - Pushed at: 11 months ago - Stars: 2,718 - Forks: 413

Shaun-le/ViQAG

Question and answer generation (QAG) is a natural language processing (NLP) task that generates a question and an answer in the same time by using context information. The input context can be represented in form of structured information in a database or raw text. The outputs of QAG systems can be directly applied to several NLP applications...

Language: Python - Size: 218 KB - Last synced at: about 17 hours ago - Pushed at: about 2 months ago - Stars: 13 - Forks: 2

zjunlp/MolGen

[ICLR 2024] Domain-Agnostic Molecular Generation with Chemical Feedback

Language: Python - Size: 16.4 MB - Last synced at: about 1 month ago - Pushed at: 6 months ago - Stars: 162 - Forks: 14

THUDM/P-tuning

A novel method to tune language models. Codes and datasets for paper ``GPT understands, too''.

Language: Python - Size: 5.98 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 933 - Forks: 112

wjn1996/HugNLP

HugNLP is a unified and comprehensive NLP library based on HuggingFace Transformer. Please hugging for NLP now!😊 HugNLP will released to @HugAILab

Language: Python - Size: 3.71 MB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 247 - Forks: 13

pat-jj/TagReal

[ACL'23] Open KG Completion with PLM (Bridging Text Mining and Prompt Engineering)

Language: Python - Size: 287 MB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 43 - Forks: 2

sangheek16/dialogue-response-dynamics

Code and material used for evaluating PLMs on dialogue response dynamics

Language: Python - Size: 22.5 KB - Last synced at: 2 months ago - Pushed at: over 2 years ago - Stars: 6 - Forks: 1

thunlp/PromptPapers

Must-read papers on prompt-based tuning for pre-trained language models.

Size: 194 KB - Last synced at: 3 months ago - Pushed at: almost 2 years ago - Stars: 4,178 - Forks: 384

Mohannadcse/AlloySpecRepair

An Empirical Evaluation of Pre-trained Large Language Models for Repairing Declarative Formal Specifications

Language: Alloy - Size: 12.9 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 2 - Forks: 1

Navy10021/SLS

SLS : Neural Information Retrieval(IR)-based Semantic Search model

Language: Jupyter Notebook - Size: 1.83 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 13 - Forks: 4

zjunlp/knowledge-rumination

[EMNLP 2023] Knowledge Rumination for Pre-trained Language Models

Language: Python - Size: 14 MB - Last synced at: 11 days ago - Pushed at: almost 2 years ago - Stars: 17 - Forks: 1

anoopkdcs/NLPBias

Towards Comprehensive Understanding of Bias in Pre-trained Neural Language Models: A Survey with Special Emphasis on Affective Bias

Size: 75.3 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 3 - Forks: 1

yuzhimanhua/SeeTopic

Seed-Guided Topic Discovery with Out-of-Vocabulary Seeds (NAACL'22)

Language: C - Size: 80.1 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 17 - Forks: 2

iSEngLab/RetriGen

[2025 TOSEM] Improving Deep Assertion Generation via Fine-Tuning Retrieval-Augmented Pre-trained Language Models

Language: Python - Size: 343 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

iSEngLab/AG-RAG

[2025 TSE] Improving Retrieval-Augmented Deep Assertion Generation via Joint Optimization

Language: Python - Size: 99.2 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

WangRongsheng/Chinese-LLaMA-Alpaca-Usage

📔 对Chinese-LLaMA-Alpaca进行使用说明和核心代码注解

Language: Jupyter Notebook - Size: 655 KB - Last synced at: about 18 hours ago - Pushed at: about 2 years ago - Stars: 50 - Forks: 7

nkcs-iclab/linglong

LingLong (玲珑): a small-scale Chinese pretrained language model

Language: Python - Size: 1.39 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 18 - Forks: 1

anas-zafar/LLM-Survey

The official GitHub page for the survey paper "Large language models: a comprehensive survey of its applications, challenges, limitations, and future prospects"

Size: 29.1 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 24 - Forks: 5

datvo06/VRDSynth

Synthesizing programs to link visually-rich document entities. This is the replication code for VRDSynth paper, accepted in ISSTA'24

Language: Python - Size: 3.19 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 5 - Forks: 0

ankur-tutlani/RAG-Enhanced-QA

A cutting-edge system using the RAG approach to enhance document retrieval and question answering with pre-trained models.

Language: Python - Size: 11.7 KB - Last synced at: 3 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

deepmancer/vit-gpt2-image-captioning

Fine-tuning an encoder-decoder transformer (ViT-Base-Patch16-224-In21k and DistilGPT2) for image captioning on the COCO dataset

Language: Jupyter Notebook - Size: 8.41 MB - Last synced at: 9 months ago - Pushed at: 10 months ago - Stars: 4 - Forks: 0

yuzhimanhua/SciMult

Pre-training Multi-task Contrastive Learning Models for Scientific Literature Understanding (Findings of EMNLP'23)

Language: Python - Size: 173 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 9 - Forks: 0

mojtabasajjadi/FarSSiBERT

A Pre-trained Language Model for Semantic Similarity Measurement of Persian Informal Short Texts

Language: Python - Size: 522 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

txsun1997/LMaaS-Papers

Awesome papers on Language-Model-as-a-Service (LMaaS)

Size: 878 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 518 - Forks: 32

rayhung1/NLP-Headline-Generation

Study Uses LLMs to craft accurate, engaging headlines from Reddit Posts

Language: Jupyter Notebook - Size: 138 MB - Last synced at: 8 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

constantjxyz/PromptLink

PromptLink: Leveraging Large Language Models for Cross-Source Biomedical Concept Linking

Language: Python - Size: 597 KB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

zjunlp/Revisit-KNN

[CCL 2023] Revisiting k-NN for Fine-tuning Pre-trained Language Models

Language: Python - Size: 205 KB - Last synced at: 11 days ago - Pushed at: about 2 years ago - Stars: 10 - Forks: 1

westlake-repl/Multimodal-recommendation-datasets

A repository listing important datasets for multimodal recommender systems

Size: 20.5 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 1

laihuiyuan/Figurative-Language-Generation

A Survey on Automatic Generation of Figurative Language: From Rule-based Systems to Large Language Models

Size: 562 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

winleexz/Question-and-Answer-Chatbot-with-Dialogflow

Question Answering Chatbot with DistilRoBERTa Sentence Embeddings, Dialogflow and Ngrok

Language: Jupyter Notebook - Size: 144 KB - Last synced at: about 1 year ago - Pushed at: about 4 years ago - Stars: 3 - Forks: 2

Yangyi-Chen/LM-TOAST

Source code for ACL 2023 Findings paper "Making Pre-trained Language Models both Task-solvers and Self-calibrators"

Language: Python - Size: 322 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 8 - Forks: 0

sunyilgdx/SIFRank_zh

Keyphrase or Keyword Extraction 基于预训练模型的中文关键词抽取方法(论文SIFRank: A New Baseline for Unsupervised Keyphrase Extraction Based on Pre-trained Language Model 的中文版代码)

Language: Python - Size: 2.38 MB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 404 - Forks: 78

sunyilgdx/SIFRank

The code of our paper "SIFRank: A New Baseline for Unsupervised Keyphrase Extraction Based on Pre-trained Language Model"

Language: Python - Size: 5.81 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 120 - Forks: 20

omermohammed9/AnswerQuestion

Small Web Project with TF

Size: 88.9 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

dariush-saberi/formalwriter

The source code for the project FormalWriter.com

Language: Cython - Size: 13.8 MB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 0

Das-Boot/bot_or_not

Code and data for paper 'Towards human-compatible autonomous car: A study of non-verbal Turing test in automated driving with affective transition modelling'

Language: Jupyter Notebook - Size: 3.7 GB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 0

WebGao/Prompt-Learning

Learning notes of Prompt.

Size: 53.4 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 3 - Forks: 0

xuanyuan14/ARES

SIGIR'22 paper: Axiomatically Regularized Pre-training for Ad hoc Search

Language: Python - Size: 6.42 MB - Last synced at: almost 2 years ago - Pushed at: about 2 years ago - Stars: 25 - Forks: 2

AndyChiangSH/CDGP

Code for Findings of EMNLP 2022 short paper "CDGP: Automatic Cloze Distractor Generation based on Pre-trained Language Model".

Language: Jupyter Notebook - Size: 102 MB - Last synced at: almost 2 years ago - Pushed at: about 2 years ago - Stars: 9 - Forks: 2

hienhuynhtdn/GPT2andImplicitCausality

Evaluation of the ability of GPT-2 to learn human biases in implicit causality.

Language: Jupyter Notebook - Size: 2.8 MB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 2 - Forks: 1

TinyTalks/Butterfly

Butterfly: An Open NLP Research Project for Chinese Danmaku,B站弹幕NLP社区建设

Size: 6.52 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 4 - Forks: 0

haitao9833/haitaoPython

Jupyter Notebook & 深度学习 & 快速上手使用教程

Language: Jupyter Notebook - Size: 9.96 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 2 - Forks: 0

Victorwz/VaLM

VaLM: Visually-augmented Language Modeling. ICLR 2023.

Language: Python - Size: 13.6 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 46 - Forks: 2

ai2-ner-project/pytorch-ko-ner

PLM 기반 한국어 개체명 인식 (NER)

Language: Python - Size: 108 KB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 12 - Forks: 2

TobiasLee/Awesome-Efficient-PLM

Must-read papers on improving efficiency for pre-trained language models.

Size: 74.2 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 91 - Forks: 12

lancopku/DynamicKD

Code for EMNLP 2021 main conference paper "Dynamic Knowledge Distillation for Pre-trained Language Models"

Language: Python - Size: 34.2 KB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 31 - Forks: 6

Hanlard/Electra_CRF_NER

We start a company-name recognition task with a small scale and low quality training data, then using skills to enhanced model training speed and predicting performance with least artificial participation. The methods we use involve lite pre-training models such as Albert-small or Electra-small with financial corpus, knowledge of distillation and multi-stage learning. The result is that we improve the recall rate of company names recognition task from 0.73 to 0.92 and get 4 times as fast as BERT-Bilstm-CRF model.

Language: Python - Size: 5.06 MB - Last synced at: over 2 years ago - Pushed at: almost 5 years ago - Stars: 73 - Forks: 14

lancopku/CascadeBERT

Code for CascadeBERT, Findings of EMNLP 2021

Language: Python - Size: 22.5 KB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 12 - Forks: 1

hmohebbi/ValueZeroing

The official repo for the EACL 2023 paper "Quantifying Context Mixing in Transformers"

Language: Python - Size: 64.5 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 6 - Forks: 1

XingLuxi/Cal-FLOPs-for-PLM

Calculating FLOPs of Pre-trained Models in NLP

Language: Python - Size: 9.77 KB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 13 - Forks: 3

cliang1453/super-structured-lottery-tickets

Super Tickets in Pre-Trained Language Models: From Model Compression to Improving Generalization (ACL 2021)

Language: Python - Size: 683 KB - Last synced at: over 2 years ago - Pushed at: almost 4 years ago - Stars: 14 - Forks: 1

edchengg/GigaBERT

Arabic Relation extraction system, named entity recognition, IE

Language: Python - Size: 31 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 7 - Forks: 2

RUCwenxiang/Answer-Selection

2020年贝壳找房问答匹配竞赛

Language: Python - Size: 2.71 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

lanwuwei/GigaBERT

Zero-shot Transfer Learning from English to Arabic

Language: Python - Size: 33.2 MB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 26 - Forks: 5

josemarcosrf/BiDAF-example-notebook

Usage example for the AllenNLP BiDAF pre-trained model

Language: Jupyter Notebook - Size: 2.93 KB - Last synced at: over 2 years ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 3

KelleyYin/XLM-Plus

Language: Python - Size: 231 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 10 - Forks: 1

Related Keywords
pre-trained-language-models 68 natural-language-processing 18 large-language-models 14 nlp 13 bert 10 pytorch 9 language-model 8 llm 7 deep-learning 6 transformers 6 transformer 5 prompt-learning 5 gpt-3 5 pre-trained-model 5 llms 4 llama 4 paper-list 4 prompt 4 ai 3 few-shot-learning 3 relation-extraction 3 prompt-tuning 3 lora 3 machine-learning 3 topic-modeling 3 word-embeddings 3 chatgpt 3 gpt-2 3 pre-training 3 fine-tuning 3 prompt-engineering 2 named-entity-recognition 2 model-compression 2 software-engineering 2 unit-testing 2 neural-networks 2 knowledge-graph 2 alpaca 2 jupyter-notebook 2 evaluation 2 awesome-list 2 model-pruning 2 iclr 2 survey 2 attention-mechanism 2 prompt-toolkit 2 chinese 2 sif 2 roberta 2 bert-model 2 arabic-nlp 2 pre-trained 2 knowlm 2 instruction-tuning 2 nlp-machine-learning 2 huggingface 2 natural-language-understanding 2 reasoning 2 foundation-models 2 keyphrase-extraction 2 text-classification 2 elmo 2 keywords-extraction 1 dialogflow 1 sifrank 1 cosinesimilarity 1 conversational-bots 1 python3 1 chatbot 1 stanfordcorenlp 1 simile 1 dialogflow-agent 1 dialogflow-chatbot 1 dialogflow-fulfillment 1 sentence-embeddings 1 python36 1 distilroberta 1 ngrok 1 pre-trained-embeddings 1 retrieval-based-dialog-system 1 webhook 1 keyword-extraction 1 calibration 1 sarcasm 1 image-recommendation 1 foundation-recommendation-model 1 cross-domain-recommendation 1 retrieval-augmented 1 knn 1 glue 1 ccl2023 1 zero-shot-learning 1 retrieve-and-rank 1 large-language-model 1 biomedical-concept-linking 1 summarization 1 rouge-metric 1 llm-training 1 headline-generation 1 bleu-score 1