GitHub topics: pre-trained-language-models
thunlp/OpenPrompt
An Open-Source Framework for Prompt-Learning.
Language: Python - Size: 14.4 MB - Last synced at: about 1 hour ago - Pushed at: 11 months ago - Stars: 4,646 - Forks: 471

RUCAIBox/LLMSurvey
The official GitHub page for the survey paper "A Survey of Large Language Models".
Language: Python - Size: 43.1 MB - Last synced at: about 17 hours ago - Pushed at: 4 months ago - Stars: 11,605 - Forks: 907

zjunlp/KnowLM
An Open-sourced Knowledgable Large Language Model Framework.
Language: Python - Size: 38.7 MB - Last synced at: about 16 hours ago - Pushed at: 5 months ago - Stars: 1,327 - Forks: 132

zjunlp/KnowledgeEditingPapers
Must-read Papers on Knowledge Editing for Large Language Models.
Size: 154 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 1,100 - Forks: 76

ddangelov/Top2Vec
Top2Vec learns jointly embedded topic, document and word vectors.
Language: Python - Size: 83.4 MB - Last synced at: about 6 hours ago - Pushed at: 7 months ago - Stars: 3,060 - Forks: 376

ymcui/Chinese-LLaMA-Alpaca
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
Language: Python - Size: 23 MB - Last synced at: 5 days ago - Pushed at: about 1 year ago - Stars: 18,859 - Forks: 1,891

airaria/TextPruner
A PyTorch-based model pruning toolkit for pre-trained language models
Language: Python - Size: 10.5 MB - Last synced at: 5 days ago - Pushed at: almost 2 years ago - Stars: 387 - Forks: 35

cedrickchee/awesome-transformer-nlp
A curated list of NLP resources focused on Transformer networks, attention mechanism, GPT, BERT, ChatGPT, LLMs, and transfer learning.
Size: 905 KB - Last synced at: about 17 hours ago - Pushed at: 8 months ago - Stars: 1,097 - Forks: 131

zjunlp/MKG_Analogy
[ICLR 2023] Multimodal Analogical Reasoning over Knowledge Graphs
Language: Python - Size: 27.9 MB - Last synced at: 11 days ago - Pushed at: 11 months ago - Stars: 119 - Forks: 12

zjunlp/ChatCell
ChatCell: Facilitating Single-Cell Analysis with Natural Language
Language: Python - Size: 17 MB - Last synced at: 11 days ago - Pushed at: 19 days ago - Stars: 50 - Forks: 10

lyy1994/awesome-data-contamination
The Paper List on Data Contamination for Large Language Models Evaluation.
Size: 106 KB - Last synced at: about 3 hours ago - Pushed at: 3 months ago - Stars: 95 - Forks: 3

zjunlp/DART
[ICLR 2022] Differentiable Prompt Makes Pre-trained Language Models Better Few-shot Learners
Language: Python - Size: 74.2 KB - Last synced at: 3 days ago - Pushed at: over 2 years ago - Stars: 132 - Forks: 17

brightmart/roberta_zh
RoBERTa中文预训练模型: RoBERTa for Chinese
Language: Python - Size: 308 KB - Last synced at: 26 days ago - Pushed at: 11 months ago - Stars: 2,718 - Forks: 413

Shaun-le/ViQAG
Question and answer generation (QAG) is a natural language processing (NLP) task that generates a question and an answer in the same time by using context information. The input context can be represented in form of structured information in a database or raw text. The outputs of QAG systems can be directly applied to several NLP applications...
Language: Python - Size: 218 KB - Last synced at: about 17 hours ago - Pushed at: about 2 months ago - Stars: 13 - Forks: 2

zjunlp/MolGen
[ICLR 2024] Domain-Agnostic Molecular Generation with Chemical Feedback
Language: Python - Size: 16.4 MB - Last synced at: about 1 month ago - Pushed at: 6 months ago - Stars: 162 - Forks: 14

THUDM/P-tuning
A novel method to tune language models. Codes and datasets for paper ``GPT understands, too''.
Language: Python - Size: 5.98 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 933 - Forks: 112

wjn1996/HugNLP
HugNLP is a unified and comprehensive NLP library based on HuggingFace Transformer. Please hugging for NLP now!😊 HugNLP will released to @HugAILab
Language: Python - Size: 3.71 MB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 247 - Forks: 13

pat-jj/TagReal
[ACL'23] Open KG Completion with PLM (Bridging Text Mining and Prompt Engineering)
Language: Python - Size: 287 MB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 43 - Forks: 2

sangheek16/dialogue-response-dynamics
Code and material used for evaluating PLMs on dialogue response dynamics
Language: Python - Size: 22.5 KB - Last synced at: 2 months ago - Pushed at: over 2 years ago - Stars: 6 - Forks: 1

thunlp/PromptPapers
Must-read papers on prompt-based tuning for pre-trained language models.
Size: 194 KB - Last synced at: 3 months ago - Pushed at: almost 2 years ago - Stars: 4,178 - Forks: 384

Mohannadcse/AlloySpecRepair
An Empirical Evaluation of Pre-trained Large Language Models for Repairing Declarative Formal Specifications
Language: Alloy - Size: 12.9 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 2 - Forks: 1

Navy10021/SLS
SLS : Neural Information Retrieval(IR)-based Semantic Search model
Language: Jupyter Notebook - Size: 1.83 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 13 - Forks: 4

zjunlp/knowledge-rumination
[EMNLP 2023] Knowledge Rumination for Pre-trained Language Models
Language: Python - Size: 14 MB - Last synced at: 11 days ago - Pushed at: almost 2 years ago - Stars: 17 - Forks: 1

anoopkdcs/NLPBias
Towards Comprehensive Understanding of Bias in Pre-trained Neural Language Models: A Survey with Special Emphasis on Affective Bias
Size: 75.3 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 3 - Forks: 1

yuzhimanhua/SeeTopic
Seed-Guided Topic Discovery with Out-of-Vocabulary Seeds (NAACL'22)
Language: C - Size: 80.1 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 17 - Forks: 2

iSEngLab/RetriGen
[2025 TOSEM] Improving Deep Assertion Generation via Fine-Tuning Retrieval-Augmented Pre-trained Language Models
Language: Python - Size: 343 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

iSEngLab/AG-RAG
[2025 TSE] Improving Retrieval-Augmented Deep Assertion Generation via Joint Optimization
Language: Python - Size: 99.2 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

WangRongsheng/Chinese-LLaMA-Alpaca-Usage
📔 对Chinese-LLaMA-Alpaca进行使用说明和核心代码注解
Language: Jupyter Notebook - Size: 655 KB - Last synced at: about 18 hours ago - Pushed at: about 2 years ago - Stars: 50 - Forks: 7

nkcs-iclab/linglong
LingLong (玲珑): a small-scale Chinese pretrained language model
Language: Python - Size: 1.39 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 18 - Forks: 1

anas-zafar/LLM-Survey
The official GitHub page for the survey paper "Large language models: a comprehensive survey of its applications, challenges, limitations, and future prospects"
Size: 29.1 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 24 - Forks: 5

datvo06/VRDSynth
Synthesizing programs to link visually-rich document entities. This is the replication code for VRDSynth paper, accepted in ISSTA'24
Language: Python - Size: 3.19 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 5 - Forks: 0

ankur-tutlani/RAG-Enhanced-QA
A cutting-edge system using the RAG approach to enhance document retrieval and question answering with pre-trained models.
Language: Python - Size: 11.7 KB - Last synced at: 3 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

deepmancer/vit-gpt2-image-captioning
Fine-tuning an encoder-decoder transformer (ViT-Base-Patch16-224-In21k and DistilGPT2) for image captioning on the COCO dataset
Language: Jupyter Notebook - Size: 8.41 MB - Last synced at: 9 months ago - Pushed at: 10 months ago - Stars: 4 - Forks: 0

yuzhimanhua/SciMult
Pre-training Multi-task Contrastive Learning Models for Scientific Literature Understanding (Findings of EMNLP'23)
Language: Python - Size: 173 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 9 - Forks: 0

mojtabasajjadi/FarSSiBERT
A Pre-trained Language Model for Semantic Similarity Measurement of Persian Informal Short Texts
Language: Python - Size: 522 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

txsun1997/LMaaS-Papers
Awesome papers on Language-Model-as-a-Service (LMaaS)
Size: 878 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 518 - Forks: 32

rayhung1/NLP-Headline-Generation
Study Uses LLMs to craft accurate, engaging headlines from Reddit Posts
Language: Jupyter Notebook - Size: 138 MB - Last synced at: 8 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

constantjxyz/PromptLink
PromptLink: Leveraging Large Language Models for Cross-Source Biomedical Concept Linking
Language: Python - Size: 597 KB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

zjunlp/Revisit-KNN
[CCL 2023] Revisiting k-NN for Fine-tuning Pre-trained Language Models
Language: Python - Size: 205 KB - Last synced at: 11 days ago - Pushed at: about 2 years ago - Stars: 10 - Forks: 1

westlake-repl/Multimodal-recommendation-datasets
A repository listing important datasets for multimodal recommender systems
Size: 20.5 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 1

laihuiyuan/Figurative-Language-Generation
A Survey on Automatic Generation of Figurative Language: From Rule-based Systems to Large Language Models
Size: 562 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

winleexz/Question-and-Answer-Chatbot-with-Dialogflow
Question Answering Chatbot with DistilRoBERTa Sentence Embeddings, Dialogflow and Ngrok
Language: Jupyter Notebook - Size: 144 KB - Last synced at: about 1 year ago - Pushed at: about 4 years ago - Stars: 3 - Forks: 2

Yangyi-Chen/LM-TOAST
Source code for ACL 2023 Findings paper "Making Pre-trained Language Models both Task-solvers and Self-calibrators"
Language: Python - Size: 322 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 8 - Forks: 0

sunyilgdx/SIFRank_zh
Keyphrase or Keyword Extraction 基于预训练模型的中文关键词抽取方法(论文SIFRank: A New Baseline for Unsupervised Keyphrase Extraction Based on Pre-trained Language Model 的中文版代码)
Language: Python - Size: 2.38 MB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 404 - Forks: 78

sunyilgdx/SIFRank
The code of our paper "SIFRank: A New Baseline for Unsupervised Keyphrase Extraction Based on Pre-trained Language Model"
Language: Python - Size: 5.81 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 120 - Forks: 20

omermohammed9/AnswerQuestion
Small Web Project with TF
Size: 88.9 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

dariush-saberi/formalwriter
The source code for the project FormalWriter.com
Language: Cython - Size: 13.8 MB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 0

Das-Boot/bot_or_not
Code and data for paper 'Towards human-compatible autonomous car: A study of non-verbal Turing test in automated driving with affective transition modelling'
Language: Jupyter Notebook - Size: 3.7 GB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 0

WebGao/Prompt-Learning
Learning notes of Prompt.
Size: 53.4 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 3 - Forks: 0

xuanyuan14/ARES
SIGIR'22 paper: Axiomatically Regularized Pre-training for Ad hoc Search
Language: Python - Size: 6.42 MB - Last synced at: almost 2 years ago - Pushed at: about 2 years ago - Stars: 25 - Forks: 2

AndyChiangSH/CDGP
Code for Findings of EMNLP 2022 short paper "CDGP: Automatic Cloze Distractor Generation based on Pre-trained Language Model".
Language: Jupyter Notebook - Size: 102 MB - Last synced at: almost 2 years ago - Pushed at: about 2 years ago - Stars: 9 - Forks: 2

hienhuynhtdn/GPT2andImplicitCausality
Evaluation of the ability of GPT-2 to learn human biases in implicit causality.
Language: Jupyter Notebook - Size: 2.8 MB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 2 - Forks: 1

TinyTalks/Butterfly
Butterfly: An Open NLP Research Project for Chinese Danmaku,B站弹幕NLP社区建设
Size: 6.52 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 4 - Forks: 0

haitao9833/haitaoPython
Jupyter Notebook & 深度学习 & 快速上手使用教程
Language: Jupyter Notebook - Size: 9.96 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 2 - Forks: 0

Victorwz/VaLM
VaLM: Visually-augmented Language Modeling. ICLR 2023.
Language: Python - Size: 13.6 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 46 - Forks: 2

ai2-ner-project/pytorch-ko-ner
PLM 기반 한국어 개체명 인식 (NER)
Language: Python - Size: 108 KB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 12 - Forks: 2

TobiasLee/Awesome-Efficient-PLM
Must-read papers on improving efficiency for pre-trained language models.
Size: 74.2 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 91 - Forks: 12

lancopku/DynamicKD
Code for EMNLP 2021 main conference paper "Dynamic Knowledge Distillation for Pre-trained Language Models"
Language: Python - Size: 34.2 KB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 31 - Forks: 6

Hanlard/Electra_CRF_NER
We start a company-name recognition task with a small scale and low quality training data, then using skills to enhanced model training speed and predicting performance with least artificial participation. The methods we use involve lite pre-training models such as Albert-small or Electra-small with financial corpus, knowledge of distillation and multi-stage learning. The result is that we improve the recall rate of company names recognition task from 0.73 to 0.92 and get 4 times as fast as BERT-Bilstm-CRF model.
Language: Python - Size: 5.06 MB - Last synced at: over 2 years ago - Pushed at: almost 5 years ago - Stars: 73 - Forks: 14

lancopku/CascadeBERT
Code for CascadeBERT, Findings of EMNLP 2021
Language: Python - Size: 22.5 KB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 12 - Forks: 1

hmohebbi/ValueZeroing
The official repo for the EACL 2023 paper "Quantifying Context Mixing in Transformers"
Language: Python - Size: 64.5 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 6 - Forks: 1

XingLuxi/Cal-FLOPs-for-PLM
Calculating FLOPs of Pre-trained Models in NLP
Language: Python - Size: 9.77 KB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 13 - Forks: 3

cliang1453/super-structured-lottery-tickets
Super Tickets in Pre-Trained Language Models: From Model Compression to Improving Generalization (ACL 2021)
Language: Python - Size: 683 KB - Last synced at: over 2 years ago - Pushed at: almost 4 years ago - Stars: 14 - Forks: 1

edchengg/GigaBERT
Arabic Relation extraction system, named entity recognition, IE
Language: Python - Size: 31 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 7 - Forks: 2

RUCwenxiang/Answer-Selection
2020年贝壳找房问答匹配竞赛
Language: Python - Size: 2.71 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

lanwuwei/GigaBERT
Zero-shot Transfer Learning from English to Arabic
Language: Python - Size: 33.2 MB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 26 - Forks: 5

josemarcosrf/BiDAF-example-notebook
Usage example for the AllenNLP BiDAF pre-trained model
Language: Jupyter Notebook - Size: 2.93 KB - Last synced at: over 2 years ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 3

KelleyYin/XLM-Plus
Language: Python - Size: 231 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 10 - Forks: 1
