GitHub topics: pretrained-language-model
Separius/awesome-sentence-embedding 📦
A curated list of pretrained sentence and word embedding models
Language: Python - Size: 282 KB - Last synced at: 1 day ago - Pushed at: about 4 years ago - Stars: 2,258 - Forks: 262

AndrewZhe/lawyer-llama
中文法律LLaMA (LLaMA for Chinese legel domain)
Language: Python - Size: 6.85 MB - Last synced at: 3 days ago - Pushed at: 10 months ago - Stars: 946 - Forks: 130

SuperBruceJia/Awesome-LLM-Self-Consistency
Awesome LLM Self-Consistency: a curated list of Self-consistency in Large Language Models
Size: 149 KB - Last synced at: 4 days ago - Pushed at: 10 months ago - Stars: 99 - Forks: 8

microsoft/torchscale
Foundation Architecture for (M)LLMs
Language: Python - Size: 361 KB - Last synced at: 3 days ago - Pushed at: about 1 year ago - Stars: 3,082 - Forks: 219

Clarifai/examples
Examples for Clarifai Python SDK and Integrations. Give the repo a star ⭐
Language: Jupyter Notebook - Size: 168 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 20 - Forks: 3

hyintell/awesome-refreshing-llms
EMNLP'23 survey: a curation of awesome papers and resources on refreshing large language models (LLMs) without expensive retraining.
Size: 2.31 MB - Last synced at: about 4 hours ago - Pushed at: over 1 year ago - Stars: 133 - Forks: 10

qianlima-lab/awesome-lifelong-learning-methods-for-llm Fork of zzz47zzz/awesome-lifelong-learning-methods-for-llm
This repository collects awesome survey, resource, and paper for Lifelong Learning for Large Language Models. (Updated Regularly)
Size: 428 KB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 52 - Forks: 1

zzz47zzz/awesome-lifelong-learning-methods-for-llm
[ACM Computing Surveys 2025] This repository collects awesome survey, resource, and paper for Lifelong Learning with Large Language Models. (Updated Regularly)
Size: 428 KB - Last synced at: 16 days ago - Pushed at: 17 days ago - Stars: 129 - Forks: 6

thunlp/OpenDelta
A plug-and-play library for parameter-efficient-tuning (Delta Tuning)
Language: Python - Size: 42 MB - Last synced at: 17 days ago - Pushed at: 9 months ago - Stars: 1,028 - Forks: 83

Kitsunp/Small-lenguaje-Model-Hybrid-Norm-Furier-Formers
A compact language model implementing HybridNorm and Fourier-based attention. Combines CoLA (low-rank projections), FANformer, and hybrid normalization to create an efficient decoder-only transformer. Leverages periodicity modeling and gated residuals to enhance performance while maintaining a small parameter footprint.
Language: Python - Size: 4.6 MB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 4 - Forks: 0

vyasdeepti/Text-to-Image-Generator-using-GANs
This repository implements a Text-to-Image Generator using Generative Adversarial Networks (GANs). The project takes textual descriptions as input and generates corresponding images, demonstrating the capability of deep learning models to bridge the gap between natural language and visual content.
Language: Jupyter Notebook - Size: 2.15 MB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 0 - Forks: 0

hojat72elect/IMDB_storyline_summaries_database
The database IMDB storylines and their summaries
Size: 1.21 MB - Last synced at: 24 days ago - Pushed at: 24 days ago - Stars: 0 - Forks: 0

cumc-dbmi/cehrbert
CEHR-BERT: Incorporating temporal information from structured EHR data to improve prediction tasks
Language: Python - Size: 17.5 MB - Last synced at: 26 days ago - Pushed at: 26 days ago - Stars: 35 - Forks: 11

DC-research/TEMPO
The official code for "TEMPO: Prompt-based Generative Pre-trained Transformer for Time Series Forecasting (ICLR 2024)". TEMPO is one of the very first open source Time Series Foundation Models for forecasting task v1.0 version.
Language: Python - Size: 1.82 MB - Last synced at: 15 days ago - Pushed at: 4 months ago - Stars: 109 - Forks: 15

ajs7270/EFE-Reasoner
The implementation of the Explicit Feature Extraction (EFE) Reasoner, a model designed to improve reasoning about numerical magnitudes in math word problems.
Language: Python - Size: 7.15 MB - Last synced at: 8 days ago - Pushed at: 29 days ago - Stars: 0 - Forks: 0

azminewasi/Awesome-LLMs-ICLR-24
It is a comprehensive resource hub compiling all LLM papers accepted at the International Conference on Learning Representations (ICLR) in 2024.
Size: 821 KB - Last synced at: about 5 hours ago - Pushed at: about 1 year ago - Stars: 62 - Forks: 3

microsoft/COCO-LM
[NeurIPS 2021] COCO-LM: Correcting and Contrasting Text Sequences for Language Model Pretraining
Language: Python - Size: 4.1 MB - Last synced at: 3 days ago - Pushed at: almost 2 years ago - Stars: 117 - Forks: 13

THUDM/P-tuning-v2
An optimized deep prompt tuning strategy comparable to fine-tuning across scales and tasks
Language: Python - Size: 1.41 MB - Last synced at: 24 days ago - Pushed at: over 1 year ago - Stars: 2,037 - Forks: 203

ZhaohanM/FusionGDA
we propose a novel FusionGDA model, which utilises a pre-training phase with a fusion module to enrich the gene and disease semantic representations encoded by pre-trained language models.
Language: Jupyter Notebook - Size: 1.61 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 3 - Forks: 2

wenge-research/YAYI2 📦
YAYI 2 是中科闻歌研发的新一代开源大语言模型,采用了超过 2 万亿 Tokens 的高质量、多语言语料进行预训练。(Repo for YaYi 2 Chinese LLMs)
Language: Python - Size: 1.3 MB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 3,624 - Forks: 19

RenzeLou/awesome-instruction-learning
Papers and Datasets on Instruction Tuning and Following. ✨✨✨
Language: Python - Size: 6.25 MB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 493 - Forks: 24

hoadm-net/FTVPLM
Tinh chỉnh mô hình ngôn ngữ lớn tiếng Việt cho một số tác vụ xử lý ngôn ngữ tự nhiên.
Language: Python - Size: 0 Bytes - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

AM-Ankitgit/Complete-Deep-Learning-Algorithms
deep-learning machine-learning
Language: Jupyter Notebook - Size: 196 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

allenai/dont-stop-pretraining
Code associated with the Don't Stop Pretraining ACL 2020 paper
Language: Python - Size: 554 KB - Last synced at: about 2 months ago - Pushed at: over 3 years ago - Stars: 529 - Forks: 73

xcfcode/Summarization-Papers 📦
Summarization Papers
Language: TeX - Size: 40.1 MB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 1,010 - Forks: 144

yukito0209/sentiment-analysis-of-taptap-game-user-reviews
A Data-Driven Study on Sentiment Analysis of TapTap Game User Reviews
Language: Python - Size: 4.88 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

heraclex12/NLP2SPARQL
Translate Natural Language Processing to SPARQL Query and vice versa
Language: Python - Size: 223 KB - Last synced at: about 2 months ago - Pushed at: about 2 years ago - Stars: 51 - Forks: 12

OpenBMB/CPM-Live
Live Training for Open-source Big Models
Language: Python - Size: 1.11 MB - Last synced at: 7 days ago - Pushed at: about 2 years ago - Stars: 506 - Forks: 39

Hzfinfdu/Diffusion-BERT
ACL'2023: DiffusionBERT: Improving Generative Masked Language Models with Diffusion Models
Language: Python - Size: 1.69 MB - Last synced at: 26 days ago - Pushed at: over 1 year ago - Stars: 307 - Forks: 24

apsinghAnalytics/FinRAGify_App
An LLM app leveraging RAG with LangChain and GPT-4 mini to analyze earnings call transcripts, assess company performance, using natural language queries (NLP), FAISS (vector database), and Hugging Face re-ranking models.
Language: Jupyter Notebook - Size: 4.85 MB - Last synced at: 2 months ago - Pushed at: 10 months ago - Stars: 2 - Forks: 0

wxl1999/UniCRS
[KDD22] Official PyTorch implementation for "Towards Unified Conversational Recommender Systems via Knowledge-Enhanced Prompt Learning".
Language: Python - Size: 13 MB - Last synced at: 2 months ago - Pushed at: 5 months ago - Stars: 27 - Forks: 19

gaoisbest/NLP-Projects
word2vec, sentence2vec, machine reading comprehension, dialog system, text classification, pretrained language model (i.e., XLNet, BERT, ELMo, GPT), sequence labeling, information retrieval, information extraction (i.e., entity, relation and event extraction), knowledge graph, text generation, network embedding
Language: OpenEdge ABL - Size: 384 MB - Last synced at: 2 months ago - Pushed at: over 4 years ago - Stars: 544 - Forks: 151

GanjinZero/CODER
CODER: Knowledge infused cross-lingual medical term embedding for term normalization. [JBI, ACL-BioNLP 2022]
Language: Python - Size: 5.63 MB - Last synced at: 2 months ago - Pushed at: almost 3 years ago - Stars: 78 - Forks: 5

EagleW/Multimedia-Generative-Script-Learning
Official implementation of the ACL Findings 2023 paper: Multimedia Generative Script Learning for Task Planning
Language: Python - Size: 45.9 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 8 - Forks: 0

thunlp/Prompt-Transferability
On Transferability of Prompt Tuning for Natural Language Processing
Language: Python - Size: 629 MB - Last synced at: about 2 months ago - Pushed at: about 1 year ago - Stars: 99 - Forks: 11

imSanko/Image_Caption_Generator_With_Transformers 📦
This repository contains code for generating captions for images using a Transformer-based model. The model used is the `VisionEncoderDecoderModel` from the Hugging Face Transformers library, specifically the `nlpconnect/vit-gpt2-image-captioning` model.
Language: Jupyter Notebook - Size: 233 KB - Last synced at: 2 months ago - Pushed at: 10 months ago - Stars: 12 - Forks: 1

SJTU-IPADS/Bamboo
Bamboo-7B Large Language Model
Size: 223 KB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 92 - Forks: 1

megagonlabs/cocosum
:coconut: Code & Data for Comparative Opinion Summarization via Collaborative Decoding (Iso et al; Findings of ACL 2022)
Language: Python - Size: 864 KB - Last synced at: 5 days ago - Pushed at: 3 months ago - Stars: 21 - Forks: 2

thunlp/CokeBERT
CokeBERT: Contextual Knowledge Selection and Embedding towards Enhanced Pre-Trained Language Models
Language: Python - Size: 101 MB - Last synced at: about 2 months ago - Pushed at: almost 2 years ago - Stars: 31 - Forks: 9

Evfidiw/LMs_NLU
Exploring different language models on text classification tasks.
Language: Python - Size: 21.1 MB - Last synced at: 3 days ago - Pushed at: 4 months ago - Stars: 3 - Forks: 0

zzz47zzz/codebase-for-incremental-learning-with-llm
[ACL2024] A Codebase for Incremental Learning with Large Language Models; Official released code for "Learn or Recall? Revisiting Incremental Learning with Pre-trained Language Models (ACL 2024)", "Incremental Sequence Labeling: A Tale of Two Shifts (ACL 2024 Findings)", and "Concept-1K: A Novel Benchmark for Instance Incremental Learning (arxiv)"
Language: Python - Size: 2.88 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 30 - Forks: 7

XCollab/HuggingFace
This repository provides an overview of Hugging Face's Transformers library, a powerful tool for natural language processing (NLP) and machine learning tasks.
Language: Jupyter Notebook - Size: 1.57 MB - Last synced at: 2 months ago - Pushed at: 5 months ago - Stars: 3 - Forks: 1

EagleW/Scientific-Inspiration-Machines-Optimized-for-Novelty
Official implementation of the ACL 2024: Scientific Inspiration Machines Optimized for Novelty
Language: Python - Size: 340 MB - Last synced at: 5 months ago - Pushed at: about 1 year ago - Stars: 71 - Forks: 11

EngineeringSoftware/CoditT5
CoditT5: Pretraining for Source Code and Natural Language Editing
Language: Python - Size: 91.9 MB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 28 - Forks: 3

CAI991108/Machine-Learning-and-Language-Model
This project explores GPT-2 and Llama models through pre-training, fine-tuning, and Chain-of-Thought (CoT) prompting. It includes memory-efficient optimizations (SGD, LoRA, BAdam) and evaluations on math datasets (GSM8K, NumGLUE, StimulEq, SVAMP).
Language: Python - Size: 54.5 MB - Last synced at: 4 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

LYH-YF/MWPToolkit
MWPToolkit is an open-source framework for math word problem(MWP) solvers.
Language: Python - Size: 59.8 MB - Last synced at: 16 days ago - Pushed at: over 2 years ago - Stars: 163 - Forks: 37

FranxYao/PoincareProbe
Implementation of ICLR 21 paper: Probing BERT in Hyperbolic Spaces
Language: Jupyter Notebook - Size: 5.94 MB - Last synced at: about 2 months ago - Pushed at: about 4 years ago - Stars: 58 - Forks: 5

RUCAIBox/UniCRS Fork of wxl1999/UniCRS
[KDD22] Official PyTorch implementation for "Towards Unified Conversational Recommender Systems via Knowledge-Enhanced Prompt Learning".
Language: Python - Size: 13 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 83 - Forks: 14

shreydan/masked-language-modeling
Transformers Pre-Training with MLM objective — implemented encoder-only model and trained from scratch on Wikipedia dataset.
Language: Jupyter Notebook - Size: 221 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 2

ZhengZixiang/ATPapers
Worth-reading papers and related resources on attention mechanism, Transformer and pretrained language model (PLM) such as BERT. 值得一读的注意力机制、Transformer和预训练语言模型论文与相关资源集合
Size: 648 KB - Last synced at: about 1 month ago - Pushed at: about 4 years ago - Stars: 133 - Forks: 13

theblackcat102/unify-learning-paradigms
data collator for UL2 and U-PaLM
Language: Python - Size: 158 KB - Last synced at: 3 months ago - Pushed at: almost 2 years ago - Stars: 29 - Forks: 1

eric11eca/reckoning-metakg
RECKONING is a bi-level learning algorithm that improves language models' reasoning ability by folding contextual knowledge into parametric knowledge through back-propagation.
Language: Python - Size: 21.4 MB - Last synced at: about 2 months ago - Pushed at: 7 months ago - Stars: 5 - Forks: 1

SreeEswaran/Train-your-LLM
This repository contains code and resources for training, fine-tuning, and deploying large language models using Hugging Face's Transformers library.
Language: Python - Size: 33.2 KB - Last synced at: 2 months ago - Pushed at: 7 months ago - Stars: 3 - Forks: 2

GanjinZero/BioBART
BioBART: Pretraining and Evaluation of A Biomedical Generative Language Model [ACL-BioNLP 2022]
Language: Python - Size: 117 KB - Last synced at: 2 months ago - Pushed at: over 2 years ago - Stars: 52 - Forks: 4

ImKeTT/ReSee
[EMNLP'23 Oral] ReSee: Responding through Seeing Fine-grained Visual Knowledge in Open-domain Dialogue PyTorch Implementation
Language: Python - Size: 464 KB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 12 - Forks: 0

SrulyRosenblat/Detecting-Pretraining-Data-Using-Probability-Slopes
A new method for recognizing text that is included in an LLM's training data.
Language: Jupyter Notebook - Size: 1.38 MB - Last synced at: 3 days ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

20101301-Alina-Hasan/Robust-Fake-Review-Detection-using-Uncertainty-Aware-LSTM-and-BERT
Our study utilizes BERT and LSTM models alongside Monte Carlo Dropout (MCD) on the Yelp Labelled Dataset. MCD bolsters robustness by introducing uncertainty through neuron dropout. The BERT-embedded MCD achieves an impressive 91.75% accuracy, surpassing the LSTM model.
Size: 1.36 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

zerohd4869/CIFM
The official repository for ACL 2024 paper "Representation Learning with Conditional Information Flow Maximization"
Language: Python - Size: 5.42 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 4 - Forks: 1

wxl1999/CFCRS
[KDD23] Official PyTorch implementation for "Improving Conversational Recommendation Systems via Counterfactual Data Simulation".
Language: Python - Size: 10.5 MB - Last synced at: 2 months ago - Pushed at: about 2 years ago - Stars: 8 - Forks: 3

lgalke/text-clf-baselines
WideMLP for Text Classification
Language: Python - Size: 78.1 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 27 - Forks: 3

HySonLab/Protein_Pretrain
Multimodal Pretraining for Unsupervised Protein Representation Learning
Language: Python - Size: 241 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 12 - Forks: 2

git-disl/BERT4ETH
BERT4ETH: A Pre-trained Transformer for Ethereum Fraud Detection (WWW23)
Language: Python - Size: 6.88 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 95 - Forks: 15

ZobayerAkib/Transfer-Learning-for-NLP-with-TensorFlow-Hub
This project demonstrates the use of various pre-trained models for transfer learning in NLP using TensorFlow Hub.
Language: Jupyter Notebook - Size: 3.05 MB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

vgaraujov/Seq2Seq-Spanish-PLMs
Sequence-to-Sequence Spanish Pre-trained Language Models
Language: Python - Size: 115 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

microsoft/AMOS
[ICLR 2022] Pretraining Text Encoders with Adversarial Mixture of Training Signal Generators
Language: Python - Size: 3.93 MB - Last synced at: 3 days ago - Pushed at: almost 2 years ago - Stars: 24 - Forks: 2

DooPhiLong/Emotion-classification
Emotion classification base on short texts
Size: 2.93 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

OpenMatch/COCO-DR
[EMNLP 2022] This is the code repo for our EMNLP‘22 paper "COCO-DR: Combating Distribution Shifts in Zero-Shot Dense Retrieval with Contrastive and Distributionally Robust Learning".
Language: Python - Size: 2.2 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 45 - Forks: 4

RUCAIBox/CFCRS Fork of wxl1999/CFCRS
[KDD23] Official PyTorch implementation for "Improving Conversational Recommendation Systems via Counterfactual Data Simulation".
Language: Python - Size: 10.5 MB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 11 - Forks: 0

GVanave/Langchain-Chatbot
Langchain Chatbot Project utilizes Langchain and Streamlit to develop interactive chatbots. Leveraging natural language processing, the project demonstrates two approaches: a CSV-based chatbot and a Llama pretrained model.
Language: Jupyter Notebook - Size: 15.6 KB - Last synced at: 3 days ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

KaikePing/SynPL
SynPL: a zero-shot prompt language model to process multiple-choice questions on synonyms
Language: Jupyter Notebook - Size: 988 KB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

expertailab/An-Empirical-study-on-Pre-trained-Embeddings-and-Language-Models-for-Bot-Detection
Code used in An Empirical study on Pre-trained Embeddings and Language Models for Bot Detection.
Language: Jupyter Notebook - Size: 725 KB - Last synced at: about 1 year ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 1

ChangwenXu98/TransPolymer
Implementation of "TransPolymer: a Transformer-based language model for polymer property predictions" in PyTorch
Language: Python - Size: 1.65 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 43 - Forks: 12

wutaiqiang/WID-NAACL2024
Code for paper: Weight-Inherited Distillation for Task-Agnostic BERT Compression
Language: Python - Size: 351 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

juyongjiang/Awesome-ANCE
Implementation of paper "Approximate Nearest Neighbor Negative Contrastive Learning for Dense Text Retrieval"
Language: Python - Size: 997 KB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 11 - Forks: 0

yingyuankai/AiSpace
AiSpace: Better practices for deep learning model development and deployment For Tensorflow 2.0
Language: Python - Size: 806 KB - Last synced at: 8 months ago - Pushed at: over 2 years ago - Stars: 26 - Forks: 4

zjukg/DUET
[Paper][AAAI 2023] DUET: Cross-modal Semantic Grounding for Contrastive Zero-shot Learning
Language: Python - Size: 7.63 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 35 - Forks: 8

RUCAIBox/ELMER
This repository is the official implementation of our EMNLP 2022 paper ELMER: A Non-Autoregressive Pre-trained Language Model for Efficient and Effective Text Generation
Language: Python - Size: 6.24 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 24 - Forks: 3

jeffhj/VER
The official repo for "VER: Unifying Verbalizing Entities and Relations" (Findings of EMNLP '23)
Language: Python - Size: 3.07 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 6 - Forks: 0

jeffhj/LM_PersonalInfoLeak
The code and data for "Are Large Pre-Trained Language Models Leaking Your Personal Information?" (Findings of EMNLP '22)
Language: Python - Size: 2.6 MB - Last synced at: 11 months ago - Pushed at: over 2 years ago - Stars: 18 - Forks: 4

zzz47zzz/CET
[ACL2023] Preserving Commonsense Knowledge from Pre-trained Language Models via Causal Inference
Language: Python - Size: 443 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 8 - Forks: 1

yueyu1030/AttrPrompt
[NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.
Language: Python - Size: 705 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 94 - Forks: 5

bigai-nlco/CDBert Fork of patrick-tssn/CDBert
[ACL2023-Findings] Shuo Wen Jie Zi is a new learning paradigm that enhances the semantics understanding ability of the Chinese PLMs with dictionary knowledge and structure of Chinese characters
Language: Python - Size: 5.06 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

yzhan238/PIEClass
The source code used for paper "PIEClass: Weakly-Supervised Text Classification with Prompting and Noise-Robust Iterative Ensemble Training", published in EMNLP 2023.
Language: Python - Size: 16.2 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 0

yumeng5/SuperGen
[NeurIPS 2022] Generating Training Data with Language Models: Towards Zero-Shot Language Understanding
Language: Python - Size: 47.4 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 54 - Forks: 10

youlandasu/Choice-Fusion
Choice Fusion as Knowledge for Zero-Shot Dialogue State Tracking (ICASSP 2023)
Language: Python - Size: 98.6 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

Strong-AI-Lab/Logical-Reasoning-Reading-Comprehension-ReClor
The source code for #5 in the Logical Reasoning Reading Comprehension Leaderboard `ReClor`.
Language: Python - Size: 11.9 MB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 4 - Forks: 1

alexandra-chron/hierarchical-domain-adaptation
Code of NAACL 2022 "Efficient Hierarchical Domain Adaptation for Pretrained Language Models" paper.
Language: Python - Size: 5.6 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 23 - Forks: 3

irenepisani/Key_Point_Analysis
Key Point Analysis: implementation of two-component system for performing Key Point Matching and Key Point Generation task with multiple PLMs.
Language: Jupyter Notebook - Size: 387 KB - Last synced at: almost 2 years ago - Pushed at: almost 3 years ago - Stars: 3 - Forks: 1

patrick-tssn/CDBert
[ACL2023] Shuo Wen Jie Zi is a new learning paradigm that enhances the semantics understanding ability of the Chinese PLMs with dictionary knowledge and structure of Chinese characters
Language: Python - Size: 5.05 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 3 - Forks: 0

juletx/gpt2-eus
Pretraining GPT2 model on Basque language
Language: Python - Size: 12.7 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

etetteh/bio-electra
BioMedical Language Processing with ELECTRA
Language: Python - Size: 2.64 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

cheneydon/hrkd
This repository contains the code for the paper in EMNLP 2021: "HRKD: Hierarchical Relational Knowledge Distillation for Cross-domain Language Model Compression".
Language: Python - Size: 37.1 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 4 - Forks: 2

cheneydon/efficient-bert
This repository contains the code for the paper in Findings of EMNLP 2021: "EfficientBERT: Progressively Searching Multilayer Perceptron via Warm-up Knowledge Distillation".
Language: Python - Size: 120 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 31 - Forks: 4

WeSeewy/Chinese-Clickbait
[CSCWD'23] Detecting Clickbait in Chinese Social Media by Prompt Learning
Language: Python - Size: 238 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

DAMO-NLP-SG/PeerDA
Source code of "PeerDA: Data Augmentation via Modeling Peer Relation for Span Identification Tasks" (ACL23)
Language: Python - Size: 6.39 MB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 4 - Forks: 1

dmhyun/MSRP
Official repository of Generating Multiple-Length Summaries via Reinforcement Learning for Unsupervised Sentence Summarization [EMNLP'22 Findings]
Language: Python - Size: 74.1 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 4 - Forks: 0

jeffhj/S-TEST
The implementation for "Can Language Models Be Specific? How?"
Language: Python - Size: 540 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

AndyCheang/TempoSum
TempoSum: Evaluating the Temporal Generalization of Abstractive Summarization
Size: 6.84 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 2 - Forks: 0

umanlp/Multi2WOZ Fork of chiachienhung/Multi2WOZ
Multi2WOZ: A Robust Multilingual Dataset and Conversational Pretraining for Task-Oriented Dialog
Language: Python - Size: 135 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 5 - Forks: 0

chiachienhung/Multi2WOZ
Multi2WOZ: A Robust Multilingual Dataset and Conversational Pretraining for Task-Oriented Dialog
Language: Python - Size: 135 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 2 - Forks: 1
