An open API service providing repository metadata for many open source software ecosystems.

Topic: "pretrained-language-model"

wenge-research/YAYI2 📦

YAYI 2 是中科闻歌研发的新一代开源大语言模型,采用了超过 2 万亿 Tokens 的高质量、多语言语料进行预训练。(Repo for YaYi 2 Chinese LLMs)

Language: Python - Size: 1.3 MB - Last synced at: 1 day ago - Pushed at: about 1 year ago - Stars: 3,624 - Forks: 19

microsoft/torchscale

Foundation Architecture for (M)LLMs

Language: Python - Size: 361 KB - Last synced at: 4 days ago - Pushed at: about 1 year ago - Stars: 3,074 - Forks: 217

Separius/awesome-sentence-embedding 📦

A curated list of pretrained sentence and word embedding models

Language: Python - Size: 282 KB - Last synced at: 4 days ago - Pushed at: about 4 years ago - Stars: 2,257 - Forks: 262

THUDM/P-tuning-v2

An optimized deep prompt tuning strategy comparable to fine-tuning across scales and tasks

Language: Python - Size: 1.41 MB - Last synced at: 7 months ago - Pushed at: over 1 year ago - Stars: 1,974 - Forks: 201

thunlp/OpenDelta

A plug-and-play library for parameter-efficient-tuning (Delta Tuning)

Language: Python - Size: 42 MB - Last synced at: 2 days ago - Pushed at: 8 months ago - Stars: 1,027 - Forks: 83

xcfcode/Summarization-Papers 📦

Summarization Papers

Language: TeX - Size: 40.1 MB - Last synced at: 6 days ago - Pushed at: almost 2 years ago - Stars: 1,010 - Forks: 144

AndrewZhe/lawyer-llama

中文法律LLaMA (LLaMA for Chinese legel domain)

Language: Python - Size: 6.85 MB - Last synced at: about 12 hours ago - Pushed at: 9 months ago - Stars: 938 - Forks: 128

gaoisbest/NLP-Projects

word2vec, sentence2vec, machine reading comprehension, dialog system, text classification, pretrained language model (i.e., XLNet, BERT, ELMo, GPT), sequence labeling, information retrieval, information extraction (i.e., entity, relation and event extraction), knowledge graph, text generation, network embedding

Language: OpenEdge ABL - Size: 384 MB - Last synced at: about 1 month ago - Pushed at: over 4 years ago - Stars: 544 - Forks: 151

allenai/dont-stop-pretraining

Code associated with the Don't Stop Pretraining ACL 2020 paper

Language: Python - Size: 554 KB - Last synced at: 12 days ago - Pushed at: over 3 years ago - Stars: 529 - Forks: 73

OpenBMB/CPM-Live

Live Training for Open-source Big Models

Language: Python - Size: 1.11 MB - Last synced at: 5 months ago - Pushed at: almost 2 years ago - Stars: 511 - Forks: 40

RenzeLou/awesome-instruction-learning

Papers and Datasets on Instruction Tuning and Following. ✨✨✨

Language: Python - Size: 6.25 MB - Last synced at: 3 days ago - Pushed at: about 1 year ago - Stars: 493 - Forks: 24

Hzfinfdu/Diffusion-BERT

ACL'2023: DiffusionBERT: Improving Generative Masked Language Models with Diffusion Models

Language: Python - Size: 1.69 MB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 255 - Forks: 15

LYH-YF/MWPToolkit

MWPToolkit is an open-source framework for math word problem(MWP) solvers.

Language: Python - Size: 59.8 MB - Last synced at: 25 days ago - Pushed at: over 2 years ago - Stars: 163 - Forks: 37

hyintell/awesome-refreshing-llms

EMNLP'23 survey: a curation of awesome papers and resources on refreshing large language models (LLMs) without expensive retraining.

Size: 2.31 MB - Last synced at: 9 days ago - Pushed at: over 1 year ago - Stars: 133 - Forks: 10

ZhengZixiang/ATPapers

Worth-reading papers and related resources on attention mechanism, Transformer and pretrained language model (PLM) such as BERT. 值得一读的注意力机制、Transformer和预训练语言模型论文与相关资源集合

Size: 648 KB - Last synced at: 6 days ago - Pushed at: about 4 years ago - Stars: 133 - Forks: 13

microsoft/COCO-LM

[NeurIPS 2021] COCO-LM: Correcting and Contrasting Text Sequences for Language Model Pretraining

Language: Python - Size: 4.1 MB - Last synced at: 4 days ago - Pushed at: almost 2 years ago - Stars: 118 - Forks: 13

DC-research/TEMPO

The official code for "TEMPO: Prompt-based Generative Pre-trained Transformer for Time Series Forecasting (ICLR 2024)". TEMPO is one of the very first open source Time Series Foundation Models for forecasting task v1.0 version.

Language: Python - Size: 1.82 MB - Last synced at: 15 days ago - Pushed at: 3 months ago - Stars: 108 - Forks: 14

thunlp/Prompt-Transferability

On Transferability of Prompt Tuning for Natural Language Processing

Language: Python - Size: 629 MB - Last synced at: 16 days ago - Pushed at: about 1 year ago - Stars: 99 - Forks: 11

SuperBruceJia/Awesome-LLM-Self-Consistency

Awesome LLM Self-Consistency: a curated list of Self-consistency in Large Language Models

Size: 149 KB - Last synced at: 13 days ago - Pushed at: 9 months ago - Stars: 96 - Forks: 7

git-disl/BERT4ETH

BERT4ETH: A Pre-trained Transformer for Ethereum Fraud Detection (WWW23)

Language: Python - Size: 6.88 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 95 - Forks: 15

yueyu1030/AttrPrompt

[NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.

Language: Python - Size: 705 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 94 - Forks: 5

SJTU-IPADS/Bamboo

Bamboo-7B Large Language Model

Size: 223 KB - Last synced at: 2 months ago - Pushed at: about 1 year ago - Stars: 92 - Forks: 1

zzz47zzz/awesome-lifelong-learning-methods-for-llm

This repository collects awesome survey, resource, and paper for Lifelong Learning with Large Language Models. (Updated Regularly)

Size: 286 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 84 - Forks: 4

RUCAIBox/UniCRS Fork of wxl1999/UniCRS

[KDD22] Official PyTorch implementation for "Towards Unified Conversational Recommender Systems via Knowledge-Enhanced Prompt Learning".

Language: Python - Size: 13 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 83 - Forks: 14

GanjinZero/CODER

CODER: Knowledge infused cross-lingual medical term embedding for term normalization. [JBI, ACL-BioNLP 2022]

Language: Python - Size: 5.63 MB - Last synced at: about 1 month ago - Pushed at: almost 3 years ago - Stars: 78 - Forks: 5

EagleW/Scientific-Inspiration-Machines-Optimized-for-Novelty

Official implementation of the ACL 2024: Scientific Inspiration Machines Optimized for Novelty

Language: Python - Size: 340 MB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 71 - Forks: 11

yumeng5/TopClus

[WWW 2022] Topic Discovery via Latent Space Clustering of Pretrained Language Model Representations

Language: Python - Size: 108 MB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 65 - Forks: 4

azminewasi/Awesome-LLMs-ICLR-24

It is a comprehensive resource hub compiling all LLM papers accepted at the International Conference on Learning Representations (ICLR) in 2024.

Size: 821 KB - Last synced at: 9 days ago - Pushed at: about 1 year ago - Stars: 61 - Forks: 3

FranxYao/PoincareProbe

Implementation of ICLR 21 paper: Probing BERT in Hyperbolic Spaces

Language: Jupyter Notebook - Size: 5.94 MB - Last synced at: 11 days ago - Pushed at: about 4 years ago - Stars: 58 - Forks: 5

yumeng5/SuperGen

[NeurIPS 2022] Generating Training Data with Language Models: Towards Zero-Shot Language Understanding

Language: Python - Size: 47.4 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 54 - Forks: 10

GanjinZero/BioBART

BioBART: Pretraining and Evaluation of A Biomedical Generative Language Model [ACL-BioNLP 2022]

Language: Python - Size: 117 KB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 52 - Forks: 4

heraclex12/NLP2SPARQL

Translate Natural Language Processing to SPARQL Query and vice versa

Language: Python - Size: 223 KB - Last synced at: 22 days ago - Pushed at: almost 2 years ago - Stars: 51 - Forks: 12

OpenMatch/COCO-DR

[EMNLP 2022] This is the code repo for our EMNLP‘22 paper "COCO-DR: Combating Distribution Shifts in Zero-Shot Dense Retrieval with Contrastive and Distributionally Robust Learning".

Language: Python - Size: 2.2 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 45 - Forks: 4

SKplanet/Dialog-KoELECTRA

ELECTRA기반 한국어 대화체 언어모델

Language: Python - Size: 42 MB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 44 - Forks: 6

ChangwenXu98/TransPolymer

Implementation of "TransPolymer: a Transformer-based language model for polymer property predictions" in PyTorch

Language: Python - Size: 1.65 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 43 - Forks: 12

qianlima-lab/awesome-lifelong-learning-methods-for-llm Fork of zzz47zzz/awesome-lifelong-learning-methods-for-llm

This repository collects awesome survey, resource, and paper for Lifelong Learning for Large Language Models. (Updated Regularly)

Size: 268 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 37 - Forks: 1

cumc-dbmi/cehrbert

CEHR-BERT: Incorporating temporal information from structured EHR data to improve prediction tasks

Language: Python - Size: 17.5 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 35 - Forks: 11

zjukg/DUET

[Paper][AAAI 2023] DUET: Cross-modal Semantic Grounding for Contrastive Zero-shot Learning

Language: Python - Size: 7.63 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 35 - Forks: 8

thunlp/CokeBERT

CokeBERT: Contextual Knowledge Selection and Embedding towards Enhanced Pre-Trained Language Models

Language: Python - Size: 101 MB - Last synced at: 16 days ago - Pushed at: almost 2 years ago - Stars: 31 - Forks: 9

cheneydon/efficient-bert

This repository contains the code for the paper in Findings of EMNLP 2021: "EfficientBERT: Progressively Searching Multilayer Perceptron via Warm-up Knowledge Distillation".

Language: Python - Size: 120 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 31 - Forks: 4

yzhan238/CGExpan

The source code used for paper "Empower Entity Set Expansion via Language Model Probing", published in ACL 2020.

Language: Python - Size: 11.7 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 31 - Forks: 2

zzz47zzz/codebase-for-incremental-learning-with-llm

[ACL2024] A Codebase for Incremental Learning with Large Language Models; Official released code for "Learn or Recall? Revisiting Incremental Learning with Pre-trained Language Models (ACL 2024)", "Incremental Sequence Labeling: A Tale of Two Shifts (ACL 2024 Findings)", and "Concept-1K: A Novel Benchmark for Instance Incremental Learning (arxiv)"

Language: Python - Size: 2.88 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 30 - Forks: 7

theblackcat102/unify-learning-paradigms

data collator for UL2 and U-PaLM

Language: Python - Size: 158 KB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 29 - Forks: 1

EngineeringSoftware/CoditT5

CoditT5: Pretraining for Source Code and Natural Language Editing

Language: Python - Size: 91.9 MB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 28 - Forks: 3

wxl1999/UniCRS

[KDD22] Official PyTorch implementation for "Towards Unified Conversational Recommender Systems via Knowledge-Enhanced Prompt Learning".

Language: Python - Size: 13 MB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 27 - Forks: 19

lgalke/text-clf-baselines

WideMLP for Text Classification

Language: Python - Size: 78.1 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 27 - Forks: 3

yingyuankai/AiSpace

AiSpace: Better practices for deep learning model development and deployment For Tensorflow 2.0

Language: Python - Size: 806 KB - Last synced at: 7 months ago - Pushed at: about 2 years ago - Stars: 26 - Forks: 4

txsun1997/Metric-Fairness

EMNLP'2022: BERTScore is Unfair: On Social Bias in Language Model-Based Metrics for Text Generation

Language: Jupyter Notebook - Size: 23.5 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 26 - Forks: 2

microsoft/AMOS

[ICLR 2022] Pretraining Text Encoders with Adversarial Mixture of Training Signal Generators

Language: Python - Size: 3.93 MB - Last synced at: 4 days ago - Pushed at: almost 2 years ago - Stars: 24 - Forks: 2

RUCAIBox/ELMER

This repository is the official implementation of our EMNLP 2022 paper ELMER: A Non-Autoregressive Pre-trained Language Model for Efficient and Effective Text Generation

Language: Python - Size: 6.24 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 24 - Forks: 3

alexandra-chron/hierarchical-domain-adaptation

Code of NAACL 2022 "Efficient Hierarchical Domain Adaptation for Pretrained Language Models" paper.

Language: Python - Size: 5.6 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 23 - Forks: 3

megagonlabs/cocosum

:coconut: Code & Data for Comparative Opinion Summarization via Collaborative Decoding (Iso et al; Findings of ACL 2022)

Language: Python - Size: 864 KB - Last synced at: 6 days ago - Pushed at: 2 months ago - Stars: 21 - Forks: 2

EagleW/Stage-wise-Fine-tuning

Code for Stage-wise Fine-tuning for Graph-to-Text Generation

Language: Lex - Size: 211 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 21 - Forks: 5

BH-So/unsupervised-paraphrase-generation

"Unsupervised Paraphrase Generation using Pre-trained Language Model."

Language: Python - Size: 40 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 21 - Forks: 9

Clarifai/examples

Examples for Clarifai Python SDK and Integrations. Give the repo a star ⭐

Language: Jupyter Notebook - Size: 169 MB - Last synced at: 28 days ago - Pushed at: about 1 month ago - Stars: 20 - Forks: 3

jeffhj/LM_PersonalInfoLeak

The code and data for "Are Large Pre-Trained Language Models Leaking Your Personal Information?" (Findings of EMNLP '22)

Language: Python - Size: 2.6 MB - Last synced at: 9 months ago - Pushed at: over 2 years ago - Stars: 18 - Forks: 4

arianhosseini/negation-learning

code for our paper "Understanding by Understanding Not: Modeling Negation in Language Models"

Language: Python - Size: 33 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 16 - Forks: 2

imSanko/Image_Caption_Generator_With_Transformers 📦

This repository contains code for generating captions for images using a Transformer-based model. The model used is the `VisionEncoderDecoderModel` from the Hugging Face Transformers library, specifically the `nlpconnect/vit-gpt2-image-captioning` model.

Language: Jupyter Notebook - Size: 233 KB - Last synced at: about 1 month ago - Pushed at: 8 months ago - Stars: 12 - Forks: 1

HySonLab/Protein_Pretrain

Multimodal Pretraining for Unsupervised Protein Representation Learning

Language: Python - Size: 241 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 12 - Forks: 2

ImKeTT/ReSee

[EMNLP'23 Oral] ReSee: Responding through Seeing Fine-grained Visual Knowledge in Open-domain Dialogue PyTorch Implementation

Language: Python - Size: 464 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 12 - Forks: 0

RUCAIBox/CFCRS Fork of wxl1999/CFCRS

[KDD23] Official PyTorch implementation for "Improving Conversational Recommendation Systems via Counterfactual Data Simulation".

Language: Python - Size: 10.5 MB - Last synced at: about 1 year ago - Pushed at: almost 2 years ago - Stars: 11 - Forks: 0

juyongjiang/Awesome-ANCE

Implementation of paper "Approximate Nearest Neighbor Negative Contrastive Learning for Dense Text Retrieval"

Language: Python - Size: 997 KB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 11 - Forks: 0

EagleW/Multimedia-Generative-Script-Learning

Official implementation of the ACL Findings 2023 paper: Multimedia Generative Script Learning for Task Planning

Language: Python - Size: 45.9 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 8 - Forks: 0

zzz47zzz/CET

[ACL2023] Preserving Commonsense Knowledge from Pre-trained Language Models via Causal Inference

Language: Python - Size: 443 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 8 - Forks: 1

wxl1999/CFCRS

[KDD23] Official PyTorch implementation for "Improving Conversational Recommendation Systems via Counterfactual Data Simulation".

Language: Python - Size: 10.5 MB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 8 - Forks: 3

jeffhj/VER

The official repo for "VER: Unifying Verbalizing Entities and Relations" (Findings of EMNLP '23)

Language: Python - Size: 3.07 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 6 - Forks: 0

xiaoyuxin1002/UQ-PLM

Uncertainty Quantification with Pre-trained Language Models: An Empirical Analysis

Language: Python - Size: 12.7 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 6 - Forks: 1

cliang1453/CAMERO

CAMERO: Consistency Regularized Ensemble of Perturbed Language Models with Weight Sharing (ACL 2022)

Language: Python - Size: 230 KB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 6 - Forks: 0

xiangyue9607/C-MORE

Code for the ACL2022 paper "C-MORE: Pretraining to Answer Open-Domain Questions by Consulting Millions of References"

Size: 9.77 KB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 6 - Forks: 0

danjohnvelasco/Filipino-ULMFiT

Pre-trained AWD-LSTM language model trained on Filipino text corpus using fastai v2. Instructions included.

Language: Jupyter Notebook - Size: 40 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 6 - Forks: 3

eric11eca/reckoning-metakg

RECKONING is a bi-level learning algorithm that improves language models' reasoning ability by folding contextual knowledge into parametric knowledge through back-propagation.

Language: Python - Size: 21.4 MB - Last synced at: 20 days ago - Pushed at: 5 months ago - Stars: 5 - Forks: 1

umanlp/Multi2WOZ Fork of chiachienhung/Multi2WOZ

Multi2WOZ: A Robust Multilingual Dataset and Conversational Pretraining for Task-Oriented Dialog

Language: Python - Size: 135 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 5 - Forks: 0

KimDaeUng/PLM-Implementation

NLP Pretrained Language Models Implementation Study

Language: Jupyter Notebook - Size: 181 KB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 5 - Forks: 2

zerohd4869/CIFM

The official repository for ACL 2024 paper "Representation Learning with Conditional Information Flow Maximization"

Language: Python - Size: 5.42 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 4 - Forks: 1

yzhan238/PIEClass

The source code used for paper "PIEClass: Weakly-Supervised Text Classification with Prompting and Noise-Robust Iterative Ensemble Training", published in EMNLP 2023.

Language: Python - Size: 16.2 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 0

DAMO-NLP-SG/PeerDA

Source code of "PeerDA: Data Augmentation via Modeling Peer Relation for Span Identification Tasks" (ACL23)

Language: Python - Size: 6.39 MB - Last synced at: 8 days ago - Pushed at: almost 2 years ago - Stars: 4 - Forks: 1

cheneydon/hrkd

This repository contains the code for the paper in EMNLP 2021: "HRKD: Hierarchical Relational Knowledge Distillation for Cross-domain Language Model Compression".

Language: Python - Size: 37.1 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 4 - Forks: 2

dmhyun/MSRP

Official repository of Generating Multiple-Length Summaries via Reinforcement Learning for Unsupervised Sentence Summarization [EMNLP'22 Findings]

Language: Python - Size: 74.1 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 4 - Forks: 0

martin-wey/ast-probe

Code and data of the paper https://arxiv.org/abs/2206.11719 (ASE 22')

Language: Python - Size: 1000 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 3

Strong-AI-Lab/Logical-Reasoning-Reading-Comprehension-ReClor

The source code for #5 in the Logical Reasoning Reading Comprehension Leaderboard `ReClor`.

Language: Python - Size: 11.9 MB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 4 - Forks: 1

ZhaohanM/FusionGDA

we propose a novel FusionGDA model, which utilises a pre-training phase with a fusion module to enrich the gene and disease semantic representations encoded by pre-trained language models.

Language: Jupyter Notebook - Size: 1.61 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 3 - Forks: 2

Evfidiw/LMs_NLU

Exploring different language models on text classification tasks.

Language: Python - Size: 21.1 MB - Last synced at: about 13 hours ago - Pushed at: 3 months ago - Stars: 3 - Forks: 0

XCollab/HuggingFace

This repository provides an overview of Hugging Face's Transformers library, a powerful tool for natural language processing (NLP) and machine learning tasks.

Language: Jupyter Notebook - Size: 1.57 MB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 3 - Forks: 1

SreeEswaran/Train-your-LLM

This repository contains code and resources for training, fine-tuning, and deploying large language models using Hugging Face's Transformers library.

Language: Python - Size: 33.2 KB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 3 - Forks: 2

patrick-tssn/CDBert

[ACL2023] Shuo Wen Jie Zi is a new learning paradigm that enhances the semantics understanding ability of the Chinese PLMs with dictionary knowledge and structure of Chinese characters

Language: Python - Size: 5.05 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 3 - Forks: 0

DEROOCE/Awesome-Pretrained-Language-Model

Worth-reading papers and related resources on pretrained-language models(PLMs). On the Shoulder of Giants!

Size: 12.7 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 0

irenepisani/Key_Point_Analysis

Key Point Analysis: implementation of two-component system for performing Key Point Matching and Key Point Generation task with multiple PLMs.

Language: Jupyter Notebook - Size: 387 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 1

umanlp/DS-TOD Fork of chiachienhung/DS-TOD

DS-TOD: Efficient Domain Specialization for Task Oriented Dialog

Language: Python - Size: 312 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 3 - Forks: 0

phanxuanphucnd/CoBERTa

CoBERTa is a pre-trained models are the pre-trained language models for Comment/ Social Vietnamese datasets.

Language: Python - Size: 1.15 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 0

janmejaybhoi/Sequential-Sentence-Classification-in-Medical-Abstracts

Implemented a Research paper "PubMed 200k RCT: a Dataset for Sequential Sentence Classification in Medical Abstracts. Use Hybrid Embedding (char + token + positional).

Language: Jupyter Notebook - Size: 739 KB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 3 - Forks: 1

aws-samples/aws-lex-retrieval-extraction-lm-pt

Examples for pre-training retrieval-extraction based language model

Language: Python - Size: 33.2 KB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 3 - Forks: 2

apsinghAnalytics/FinRAGify_App

An LLM app leveraging RAG with LangChain and GPT-4 mini to analyze earnings call transcripts, assess company performance, using natural language queries (NLP), FAISS (vector database), and Hugging Face re-ranking models.

Language: Jupyter Notebook - Size: 4.85 MB - Last synced at: about 1 month ago - Pushed at: 9 months ago - Stars: 2 - Forks: 0

AndyCheang/TempoSum

TempoSum: Evaluating the Temporal Generalization of Abstractive Summarization

Size: 6.84 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 2 - Forks: 0

chiachienhung/Multi2WOZ

Multi2WOZ: A Robust Multilingual Dataset and Conversational Pretraining for Task-Oriented Dialog

Language: Python - Size: 135 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 2 - Forks: 1

iwan-rg/Arabic-Humor

The Arabic humor dataset was collected using Twint and Sketch Engine and it consists of 10k tweets.

Size: 729 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 1

jeffhj/S-TEST

The implementation for "Can Language Models Be Specific? How?"

Language: Python - Size: 540 KB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

JINHXu/PCL-Detection-SemEval2022-task4

Code repository for the paper Xu at SemEval2022 Task 4: pre-BERT Neural Network Methods vs post-BERT RoBERTa Approach for Patronizing and Condescending Language Detection.

Language: Jupyter Notebook - Size: 173 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 2 - Forks: 0

CAI991108/Machine-Learning-and-Language-Model

This project explores GPT-2 and Llama models through pre-training, fine-tuning, and Chain-of-Thought (CoT) prompting. It includes memory-efficient optimizations (SGD, LoRA, BAdam) and evaluations on math datasets (GSM8K, NumGLUE, StimulEq, SVAMP).

Language: Python - Size: 54.5 MB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

vgaraujov/Seq2Seq-Spanish-PLMs

Sequence-to-Sequence Spanish Pre-trained Language Models

Language: Python - Size: 115 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 0

bigai-nlco/CDBert Fork of patrick-tssn/CDBert

[ACL2023-Findings] Shuo Wen Jie Zi is a new learning paradigm that enhances the semantics understanding ability of the Chinese PLMs with dictionary knowledge and structure of Chinese characters

Language: Python - Size: 5.06 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

Related Topics
natural-language-processing 29 nlp 24 bert 18 pretrained-models 17 deep-learning 16 language-model 14 large-language-models 13 transformers 12 pytorch 12 text-generation 11 llm 11 dialogue-systems 10 transformer 10 dialog 9 natural-language-understanding 9 text-classification 8 machine-learning 8 pretraining 7 llms 5 contrastive-learning 5 prompt 5 fine-tuning 5 parameter-efficient-learning 5 python 5 natural-language-generation 5 transfer-learning 5 artificial-intelligence 5 gpt 5 dialogue 4 conversational-bots 4 knowledge-distillation 4 large-language-model 4 conversational-ai 4 conversation 4 datasets 4 prompt-tuning 4 recommendation 4 information-retrieval 4 recommender-system 4 retrieval-augmented-generation 4 llama 4 continual-learning 4 bert-model 4 representation-learning 3 zero-shot-learning 3 data-augmentation 3 task-oriented-dialogue 3 domain-adaptation 3 tensorflow 3 question-answering 3 incremental-learning 3 zero-shot 3 recommendation-system 3 prompts 3 awesome-list 3 knowledge-graph 2 awesome 2 semantics 2 llm-inference 2 gpt4 2 temporal-data 2 non-autoregressive-translation 2 masked-language-models 2 lifelong-learning 2 ai 2 machine-reading-comprehension 2 language-adaptation 2 multilingual 2 xlmroberta 2 huggingface-transformers 2 sentiment-analysis 2 chinese 2 lstm 2 tensorflow2 2 embeddings 2 domain-specific-models 2 named-entity-recognition 2 llm-training 2 bert-embeddings 2 survey 2 roberta 2 transformers-models 2 gpt2 2 computer-vision 2 acl2023 2 unsupervised-learning 2 prompt-learning 2 model-compression 2 chatgpt 2 clustering 2 data-augmentation-strategies 2 conversational-recommender-system 2 math-word-problem 2 conversational-recommendation 2 pretrained-embedding 2 ehr-data 1 sequence-labeling 1 sentence2vec 1 information-extraction 1 network-embedding 1