pretrained-language-model | Topic

Topic: "pretrained-language-model"

wenge-research/YAYI2 📦

YAYI 2 是中科闻歌研发的新一代开源大语言模型，采用了超过 2 万亿 Tokens 的高质量、多语言语料进行预训练。(Repo for YaYi 2 Chinese LLMs)

Language: Python - Size: 1.3 MB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 3,624 - Forks: 19

microsoft/torchscale

Foundation Architecture for (M)LLMs

Language: Python - Size: 361 KB - Last synced at: 3 days ago - Pushed at: about 1 year ago - Stars: 3,084 - Forks: 220

Separius/awesome-sentence-embedding 📦

A curated list of pretrained sentence and word embedding models

Language: Python - Size: 282 KB - Last synced at: 9 days ago - Pushed at: about 4 years ago - Stars: 2,258 - Forks: 262

THUDM/P-tuning-v2

An optimized deep prompt tuning strategy comparable to fine-tuning across scales and tasks

Language: Python - Size: 1.41 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 2,037 - Forks: 203

thunlp/OpenDelta

A plug-and-play library for parameter-efficient-tuning (Delta Tuning)

Language: Python - Size: 42 MB - Last synced at: 24 days ago - Pushed at: 9 months ago - Stars: 1,028 - Forks: 83

xcfcode/Summarization-Papers 📦

Summarization Papers

Language: TeX - Size: 40.1 MB - Last synced at: about 2 months ago - Pushed at: almost 2 years ago - Stars: 1,010 - Forks: 144

AndrewZhe/lawyer-llama

中文法律LLaMA (LLaMA for Chinese legel domain)

Language: Python - Size: 6.85 MB - Last synced at: 10 days ago - Pushed at: 10 months ago - Stars: 946 - Forks: 130

word2vec, sentence2vec, machine reading comprehension, dialog system, text classification, pretrained language model (i.e., XLNet, BERT, ELMo, GPT), sequence labeling, information retrieval, information extraction (i.e., entity, relation and event extraction), knowledge graph, text generation, network embedding

Language: OpenEdge ABL - Size: 384 MB - Last synced at: 3 months ago - Pushed at: over 4 years ago - Stars: 544 - Forks: 151

allenai/dont-stop-pretraining

Code associated with the Don't Stop Pretraining ACL 2020 paper

Language: Python - Size: 554 KB - Last synced at: about 2 months ago - Pushed at: over 3 years ago - Stars: 529 - Forks: 73

OpenBMB/CPM-Live

Live Training for Open-source Big Models

Language: Python - Size: 1.11 MB - Last synced at: 14 days ago - Pushed at: about 2 years ago - Stars: 506 - Forks: 39

RenzeLou/awesome-instruction-learning

Papers and Datasets on Instruction Tuning and Following. ✨✨✨

Language: Python - Size: 6.25 MB - Last synced at: about 2 months ago - Pushed at: about 1 year ago - Stars: 493 - Forks: 24

Hzfinfdu/Diffusion-BERT

ACL'2023: DiffusionBERT: Improving Generative Masked Language Models with Diffusion Models

Language: Python - Size: 1.69 MB - Last synced at: 6 days ago - Pushed at: over 1 year ago - Stars: 309 - Forks: 25

LYH-YF/MWPToolkit

MWPToolkit is an open-source framework for math word problem(MWP) solvers.

Language: Python - Size: 59.8 MB - Last synced at: 23 days ago - Pushed at: over 2 years ago - Stars: 163 - Forks: 37

hyintell/awesome-refreshing-llms

EMNLP'23 survey: a curation of awesome papers and resources on refreshing large language models (LLMs) without expensive retraining.

Size: 2.31 MB - Last synced at: 7 days ago - Pushed at: over 1 year ago - Stars: 133 - Forks: 10

ZhengZixiang/ATPapers

Worth-reading papers and related resources on attention mechanism, Transformer and pretrained language model (PLM) such as BERT. 值得一读的注意力机制、Transformer和预训练语言模型论文与相关资源集合

Size: 648 KB - Last synced at: 4 days ago - Pushed at: about 4 years ago - Stars: 133 - Forks: 13

zzz47zzz/awesome-lifelong-learning-methods-for-llm

[ACM Computing Surveys 2025] This repository collects awesome survey, resource, and paper for Lifelong Learning with Large Language Models. (Updated Regularly)

Size: 428 KB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 129 - Forks: 6

microsoft/COCO-LM

[NeurIPS 2021] COCO-LM: Correcting and Contrasting Text Sequences for Language Model Pretraining

Language: Python - Size: 4.1 MB - Last synced at: 3 days ago - Pushed at: almost 2 years ago - Stars: 117 - Forks: 13

DC-research/TEMPO

The official code for "TEMPO: Prompt-based Generative Pre-trained Transformer for Time Series Forecasting (ICLR 2024)". TEMPO is one of the very first open source Time Series Foundation Models for forecasting task v1.0 version.

Language: Python - Size: 1.82 MB - Last synced at: 22 days ago - Pushed at: 4 months ago - Stars: 109 - Forks: 15

SuperBruceJia/Awesome-LLM-Self-Consistency

Awesome LLM Self-Consistency: a curated list of Self-consistency in Large Language Models

Size: 149 KB - Last synced at: 11 days ago - Pushed at: 11 months ago - Stars: 99 - Forks: 8

thunlp/Prompt-Transferability

On Transferability of Prompt Tuning for Natural Language Processing

Language: Python - Size: 629 MB - Last synced at: about 17 hours ago - Pushed at: about 1 year ago - Stars: 99 - Forks: 11

git-disl/BERT4ETH

BERT4ETH: A Pre-trained Transformer for Ethereum Fraud Detection (WWW23)

Language: Python - Size: 6.88 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 95 - Forks: 15

yueyu1030/AttrPrompt

[NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.

Language: Python - Size: 705 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 94 - Forks: 5

SJTU-IPADS/Bamboo

Bamboo-7B Large Language Model

Size: 223 KB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 92 - Forks: 1

RUCAIBox/UniCRS Fork of wxl1999/UniCRS

[KDD22] Official PyTorch implementation for "Towards Unified Conversational Recommender Systems via Knowledge-Enhanced Prompt Learning".

Language: Python - Size: 13 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 83 - Forks: 14

GanjinZero/CODER

CODER: Knowledge infused cross-lingual medical term embedding for term normalization. [JBI, ACL-BioNLP 2022]

Language: Python - Size: 5.63 MB - Last synced at: 3 months ago - Pushed at: almost 3 years ago - Stars: 78 - Forks: 5

EagleW/Scientific-Inspiration-Machines-Optimized-for-Novelty

Official implementation of the ACL 2024: Scientific Inspiration Machines Optimized for Novelty

Language: Python - Size: 340 MB - Last synced at: 5 months ago - Pushed at: about 1 year ago - Stars: 71 - Forks: 11

yumeng5/TopClus

[WWW 2022] Topic Discovery via Latent Space Clustering of Pretrained Language Model Representations

Language: Python - Size: 108 MB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 65 - Forks: 4

azminewasi/Awesome-LLMs-ICLR-24

It is a comprehensive resource hub compiling all LLM papers accepted at the International Conference on Learning Representations (ICLR) in 2024.

Size: 821 KB - Last synced at: 7 days ago - Pushed at: about 1 year ago - Stars: 62 - Forks: 3

FranxYao/PoincareProbe

Implementation of ICLR 21 paper: Probing BERT in Hyperbolic Spaces

Language: Jupyter Notebook - Size: 5.94 MB - Last synced at: about 2 months ago - Pushed at: about 4 years ago - Stars: 58 - Forks: 5

yumeng5/SuperGen

[NeurIPS 2022] Generating Training Data with Language Models: Towards Zero-Shot Language Understanding

Language: Python - Size: 47.4 MB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 54 - Forks: 10

qianlima-lab/awesome-lifelong-learning-methods-for-llm Fork of zzz47zzz/awesome-lifelong-learning-methods-for-llm

This repository collects awesome survey, resource, and paper for Lifelong Learning for Large Language Models. (Updated Regularly)

Size: 428 KB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 52 - Forks: 1

GanjinZero/BioBART

BioBART: Pretraining and Evaluation of A Biomedical Generative Language Model [ACL-BioNLP 2022]

Language: Python - Size: 117 KB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 52 - Forks: 4

heraclex12/NLP2SPARQL

Translate Natural Language Processing to SPARQL Query and vice versa

Language: Python - Size: 223 KB - Last synced at: 2 months ago - Pushed at: about 2 years ago - Stars: 51 - Forks: 12

OpenMatch/COCO-DR

[EMNLP 2022] This is the code repo for our EMNLP‘22 paper "COCO-DR: Combating Distribution Shifts in Zero-Shot Dense Retrieval with Contrastive and Distributionally Robust Learning".

Language: Python - Size: 2.2 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 45 - Forks: 4

SKplanet/Dialog-KoELECTRA

ELECTRA기반 한국어 대화체 언어모델

Language: Python - Size: 42 MB - Last synced at: over 2 years ago - Pushed at: almost 4 years ago - Stars: 44 - Forks: 6

ChangwenXu98/TransPolymer

Implementation of "TransPolymer: a Transformer-based language model for polymer property predictions" in PyTorch

Language: Python - Size: 1.65 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 43 - Forks: 12

cumc-dbmi/cehrbert

CEHR-BERT: Incorporating temporal information from structured EHR data to improve prediction tasks

Language: Python - Size: 17.5 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 35 - Forks: 11

zjukg/DUET

[Paper][AAAI 2023] DUET: Cross-modal Semantic Grounding for Contrastive Zero-shot Learning

Language: Python - Size: 7.63 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 35 - Forks: 8

thunlp/CokeBERT

CokeBERT: Contextual Knowledge Selection and Embedding towards Enhanced Pre-Trained Language Models

Language: Python - Size: 101 MB - Last synced at: about 2 months ago - Pushed at: almost 2 years ago - Stars: 31 - Forks: 9

cheneydon/efficient-bert

This repository contains the code for the paper in Findings of EMNLP 2021: "EfficientBERT: Progressively Searching Multilayer Perceptron via Warm-up Knowledge Distillation".

Language: Python - Size: 120 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 31 - Forks: 4

yzhan238/CGExpan

The source code used for paper "Empower Entity Set Expansion via Language Model Probing", published in ACL 2020.

Language: Python - Size: 11.7 KB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 31 - Forks: 2

zzz47zzz/codebase-for-incremental-learning-with-llm

[ACL2024] A Codebase for Incremental Learning with Large Language Models; Official released code for "Learn or Recall? Revisiting Incremental Learning with Pre-trained Language Models (ACL 2024)", "Incremental Sequence Labeling: A Tale of Two Shifts (ACL 2024 Findings)", and "Concept-1K: A Novel Benchmark for Instance Incremental Learning (arxiv)"

Language: Python - Size: 2.88 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 30 - Forks: 7

theblackcat102/unify-learning-paradigms

data collator for UL2 and U-PaLM

Language: Python - Size: 158 KB - Last synced at: 4 months ago - Pushed at: almost 2 years ago - Stars: 29 - Forks: 1

EngineeringSoftware/CoditT5

CoditT5: Pretraining for Source Code and Natural Language Editing

Language: Python - Size: 91.9 MB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 28 - Forks: 3

wxl1999/UniCRS

[KDD22] Official PyTorch implementation for "Towards Unified Conversational Recommender Systems via Knowledge-Enhanced Prompt Learning".

Language: Python - Size: 13 MB - Last synced at: 3 months ago - Pushed at: 5 months ago - Stars: 27 - Forks: 19

lgalke/text-clf-baselines

WideMLP for Text Classification

Language: Python - Size: 78.1 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 27 - Forks: 3

yingyuankai/AiSpace

AiSpace: Better practices for deep learning model development and deployment For Tensorflow 2.0

Language: Python - Size: 806 KB - Last synced at: 8 months ago - Pushed at: over 2 years ago - Stars: 26 - Forks: 4

txsun1997/Metric-Fairness

EMNLP'2022: BERTScore is Unfair: On Social Bias in Language Model-Based Metrics for Text Generation

Language: Jupyter Notebook - Size: 23.5 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 26 - Forks: 2

microsoft/AMOS

[ICLR 2022] Pretraining Text Encoders with Adversarial Mixture of Training Signal Generators

Language: Python - Size: 3.93 MB - Last synced at: 3 days ago - Pushed at: almost 2 years ago - Stars: 24 - Forks: 2

RUCAIBox/ELMER

This repository is the official implementation of our EMNLP 2022 paper ELMER: A Non-Autoregressive Pre-trained Language Model for Efficient and Effective Text Generation

Language: Python - Size: 6.24 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 24 - Forks: 3

alexandra-chron/hierarchical-domain-adaptation

Code of NAACL 2022 "Efficient Hierarchical Domain Adaptation for Pretrained Language Models" paper.

Language: Python - Size: 5.6 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 23 - Forks: 3

megagonlabs/cocosum

:coconut: Code & Data for Comparative Opinion Summarization via Collaborative Decoding (Iso et al; Findings of ACL 2022)

Language: Python - Size: 864 KB - Last synced at: 3 days ago - Pushed at: 4 months ago - Stars: 21 - Forks: 2

EagleW/Stage-wise-Fine-tuning

Code for Stage-wise Fine-tuning for Graph-to-Text Generation

Language: Lex - Size: 211 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 21 - Forks: 5

BH-So/unsupervised-paraphrase-generation

"Unsupervised Paraphrase Generation using Pre-trained Language Model."

Language: Python - Size: 40 KB - Last synced at: over 2 years ago - Pushed at: almost 5 years ago - Stars: 21 - Forks: 9

Clarifai/examples

Examples for Clarifai Python SDK and Integrations. Give the repo a star ⭐

Language: Jupyter Notebook - Size: 168 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 20 - Forks: 3

jeffhj/LM_PersonalInfoLeak

The code and data for "Are Large Pre-Trained Language Models Leaking Your Personal Information?" (Findings of EMNLP '22)

Language: Python - Size: 2.6 MB - Last synced at: 11 months ago - Pushed at: over 2 years ago - Stars: 18 - Forks: 4

arianhosseini/negation-learning

code for our paper "Understanding by Understanding Not: Modeling Negation in Language Models"

Language: Python - Size: 33 MB - Last synced at: 3 months ago - Pushed at: almost 3 years ago - Stars: 16 - Forks: 2

imSanko/Image_Caption_Generator_With_Transformers 📦

This repository contains code for generating captions for images using a Transformer-based model. The model used is the `VisionEncoderDecoderModel` from the Hugging Face Transformers library, specifically the `nlpconnect/vit-gpt2-image-captioning` model.

Language: Jupyter Notebook - Size: 233 KB - Last synced at: 3 months ago - Pushed at: 10 months ago - Stars: 12 - Forks: 1

HySonLab/Protein_Pretrain

Multimodal Pretraining for Unsupervised Protein Representation Learning

Language: Python - Size: 241 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 12 - Forks: 2

ImKeTT/ReSee

[EMNLP'23 Oral] ReSee: Responding through Seeing Fine-grained Visual Knowledge in Open-domain Dialogue PyTorch Implementation

Language: Python - Size: 464 KB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 12 - Forks: 0

RUCAIBox/CFCRS Fork of wxl1999/CFCRS

[KDD23] Official PyTorch implementation for "Improving Conversational Recommendation Systems via Counterfactual Data Simulation".

Language: Python - Size: 10.5 MB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 11 - Forks: 0

juyongjiang/Awesome-ANCE

Implementation of paper "Approximate Nearest Neighbor Negative Contrastive Learning for Dense Text Retrieval"

Language: Python - Size: 997 KB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 11 - Forks: 0

EagleW/Multimedia-Generative-Script-Learning

Official implementation of the ACL Findings 2023 paper: Multimedia Generative Script Learning for Task Planning

Language: Python - Size: 45.9 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 8 - Forks: 0

zzz47zzz/CET

[ACL2023] Preserving Commonsense Knowledge from Pre-trained Language Models via Causal Inference

Language: Python - Size: 443 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 8 - Forks: 1

wxl1999/CFCRS

[KDD23] Official PyTorch implementation for "Improving Conversational Recommendation Systems via Counterfactual Data Simulation".

Language: Python - Size: 10.5 MB - Last synced at: 3 months ago - Pushed at: about 2 years ago - Stars: 8 - Forks: 3

jeffhj/VER

The official repo for "VER: Unifying Verbalizing Entities and Relations" (Findings of EMNLP '23)

Language: Python - Size: 3.07 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 6 - Forks: 0

xiaoyuxin1002/UQ-PLM

Uncertainty Quantification with Pre-trained Language Models: An Empirical Analysis

Language: Python - Size: 12.7 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 6 - Forks: 1

cliang1453/CAMERO

CAMERO: Consistency Regularized Ensemble of Perturbed Language Models with Weight Sharing (ACL 2022)

Language: Python - Size: 230 KB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 6 - Forks: 0

xiangyue9607/C-MORE

Code for the ACL2022 paper "C-MORE: Pretraining to Answer Open-Domain Questions by Consulting Millions of References"

Size: 9.77 KB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 6 - Forks: 0

danjohnvelasco/Filipino-ULMFiT

Pre-trained AWD-LSTM language model trained on Filipino text corpus using fastai v2. Instructions included.

Language: Jupyter Notebook - Size: 40 KB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 6 - Forks: 3

eric11eca/reckoning-metakg

RECKONING is a bi-level learning algorithm that improves language models' reasoning ability by folding contextual knowledge into parametric knowledge through back-propagation.

Language: Python - Size: 21.4 MB - Last synced at: 2 months ago - Pushed at: 7 months ago - Stars: 5 - Forks: 1

umanlp/Multi2WOZ Fork of chiachienhung/Multi2WOZ

Multi2WOZ: A Robust Multilingual Dataset and Conversational Pretraining for Task-Oriented Dialog

Language: Python - Size: 135 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 5 - Forks: 0

KimDaeUng/PLM-Implementation

NLP Pretrained Language Models Implementation Study

Language: Jupyter Notebook - Size: 181 KB - Last synced at: over 2 years ago - Pushed at: almost 4 years ago - Stars: 5 - Forks: 2

Kitsunp/Small-lenguaje-Model-Hybrid-Norm-Furier-Formers

A compact language model implementing HybridNorm and Fourier-based attention. Combines CoLA (low-rank projections), FANformer, and hybrid normalization to create an efficient decoder-only transformer. Leverages periodicity modeling and gated residuals to enhance performance while maintaining a small parameter footprint.

Language: Python - Size: 4.6 MB - Last synced at: 29 days ago - Pushed at: 29 days ago - Stars: 4 - Forks: 0

zerohd4869/CIFM

The official repository for ACL 2024 paper "Representation Learning with Conditional Information Flow Maximization"

Language: Python - Size: 5.42 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 4 - Forks: 1

yzhan238/PIEClass

The source code used for paper "PIEClass: Weakly-Supervised Text Classification with Prompting and Noise-Robust Iterative Ensemble Training", published in EMNLP 2023.

Language: Python - Size: 16.2 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 0

shreydan/masked-language-modeling

Transformers Pre-Training with MLM objective — implemented encoder-only model and trained from scratch on Wikipedia dataset.

Language: Jupyter Notebook - Size: 221 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 2

DAMO-NLP-SG/PeerDA

Source code of "PeerDA: Data Augmentation via Modeling Peer Relation for Span Identification Tasks" (ACL23)

Language: Python - Size: 6.39 MB - Last synced at: about 2 months ago - Pushed at: almost 2 years ago - Stars: 4 - Forks: 1

cheneydon/hrkd

This repository contains the code for the paper in EMNLP 2021: "HRKD: Hierarchical Relational Knowledge Distillation for Cross-domain Language Model Compression".

Language: Python - Size: 37.1 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 4 - Forks: 2

dmhyun/MSRP

Official repository of Generating Multiple-Length Summaries via Reinforcement Learning for Unsupervised Sentence Summarization [EMNLP'22 Findings]

Language: Python - Size: 74.1 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 4 - Forks: 0

martin-wey/ast-probe

Code and data of the paper https://arxiv.org/abs/2206.11719 (ASE 22')

Language: Python - Size: 1000 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 3

Strong-AI-Lab/Logical-Reasoning-Reading-Comprehension-ReClor

The source code for #5 in the Logical Reasoning Reading Comprehension Leaderboard `ReClor`.

Language: Python - Size: 11.9 MB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 4 - Forks: 1

ZhaohanM/FusionGDA

we propose a novel FusionGDA model, which utilises a pre-training phase with a fusion module to enrich the gene and disease semantic representations encoded by pre-trained language models.

Language: Jupyter Notebook - Size: 1.61 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 3 - Forks: 2

Evfidiw/LMs_NLU

Exploring different language models on text classification tasks.

Language: Python - Size: 21.1 MB - Last synced at: 10 days ago - Pushed at: 5 months ago - Stars: 3 - Forks: 0

XCollab/HuggingFace

This repository provides an overview of Hugging Face's Transformers library, a powerful tool for natural language processing (NLP) and machine learning tasks.

Language: Jupyter Notebook - Size: 1.57 MB - Last synced at: 2 months ago - Pushed at: 5 months ago - Stars: 3 - Forks: 1

SreeEswaran/Train-your-LLM

This repository contains code and resources for training, fine-tuning, and deploying large language models using Hugging Face's Transformers library.

Language: Python - Size: 33.2 KB - Last synced at: 3 months ago - Pushed at: 7 months ago - Stars: 3 - Forks: 2

patrick-tssn/CDBert

[ACL2023] Shuo Wen Jie Zi is a new learning paradigm that enhances the semantics understanding ability of the Chinese PLMs with dictionary knowledge and structure of Chinese characters

Language: Python - Size: 5.05 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 3 - Forks: 0

DEROOCE/Awesome-Pretrained-Language-Model

Worth-reading papers and related resources on pretrained-language models(PLMs). On the Shoulder of Giants!

Size: 12.7 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 0

irenepisani/Key_Point_Analysis

Key Point Analysis: implementation of two-component system for performing Key Point Matching and Key Point Generation task with multiple PLMs.

Language: Jupyter Notebook - Size: 387 KB - Last synced at: almost 2 years ago - Pushed at: almost 3 years ago - Stars: 3 - Forks: 1

umanlp/DS-TOD Fork of chiachienhung/DS-TOD

DS-TOD: Efficient Domain Specialization for Task Oriented Dialog

Language: Python - Size: 312 KB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 3 - Forks: 0