An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: masked-language-models

psychbruce/FMAT

😷 The Fill-Mask Association Test (FMAT): Measuring Propositions in Natural Language.

Language: R - Size: 10.6 MB - Last synced at: 15 days ago - Pushed at: 23 days ago - Stars: 13 - Forks: 0

zalkikar/mlm-bias

Measuring Biases in Masked Language Models for PyTorch Transformers. Support for multiple social biases and evaluation measures.

Language: Python - Size: 45.9 KB - Last synced at: 10 days ago - Pushed at: 4 months ago - Stars: 4 - Forks: 1

haofanwang/awesome-vision-language-modeling

Recent Advances in Vision-Language Pre-training!

Size: 18.6 KB - Last synced at: 5 days ago - Pushed at: over 3 years ago - Stars: 29 - Forks: 2

ayoolaolafenwa/TrainNLP

Sample tutorials for training Natural Language Processing Models with Transformers

Language: Python - Size: 37.4 MB - Last synced at: 17 days ago - Pushed at: almost 2 years ago - Stars: 21 - Forks: 4

willxxy/Text-EGM

[CHIL 2024] Interpretation of Intracardiac Electrograms Through Textual Representations

Language: Python - Size: 50.8 KB - Last synced at: 12 days ago - Pushed at: 8 months ago - Stars: 10 - Forks: 1

AndreaCossu/continual-pretraining-nlp-vision

Code to reproduce experiments from the paper "Continual Pre-Training Mitigates Forgetting in Language and Vision" https://arxiv.org/abs/2205.09357

Language: Jupyter Notebook - Size: 872 KB - Last synced at: 9 days ago - Pushed at: over 1 year ago - Stars: 22 - Forks: 1

ShiZhengyan/PowerfulPromptFT

[NeurIPS 2023 Main Track] This is the repository for the paper titled "Don’t Stop Pretraining? Make Prompt-based Fine-tuning Powerful Learner"

Language: Python - Size: 34.2 MB - Last synced at: 15 days ago - Pushed at: about 1 year ago - Stars: 73 - Forks: 18

sergiusechel/LLM-Masked-Robber

This script utilizes Facebook AI's RoBERTa model for masked language modeling to predict the top-k most likely words for masked tokens in a given sentence. It generates variations of the input sentence by filling in the masked tokens with the predicted words.

Language: Python - Size: 1.68 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

briesearch/token-masks

Masked language model with Positional & One-Hot encoding - built using Aurora

Language: Java - Size: 18.6 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

AKKI0511/Masked-Language-Model

Powerful Python tool for visualizing and interacting with pre-trained Masked Language Models (MLMs) like BERT. Features include self-attention visualization, masked token prediction, model fine-tuning, embedding analysis with PCA/t-SNE, and SHAP-based model interpretability.

Language: Python - Size: 122 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

musty-ess/Masked-Language-Model-Using-BERT

This project implements a Masked Language Model using BERT, a transformer-based model developed by Google, to predict masked words in text sequences.

Language: Python - Size: 124 KB - Last synced at: 15 days ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

AndrewDarnall/The-Code-Unmasker

SpringBoot-based microserviced web app which unmasks, using CodeBERT MLM, a code prompt

Language: Jupyter Notebook - Size: 71.3 KB - Last synced at: about 1 month ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

AstraBert/proteins-w-esm

Predict the whole sequence and 3D structure of masked protein sequences with ESM by @evolutionaryscale

Language: Python - Size: 23.4 KB - Last synced at: 4 days ago - Pushed at: 10 months ago - Stars: 4 - Forks: 0

chiral-carbon/inverse-prompt

Inverse prompting LLMs for interpretability

Language: Jupyter Notebook - Size: 150 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

luozhouyang/transformers-keras

Transformer-based models implemented in tensorflow 2.x(using keras).

Language: Python - Size: 696 KB - Last synced at: 20 days ago - Pushed at: over 3 years ago - Stars: 75 - Forks: 13

hriaz17/ELLEN

Code for "ELLEN: Extremely Lightly Supervised Learning For Efficient Named Entity Recognition" (LREC-COLING 2024)

Size: 884 KB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 1 - Forks: 0

chanapapan/Depression-Detection

Comparing Selective Masking Methods for Depression Detection in Social Media

Language: Python - Size: 7.69 MB - Last synced at: 12 months ago - Pushed at: almost 2 years ago - Stars: 9 - Forks: 1

saagar-parikh/ASR_LLM_Rescoring Fork of jdannem6/ASR_LLM_Rescoring

Rescoring Automatic Speech Recognition using Large Language Models

Language: Jupyter Notebook - Size: 25 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

insdout/BertAttentionViz

BERT Attention Visualization is a web application powered by Streamlit, offering intuitive visualization of attention weights generated by BERT-based models.

Language: Python - Size: 524 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

mc-cat-tty/PlaceRank

Final assigment for "Gestione dell'Informazione" ("Search Engines") course @ UniMoRe

Language: Jupyter Notebook - Size: 45.3 MB - Last synced at: 21 days ago - Pushed at: about 1 year ago - Stars: 3 - Forks: 1

rstodden/stance-detection

Code for "Using Masked Language Model Probabilities of Connectives for Stance Detection in English Discourse"

Language: Jupyter Notebook - Size: 101 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

JayaswalVivek/Masked_Language_Model

Create a masked language model for a vernacular of your choice.

Language: Jupyter Notebook - Size: 47.9 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

SharathHebbar/Transformers

Transformers Intuition

Language: Jupyter Notebook - Size: 34.3 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

shreydan/masked-language-modeling

Transformers Pre-Training with MLM objective — implemented encoder-only model and trained from scratch on Wikipedia dataset.

Language: Jupyter Notebook - Size: 221 KB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

pedroandreou/Cypriot-LLM

Training the first Cypriot Large Language Model on the Masked Language Modeling objective for predicting a given masked word token within a given context

Language: Python - Size: 2.55 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

petermchale/nucleotide-transformer

Using an LLM to discover the genetic causes of rare disease

Language: Jupyter Notebook - Size: 378 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

GU-DataLab/PoliBERTweet

A transformer-based language model trained on politics-related Twitter data. This repo is the official resource of the paper "PoliBERTweet: A Pre-trained Language Model for Analyzing Political Content on Twitter", LREC 2022

Size: 52.7 KB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 7 - Forks: 0

yzhan238/CGExpan

The source code used for paper "Empower Entity Set Expansion via Language Model Probing", published in ACL 2020.

Language: Python - Size: 11.7 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 31 - Forks: 2

phueb/Zorro

Grammar test suite for masked language models

Language: Python - Size: 316 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 6 - Forks: 3

phueb/UnMasked

Score masked language models on grammar test suites

Language: Python - Size: 22.4 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 1

naivenlp/rapidnlp-datasets

Data pipelines for both TensorFlow and PyTorch!

Language: Python - Size: 117 KB - Last synced at: 16 days ago - Pushed at: about 3 years ago - Stars: 1 - Forks: 0

RishabhMaheshwary/contextattack

A Context Aware Approach for Generating Natural Language Attacks.

Language: Python - Size: 4.47 MB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 6 - Forks: 0

Related Keywords
masked-language-models 32 bert 10 nlp 9 transformers 8 language-model 8 huggingface 6 deep-learning 5 machine-learning 5 pytorch 4 natural-language-processing 4 tensorflow 3 huggingface-transformers 3 pretrained-models 3 ai 3 bert-model 3 large-language-models 3 embeddings 2 attention-is-all-you-need 2 causal-language-modeling 2 pretrained-language-model 2 python 2 stance-detection 2 pretraining 2 artificial-intelligence 2 simcse 2 masked-image-modeling 2 language-models 2 llm 2 transformer 2 text-classification 2 roberta-model 2 sequence-classification 2 question-answering 2 ner 2 adversarial-attacks 2 semi-supervised-learning 2 token-classification 2 masked-language-modeling 2 streamlit-webapp 1 streamlit 1 distilbert 1 airbnb 1 attention-mechanism 1 benchmarking 1 bert-embeddings 1 datasets 1 physiological-signals 1 gpt-35-turbo 1 gpt-4 1 gpt-4-turbo 1 heuristics 1 linguistics 1 named-entity-recognition 1 neuro-symbolic-ai 1 neuro-symbolic-learning 1 pos-tagging 1 self-training 1 depression-detection 1 finetuning 1 selective-masking 1 asr 1 gpt 1 rescore 1 speech-recognition 1 genomics 1 nlp-machine-learning 1 elections 1 lrec 1 lrec2022 1 political 1 politics 1 twitter 1 word-prediction 1 entity-set-expansion 1 set-expansion 1 benchmark 1 grammatical 1 test-sentences 1 grammar-learning 1 test-suite 1 dataset-loader 1 keras 1 information-retrieval 1 insideairbnb-data 1 ncurses 1 ranking-algorithm 1 search-engine 1 urwid 1 whoosh 1 autoencoder 1 huggingface-transformer 1 decoder-model 1 encoder-decoder-model 1 encoder-model 1 semantic-similarity 1 sequence-to-sequence 1 tokenization 1 cyprus 1 roberta 1 foundation-models 1