GitHub topics: masked-language-models
psychbruce/FMAT
😷 The Fill-Mask Association Test (FMAT): Measuring Propositions in Natural Language.
Language: R - Size: 10.6 MB - Last synced at: 15 days ago - Pushed at: 23 days ago - Stars: 13 - Forks: 0

zalkikar/mlm-bias
Measuring Biases in Masked Language Models for PyTorch Transformers. Support for multiple social biases and evaluation measures.
Language: Python - Size: 45.9 KB - Last synced at: 10 days ago - Pushed at: 4 months ago - Stars: 4 - Forks: 1

haofanwang/awesome-vision-language-modeling
Recent Advances in Vision-Language Pre-training!
Size: 18.6 KB - Last synced at: 5 days ago - Pushed at: over 3 years ago - Stars: 29 - Forks: 2

ayoolaolafenwa/TrainNLP
Sample tutorials for training Natural Language Processing Models with Transformers
Language: Python - Size: 37.4 MB - Last synced at: 17 days ago - Pushed at: almost 2 years ago - Stars: 21 - Forks: 4

willxxy/Text-EGM
[CHIL 2024] Interpretation of Intracardiac Electrograms Through Textual Representations
Language: Python - Size: 50.8 KB - Last synced at: 12 days ago - Pushed at: 8 months ago - Stars: 10 - Forks: 1

AndreaCossu/continual-pretraining-nlp-vision
Code to reproduce experiments from the paper "Continual Pre-Training Mitigates Forgetting in Language and Vision" https://arxiv.org/abs/2205.09357
Language: Jupyter Notebook - Size: 872 KB - Last synced at: 9 days ago - Pushed at: over 1 year ago - Stars: 22 - Forks: 1

ShiZhengyan/PowerfulPromptFT
[NeurIPS 2023 Main Track] This is the repository for the paper titled "Don’t Stop Pretraining? Make Prompt-based Fine-tuning Powerful Learner"
Language: Python - Size: 34.2 MB - Last synced at: 15 days ago - Pushed at: about 1 year ago - Stars: 73 - Forks: 18

sergiusechel/LLM-Masked-Robber
This script utilizes Facebook AI's RoBERTa model for masked language modeling to predict the top-k most likely words for masked tokens in a given sentence. It generates variations of the input sentence by filling in the masked tokens with the predicted words.
Language: Python - Size: 1.68 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

briesearch/token-masks
Masked language model with Positional & One-Hot encoding - built using Aurora
Language: Java - Size: 18.6 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

AKKI0511/Masked-Language-Model
Powerful Python tool for visualizing and interacting with pre-trained Masked Language Models (MLMs) like BERT. Features include self-attention visualization, masked token prediction, model fine-tuning, embedding analysis with PCA/t-SNE, and SHAP-based model interpretability.
Language: Python - Size: 122 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

musty-ess/Masked-Language-Model-Using-BERT
This project implements a Masked Language Model using BERT, a transformer-based model developed by Google, to predict masked words in text sequences.
Language: Python - Size: 124 KB - Last synced at: 15 days ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

AndrewDarnall/The-Code-Unmasker
SpringBoot-based microserviced web app which unmasks, using CodeBERT MLM, a code prompt
Language: Jupyter Notebook - Size: 71.3 KB - Last synced at: about 1 month ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

AstraBert/proteins-w-esm
Predict the whole sequence and 3D structure of masked protein sequences with ESM by @evolutionaryscale
Language: Python - Size: 23.4 KB - Last synced at: 4 days ago - Pushed at: 10 months ago - Stars: 4 - Forks: 0

chiral-carbon/inverse-prompt
Inverse prompting LLMs for interpretability
Language: Jupyter Notebook - Size: 150 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

luozhouyang/transformers-keras
Transformer-based models implemented in tensorflow 2.x(using keras).
Language: Python - Size: 696 KB - Last synced at: 20 days ago - Pushed at: over 3 years ago - Stars: 75 - Forks: 13

hriaz17/ELLEN
Code for "ELLEN: Extremely Lightly Supervised Learning For Efficient Named Entity Recognition" (LREC-COLING 2024)
Size: 884 KB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 1 - Forks: 0

chanapapan/Depression-Detection
Comparing Selective Masking Methods for Depression Detection in Social Media
Language: Python - Size: 7.69 MB - Last synced at: 12 months ago - Pushed at: almost 2 years ago - Stars: 9 - Forks: 1

saagar-parikh/ASR_LLM_Rescoring Fork of jdannem6/ASR_LLM_Rescoring
Rescoring Automatic Speech Recognition using Large Language Models
Language: Jupyter Notebook - Size: 25 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

insdout/BertAttentionViz
BERT Attention Visualization is a web application powered by Streamlit, offering intuitive visualization of attention weights generated by BERT-based models.
Language: Python - Size: 524 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

mc-cat-tty/PlaceRank
Final assigment for "Gestione dell'Informazione" ("Search Engines") course @ UniMoRe
Language: Jupyter Notebook - Size: 45.3 MB - Last synced at: 21 days ago - Pushed at: about 1 year ago - Stars: 3 - Forks: 1

rstodden/stance-detection
Code for "Using Masked Language Model Probabilities of Connectives for Stance Detection in English Discourse"
Language: Jupyter Notebook - Size: 101 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

JayaswalVivek/Masked_Language_Model
Create a masked language model for a vernacular of your choice.
Language: Jupyter Notebook - Size: 47.9 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

SharathHebbar/Transformers
Transformers Intuition
Language: Jupyter Notebook - Size: 34.3 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

shreydan/masked-language-modeling
Transformers Pre-Training with MLM objective — implemented encoder-only model and trained from scratch on Wikipedia dataset.
Language: Jupyter Notebook - Size: 221 KB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

pedroandreou/Cypriot-LLM
Training the first Cypriot Large Language Model on the Masked Language Modeling objective for predicting a given masked word token within a given context
Language: Python - Size: 2.55 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

petermchale/nucleotide-transformer
Using an LLM to discover the genetic causes of rare disease
Language: Jupyter Notebook - Size: 378 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

GU-DataLab/PoliBERTweet
A transformer-based language model trained on politics-related Twitter data. This repo is the official resource of the paper "PoliBERTweet: A Pre-trained Language Model for Analyzing Political Content on Twitter", LREC 2022
Size: 52.7 KB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 7 - Forks: 0

yzhan238/CGExpan
The source code used for paper "Empower Entity Set Expansion via Language Model Probing", published in ACL 2020.
Language: Python - Size: 11.7 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 31 - Forks: 2

phueb/Zorro
Grammar test suite for masked language models
Language: Python - Size: 316 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 6 - Forks: 3

phueb/UnMasked
Score masked language models on grammar test suites
Language: Python - Size: 22.4 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 1

naivenlp/rapidnlp-datasets
Data pipelines for both TensorFlow and PyTorch!
Language: Python - Size: 117 KB - Last synced at: 16 days ago - Pushed at: about 3 years ago - Stars: 1 - Forks: 0

RishabhMaheshwary/contextattack
A Context Aware Approach for Generating Natural Language Attacks.
Language: Python - Size: 4.47 MB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 6 - Forks: 0
