GitHub topics: masked-language-models

Repositories

psychbruce/FMAT

😷 The Fill-Mask Association Test (FMAT): Measuring Propositions in Natural Language.

Language: R - Size: 10.6 MB - Last synced at: 15 days ago - Pushed at: 23 days ago - Stars: 13 - Forks: 0

zalkikar/mlm-bias

Measuring Biases in Masked Language Models for PyTorch Transformers. Support for multiple social biases and evaluation measures.

Language: Python - Size: 45.9 KB - Last synced at: 10 days ago - Pushed at: 4 months ago - Stars: 4 - Forks: 1

haofanwang/awesome-vision-language-modeling

Recent Advances in Vision-Language Pre-training!

Size: 18.6 KB - Last synced at: 5 days ago - Pushed at: over 3 years ago - Stars: 29 - Forks: 2

ayoolaolafenwa/TrainNLP

Sample tutorials for training Natural Language Processing Models with Transformers

Language: Python - Size: 37.4 MB - Last synced at: 17 days ago - Pushed at: almost 2 years ago - Stars: 21 - Forks: 4

willxxy/Text-EGM

[CHIL 2024] Interpretation of Intracardiac Electrograms Through Textual Representations

Language: Python - Size: 50.8 KB - Last synced at: 12 days ago - Pushed at: 8 months ago - Stars: 10 - Forks: 1

AndreaCossu/continual-pretraining-nlp-vision

Code to reproduce experiments from the paper "Continual Pre-Training Mitigates Forgetting in Language and Vision" https://arxiv.org/abs/2205.09357

Language: Jupyter Notebook - Size: 872 KB - Last synced at: 9 days ago - Pushed at: over 1 year ago - Stars: 22 - Forks: 1

ShiZhengyan/PowerfulPromptFT

[NeurIPS 2023 Main Track] This is the repository for the paper titled "Don’t Stop Pretraining? Make Prompt-based Fine-tuning Powerful Learner"

Language: Python - Size: 34.2 MB - Last synced at: 15 days ago - Pushed at: about 1 year ago - Stars: 73 - Forks: 18

sergiusechel/LLM-Masked-Robber

This script utilizes Facebook AI's RoBERTa model for masked language modeling to predict the top-k most likely words for masked tokens in a given sentence. It generates variations of the input sentence by filling in the masked tokens with the predicted words.

Language: Python - Size: 1.68 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

briesearch/token-masks

Masked language model with Positional & One-Hot encoding - built using Aurora

Language: Java - Size: 18.6 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

AKKI0511/Masked-Language-Model

Powerful Python tool for visualizing and interacting with pre-trained Masked Language Models (MLMs) like BERT. Features include self-attention visualization, masked token prediction, model fine-tuning, embedding analysis with PCA/t-SNE, and SHAP-based model interpretability.

Language: Python - Size: 122 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

musty-ess/Masked-Language-Model-Using-BERT

This project implements a Masked Language Model using BERT, a transformer-based model developed by Google, to predict masked words in text sequences.

Language: Python - Size: 124 KB - Last synced at: 15 days ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

AndrewDarnall/The-Code-Unmasker

SpringBoot-based microserviced web app which unmasks, using CodeBERT MLM, a code prompt

Language: Jupyter Notebook - Size: 71.3 KB - Last synced at: about 1 month ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

AstraBert/proteins-w-esm

Predict the whole sequence and 3D structure of masked protein sequences with ESM by @evolutionaryscale

Language: Python - Size: 23.4 KB - Last synced at: 4 days ago - Pushed at: 10 months ago - Stars: 4 - Forks: 0

chiral-carbon/inverse-prompt

Inverse prompting LLMs for interpretability

Language: Jupyter Notebook - Size: 150 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

luozhouyang/transformers-keras

Transformer-based models implemented in tensorflow 2.x(using keras).

Language: Python - Size: 696 KB - Last synced at: 20 days ago - Pushed at: over 3 years ago - Stars: 75 - Forks: 13

hriaz17/ELLEN

Code for "ELLEN: Extremely Lightly Supervised Learning For Efficient Named Entity Recognition" (LREC-COLING 2024)

Size: 884 KB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 1 - Forks: 0

chanapapan/Depression-Detection

Comparing Selective Masking Methods for Depression Detection in Social Media

Language: Python - Size: 7.69 MB - Last synced at: 12 months ago - Pushed at: almost 2 years ago - Stars: 9 - Forks: 1

saagar-parikh/ASR_LLM_Rescoring Fork of jdannem6/ASR_LLM_Rescoring

Rescoring Automatic Speech Recognition using Large Language Models

Language: Jupyter Notebook - Size: 25 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

insdout/BertAttentionViz

BERT Attention Visualization is a web application powered by Streamlit, offering intuitive visualization of attention weights generated by BERT-based models.

Language: Python - Size: 524 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

mc-cat-tty/PlaceRank

Final assigment for "Gestione dell'Informazione" ("Search Engines") course @ UniMoRe

Language: Jupyter Notebook - Size: 45.3 MB - Last synced at: 21 days ago - Pushed at: about 1 year ago - Stars: 3 - Forks: 1

rstodden/stance-detection

Code for "Using Masked Language Model Probabilities of Connectives for Stance Detection in English Discourse"

Language: Jupyter Notebook - Size: 101 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

JayaswalVivek/Masked_Language_Model

Create a masked language model for a vernacular of your choice.

Language: Jupyter Notebook - Size: 47.9 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

SharathHebbar/Transformers

Transformers Intuition

Language: Jupyter Notebook - Size: 34.3 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

shreydan/masked-language-modeling

Transformers Pre-Training with MLM objective — implemented encoder-only model and trained from scratch on Wikipedia dataset.

Language: Jupyter Notebook - Size: 221 KB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

pedroandreou/Cypriot-LLM

Training the first Cypriot Large Language Model on the Masked Language Modeling objective for predicting a given masked word token within a given context

Language: Python - Size: 2.55 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

petermchale/nucleotide-transformer

Using an LLM to discover the genetic causes of rare disease

Language: Jupyter Notebook - Size: 378 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

GU-DataLab/PoliBERTweet

A transformer-based language model trained on politics-related Twitter data. This repo is the official resource of the paper "PoliBERTweet: A Pre-trained Language Model for Analyzing Political Content on Twitter", LREC 2022

Size: 52.7 KB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 7 - Forks: 0