An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: protein-language-model

IBM/AutoPeptideML

AutoML system for building trustworthy peptide bioactivity predictors

Language: Python - Size: 48.9 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 29 - Forks: 2

tyang816/SES-Adapter

🎈 Structure-aware adapter fine-tuning PLMs, with high training speed and impressive performance (Journal of Chemical Information and Modeling 2024).

Language: Python - Size: 107 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 21 - Forks: 1

Cassie818/MsaPhylo

Learning the language of phylogeny with MSA Transformer

Language: Jupyter Notebook - Size: 40 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 2 - Forks: 0

westlake-repl/SaprotHub

Making Protein Modeling Accessible to All Biologists

Language: Jupyter Notebook - Size: 91.4 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 113 - Forks: 14

pengzhangzhi/faplm

FAPLM: A Drop-in Efficient Pytorch Implementation of Protein Language Models

Language: Python - Size: 2.49 MB - Last synced at: 1 day ago - Pushed at: about 1 month ago - Stars: 118 - Forks: 11

LLNL/protlib-designer

Integer Linear Programming for Protein Library Design

Language: Python - Size: 1.93 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 6 - Forks: 0

HUBioDataLab/ProtHGT

A Heterogeneous Graph Transformer (HGT)-based model for protein function prediction using biological knowledge graphs and protein language models

Language: Python - Size: 11.5 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 10 - Forks: 0

westlake-repl/SaProt

Saprot: Protein Language Model with Structural Alphabet (AA+3Di)

Language: Python - Size: 2.45 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 445 - Forks: 43

OpenProteinAI/openprotein-python

Simple python interface for the OpenProtein.AI REST API.

Language: Python - Size: 14.5 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 9 - Forks: 0

KCLabMTU/LMSuccSite

Improving Protein Succinylation Sites Prediction Using Features Extracted from Protein Language Model

Language: Jupyter Notebook - Size: 105 MB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 1 - Forks: 3

biocentral/biotrainer Fork of sacdallago/biotrainer

Development branch of sacdallago/biotrainer

Language: Python - Size: 8.78 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 0 - Forks: 0

westlake-repl/ProTrek

ProTrek: Navigating the Protein Universe through Tri-Modal Contrastive Learning

Language: Python - Size: 1.19 MB - Last synced at: 20 days ago - Pushed at: 20 days ago - Stars: 101 - Forks: 8

NREL/EvoProtGrad

Directed evolution of proteins in sequence space with gradients

Language: Jupyter Notebook - Size: 9.5 MB - Last synced at: 19 days ago - Pushed at: 10 months ago - Stars: 85 - Forks: 12

bioinfodlsu/PHIStruct

Published in Bioinformatics. Phage-host interaction prediction tool that incorporates protein structure information in representing receptor-binding proteins (RBPs). It improves performance especially for phages with RBPs that have low sequence similarity to those of known phages

Language: Jupyter Notebook - Size: 15.7 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 4 - Forks: 0

pentalpha/protein_dimension_db

Datasets with embeddings and other representations for all proteins in Uniprot/Swiss-Prot

Language: Python - Size: 33.8 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 6 - Forks: 0

ISYSLAB-HUST/ProtFlash

ProtFlash: A lightweight protein language model

Language: Python - Size: 21.5 KB - Last synced at: 13 days ago - Pushed at: about 1 year ago - Stars: 103 - Forks: 3

zaixizhang/PocketGen

PocketGen (Nature Machine Intelligence 24): Generating Full-Atom Ligand-Binding Protein Pockets

Language: Python - Size: 15.8 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 176 - Forks: 18

THUDM/ProteinLM

Protein Language Model

Language: Python - Size: 694 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 117 - Forks: 21

HySonLab/LatentDE

LatentDE: Latent-based Directed Evolution for Protein Sequence Design

Language: Python - Size: 3.04 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 26 - Forks: 0

chaohaoyuan/PAAG

Source code for Annotation-guided Protein Design with Multi-Level Domain Alignment. (KDD 2025)

Language: Python - Size: 7.74 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 4 - Forks: 1

OpenProteinAI/PoET

Inference code for PoET: A generative model of protein families as sequences-of-sequences

Language: Python - Size: 772 KB - Last synced at: 19 days ago - Pushed at: about 1 year ago - Stars: 71 - Forks: 3

mims-harvard/SPECTRA

SPECTRA: Spectral framework for evaluation of biomedical AI models

Language: Jupyter Notebook - Size: 70.1 MB - Last synced at: 11 days ago - Pushed at: 2 months ago - Stars: 38 - Forks: 7

THUDM/MSAGPT

MSAGPT

Language: Python - Size: 1.85 MB - Last synced at: about 1 month ago - Pushed at: 6 months ago - Stars: 30 - Forks: 2

yutanagano/sceptr

Highly informative TCR representation model

Language: Python - Size: 14.7 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 10 - Forks: 2

PaccMann/paccmann_proteomics

PaccMann models for protein language modeling

Language: Python - Size: 3.54 MB - Last synced at: about 1 month ago - Pushed at: over 3 years ago - Stars: 42 - Forks: 8

HySonLab/LANTERN

LANTERN: Leveraging Large Language Models And Transformer For Enhanced Molecular Interaction

Language: Python - Size: 492 KB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 2 - Forks: 0

dohlee/proteinbert-pytorch

Implementation of ProteinBERT, a deep language model designed for proteins.

Language: Python - Size: 87.9 KB - Last synced at: 7 days ago - Pushed at: about 2 years ago - Stars: 3 - Forks: 0

dohlee/deepab-pytorch

An unofficial re-implementation of DeepAb, an interpretable deep learning model for antibody structure prediction.

Language: Python - Size: 15.8 MB - Last synced at: 22 days ago - Pushed at: almost 2 years ago - Stars: 3 - Forks: 0

KYQiu21/plmrepeat

Exploiting sequence representations derived from protein language models for repeat identification

Language: Python - Size: 16.8 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 3 - Forks: 0

dohlee/antiberty-pytorch

An unofficial re-implementation of AntiBERTy, an antibody-specific protein language model, in PyTorch.

Language: Jupyter Notebook - Size: 228 KB - Last synced at: 28 days ago - Pushed at: about 1 year ago - Stars: 24 - Forks: 5

Bhattacharya-Lab/EquiPNAS

pLM-informed E(3) equivariant deep graph neural networks for protein-nucleic acid binding site prediction

Language: Python - Size: 9.05 MB - Last synced at: 5 months ago - Pushed at: about 1 year ago - Stars: 21 - Forks: 1

oxpig/CaLM

Protein language model trained on coding DNA

Language: Python - Size: 67.7 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 38 - Forks: 10

LucasDedieu/MHC1-Binding

Build a classifier that predicts if a given peptide will be presented by a specific MHC1 protein.

Language: Jupyter Notebook - Size: 766 KB - Last synced at: about 2 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

ml4bio/Dense-Homolog-Retrieval

Nature Biotechnology: Ultra-fast, sensitive detection of protein remote homologs using deep dense retrieval

Language: Python - Size: 26.1 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 40 - Forks: 1

bioinfodlsu/phage-host-prediction

Published in PLOS ONE. Phage-host interaction prediction tool that uses protein language models to represent the receptor-binding proteins of phages. It presents improvements over using handcrafted sequence properties and eliminates the need to manually extract and select features from phage sequences

Language: Jupyter Notebook - Size: 10.9 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 10 - Forks: 2

ProteinEngineering-PESB2/protein-interactions-models

Exploring protein language models and machine learning algorithms for protein interactions predictive models

Language: Python - Size: 60.2 MB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

KCLabMTU/LMCrot

Protein Language Model (pLM) Powered Protein Crotonylation (Kcr) Modified Site Predictor

Language: Python - Size: 22.5 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 2

elliothershberg/interactive-mutation-browser

In silico protein surgery using ESM.

Language: TypeScript - Size: 309 KB - Last synced at: 13 days ago - Pushed at: about 1 year ago - Stars: 18 - Forks: 1

dohlee/abyssal-pytorch

Implementation of Abyssal, a deep neural network trained with a new "mega" dataset to predict the impact of an amino acid variant on protein stability.

Language: Jupyter Notebook - Size: 154 KB - Last synced at: 21 days ago - Pushed at: about 2 years ago - Stars: 6 - Forks: 2

ShubhrangshuGhosh2000/mat_p2ip_prj

Contains code for the paper "MaTPIP: a deep-learning architecture with eXplainable AI for sequence-driven, feature mixed protein-protein interaction prediction"

Language: Python - Size: 184 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

titobabatunde/SPOT-1D-LM Fork of jas-preet/SPOT-1D-LM

SPOT-1D-LM-Extended

Language: Jupyter Notebook - Size: 17.6 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

KCLabMTU/pLMSNOSite

An ensemble-based approach for prediction of protein S-nitrosylation sites integrating supervised word embedding and embedding from protein language model

Language: Python - Size: 75.9 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 14 - Forks: 3

jhrcook/protein-language-models

Experimenting with protein language model predictions

Language: Python - Size: 3.91 KB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

biovino1/SearchEmb

Searching proteins against a database of embeddings

Language: Python - Size: 309 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

dzjxzyd/pLM4ACE

Language: Jupyter Notebook - Size: 7.18 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

elttaes/Revisiting-PLMs

Exploring Evolution-aware & free protein language models as protein function predictors

Language: Python - Size: 6.55 MB - Last synced at: almost 2 years ago - Pushed at: about 2 years ago - Stars: 41 - Forks: 9

vam-sin/CATHe

Deep Learning tool trained on protein sequence embeddings from protein language models to accurately detect remote homologues for CATH superfamilies

Language: Python - Size: 63.5 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 7 - Forks: 1

salvatoreloguercio/cov2vec

cov2vec is a systematic effort to obtain SARS CoV-2 genome embeddings by encoding viral genomes with protein language models.

Language: Python - Size: 15.6 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

Related Keywords
protein-language-model 48 bioinformatics 10 protein 10 protein-sequences 8 deep-learning 8 machine-learning 8 protein-structure 6 transformer 6 proteins 4 transfer-learning 4 representation-learning 3 protein-embedding 3 alphafold2 3 contrastive-learning 3 pretrained-models 3 pytorch 3 biology 3 protein-protein-interaction 3 computational-biology 3 alphafold 3 ptm-prediction 2 gene-ontology 2 python 2 post-translational-modification 2 plm 2 embeddings 2 protein-sequence 2 proteomics 2 phage-host-prediction 2 genomics 2 directed-evolution 2 homology-search 2 interpretable-machine-learning 2 peptide-prediction 2 msa-construction 2 protein-annotation 2 large-language-models 2 transformers 2 msa-transformer 2 antibody-sequences 2 foundation-models 2 protein-design 2 protein-function-prediction 2 protein-structure-prediction 2 protein-engineering 2 protein-ligand-interactions 2 geometric-deep-learning 2 mhc-binding-prediction 1 dense-retrieval 1 generative-ai 1 molecule-descriptors 1 bert 1 bert-model 1 proteinbert 1 transformers-bert 1 antibody 1 antibody-structure-prediction 1 antibody-sequence 1 graph-neural-netowrks 1 protein-dna-interactions 1 protein-rna-interactions 1 codons 1 dna 1 language-models 1 llms 1 llm 1 bioactive-pepeptide 1 bioactivity 1 peptide 1 peptide-features 1 benchmark 1 contact-prediction 1 esm1b 1 fitness-prediction 1 function-prediction 1 metal-ion-binding 1 multiple-sequence-alignment 1 protein-classification 1 remote-homology-detection 1 stability 1 zero-shot-prediction 1 neural-networks 1 covid-19 1 manifold-learning 1 crotonylation 1 natural-language-processing 1 protein-stability 1 reproduction 1 reproduction-code 1 deep-learning-architecture 1 explainable-ai 1 protein-protein-interaction-prediction 1 protein-sequence-features 1 11-785 1 end-to-end-sequence 1 plmsnosite 1 prot-t5 1 ptm 1 s-nitrosylation 1 data-analysis 1