An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: protein-sequence

Dy1365/smiles2dta-demo

A Streamlit app for predicting drug-target binding affinity using a trained CNN model. Input SMILES strings and protein sequences for fast and accurate predictions.

Size: 1.95 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0

hasanulmukit/smiles2dta-demo

A Streamlit app for predicting drug-target binding affinity using a trained CNN model. Input SMILES strings and protein sequences for fast and accurate predictions.

Language: Jupyter Notebook - Size: 18.1 MB - Last synced at: 4 days ago - Pushed at: 16 days ago - Stars: 3 - Forks: 0

instadeepai/protein-sequence-bfn

Supporting code for our paper "Protein Sequence Modelling with Bayesian Flow Networks"

Language: Python - Size: 1.02 MB - Last synced at: 15 days ago - Pushed at: about 2 months ago - Stars: 32 - Forks: 2

chao1224/ProteinDT

A Text-guided Protein Design Framework, Nat Mach Intell 2025 (https://www.nature.com/articles/s42256-025-01011-z)

Language: Python - Size: 19.4 MB - Last synced at: 19 days ago - Pushed at: 3 months ago - Stars: 66 - Forks: 6

johnnytam100/awesome-protein-design

A curated list of awesome protein design research, software and resources.

Size: 29.3 KB - Last synced at: 6 days ago - Pushed at: almost 3 years ago - Stars: 14 - Forks: 1

tbepler/protein-sequence-embedding-iclr2019

Source code for "Learning protein sequence embeddings using information from structure" - ICLR 2019

Language: Python - Size: 50.8 KB - Last synced at: 11 days ago - Pushed at: almost 4 years ago - Stars: 260 - Forks: 75

VarunUllanat/mint

Learning the language of protein-protein interactions

Language: Python - Size: 3.59 MB - Last synced at: 26 days ago - Pushed at: 26 days ago - Stars: 56 - Forks: 4

ostrokach/proteinsolver

Graph neural network for generating novel amino acid sequences that fold into proteins with predetermined topologies.

Language: Jupyter Notebook - Size: 271 MB - Last synced at: 14 days ago - Pushed at: about 4 years ago - Stars: 59 - Forks: 8

aqlaboratory/proteinnet

Standardized data set for machine learning of protein structure

Language: Python - Size: 223 KB - Last synced at: 16 days ago - Pushed at: over 4 years ago - Stars: 886 - Forks: 130

ISDementyev/pmUE

pmUE (Protein Modelling Unreal Engine) - a repo for constructing a molecule visualizer plugin in Unreal

Language: C++ - Size: 37.2 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 6 - Forks: 3

aziele/pairwise-sequence-alignment

A Python module to calculate alignment between two sequences using EMBOSS' needle, stretcher, and water

Language: Python - Size: 44.9 KB - Last synced at: 25 days ago - Pushed at: about 2 months ago - Stars: 12 - Forks: 3

MartinThoma/propy3

A Python 3 version of the protein descriptor package propy

Language: Python - Size: 960 KB - Last synced at: 6 days ago - Pushed at: about 2 years ago - Stars: 41 - Forks: 13

aziele/fastapy

A lightweight Python module to read and write FASTA sequence records

Language: Python - Size: 58.6 KB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 6 - Forks: 2

ISYSLAB-HUST/ProtFlash

ProtFlash: A lightweight protein language model

Language: Python - Size: 21.5 KB - Last synced at: 16 days ago - Pushed at: about 1 year ago - Stars: 104 - Forks: 3

wudangt/awesome-molecular-modeling-and-drug-discovery

A curated list of awesome Molecular Modeling And Drug Discovery 🔥

Size: 51.8 KB - Last synced at: 5 days ago - Pushed at: almost 3 years ago - Stars: 11 - Forks: 0

dohlee/pifold-pytorch

An unofficial re-implementation of PiFold, a fast inverse-folding algorithm for protein sequence design, in PyTorch.

Language: Jupyter Notebook - Size: 152 KB - Last synced at: 15 days ago - Pushed at: almost 2 years ago - Stars: 5 - Forks: 1

cusbg/MolArt

MOLeculAR structure annoTator

Language: JavaScript - Size: 58.5 MB - Last synced at: 7 days ago - Pushed at: 7 months ago - Stars: 34 - Forks: 8

westlake-repl/SaProt

[ICLR'24 spotlight] Saprot: Protein Language Model with Structural Alphabet

Language: Python - Size: 2.45 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 332 - Forks: 32

PeptoneLtd/pepkalc

Robust simulation software for the comprehensive evaluation of protein electrostatics in unfolded state.

Language: Python - Size: 24.4 KB - Last synced at: about 5 hours ago - Pushed at: over 2 years ago - Stars: 5 - Forks: 1

michalbukowski/pfam-genomes

Snakemake pipeline for searching genomic sequences for those that encode proteins containing domains of choice

Language: Python - Size: 216 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

codecreatede/panache-extract

maf to panache and extracting all snps and pangenome specific information.

Language: Ruby - Size: 8.79 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

PNNL-Comp-Mass-Spec/protein-coverage-summarizer

Computes the percent of the residues in each protein sequence that have been identified, based on a list of identified peptides. A graphical user interface (GUI) is provided to allow the user to select the input files, set the options, and browse the coverage results.

Language: C# - Size: 35.1 MB - Last synced at: 3 days ago - Pushed at: 9 months ago - Stars: 6 - Forks: 3

RG-10/RG-10.github.io

🌱🌟 My Personal Portfolio 🌟🌱

Language: JavaScript - Size: 8.54 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

jaumebonet/RosettaSilentToolbox

Python Toolbox For Rosetta Silent Files Processing

Language: Python - Size: 389 MB - Last synced at: 14 days ago - Pushed at: over 3 years ago - Stars: 10 - Forks: 9

psipred/DMPfold

De novo protein structure prediction using iteratively predicted structural constraints

Language: C - Size: 195 MB - Last synced at: 5 months ago - Pushed at: about 3 years ago - Stars: 55 - Forks: 16

Shen-Lab/Fold2Seq-icml2021 Fork of IBM/fold2seq

[ICML 2021] "Fold2Seq: A Joint Sequence(1D)-Fold(3D) Embedding-based Generative Model for Protein Design" by Yue Cao, Payel Das, Vijil Chenthamarakshan, Pin-Yu Chen, Igor Melnyk, Yang Shen

Language: Python - Size: 4.85 MB - Last synced at: 12 months ago - Pushed at: almost 3 years ago - Stars: 5 - Forks: 1

IbrahimTanyalcin/I-PV

Interactive Protein Sequence VIsualization/Viewer - Interactive Circos

Language: HTML - Size: 66.6 MB - Last synced at: about 1 month ago - Pushed at: 12 months ago - Stars: 10 - Forks: 1

kris96tian/translate_dna_app

Code for DNA translation to proteins through a web interface. It uses Flask for the web aspects and defines the logic to translate codons based on genetic code rules. Users can interactively enter DNA-seq on a web form and see the protein output.

Language: Python - Size: 3.91 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

sinc-lab/Comparison-of-Protein-learning

Comparison of protein learning

Language: Jupyter Notebook - Size: 17.7 MB - Last synced at: 9 months ago - Pushed at: almost 3 years ago - Stars: 4 - Forks: 1

jendelel/PrankWebApp

Web application for protein-ligand binding sites analysis and visualization

Language: JavaScript - Size: 13.8 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 12 - Forks: 6

LPDI-EPFL/rstoolbox Fork of jaumebonet/RosettaSilentToolbox

Python Toolbox For Rosetta Silent Files Processing

Language: Python - Size: 389 MB - Last synced at: about 1 year ago - Pushed at: almost 6 years ago - Stars: 14 - Forks: 4

TommyGiak/HP_model

Implementation of the HP protein folding model with commands line interface

Language: Python - Size: 10.9 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

haichengyi/bioseq2vec

BioSeq2vec: learning deep representation of biological sequences using LSTM Encoder-Decoder

Language: Python - Size: 80.3 MB - Last synced at: over 1 year ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 4

bhi-kimlab/DeepFam

Deep learning based alignment-free method for protein family modeling and prediction

Language: Python - Size: 3.04 MB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 16 - Forks: 8

kalarimonk/Probhujina

Visualizing protein structures with their sequences!!!

Language: C++ - Size: 48.8 KB - Last synced at: over 1 year ago - Pushed at: about 7 years ago - Stars: 0 - Forks: 0

dissipative/ribosome

The Ribosome package is a Go library designed for efficient transcription and translation of DNA and RNA sequences, inspired by the real processes in living cells.

Language: Go - Size: 71.3 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Shen-Lab/gcWGAN

Guided Conditional Wasserstein GAN for De Novo Protein Design

Language: Roff - Size: 334 MB - Last synced at: 12 months ago - Pushed at: about 4 years ago - Stars: 36 - Forks: 7

kn-bibs/dotplot

Simple visualisation tool for sequences' similarity in bioinformatics

Language: Python - Size: 210 KB - Last synced at: 10 days ago - Pushed at: about 7 years ago - Stars: 13 - Forks: 2

lukaszsobala/spike-annotation 📦

Annotation of SARS-CoV-2 Spike glycoprotein

Size: 88.9 KB - Last synced at: 2 days ago - Pushed at: almost 5 years ago - Stars: 1 - Forks: 0

0xpranjal/COVID-Genome-Computational-Analysis

Computational predictions of protein attributes associated with COVID-19 using Data Science techniques

Language: Jupyter Notebook - Size: 1.96 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 6 - Forks: 5

kaledhoshme123/Using-Deep-Learning-to-Annotate-the-Protein-Universe

Understanding the relationship between amino acid sequence and protein function is a long-standing problem in molecular biology with far-reaching scientific implications.

Language: Jupyter Notebook - Size: 595 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

zmmason/BINF

Computational programs and algorithms used to convert information from biochemical experiments (DNA/RNA/Protein/DNA chip/NGS) into useful information and data.

Language: Python - Size: 86.9 KB - Last synced at: almost 2 years ago - Pushed at: about 4 years ago - Stars: 5 - Forks: 1

DSIMB/MEDUSA

A Deep Learning based protein flexibility prediction tool.

Language: Perl - Size: 121 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 5 - Forks: 2

johnnytam100/FPredX

FPredX

Language: Python - Size: 36.1 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 2 - Forks: 1

changlabtw/MS2CNN

Language: Python - Size: 96.2 MB - Last synced at: about 1 year ago - Pushed at: about 4 years ago - Stars: 4 - Forks: 3

mtp-usz/IsoAligner Fork of JacobHanimann/IsoAligner

IsoAligner: dynamic mapping of amino acid positions across protein isoforms

Language: Python - Size: 215 MB - Last synced at: about 1 year ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 0

liponan/structure-generator

A machine learning model that builds amino acids into a protein model.

Language: Python - Size: 50.8 KB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 4 - Forks: 2

neelotpal-d/DNA_Protein_Translation

Program for Translation from DNA sequence to protein sequence

Language: C - Size: 752 KB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0

YoannPa/Threading-PU

Alignement structurale d'une séquence protéique sur une structure protéique 3D.

Language: Python - Size: 1.36 MB - Last synced at: almost 2 years ago - Pushed at: about 6 years ago - Stars: 0 - Forks: 0

ratvec/results

Results from application of RatVec to protein sequences

Language: Jupyter Notebook - Size: 161 KB - Last synced at: 12 months ago - Pushed at: about 6 years ago - Stars: 2 - Forks: 0

acgtun/hsearch

HSEARCH: fast and accurate protein sequence motif search and clustering

Language: C++ - Size: 3.66 MB - Last synced at: about 2 years ago - Pushed at: about 8 years ago - Stars: 1 - Forks: 0

acgtun/s3

S3: Sequence Similarity Search (Protein Sequence)

Language: C++ - Size: 120 KB - Last synced at: about 2 years ago - Pushed at: almost 9 years ago - Stars: 0 - Forks: 0

Related Keywords
protein-sequence 52 protein-structure 19 protein 15 bioinformatics 14 deep-learning 8 protein-design 8 machine-learning 6 proteins 4 structural-biology 4 pdb 4 protein-structure-prediction 4 computational-biology 3 protein-sequences 3 dna 3 sequence-alignment 3 alphafold2 3 visualization 3 drug-design 3 drug-discovery 3 representation-learning 3 science 2 data-visualization 2 molecular-modeling 2 data-analysis 2 protein-function-prediction 2 rna-sequence 2 uniprot 2 sars-cov-2 2 convolutional-neural-networks 2 amino-acid-sequence 2 mass-spectrometry 2 fasta 2 prediction 2 protein-language-model 2 protein-domains 2 dna-sequences 2 fasta-sequences 2 rna 2 genome-analysis 2 python3 2 python 2 protein-engineering 2 protein-folding 2 alphafold 2 streamlit 2 smiles 2 drug-target-interactions 2 drug-target-affinity 2 cnn 2 protein-modeling 2 protein-representation-learning 2 lstm 2 generative-model 2 sequence 2 protein-homology 1 generative-adversarial-networks 1 sequence-analysis 1 dotplot 1 gene-similarity 1 visualisation 1 ligand 1 pdb-files 1 covid-19 1 covid19 1 demogorgon 1 protein-sequence-analysis 1 19 1 complete-genome-analysis 1 ncrna-protein-interactions 1 mirna 1 lncrna 1 encoder-decoder 1 circrnas 1 sequence-to-sequence 1 alignment-free 1 biological-sequences 1 computer-graphics 1 graphics 1 pyhton 1 opengl 1 hp-model 1 annealing 1 webapp 1 gc-content 1 web-application 1 ncbi 1 web 1 protein-ligand-interactions 1 alternative-splicing 1 aminoacid-position 1 ensemble 1 exon-mapping 1 protein-ids 1 protein-isoform 1 refseq 1 rest-api 1 splice-variants 1 ucsc 1 webtool 1 gcn 1