GitHub topics: sequence-embeddings
sacdallago/bio_embeddings
Get protein embeddings from protein sequences
Language: HTML - Size: 68.3 MB - Last synced at: 5 days ago - Pushed at: about 2 years ago - Stars: 493 - Forks: 71

michaelscutari/protclust
protclust is a Python library for protein sequence analysis that integrates MMseqs2 for fast clustering and provides tools for creating robust machine learning datasets. It offers cluster-aware data splitting to prevent sequence similarity bias in model evaluation, along with comprehensive protein embedding capabilities for feature generation.
Language: Python - Size: 354 KB - Last synced at: 2 days ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

joanagoncalveslab/ELISL
ELISL: Early-Late Synthetic Lethality Prediction in Cancer by Tepeli YI, Seale C, Gonçalves JP (bioRxiv 2022, Bioinformatics 2023)
Language: Python - Size: 609 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 3 - Forks: 0
