An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: sequence-embeddings

sacdallago/bio_embeddings

Get protein embeddings from protein sequences

Language: HTML - Size: 68.3 MB - Last synced at: 5 days ago - Pushed at: about 2 years ago - Stars: 493 - Forks: 71

michaelscutari/protclust

protclust is a Python library for protein sequence analysis that integrates MMseqs2 for fast clustering and provides tools for creating robust machine learning datasets. It offers cluster-aware data splitting to prevent sequence similarity bias in model evaluation, along with comprehensive protein embedding capabilities for feature generation.

Language: Python - Size: 354 KB - Last synced at: 2 days ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

joanagoncalveslab/ELISL

ELISL: Early-Late Synthetic Lethality Prediction in Cancer by Tepeli YI, Seale C, Gonçalves JP (bioRxiv 2022, Bioinformatics 2023)

Language: Python - Size: 609 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 3 - Forks: 0