An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: mmseqs2

michaelscutari/protclust

protclust is a Python library for protein sequence analysis that integrates MMseqs2 for fast clustering and provides tools for creating robust machine learning datasets. It offers cluster-aware data splitting to prevent sequence similarity bias in model evaluation, along with comprehensive protein embedding capabilities for feature generation.

Language: Python - Size: 354 KB - Last synced at: 9 days ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

Anand-Research-Group/complex-I

This repository contains a set of scripts and workflows designed to search for Respiratory Complex I (NADH Ubiquinone Oxidoreductase) subunits in prokaryotic genomes and proteomes.

Language: Jupyter Notebook - Size: 5.67 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

hds-sandbox/AlphaFold_Workshop

Predict protein folding structures using ColabFold. Gain a deeper understanding of protein folding prediction with AlphaFold2 and MMseqs2. Run the Jupyter notebook on UCloud, learn to interpret results, predict protein structures of interest. Technical requirements provided. Enhance your knowledge of protein folding and AlphaFold2's principles. Fam

Language: Jupyter Notebook - Size: 567 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

mrzResearchArena/protein-clustering

Protein Clustering

Language: Jupyter Notebook - Size: 23.7 MB - Last synced at: about 2 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0