An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: fasta-files

BioJulia/FASTX.jl

Parse and process FASTA and FASTQ formatted files of biological sequences.

Language: Julia - Size: 1.09 MB - Last synced at: 9 days ago - Pushed at: 7 months ago - Stars: 62 - Forks: 20

PNNL-Comp-Mass-Spec/Validate-Fasta-File

Parses a FASTA file (with protein name and sequence information) to check for valid text. Also returns protein and residue stats.

Language: C# - Size: 3.97 MB - Last synced at: 29 days ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 2

PNNL-Comp-Mass-Spec/Protein-Digestion-Simulator

Performs validation, transformation, and in-silico digestion of text files containing protein or peptide sequences (FASTA format or delimited text)

Language: C# - Size: 7.92 MB - Last synced at: 29 days ago - Pushed at: 9 months ago - Stars: 7 - Forks: 4

GiatrasKon/sandbox.bio-Solutions

Bash scripts replicating the commands from sandbox.bio's interactive bioinformatics tutorials, organized by categories such as Data Exploration, File Formats, Quality Control, and Data Analysis.

Language: Shell - Size: 11.7 KB - Last synced at: about 2 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

Shogun486/Custom-DNA-mRNA-Sequence-Mapper

🧬 Automating the functionality of open-source program Minimap2

Language: Perl - Size: 5.86 KB - Last synced at: 2 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

AntoineHo/CircosAlignmentPlotter

Converts a part of an alignment (.PAF perhaps others sometimes) to a Circos image using BED and fasta files.

Language: Python - Size: 22.5 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 3 - Forks: 1

darwinsorchid/GC-Content-Calculator

[Python] Tool for calculating GC content of nucleotide sequences with optional sliding window analysis. Sequence input options include strings and the following file formats: FASTA, FASTA Nucleid Acid, GenBank, Aligned FASTA and ClustalW.

Language: Python - Size: 75.2 KB - Last synced at: 3 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

haschka/sequence_in_sequence_finder

A tool that finds a nucleic sub-sequence string ( from a FASTA file ) in a FASTA file using the fourier transform.

Language: C - Size: 15.6 KB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

haschka/pdb2fasta

Extracts the fasta sequence from protein stored in a PDB file

Language: C - Size: 26.4 KB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

cartoonist/kseqpp

Fast FASTA/Q parser and writer (C++ re-implementation of kseq library)

Language: C++ - Size: 90.8 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 41 - Forks: 7

DarisCappelletti/Gff3-tools

C# web tool for reading GFF3 files, filtering, ordering, extracting information into Excel, GFF3 files, CDS files, and comparing GFF3 data with FASTA and CDS files.

Language: ASP.NET - Size: 1.95 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

raymondkiu/bioinformatics-tools

Small and simple scripts useful for various bioinformatics purposes e.g. extract sequences from fasta files

Language: Shell - Size: 78.1 KB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 14 - Forks: 5

PriyaLakr/NGS_DataAnalysis

Some scripts to make your bioinformatics analyses reproducible and a bit easy 🤓

Language: Shell - Size: 58.6 KB - Last synced at: over 1 year ago - Pushed at: almost 4 years ago - Stars: 2 - Forks: 1

Marie-Schmit/Java-fasta-gtf-reader

Java project for MSc Applied Bioinformatics at Cranfield University. Java program for gene models visualisation using gene structures from GTF annotation and FASTA files. User can choose a file, display its information, calculate basic statistics, display and highlight exons (with text and graphically).

Language: Java - Size: 4.01 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

Sarah-Hesham-2022/BioPython-Getopt-Biological-Command-Line-Interpreter

Bio Python Project using getopt library in python and engaging with the command line prompt.

Language: Python - Size: 1.21 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 0

exTerEX/pdb2fasta 📦

A simple C library to extract the amino acid sequence from a file in PDB (Protein data bank) format and output to a FASTA format file.

Language: C - Size: 725 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

PNNL-Comp-Mass-Spec/Fasta-File-Splitter

The FASTA File Splitter program can be used to split apart a protein FASTA file into a number of sections. Although the splitting is random, each section will have a nearly identical number of residues.

Language: C# - Size: 144 KB - Last synced at: 29 days ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 2

DiegoELT/sequenceVisor

A basic TkInter-based interface to visualize DNA sequences from .fasta files, along with base saturation and consensus.

Language: Python - Size: 421 KB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 2 - Forks: 0

angelovangel/fastjac

k-mer similarity metrics for two fastx files

Language: Rust - Size: 5.86 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

aaiezza/FLiCK

FLiCK - Format LeveragIng Compression frameworK

Language: Java - Size: 1.36 MB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 2 - Forks: 0

abhijeetsingh1704/CONCAT

Convert sequences in multifasta file to a single sequence with new fasta header

Language: Shell - Size: 21.5 KB - Last synced at: about 2 months ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0

bfosso/ITSoneDB-population-pipeline

Scripts used for the population of ITSoneDB

Language: Python - Size: 470 KB - Last synced at: almost 2 years ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 1