Topic: "next-generation-sequencing"
ablab/spades
SPAdes Genome Assembler
Language: C++ - Size: 97.6 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 820 - Forks: 149

lindenb/jvarkit
Java utilities for Bioinformatics
Language: Java - Size: 60.6 MB - Last synced at: 13 minutes ago - Pushed at: about 1 hour ago - Stars: 498 - Forks: 133

openbiox/awosome-bioinformatics
A curated list of resources for learning bioinformatics.
Size: 360 KB - Last synced at: 8 days ago - Pushed at: about 3 years ago - Stars: 475 - Forks: 125

nf-core/sarek
Analysis pipeline to detect germline or somatic variants (pre-processing, variant calling and annotation) from WGS / targeted sequencing
Language: Nextflow - Size: 102 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 442 - Forks: 443

sanger-pathogens/Roary
Rapid large-scale prokaryote pan genome analysis
Language: Perl - Size: 17.7 MB - Last synced at: about 1 month ago - Pushed at: about 3 years ago - Stars: 340 - Forks: 190

philres/ngmlr
NGMLR is a long-read mapper designed to align PacBio or Oxford Nanopore (standard and ultra-long) to a reference genome with a focus on reads that span structural variations
Language: C++ - Size: 35.4 MB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 297 - Forks: 40

bioinformatics-centre/kaiju
Fast taxonomic classification of metagenomic sequencing reads using a protein reference database
Language: C - Size: 906 KB - Last synced at: 20 days ago - Pushed at: 20 days ago - Stars: 278 - Forks: 67

sanger-pathogens/snp-sites
Finds SNP sites from a multi-FASTA alignment file
Language: C - Size: 14.7 MB - Last synced at: about 1 month ago - Pushed at: over 4 years ago - Stars: 257 - Forks: 49

sanger-pathogens/circlator
A tool to circularize genome assemblies
Language: Python - Size: 25.7 MB - Last synced at: 23 days ago - Pushed at: about 1 year ago - Stars: 240 - Forks: 57

nextstrain/nextclade
Viral genome alignment, mutation calling, clade assignment, quality checks and phylogenetic placement
Language: Rust - Size: 59.2 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 230 - Forks: 61

sanger-pathogens/Artemis
Artemis is a free genome viewer and annotation tool that allows visualization of sequence features and the results of analyses within the context of the sequence, and its six-frame translation
Language: Java - Size: 153 MB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 208 - Forks: 73

nickjcroucher/gubbins
Rapid phylogenetic analysis of large samples of recombinant bacterial whole genome sequences using Gubbins
Language: Python - Size: 9.69 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 184 - Forks: 53

sanger-pathogens/ariba
Antimicrobial Resistance Identification By Assembly
Language: Python - Size: 12 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 173 - Forks: 55

ngless-toolkit/ngless
NGLess: NGS with less work
Language: Haskell - Size: 14.1 MB - Last synced at: about 4 hours ago - Pushed at: about 2 months ago - Stars: 146 - Forks: 25

SciLifeLab/Sarek Fork of nf-core/sarek 📦
Detect germline or somatic variants from normal or tumour/normal whole-genome or targeted sequencing
Language: Nextflow - Size: 43.8 MB - Last synced at: about 2 months ago - Pushed at: over 5 years ago - Stars: 133 - Forks: 7

alastair-droop/fqtools
An efficient FASTQ manipulation suite
Language: C - Size: 2.28 MB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 127 - Forks: 16

zhanxw/rvtests
Rare variant test software for next generation sequencing data
Language: C++ - Size: 32.7 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 120 - Forks: 41

chasewnelson/SNPGenie
Program for estimating πN/πS, dN/dS, and other diversity measures from next-generation sequencing data
Language: Perl - Size: 876 KB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 111 - Forks: 37

xjtu-omics/msisensor-pro
Microsatellite Instability (MSI) detection using high-throughput sequencing data.
Language: C++ - Size: 233 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 107 - Forks: 23

sanger-pathogens/assembly-stats
Get assembly statistics from FASTA and FASTQ files
Language: C++ - Size: 1.02 MB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 95 - Forks: 22

Cibiv/NextGenMap
NextGenMap is a flexible highly sensitive short read mapping tool that handles much higher mismatch rates than comparable algorithms while still outperforming them in terms of runtime. This allows analysing large scale datasets even with increased SNP rates or higher error rates (e.g. caused by specialized experimental protocols) and avoids biases caused by highly variable regions in the genome.
Language: C++ - Size: 113 MB - Last synced at: 25 days ago - Pushed at: almost 6 years ago - Stars: 87 - Forks: 8

chanzuckerberg/czid-web
Infectious Disease Sequencing Platform
Language: TypeScript - Size: 466 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 84 - Forks: 25

KHP-Informatics/ngseasy
Dockerised Next Generation Sequencing Pipeline (QC, Align, Calling, Annotation)
Language: Shell - Size: 44 MB - Last synced at: over 1 year ago - Pushed at: almost 8 years ago - Stars: 84 - Forks: 40

stjude-rust-labs/fq
Command line utility for manipulating Illumina-generated FASTQ files.
Language: Rust - Size: 563 KB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 82 - Forks: 5

tobiasrausch/ATACseq
Analysis Workflow for Assay for Transposase-Accessible Chromatin using sequencing (ATAC-Seq)
Language: Shell - Size: 330 KB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 75 - Forks: 36

MicrobeLab/DeepMicrobes
DeepMicrobes: taxonomic classification for metagenomics with deep learning
Language: Python - Size: 1.25 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 74 - Forks: 19

sanger-pathogens/Fastaq
Python3 scripts to manipulate FASTA and FASTQ files
Language: Python - Size: 371 KB - Last synced at: 18 days ago - Pushed at: about 2 months ago - Stars: 70 - Forks: 21

sanger-pathogens/pathogen-informatics-training
Language: Jupyter Notebook - Size: 359 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 58 - Forks: 34

sanger-pathogens/iva
de novo virus assembler of Illumina paired reads
Language: Python - Size: 8.46 MB - Last synced at: 25 days ago - Pushed at: almost 4 years ago - Stars: 55 - Forks: 18

sanger-pathogens/assembly_improvement
Improve the quality of a denovo assembly by scaffolding and gap filling
Language: Perl - Size: 3.03 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 52 - Forks: 21

stjude-rust-labs/wdl
Rust crates for working with Workflow Description Language (WDL) documents.
Language: Rust - Size: 3.83 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 49 - Forks: 34

epigen/MrBiomics
MrBiomics: Modules & Recipes augment Bioinformatics for Multi-Omics Analyses
Language: Python - Size: 10.1 MB - Last synced at: 29 days ago - Pushed at: about 1 month ago - Stars: 49 - Forks: 1

s4hts/HTStream
A high throughput sequence read toolset using a streaming approach facilitated by Linux pipes
Language: C++ - Size: 20.3 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 49 - Forks: 9

YaqiangCao/cLoops2
Enhanced and elegant flexible peak/loop/domain -calling and analysis tool for 1D/3D genomic data.
Language: Python - Size: 251 MB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 46 - Forks: 8

mehrdadbakhtiari/adVNTR
A tool for genotyping Variable Number Tandem Repeats (VNTR) from sequence data
Language: Python - Size: 1.42 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 41 - Forks: 15

stjudecloud/oliver 📦
An opinionated Cromwell orchestration manager.
Language: Python - Size: 2.61 MB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 40 - Forks: 3

KHP-Informatics/DNAscan
DNAscan is a fast and efficient bioinformatics pipeline that allows for the analysis of DNA Next Generation sequencing data, requiring very little computational effort and memory usage.
Language: HTML - Size: 57.2 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 39 - Forks: 22

nextgenusfs/amptk
AMPtk: Amplicon ToolKit for NGS data (formally UFITS)
Language: Python - Size: 18.3 MB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 38 - Forks: 14

stjudecloud/workflows
Bioinformatics workflows developed for and used on the St. Jude Cloud project.
Language: WDL - Size: 74.6 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 36 - Forks: 12

stjude-rust-labs/ngs
Command line utility for working with next-generation sequencing files.
Language: Rust - Size: 893 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 34 - Forks: 2

zhanxw/seqminer
Query sequence data (VCF/BCF1/BCF2, Tabix, BGEN, PLINK) in R
Language: C - Size: 7.99 MB - Last synced at: 23 days ago - Pushed at: 7 months ago - Stars: 30 - Forks: 12

kstawiski/OmicSelector
OmicSelector - Environment, docker-based application and R package for biomarker signiture selection (feature selection) & deep learning diagnostic tool development from high-throughput high-throughput omics experiments and other multidimensional datasets. Initially developed for miRNA-seq, RNA-seq and qPCR.
Language: R - Size: 54.5 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 30 - Forks: 3

ncbi/pm4ngs
Project Manager for NGS data analysis
Language: Python - Size: 14 MB - Last synced at: 5 days ago - Pushed at: over 2 years ago - Stars: 30 - Forks: 10

sanger-pathogens/mlst_check
Multilocus sequence typing by blast using the schemes from PubMLST
Language: Perl - Size: 5.23 MB - Last synced at: about 18 hours ago - Pushed at: almost 3 years ago - Stars: 30 - Forks: 15

sanger-pathogens/gff3toembl
Converts Prokka GFF3 files to EMBL files for uploading annotated assemblies to EBI
Language: Python - Size: 2.92 MB - Last synced at: about 17 hours ago - Pushed at: over 6 years ago - Stars: 29 - Forks: 12

sanger-pathogens/plasmidtron
Assembling the cause of phenotypes and genotypes from NGS data
Language: Python - Size: 825 KB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 28 - Forks: 8

menghaowei/ngstools
My own tools code for NGS data analysis (Next Generation Sequencing)
Language: Python - Size: 2.53 MB - Last synced at: over 1 year ago - Pushed at: almost 6 years ago - Stars: 28 - Forks: 9

stjudecloud/wdldoc
Create WDL documentation using Markdown.
Language: Python - Size: 157 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 27 - Forks: 5

baku4/sigalign
A Similarity-Guided Alignment Algorithm
Language: Rust - Size: 11.7 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 26 - Forks: 4

edgardomortiz/Captus
Assembly of Phylogenomic Datasets from High-Throughput Sequencing data
Language: Python - Size: 77.7 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 25 - Forks: 6

sanger-pathogens/saffrontree
SaffronTree: Reference free rapid phylogenetic tree construction from raw read data
Language: Python - Size: 1.32 MB - Last synced at: about 18 hours ago - Pushed at: almost 5 years ago - Stars: 25 - Forks: 3

genid/Yleaf
Yleaf software for human Y-chromosomal haplogroup inference from next generation sequencing data
Language: Python - Size: 41.1 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 24 - Forks: 10

TimoLassmann/samstat
SAMStat displays various properties of next-generation sequencing reads stored in SAM/BAM format.
Language: C - Size: 6.67 MB - Last synced at: 10 days ago - Pushed at: almost 2 years ago - Stars: 24 - Forks: 6

amberbiology/py4lifesci
Code repository for the book "Python for the Life Sciences" by Lancaster & Webster
Language: Python - Size: 8.64 MB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 24 - Forks: 15

GarrettJenkinson/informME
An information-theoretic pipeline for methylation analysis of WGBS data
Language: MATLAB - Size: 7.78 MB - Last synced at: over 1 year ago - Pushed at: about 6 years ago - Stars: 24 - Forks: 9

rhshah/iCallSV
A Framework to call Structural Variants from NGS based datasets
Language: Python - Size: 3.12 MB - Last synced at: 10 days ago - Pushed at: over 7 years ago - Stars: 22 - Forks: 10

sigven/cacao
Callable Cancer Loci - assessment of sequencing coverage for actionable and pathogenic loci in cancer
Language: R - Size: 20.4 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 21 - Forks: 4

Genometric/MSPC
Using combined evidence from replicates to evaluate ChIP-seq peaks
Language: C# - Size: 22.1 MB - Last synced at: 1 day ago - Pushed at: 16 days ago - Stars: 20 - Forks: 10

sanger-pathogens/Bio-Tradis
A set of tools to analyse the output from TraDIS analyses
Language: Perl - Size: 192 MB - Last synced at: 11 months ago - Pushed at: almost 2 years ago - Stars: 20 - Forks: 29

MorrellLAB/sequence_handling Fork of pmorrell/sequence_handling
A series of scripts to automate sequence workflows
Language: Jupyter Notebook - Size: 6.94 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 18 - Forks: 9

sanger-pathogens/seroba
k-mer based Pipeline to identify the Serotype from Illumina NGS reads
Language: Python - Size: 19.4 MB - Last synced at: 9 months ago - Pushed at: almost 2 years ago - Stars: 18 - Forks: 16

chgibb/PHAT
Pathogen-Host Analysis Tool - A modern Next-Generation Sequencing (NGS) analysis platform
Language: TypeScript - Size: 649 MB - Last synced at: 10 days ago - Pushed at: over 2 years ago - Stars: 17 - Forks: 2

galantelab/sandy
A straightforward and complete next-generation sequencing read simulator
Language: Perl - Size: 25.9 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 15 - Forks: 2

PacificBiosciences/minorseq 📦
Minor Variant Calling and Phasing Tools
Size: 1.96 MB - Last synced at: 10 months ago - Pushed at: over 3 years ago - Stars: 15 - Forks: 6

futianfan/GenoCraft
GenoCraft: A Comprehensive, User-Friendly Web Platform for High-Throughput Omics Data Analysis and Visualization (https://arxiv.org/pdf/2312.14249)
Language: SCSS - Size: 69.9 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 14 - Forks: 0

merlyescalona/ngsphy
NGSphy: phylogenomic simulation of next-generation sequencing data
Language: Python - Size: 16.3 MB - Last synced at: 7 months ago - Pushed at: about 7 years ago - Stars: 14 - Forks: 6

ANGSD-wrapper/angsd-wrapper
Utilities for analyzing next generation sequencing data.
Language: Shell - Size: 13.4 MB - Last synced at: almost 2 years ago - Pushed at: about 4 years ago - Stars: 13 - Forks: 4

Sentieon/sentieon-dnascope-ml
Sentieon DNAscope + Machine Learning Model
Language: Shell - Size: 111 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 12 - Forks: 4

stjudecloud/ngsderive
Forensic analysis tool useful in backwards computing information from next-generation sequencing data.
Language: Python - Size: 1.07 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 11 - Forks: 0

Pathogen-Genomics-Cymru/lodestone
Mycobacterial pipeline
Language: Python - Size: 572 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 11 - Forks: 4

maxemil/PhyloMagnet
screening metagenomes for arbitrary lineages, using gene-centric assembly methods and phylogenetics
Language: Nextflow - Size: 2.34 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 11 - Forks: 4

epigen/fetch_ngs
Workflow to Fetch Public Sequencing Data and Metadata Using iSeq and MrBiomics Module.
Language: Python - Size: 46.9 KB - Last synced at: 29 days ago - Pushed at: about 1 month ago - Stars: 10 - Forks: 0

blachlylab/fade
Fragmentase Artifact Detection and Elimination
Language: D - Size: 721 KB - Last synced at: over 1 year ago - Pushed at: about 3 years ago - Stars: 10 - Forks: 3

MultiQC/example-plugin
A miniature example of a MultiQC plugin.
Language: Python - Size: 43.9 KB - Last synced at: 10 days ago - Pushed at: 3 months ago - Stars: 9 - Forks: 6

galantelab/sideRETRO
A pipeline for detecting Somatic Insertion of DE novo RETROcopies
Language: C - Size: 4.76 MB - Last synced at: about 1 month ago - Pushed at: 6 months ago - Stars: 9 - Forks: 5

HudsonAlpha/rust-msbwt
place for building up a msbwt resource in Rust
Language: Rust - Size: 156 KB - Last synced at: 15 days ago - Pushed at: over 2 years ago - Stars: 9 - Forks: 2

quadram-institute-bioscience/albatradis
Comparative TraDIS analysis
Language: Gnuplot - Size: 19.6 MB - Last synced at: 10 days ago - Pushed at: over 2 years ago - Stars: 9 - Forks: 4

bihealth/cnvetti
:tada: CNVetti – robust, efficient, and versatile clinical CNV calling from HTS data
Language: Rust - Size: 547 KB - Last synced at: about 1 year ago - Pushed at: over 5 years ago - Stars: 8 - Forks: 0

sepro/LSTrAP 📦
Language: Python - Size: 33.4 MB - Last synced at: about 1 year ago - Pushed at: almost 6 years ago - Stars: 8 - Forks: 4

kaizhang/Taiji 📦
This project has been moved to:
Language: Haskell - Size: 3.67 MB - Last synced at: about 2 years ago - Pushed at: about 6 years ago - Stars: 8 - Forks: 3

yanlinlin82/seqpipe
A framework for SEQuencing data analysis PIPElines.
Language: Perl - Size: 5.33 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 7 - Forks: 4

kohyamat/massgenotyping
Python package for microsatellite genotyping from highly multiplexed amplicon sequencing data
Language: Python - Size: 420 KB - Last synced at: 9 months ago - Pushed at: over 1 year ago - Stars: 7 - Forks: 1

linnil1/HLA_collections
A script to run HLA typing tools from next generation sequencing data
Language: Python - Size: 292 KB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 7 - Forks: 1

stjudecloud/docs-archive 📦
Documentation on St. Jude Cloud.
Language: Shell - Size: 413 MB - Last synced at: 4 months ago - Pushed at: about 3 years ago - Stars: 7 - Forks: 6

stjude/Episomizer
Constuct Extrachromosomal Circular DNA using Whole Genome Sequencing Data
Language: Python - Size: 73.6 MB - Last synced at: about 1 year ago - Pushed at: over 6 years ago - Stars: 7 - Forks: 5

BBCG/epialleleR
Fast, epiallele-aware methylation caller and reporter — an R/Bioconductor package
Language: R - Size: 82.7 MB - Last synced at: 16 days ago - Pushed at: 2 months ago - Stars: 6 - Forks: 1

Roco-scientist/NGS-Barcode-Count
Counts barcodes within next generation sequencing data. DNA encoded libraries (DEL), CRISPR, and barcode-seq
Language: Rust - Size: 346 KB - Last synced at: 13 days ago - Pushed at: over 1 year ago - Stars: 6 - Forks: 4

ucladx/pancan-panel-design
A pan-cancer DNA-seq panel
Language: Python - Size: 3.31 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 6 - Forks: 0

blachlylab/dhtslib
D bindings and OOP wrappers for htslib
Language: D - Size: 1.21 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 6 - Forks: 1

xjtu-omics/Mako
A graph-based pattern growth approach for CSV discovery
Language: Java - Size: 18 MB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 6 - Forks: 1

rcs333/CLOMP
:horse: CLinically Okay Metagenomic Pipeline :horse:
Language: Python - Size: 311 KB - Last synced at: about 1 year ago - Pushed at: over 5 years ago - Stars: 6 - Forks: 3

WCSCourses/NGS_Bioinformatics_2024 Fork of WCSCourses/NGS_Bioinformatics 📦
Learn to analyse next generation sequence data to address a wide range of biological questions
Language: TeX - Size: 137 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 5 - Forks: 2

daormar/geno-debasher
Geno-DeBasher package for detection of germline and somatic variants
Language: Shell - Size: 3.36 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 1

ewimberley/peakPass
Create ChIP-Seq blacklists automaticaly and validate them.
Language: R - Size: 20 MB - Last synced at: over 1 year ago - Pushed at: about 3 years ago - Stars: 5 - Forks: 1

alok123t/HyINDEL
Hybrid approach for detection of Insertions and Deletions from NGS data
Language: C++ - Size: 4.6 MB - Last synced at: 4 months ago - Pushed at: about 5 years ago - Stars: 5 - Forks: 0

NCBI-Hackathons/MetadataTable
Work on and around an interactive metadata table for project identification
Language: Python - Size: 113 KB - Last synced at: about 1 year ago - Pushed at: about 7 years ago - Stars: 5 - Forks: 2

swyder/Genomic_Utilities
Size: 105 MB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 5 - Forks: 2

RRafiee/XHMM
eXome Hidden Markov model (XHMM) - Copy Number Variants (CNVs) from Whole Exome Sequencing Data
Language: Shell - Size: 13 MB - Last synced at: over 1 year ago - Pushed at: about 8 years ago - Stars: 5 - Forks: 3

jiankaiwang/NextGenerationSequencing
Construct a flow to analyze NGS data based on R, Perl, and Python.
Language: R - Size: 84 KB - Last synced at: over 1 year ago - Pushed at: over 8 years ago - Stars: 5 - Forks: 4

nf-core/sammyseq
Pipeline for Sequential Analysis of MacroMolecules accessibilitY sequencing (SAMMY-seq) data, to analyze chromatin state.
Language: Nextflow - Size: 2.94 MB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 4 - Forks: 15
