Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: variant-calling

tanghaibao/jcvi

Python library to facilitate genome assembly, annotation, and comparative genomics

Language: Python - Size: 17.8 MB - Last synced: 4 days ago - Pushed: 6 days ago - Stars: 701 - Forks: 180

vcflib/vcflib

C++ library and cmdline tools for parsing and manipulating VCF files with python and zig bindings

Language: C++ - Size: 32.9 MB - Last synced: about 21 hours ago - Pushed: 22 days ago - Stars: 604 - Forks: 219

tseemann/snippy

:scissors: :zap: Rapid haploid variant calling and core genome alignment

Language: Perl - Size: 124 MB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 408 - Forks: 111

nf-core/sarek

Analysis pipeline to detect germline or somatic variants (pre-processing, variant calling and annotation) from WGS / targeted sequencing

Language: Nextflow - Size: 82.4 MB - Last synced: 27 days ago - Pushed: 27 days ago - Stars: 333 - Forks: 370

luntergroup/octopus

Bayesian haplotype-based mutation calling

Language: C++ - Size: 137 MB - Last synced: 11 days ago - Pushed: 9 months ago - Stars: 296 - Forks: 37

kishwarshafin/pepper

PEPPER-Margin-DeepVariant

Language: Python - Size: 36.1 MB - Last synced: 16 days ago - Pushed: 4 months ago - Stars: 231 - Forks: 41

HKU-BAL/Clair3

Clair3 - Symphonizing pileup and full-alignment for high-performance long-read variant calling

Language: Python - Size: 3.11 MB - Last synced: 3 days ago - Pushed: 3 days ago - Stars: 220 - Forks: 26

suhrig/arriba

Fast and accurate gene fusion detection from RNA-Seq data

Language: C++ - Size: 24.1 MB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 206 - Forks: 49

atks/vt

A tool set for short variant discovery in genetic sequence data.

Language: C - Size: 30.1 MB - Last synced: 7 months ago - Pushed: about 3 years ago - Stars: 190 - Forks: 3

nanoporetech/megalodon

Megalodon is a research command line tool to extract high accuracy modified base and sequence variant calls from raw nanopore reads by anchoring the information rich basecalling neural network output to a reference genome/transriptome.

Language: Python - Size: 24.9 MB - Last synced: 6 days ago - Pushed: about 1 year ago - Stars: 190 - Forks: 29

broadinstitute/viral-ngs

Viral genomics analysis pipelines

Language: Python - Size: 64.5 MB - Last synced: 7 months ago - Pushed: over 3 years ago - Stars: 180 - Forks: 66

aquaskyline/Clairvoyante

Clairvoyante: a multi-task convolutional deep neural network for variant calling in Single Molecule Sequencing

Language: Python - Size: 894 MB - Last synced: 11 days ago - Pushed: 5 months ago - Stars: 172 - Forks: 27

Illumina/paragraph

Graph realignment tools for structural variants

Language: C++ - Size: 30.8 MB - Last synced: about 2 months ago - Pushed: over 1 year ago - Stars: 143 - Forks: 28

sequana/sequana

Sequana: a set of Snakemake NGS pipelines

Language: Jupyter Notebook - Size: 35.5 MB - Last synced: about 2 hours ago - Pushed: 2 months ago - Stars: 141 - Forks: 27

CRG-CNAG/CalliNGS-NF

GATK RNA-Seq Variant Calling in Nextflow

Language: Nextflow - Size: 29 MB - Last synced: about 1 month ago - Pushed: over 1 year ago - Stars: 129 - Forks: 52

tseemann/nullarbor

:floppy_disk: :page_with_curl: "Reads to report" for public health and clinical microbiology

Language: Perl - Size: 11.3 MB - Last synced: 7 months ago - Pushed: almost 4 years ago - Stars: 123 - Forks: 35

andersen-lab/ivar Fork of gkarthik/ivar

iVar is a computational package that contains functions broadly useful for viral amplicon-based sequencing.

Language: C++ - Size: 16 MB - Last synced: 1 day ago - Pushed: 1 day ago - Stars: 115 - Forks: 38

broadinstitute/long-read-pipelines

Long read production pipelines

Language: Jupyter Notebook - Size: 179 MB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 111 - Forks: 22

nf-core/viralrecon

Assembly and intrahost/low-frequency variant calling for viral samples

Language: Nextflow - Size: 9.67 MB - Last synced: 9 days ago - Pushed: 10 days ago - Stars: 108 - Forks: 97

lh3/fermikit

De novo assembly based variant calling pipeline for Illumina short reads

Language: TeX - Size: 7.45 MB - Last synced: 11 days ago - Pushed: over 3 years ago - Stars: 108 - Forks: 23

mcveanlab/mccortex

De novo genome assembly and multisample variant calling

Language: C - Size: 9.86 MB - Last synced: 7 months ago - Pushed: about 5 years ago - Stars: 107 - Forks: 24

ShujiaHuang/ilus

A handy variant calling pipeline generator for whole-genome sequencing (WGS) and whole exom sequencing data (WES) analysis. 一个简易且全面的 WGS/WES 分析流程生成器.

Language: Python - Size: 780 KB - Last synced: 17 days ago - Pushed: 4 months ago - Stars: 107 - Forks: 35

PacificBiosciences/ccs

CCS: Generate Highly Accurate Single-Molecule Consensus Reads (HiFi Reads)

Size: 34.1 MB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 105 - Forks: 31

HKU-BAL/Clair

Clair: Exploring the limit of using deep neural network on pileup data for germline variant calling

Language: Python - Size: 1.01 MB - Last synced: 7 months ago - Pushed: over 1 year ago - Stars: 102 - Forks: 12

gear-genomics/tracy

Basecalling, alignment, assembly and deconvolution of Sanger Chromatogram trace files

Language: C++ - Size: 3.17 MB - Last synced: about 1 month ago - Pushed: about 2 months ago - Stars: 94 - Forks: 19

humanlongevity/HLA

xHLA: Fast and accurate HLA typing from short read sequence data

Language: Jupyter Notebook - Size: 91.3 MB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 92 - Forks: 51

GenomicsDB/GenomicsDB

High performance data storage for importing, querying and transforming variants.

Language: C++ - Size: 65.9 MB - Last synced: 9 days ago - Pushed: 9 days ago - Stars: 89 - Forks: 18

iqbal-lab-org/gramtools

Genome inference from a population reference graph

Language: C++ - Size: 14.7 MB - Last synced: 7 months ago - Pushed: over 1 year ago - Stars: 89 - Forks: 14

kcleal/dysgu

Toolkit for calling structural variants using short or long reads

Language: Cython - Size: 46.3 MB - Last synced: 23 days ago - Pushed: 23 days ago - Stars: 85 - Forks: 10

bioinformatics-centre/BayesTyper

A method for variant graph genotyping based on exact alignment of k-mers

Language: C++ - Size: 1.75 MB - Last synced: 5 months ago - Pushed: about 5 years ago - Stars: 83 - Forks: 8

TileDB-Inc/TileDB-VCF

Efficient variant-call data storage and retrieval library using the TileDB storage library.

Language: C++ - Size: 33 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 79 - Forks: 13

moiexpositoalonsolab/grenepipe

A flexible, scalable, and reproducible pipeline to automate variant calling from sequence reads.

Language: Python - Size: 93.2 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 78 - Forks: 18

lh3/minipileup

Simple pileup-based variant caller

Language: C - Size: 93.8 KB - Last synced: 11 days ago - Pushed: about 2 months ago - Stars: 76 - Forks: 3

nf-core/raredisease

Call and score variants from WGS/WES of rare disease patients.

Language: Nextflow - Size: 11.2 MB - Last synced: about 7 hours ago - Pushed: about 8 hours ago - Stars: 72 - Forks: 34

TimD1/vcfdist

vcfdist: Accurately benchmarking phased variant calls

Language: C++ - Size: 61.4 MB - Last synced: 11 days ago - Pushed: 11 days ago - Stars: 67 - Forks: 5

czbiohub-sf/cerebra 📦

A tool for fast and accurate summarizing of variant calling format (VCF) files

Language: Python - Size: 164 MB - Last synced: 2 days ago - Pushed: over 1 year ago - Stars: 59 - Forks: 9

lh3/CHM-eval

Language: TeX - Size: 524 KB - Last synced: 11 days ago - Pushed: almost 4 years ago - Stars: 49 - Forks: 8

karel-brinda/ococo

Ococo: the first online variant and consensus caller. Call genomic consensus directly from an unsorted SAM/BAM stream.

Language: C++ - Size: 3.49 MB - Last synced: 7 months ago - Pushed: over 5 years ago - Stars: 47 - Forks: 4

Clinical-Genomics/BALSAMIC

Bioinformatic Analysis pipeLine for SomAtic Mutations In Cancer

Language: Python - Size: 228 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 43 - Forks: 16

NVIDIA-Genomics-Research/VariantWorks

Deep Learning based variant calling toolkit - https://clara-parabricks.github.io/VariantWorks/

Language: Python - Size: 10.1 MB - Last synced: 15 days ago - Pushed: over 1 year ago - Stars: 43 - Forks: 11

gencorefacility/variant-calling-pipeline-gatk4

Variant Calling Pipeline Using GATK4 and Nextflow

Language: Nextflow - Size: 23.4 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 40 - Forks: 40

kevlar-dev/kevlar

Reference-free variant discovery in large eukaryotic genomes

Language: Python - Size: 71.5 MB - Last synced: about 1 month ago - Pushed: almost 3 years ago - Stars: 40 - Forks: 9

nf-core/deepvariant Fork of lifebit-ai/DeepVariant 📦

Please consider using/contributing to https://github.com/nf-core/sarek

Language: Nextflow - Size: 7.18 MB - Last synced: 5 months ago - Pushed: almost 3 years ago - Stars: 40 - Forks: 18

brentp/indelope

find large indels (in the blind spot between GATK/freebayes and SV callers)

Language: Nim - Size: 133 KB - Last synced: 18 days ago - Pushed: over 6 years ago - Stars: 39 - Forks: 1

kehrlab/PopDel

Population-wide Deletion Calling

Language: C++ - Size: 20.4 MB - Last synced: 4 days ago - Pushed: 4 days ago - Stars: 33 - Forks: 2

gear-genomics/indigo

Indigo: SNV and InDel Discovery in Chromatogram traces obtained from Sanger sequencing of PCR products

Language: JavaScript - Size: 2.72 MB - Last synced: about 1 month ago - Pushed: 7 months ago - Stars: 30 - Forks: 6

oliverSI/GATK4_Best_Practice

GATK4 Best Practice Nextflow Pipeline

Size: 9.77 KB - Last synced: 8 months ago - Pushed: over 6 years ago - Stars: 29 - Forks: 28

HKU-BAL/ClairS-TO

ClairS-TO - a deep-learning method for tumor-only somatic variant calling

Language: Python - Size: 1.93 MB - Last synced: 23 days ago - Pushed: 24 days ago - Stars: 29 - Forks: 2

nf-core/rnavar

gatk4 RNA variant calling pipeline

Language: Nextflow - Size: 3.64 MB - Last synced: 10 days ago - Pushed: 10 days ago - Stars: 28 - Forks: 29

sib-swiss/NGS-variants-training

GitHub for the SIB courses NGS - Genome variant analysis

Language: Shell - Size: 68.1 MB - Last synced: about 1 month ago - Pushed: 3 months ago - Stars: 24 - Forks: 9

aryarm/varCA

Use an ensemble of variant callers to call variants from ATAC-seq data

Language: Python - Size: 354 KB - Last synced: about 1 year ago - Pushed: almost 2 years ago - Stars: 21 - Forks: 7

SUwonglab/arcsv

Complex structural variant detection from WGS data

Language: Python - Size: 5.1 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 20 - Forks: 6

IKIM-Essen/uncovar

Transparent and robust SARS-CoV-2 variant calling and lineage assignment with comprehensive reporting.

Language: Python - Size: 3.83 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 20 - Forks: 4

Sentieon/sentieon-dnaseq 📦

Sentieon DNAseq

Language: Shell - Size: 60.5 KB - Last synced: 5 months ago - Pushed: over 3 years ago - Stars: 19 - Forks: 12

sanjaynagi/rna-seq-pop

Snakemake workflow for Illumina RNA-sequencing experiments - extract population genomic signals from RNA-Seq data

Language: Jupyter Notebook - Size: 99.6 MB - Last synced: 19 days ago - Pushed: 23 days ago - Stars: 17 - Forks: 6

adrianodemarino/Imputation_beagle_tutorial

Imputation-beagle-tutorial

Size: 12.7 KB - Last synced: 7 months ago - Pushed: almost 2 years ago - Stars: 16 - Forks: 2

tjiangHIT/rMETL

rMETL - realignment-based Mobile Element insertion detection Tool for Long read

Language: Python - Size: 1.02 MB - Last synced: 24 days ago - Pushed: 9 months ago - Stars: 15 - Forks: 3

nf-core/exoseq 📦

Please consider using/contributing to https://github.com/nf-core/sarek

Language: Nextflow - Size: 2.77 MB - Last synced: 5 months ago - Pushed: over 5 years ago - Stars: 14 - Forks: 25

nf-core/vipr 📦

Assembly and intrahost / low-frequency variant calling for viral samples

Language: Nextflow - Size: 217 KB - Last synced: 4 months ago - Pushed: over 4 years ago - Stars: 14 - Forks: 10

NBISweden/workshop-ngsintro

Workshop • Intro to Bioinformatics using NGS data • 5 days

Language: HTML - Size: 695 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 12 - Forks: 5

TimD1/nPoRe

nPoRe: n-Polymer Realigner for improved pileup-based variant calling

Language: Python - Size: 493 KB - Last synced: about 1 year ago - Pushed: almost 2 years ago - Stars: 12 - Forks: 0

HKU-BAL/Clair3-Trio

Clair3-Trio: variant calling in trio using Nanopore long-reads

Language: Python - Size: 2.74 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 12 - Forks: 1

collaborativebioinformatics/Sniphles

Sniphles is a read-based phasing approach for phased variant calling of structural variants.

Language: Python - Size: 2.44 MB - Last synced: about 1 month ago - Pushed: over 3 years ago - Stars: 12 - Forks: 2

koesterlab/microphaser

Language: Rust - Size: 1.91 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 11 - Forks: 2

CMU-SAFARI/Genome-on-Diet

Genome-on-Diet is a fast and memory-frugal framework for exemplifying sparsified genomics for read mapping, containment search, and metagenomic profiling. It is much faster & more memory-efficient than minimap2 for Illumina, HiFi, and ONT reads. Described by Alser et al. (preliminary version: https://arxiv.org/abs/2211.08157).

Language: Roff - Size: 63 MB - Last synced: 23 days ago - Pushed: 8 months ago - Stars: 11 - Forks: 4

tobiasrausch/nRex

nRex: Germline and somatic single-nucleotide, short indel and structural variant calling

Language: Shell - Size: 1.85 MB - Last synced: 19 days ago - Pushed: 20 days ago - Stars: 11 - Forks: 1

mbhall88/tbpore

Mycobacterium tuberculosis genomic analysis from Nanopore sequencing data

Language: Python - Size: 3.3 MB - Last synced: 17 days ago - Pushed: 7 months ago - Stars: 11 - Forks: 2

KChen-lab/MonoVar

Single nucleotide variant (SNV) detection and genotyping algorithm for single-cell DNA sequencing data

Language: Python - Size: 13.5 MB - Last synced: 7 months ago - Pushed: about 4 years ago - Stars: 11 - Forks: 6

Sydney-Informatics-Hub/Bioinformatics

A suite of bioinformatics data processing and analysis pipelines, software, and training resources for common DNA and RNA sequence analysis methods.

Size: 137 KB - Last synced: about 1 month ago - Pushed: about 2 months ago - Stars: 11 - Forks: 0

abs-tudelft/variant-calling-at-scale

Scalable and High Performance Variant Calling on Cluster Environments

Language: Python - Size: 818 KB - Last synced: 10 months ago - Pushed: over 2 years ago - Stars: 10 - Forks: 0

yjx1217/Varathon

A scalable variant calling and benchmarking framework supporting both short and long reads.

Language: Perl - Size: 934 KB - Last synced: 12 months ago - Pushed: over 3 years ago - Stars: 10 - Forks: 3

BioJulia/SequenceVariation.jl

Biological sequence variation

Language: Julia - Size: 371 KB - Last synced: about 1 month ago - Pushed: about 1 year ago - Stars: 9 - Forks: 3

pdimens/harpy

Process raw haplotagging data, from raw sequences to phased haplotypes, batteries included.

Language: Python - Size: 386 MB - Last synced: 3 days ago - Pushed: 3 days ago - Stars: 9 - Forks: 1

Sentieon/sentieon-dnascope-ml

Sentieon DNAscope + Machine Learning Model

Language: Shell - Size: 111 KB - Last synced: 7 months ago - Pushed: over 2 years ago - Stars: 9 - Forks: 3

bgm-cwg/novoCaller

Language: C++ - Size: 62.5 KB - Last synced: 7 months ago - Pushed: over 3 years ago - Stars: 8 - Forks: 1

dmnfarrell/snipgenie

command line and desktop tool for microbial variant calling

Language: Jupyter Notebook - Size: 122 MB - Last synced: 18 days ago - Pushed: about 1 month ago - Stars: 8 - Forks: 0

pblaney/mgp1000

Nextflow bioinformatics pipeline for large-scale analysis of Multiple Myeloma genomes

Language: Nextflow - Size: 324 MB - Last synced: 11 days ago - Pushed: 11 days ago - Stars: 8 - Forks: 4

VCCRI/dv-trio

dv-trio provides a pipeline to call variants for a trio (father-mother-child) using DeepVariants [1]. Genomic Variant Calling Files (gVCFs) created by DeepVariants are then co_called together using GATK[2]. The resultant trio VCF is then post-processing with FamSeq[3] to eliminate mendelian errors.

Language: Shell - Size: 165 KB - Last synced: about 2 months ago - Pushed: over 3 years ago - Stars: 8 - Forks: 1

CMU-SAFARI/Molecules2Variations

The first work to provide a comprehensive survey of a prominent set of algorithmic improvement and hardware acceleration efforts for the entire genome analysis pipeline used for the three most prominent sequencing data, short reads (Illumina), ultra-long reads (ONT), and accurate long reads (HiFi). Described in arXiv (2022) by Alser et al. https://arxiv.org/abs/2205.07957

Size: 4.31 MB - Last synced: 23 days ago - Pushed: almost 2 years ago - Stars: 8 - Forks: 0

tkonopka/GeneticThesaurus

Thesaurus for genetic variants

Language: Java - Size: 1.2 MB - Last synced: 3 months ago - Pushed: about 6 years ago - Stars: 8 - Forks: 1

kaist-ina/RUN-DVC

Generalizing deep learning-based variant callers via domain adaptation and semi-supervised learning

Language: Python - Size: 3.5 MB - Last synced: about 1 month ago - Pushed: 4 months ago - Stars: 8 - Forks: 0

INMEGEN/Pipelines_INMEGEN

Flujos de trabajos desarrollados y automatizados en el INMEGEN para el procesamiento de datos genómicos y transcriptómicos.

Language: Nextflow - Size: 3.06 MB - Last synced: 2 days ago - Pushed: 2 days ago - Stars: 7 - Forks: 0

fedarko/strainFlye

Pipeline for analyzing (rare) mutations in metagenome-assembled genomes

Language: Python - Size: 14.4 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 7 - Forks: 1

DKFZ-ODCF/SNVCallingWorkflow

The DKFZ-ODCF, formerly DKFZ/eilslabs SNV-Calling Workflow

Language: Python - Size: 925 KB - Last synced: 7 months ago - Pushed: 11 months ago - Stars: 7 - Forks: 4

lindechun/VirVarDP

Virus variant detection pipeline

Language: Python - Size: 575 KB - Last synced: about 1 year ago - Pushed: over 5 years ago - Stars: 7 - Forks: 0

bioinf/RMAhunter

Language: HTML - Size: 116 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 7 - Forks: 0

HKU-BAL/Clair3-RNA

Clair3-RNA - a long-read small variant caller for RNA sequencing data

Language: Python - Size: 327 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 7 - Forks: 0

mcveanlab/Corticall

Germline DNM caller based on LdBG assembly of pathogens

Language: Java - Size: 128 MB - Last synced: 8 months ago - Pushed: about 4 years ago - Stars: 7 - Forks: 3

hitbc/rMETL Fork of tjiangHIT/rMETL

rMETL - realignment-based Mobile Element insertion detection Tool for Long read

Language: Python - Size: 1.04 MB - Last synced: 10 months ago - Pushed: over 4 years ago - Stars: 6 - Forks: 0

Cristianetaniguti/Reads2Map

WDL workflows to build genetic maps from sequencing reads

Language: WDL - Size: 167 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 6 - Forks: 0

ucladx/pancan-panel-design

A pan-cancer DNA-seq panel

Language: Python - Size: 3.31 MB - Last synced: 7 months ago - Pushed: over 1 year ago - Stars: 6 - Forks: 0

tobiasrausch/covid19

SARS-CoV-2 analysis pipeline for short-read, paired-end illumina sequencing

Language: Python - Size: 324 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 6 - Forks: 1

elowy01/vcf_filtering

Filter a VCF to discard false positive variants

Language: Python - Size: 72.3 KB - Last synced: about 1 year ago - Pushed: over 2 years ago - Stars: 6 - Forks: 0

aprilweilab/picovcf

Single-header C++ library for fast/low-memory VCF (Variant Call Format) parsing.

Language: C++ - Size: 151 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 6 - Forks: 0

brwnj/freebayes-nf

a better freebayes-parallel

Language: Nextflow - Size: 21.5 KB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 6 - Forks: 1

CRG-CNAG/ngs2017ws-docs

NGS 2017 Post-Conference Workshop Documentation

Language: CSS - Size: 13 MB - Last synced: about 1 month ago - Pushed: over 1 year ago - Stars: 5 - Forks: 1

allytrope/variant-analysis

A Snakemake pipeline for variant calling of genomic FASTQ data using GATK

Language: Python - Size: 2.1 MB - Last synced: 9 days ago - Pushed: 9 days ago - Stars: 5 - Forks: 0

NCI-CGR/GEMSCAN

Joint variant calling with GATK4 HaplotypeCaller, Google DeepVariant 1.0.0 and Strelka2, coordinated via Snakemake.

Language: Python - Size: 17.4 MB - Last synced: 18 days ago - Pushed: 18 days ago - Stars: 5 - Forks: 7

tobiasrausch/rdxon

Reference-free FASTQ filter for rare germline and somatic variants

Language: C++ - Size: 89.8 KB - Last synced: 10 months ago - Pushed: 10 months ago - Stars: 5 - Forks: 0

rice-systems/lofreq-fpga

FPGA Acceleration for the LoFreq variant caller

Language: C - Size: 20.5 KB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 5 - Forks: 0