An open API service providing repository metadata for many open source software ecosystems.

Topic: "genbank"

kblin/ncbi-genome-download

Scripts to download genomes from the NCBI FTP servers

Language: Python - Size: 347 KB - Last synced at: 3 days ago - Pushed at: 12 months ago - Stars: 1,023 - Forks: 177

bebop/poly

A Go package for engineering organisms.

Language: Go - Size: 11 MB - Last synced at: 3 days ago - Pushed at: 9 months ago - Stars: 709 - Forks: 71

Edinburgh-Genome-Foundry/DnaFeaturesViewer

:eye: Python library to plot DNA sequence features (e.g. from Genbank files)

Language: Python - Size: 15.6 MB - Last synced at: 21 days ago - Pushed at: 23 days ago - Stars: 640 - Forks: 100

moshi4/pyGenomeViz

A genome visualization python package for comparative genomics

Language: Python - Size: 74.9 MB - Last synced at: 19 days ago - Pushed at: 2 months ago - Stars: 341 - Forks: 21

pydna-group/pydna

Clone with Python! Data structures for double stranded DNA & simulation of homologous recombination, Gibson assembly, cut & paste cloning.

Language: Python - Size: 57.4 MB - Last synced at: about 24 hours ago - Pushed at: 1 day ago - Stars: 182 - Forks: 47

pirovc/genome_updater

Bash script to download/update snapshots of files from NCBI genomes repository (refseq/genbank) with track of changes and without redundancy

Language: Shell - Size: 1.29 MB - Last synced at: 3 months ago - Pushed at: 7 months ago - Stars: 154 - Forks: 15

eead-csic-compbio/get_homologues

GET_HOMOLOGUES: a versatile software package for pan-genome analysis

Language: Perl - Size: 79.8 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 116 - Forks: 27

pirovc/ganon

ganon2 classifies genomic sequences against large sets of references efficiently, with integrated download and update of databases (refseq/genbank), taxonomic profiling (ncbi/gtdb), binning and hierarchical classification, customized reporting and more

Language: Python - Size: 24.2 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 96 - Forks: 12

mtisza1/Cenote-Taker2

Cenote-Taker2: Discover and Annotate Divergent Viral Contigs (Please use Cenote-Taker 3 instead)

Language: Shell - Size: 62.3 MB - Last synced at: 4 months ago - Pushed at: over 1 year ago - Stars: 55 - Forks: 7

rcs333/VAPiD

VAPiD: Viral Annotation and Identification Pipeline

Language: Shell - Size: 18 MB - Last synced at: 4 months ago - Pushed at: about 1 year ago - Stars: 52 - Forks: 15

nextstrain/mpox

Nextstrain build for mpox virus

Language: Python - Size: 33.3 MB - Last synced at: 2 days ago - Pushed at: 3 days ago - Stars: 49 - Forks: 22

nextstrain/ncov-ingest

A pipeline that ingests SARS-CoV-2 (i.e. nCoV) data from GISAID and Genbank, transforms it, stores it on S3, and triggers Nextstrain nCoV rebuilds.

Language: Python - Size: 402 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 37 - Forks: 20

CDCgov/seqsender

Automated Pipeline to Generate FTP Files and Manage Submission of Sequence Data to Public Repositories

Language: Python - Size: 261 MB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 35 - Forks: 14

maximilianh/multiSub

Prepares a SARS-CoV-2 submission for GISAID, NCBI or ENA. Can read GISAID or NCBI files, or plain fasta+tsv/csv/xls. Finds files in input directory and merges everything into a single output directory. Auto-detects input file formats. Can submit the results to multiple repositories from the command line.

Language: Python - Size: 1020 KB - Last synced at: 12 months ago - Pushed at: about 3 years ago - Stars: 35 - Forks: 2

jperkel/gb_read

An example GenBank file reader in Rust

Language: Rust - Size: 85 KB - Last synced at: over 2 years ago - Pushed at: about 4 years ago - Stars: 34 - Forks: 7

Koeng101/dnadesign

A Go package for designing DNA.

Language: Go - Size: 37.1 MB - Last synced at: 30 days ago - Pushed at: 30 days ago - Stars: 32 - Forks: 1

CDCgov/tostadas

🧬 💻 TOSTADAS → Toolkit for Open Sequence Triage, Annotation and DAtabase Submission

Language: Python - Size: 49.4 MB - Last synced at: 3 days ago - Pushed at: 4 days ago - Stars: 28 - Forks: 14

ropensci/restez

:sleeping: :open_file_folder: Create and Query a Local Copy of GenBank in R

Language: R - Size: 10 MB - Last synced at: 2 days ago - Pushed at: 5 months ago - Stars: 27 - Forks: 5

Edinburgh-Genome-Foundry/SnapGeneReader Fork of IsaacLuo/SnapGeneFileReader

👓 Python library to parse Snapgene *.dna files to dict or biopython seqrecord.

Language: Python - Size: 644 KB - Last synced at: 12 days ago - Pushed at: 3 months ago - Stars: 25 - Forks: 7

wpwupingwp/OGU

a toolbox for utilize organelle genomic data

Language: Python - Size: 1.78 MB - Last synced at: 20 days ago - Pushed at: 28 days ago - Stars: 24 - Forks: 1

fsprojects/BioProviders

F# library for accessing and manipulating bioinformatic datasets.

Language: F# - Size: 251 MB - Last synced at: 17 days ago - Pushed at: over 1 year ago - Stars: 24 - Forks: 0

bebop/ark

Go REST API to replace Genbank, Uniprot, Rhea, and CHEMBL

Language: Go - Size: 17 MB - Last synced at: 5 days ago - Pushed at: over 1 year ago - Stars: 23 - Forks: 6

ropensci/phylotaR

An automated pipeline for retrieving orthologous DNA sequences from GenBank in R

Language: R - Size: 15.3 MB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 23 - Forks: 8

moshi4/GBKviz 📦

Easy-to-use web application for visualization and comparison of genomes in Genbank file

Language: Python - Size: 5.25 MB - Last synced at: about 1 month ago - Pushed at: about 3 years ago - Stars: 21 - Forks: 4

dlesl/gb-io

A Rust library for parsing, writing and manipulating Genbank sequence files

Language: Rust - Size: 3.67 MB - Last synced at: 26 days ago - Pushed at: 3 months ago - Stars: 20 - Forks: 5

Lattice-Automation/seqparse

Parse sequence files (GenBank, FASTA, SnapGene, SBOL) and accession IDs (NCBI, iGEM) to a common format

Language: TypeScript - Size: 4.89 MB - Last synced at: 29 days ago - Pushed at: about 1 year ago - Stars: 17 - Forks: 4

BioJulia/GenomicAnnotations.jl

Language: Julia - Size: 5.03 MB - Last synced at: 7 days ago - Pushed at: 5 months ago - Stars: 16 - Forks: 4

HobnobMancer/cazy_webscraper

Web scraper to retrieve protein data catalogued by the CAZy, UniProt, NCBI, GTDB and PDB websites/databases.

Language: Python - Size: 46.7 MB - Last synced at: 18 days ago - Pushed at: 7 months ago - Stars: 14 - Forks: 2

mtisza1/Cenote-Taker

DEPRECATED: Use Cenote-Taker 3 instead

Language: Shell - Size: 147 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 14 - Forks: 2

biosustain/goodbye-genbank

A Python package for Biopython that gives feature annotations from GenBank records a new and better life

Language: Python - Size: 284 KB - Last synced at: 6 days ago - Pushed at: about 9 years ago - Stars: 14 - Forks: 3

karubiotools/getSequenceInfo Fork of dcouvin/getSequenceInfo

Perl and Python scripts allowing to get sequence information from GenBank, RefSeq or ENA sequence repositories

Language: Perl - Size: 14.7 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 12 - Forks: 4

TimothyStiles/worst-genbank-ever

The most awful genbank file you'll ever need to parse.

Size: 14.6 KB - Last synced at: 4 months ago - Pushed at: over 1 year ago - Stars: 9 - Forks: 0

michellejlin/lava

LAVA: Longitudinal Analysis of Viral Alleles

Language: Python - Size: 53.5 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 8 - Forks: 7

Changwanseo/GenMine

GenBank Record downloader for taxonomists

Language: Python - Size: 446 KB - Last synced at: 1 day ago - Pushed at: about 1 month ago - Stars: 7 - Forks: 0

bfosso/MetaShot

MetaShot (Metagenomics Shotgun) is a complete pipeline designed for the taxonomic classification of the human microbiota members. In MetaShot, third party tools and new developed Python and Bash scripts are integrated to analyze paired-end (PE) Illumina sequences, offering an automated procedure covering all the analysis steps from raw data management to taxonomic profiling. It is designed to analyze both DNA-Seq and RNA-Seq data.

Language: Python - Size: 1.54 MB - Last synced at: almost 2 years ago - Pushed at: about 5 years ago - Stars: 7 - Forks: 5

j-i-l/GenBankParser

Parser (unofficial) for ncbi GenBank data

Language: Python - Size: 208 KB - Last synced at: 4 months ago - Pushed at: over 5 years ago - Stars: 7 - Forks: 3

Robaina/GenBankpy

Tools to download, parse and filter GenBank files

Language: Python - Size: 25 MB - Last synced at: 6 days ago - Pushed at: over 2 years ago - Stars: 5 - Forks: 1

terrimporter/COI_NCBI_2018

This repository contains the scripts used to retrieve and analyze the data reported in Porter & Hajibabaei 2018 bioRxiv doi: https://doi.org/10.1101/353904

Language: Perl - Size: 46.9 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 5 - Forks: 7

moltinginstar/addgene-api

An unofficial API for Addgene, the open-source plasmid repository.

Language: Python - Size: 26.4 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 4 - Forks: 1

santiagosnchez/gb2fasta

Perl script to convert GenBank records to FASTA format

Language: Perl - Size: 17.6 KB - Last synced at: almost 2 years ago - Pushed at: about 4 years ago - Stars: 4 - Forks: 0

SMRUCC/GCModeller.Core

GCModeller Individual Components: GCModeller base core assembly library on common biological database read and write I/O

Language: Visual Basic .NET - Size: 22.5 MB - Last synced at: 8 days ago - Pushed at: almost 5 years ago - Stars: 4 - Forks: 2

hansiu/MatPhylobi

a tool for automatic construction of molecular data matrix for phylogenetic inference based on GenBank records

Language: Python - Size: 789 KB - Last synced at: almost 2 years ago - Pushed at: over 6 years ago - Stars: 4 - Forks: 0

dmyersturnbull/bioio Fork of PharmGKB/genome-sequence-io

Micro-libraries for reading and writing genomic sequence data in various formats.

Language: Java - Size: 690 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

lehwark/GBSON

A new annotation file format based on JSON, containing all information stored in the GenBank format but with advantageoius parsing and information structure properties.

Language: TypeScript - Size: 41 KB - Last synced at: 25 days ago - Pushed at: over 4 years ago - Stars: 3 - Forks: 0

vbaliga/genbank_downloadR

🔬 Batch downloading of DNA or protein sequences from GenBank

Language: R - Size: 104 KB - Last synced at: about 2 months ago - Pushed at: almost 5 years ago - Stars: 3 - Forks: 0

lucaspalmeira/bioinfo

Guia de programas e ferramentas de bioinformática e química computacional

Size: 16.6 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 2 - Forks: 0

RowanDurrant/BankIt_Checker

R function that checks .fasta files are suitable for GenBank submission

Language: R - Size: 9.77 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 2 - Forks: 0

dnanto/ffbio

flat-file sequence/database utils

Language: Python - Size: 606 KB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 0

oleon12/alignTools

Easy download and manage GenBank data, and alignments for phylogenetics

Language: R - Size: 747 KB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 0

blackrim/phlawd_db_maker

this will just get the ncbi db from genbank made

Language: C++ - Size: 844 KB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 2

Elliot-Chan-120/NCBI__NtDb_GenBank_Parser

Parses GenBank files from the NCBI nucleotide database using accession number and email address associated with NCBI account. Is capable of outputting a .txt file containing basic sequence information, source, CDS, and gene feature dictionaries as well as generate a linear gene map visualizing all of the aforementioned information.

Language: Python - Size: 25.4 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

mdcjansen/DBA

DNA barcoding analysis pipeline

Language: Python - Size: 16.7 MB - Last synced at: 5 months ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 1

gregyjames/GBtoTiny

Genbank to TinyDB.

Language: Python - Size: 498 KB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

brinkmanlab/MicrobeDB

Curated mirror of RefSeq Microbial Genomes. Available via CVMFS repository.

Language: Shell - Size: 71.3 KB - Last synced at: almost 2 years ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 0

abhijeetsingh1704/PROTEINcleaner

a python utility to clean PROTEIN sequences and headers

Language: Python - Size: 6.84 KB - Last synced at: 4 months ago - Pushed at: about 3 years ago - Stars: 1 - Forks: 0

loalon/gbcrawler

GenBank complete parser

Language: Python - Size: 1.9 MB - Last synced at: about 1 month ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 0

ShawHahnLab/genbank-sub-20181109-dloop

GenBank Submission 2018/11/09

Language: Python - Size: 97.7 KB - Last synced at: over 2 years ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 0

ainefairbrother/GenBank-parser

This is a parser written to extract data from a GenBank file and insert it into an SQL database. This is part of a project that I did for my MSc in Bioinformatics.

Language: Python - Size: 12.7 KB - Last synced at: over 2 years ago - Pushed at: almost 7 years ago - Stars: 1 - Forks: 0

toddknutson/genbank_scrapper

GenBank Metadata Extraction Tool

Language: Python - Size: 1.43 MB - Last synced at: almost 2 years ago - Pushed at: about 8 years ago - Stars: 1 - Forks: 0

pseudogene/uniprime 📦

A workflow-based platform for improved Universal Primer design

Language: PHP - Size: 183 KB - Last synced at: 4 months ago - Pushed at: over 9 years ago - Stars: 1 - Forks: 1

spyisgooddawg/Genome

Genome is a platform for exploring and analyzing genetic data. Join us in advancing genomic research and collaboration! 🧬🌐

Size: 3.91 KB - Last synced at: 20 days ago - Pushed at: 20 days ago - Stars: 0 - Forks: 0

engkinandatama/NCBI-Sequence-Fetcher

NCBI Sequence Fetcher is a Python desktop app for downloading nucleotide sequences and extracting metadata from NCBI. It features an easy-to-use GUI, supports FASTA and GenBank formats, and helps researchers students and bioinformaticians efficiently collect DNA sequences and store metadata in Excel files.

Language: Python - Size: 26.4 KB - Last synced at: 2 days ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

ankushgpta2/tostadas Fork of CDCgov/tostadas

🧬 💻 TOSTADAS → Toolkit for Open Sequence Triage, Annotation and DAtabase Submission

Language: Python - Size: 1.36 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

masikol/con-hi

The program annotates low-coverage and high-coverage regions of sequences in fasta format

Language: Python - Size: 209 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

boopsboops/ancistr

Automatically make an Ancistrus phylogeny and identify the common bristlenose catfish

Language: R - Size: 143 KB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

tomek7667/biotech-js

Package developed at A&A Biotechnology for reading all kinds of biotechnology related files

Language: Gnuplot - Size: 46 MB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

mansikath/Fasta-to-Genbank-Converter

A simple Python script using Biopython to convert FASTA files to GenBank format. The script prompts the user for input and output filenames, along with the molecule type (default is DNA). Ensure accurate and annotated GenBank files for your biological sequence data.

Language: Python - Size: 5.86 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

bhagesh-codebeast/Bioinformatics

Language: Jupyter Notebook - Size: 1000 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

michalbukowski/fetch-genomes

Download genomes from NCBI GenBank FTP site

Language: Python - Size: 72.3 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

rainbowmycelium/ConSequences

R script for GenBank sequences names changing, filling-in missing molecular markers data and sequences concatenation

Language: R - Size: 24.4 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

FABallemand/ProjetAlgorithmesDuTexte

GenBank DNA files parser with graphic user interface

Language: Python - Size: 22.2 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 3

hunglin59638/makura

NCBI Genome downloader

Language: Python - Size: 143 KB - Last synced at: 6 days ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

victoria-r/BioPython

A collection of various biopython scripts.

Language: Python - Size: 3.76 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

plasmid-designer/genereader

A library to read, manipulate and write various genetic sequencing formats.

Language: Rust - Size: 32.2 KB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

Aurorabili/BITools

some bioinformatics tools

Language: Python - Size: 3.91 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

ZooPhy/zoophy-genbankfactory

GenBankFactory for GenBank Data Dumps/Normalization

Language: Java - Size: 446 KB - Last synced at: 10 days ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 1

abhijeetsingh1704/SubsetSeq

a python utility to subset multisequence file based on identifiers from external text file

Language: Python - Size: 34.2 KB - Last synced at: 4 months ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

ajodeh-juma/bixcop-2021-python

Simple to moderate python programming tasks with a focus in bioinformatics

Language: HTML - Size: 1.84 MB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

bielasilva/mock_ncbi_download

Randomly download genomes from NCBI RefSeq and Genbank

Language: Python - Size: 7.81 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

eparayno/SNPAnalyzer

Takes GenBank data (csv and fasta), aggregates set using common length, and identifies SNPs. Buffered nucleotide location with most frequent mutations will be BLASTed (Basic Local Alignment Search Tool) to return top organism match.

Language: Python - Size: 522 KB - Last synced at: almost 2 years ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0

AntonelliLab/cavvy-tree

:hamster: Building a tree of some small fluffy animals

Language: R - Size: 1.7 MB - Last synced at: over 1 year ago - Pushed at: almost 6 years ago - Stars: 0 - Forks: 1

kblin/fungal-ui

A web UI for the fungal version of antiSMASH.

Language: JavaScript - Size: 540 KB - Last synced at: 4 months ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0

ChristopherAyling/ScalaPromoterPrediciton

Predicting the promoters of Genes which share common ancestors with E. Coli

Language: Java - Size: 20.5 MB - Last synced at: over 2 years ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0

gibarsin/bioinformatics-tp1

Trabajo Práctico para Introducción a Bioinformática en ITBA

Language: Java - Size: 11.4 MB - Last synced at: over 2 years ago - Pushed at: about 7 years ago - Stars: 0 - Forks: 0

gregyjames/genemap

A tool to visualize the genomes of phages.

Language: HTML - Size: 1.95 KB - Last synced at: 17 days ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

glickmac/GRAB

Retrieve and create a custom BLAST database by taxonomic search

Language: Python - Size: 48.5 MB - Last synced at: over 2 years ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

das2000sidd/All-Code-for-B-KUL-I0U30A

These are python and linux codes for the class mentioned in the header at KU Leuven. It demonstrates some of my python and linux work relevant to bioinformatics.

Language: Python - Size: 3.34 MB - Last synced at: almost 2 years ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

drozdovapb/myBedGtfGffVcfTools

home-made scripts to manipulate sequence annotation file formats (gff / vcf / genbank)

Language: Python - Size: 2.89 MB - Last synced at: over 1 year ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 1

VinLau/perlScripts

Some Perl scripts (with some related to bioinformatics. see READ ME)

Language: Perl - Size: 55.7 KB - Last synced at: about 2 years ago - Pushed at: over 8 years ago - Stars: 0 - Forks: 0

larralde/gb-io.py

A Python interface to gb-io, a fast GenBank parser and serializer written in Rust.

Last synced at: about 1 year ago - Stars: 0 - Forks: 0