An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: genbank

yashpandey007/csv-everything

🖼️ Convert images of tables or charts into downloadable CSV files effortlessly with the CSV Everything Chrome Extension using the OpenRouter API.

Language: HTML - Size: 1.3 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 0 - Forks: 0

CDCgov/tostadas

🧬 💻 TOSTADAS → Toolkit for Open Sequence Triage, Annotation and DAtabase Submission

Language: Python - Size: 48.7 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 28 - Forks: 15

nextstrain/mpox

Nextstrain build for mpox virus

Language: Python - Size: 30.6 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 50 - Forks: 23

moshi4/pyGenomeViz

A genome visualization python package for comparative genomics

Language: Python - Size: 75.1 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 357 - Forks: 21

CDCgov/seqsender

Automated Pipeline to Generate FTP Files and Manage Submission of Sequence Data to Public Repositories

Language: Python - Size: 261 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 38 - Forks: 14

pydna-group/pydna

Clone with Python! Data structures for double stranded DNA & simulation of homologous recombination, Gibson assembly, cut & paste cloning.

Language: Python - Size: 58.1 MB - Last synced at: 6 days ago - Pushed at: 12 days ago - Stars: 194 - Forks: 48

kblin/ncbi-genome-download

Scripts to download genomes from the NCBI FTP servers

Language: Python - Size: 359 KB - Last synced at: 12 days ago - Pushed at: about 1 month ago - Stars: 1,026 - Forks: 179

nextstrain/ncov-ingest

A pipeline that ingests SARS-CoV-2 (i.e. nCoV) data from GISAID and Genbank, transforms it, stores it on S3, and triggers Nextstrain nCoV rebuilds.

Language: Python - Size: 402 MB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 37 - Forks: 20

bebop/poly

A Go package for engineering organisms.

Language: Go - Size: 11 MB - Last synced at: 2 days ago - Pushed at: 11 months ago - Stars: 711 - Forks: 71

Edinburgh-Genome-Foundry/DnaFeaturesViewer

:eye: Python library to plot DNA sequence features (e.g. from Genbank files)

Language: Python - Size: 15.6 MB - Last synced at: 22 days ago - Pushed at: 2 months ago - Stars: 647 - Forks: 99

pirovc/genome_updater

Bash script to download/update snapshots of files from NCBI genomes repository (refseq/genbank) with track of changes and without redundancy

Language: Shell - Size: 1.29 MB - Last synced at: 2 days ago - Pushed at: 9 months ago - Stars: 160 - Forks: 15

wpwupingwp/OGU

a toolbox for utilize organelle genomic data

Language: Python - Size: 1.78 MB - Last synced at: 12 days ago - Pushed at: about 1 month ago - Stars: 24 - Forks: 1

Changwanseo/GenMine

GenBank Record downloader for taxonomists

Language: Python - Size: 451 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 7 - Forks: 0

Edinburgh-Genome-Foundry/SnapGeneReader Fork of IsaacLuo/SnapGeneFileReader

👓 Python library to parse Snapgene *.dna files to dict or biopython seqrecord.

Language: Python - Size: 644 KB - Last synced at: 19 days ago - Pushed at: 4 months ago - Stars: 26 - Forks: 7

ropensci/phylotaR

An automated pipeline for retrieving orthologous DNA sequences from GenBank in R

Language: R - Size: 15.2 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 24 - Forks: 9

Robaina/GenBankpy

Tools to download, parse and filter GenBank files

Language: Python - Size: 25 MB - Last synced at: 8 days ago - Pushed at: over 2 years ago - Stars: 6 - Forks: 1

dlesl/gb-io

A Rust library for parsing, writing and manipulating Genbank sequence files

Language: Rust - Size: 3.67 MB - Last synced at: 7 days ago - Pushed at: 5 months ago - Stars: 21 - Forks: 5

eead-csic-compbio/get_homologues

GET_HOMOLOGUES: a versatile software package for pan-genome analysis

Language: Perl - Size: 79.8 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 116 - Forks: 27

Lattice-Automation/seqparse

Parse sequence files (GenBank, FASTA, SnapGene, SBOL) and accession IDs (NCBI, iGEM) to a common format

Language: TypeScript - Size: 4.89 MB - Last synced at: 8 days ago - Pushed at: over 1 year ago - Stars: 17 - Forks: 5

Koeng101/dnadesign

A Go package for designing DNA.

Language: Go - Size: 37.1 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 32 - Forks: 1

moltinginstar/addgene-api

An unofficial API for Addgene, the open-source plasmid repository.

Language: Python - Size: 26.4 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 4 - Forks: 1

engkinandatama/NCBI-Sequence-Fetcher

NCBI Sequence Fetcher is a Python desktop app for downloading nucleotide sequences and extracting metadata from NCBI. It features an easy-to-use GUI, supports FASTA and GenBank formats, and helps researchers students and bioinformaticians efficiently collect DNA sequences and store metadata in Excel files.

Language: Python - Size: 26.4 KB - Last synced at: about 2 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

ropensci/restez

:sleeping: :open_file_folder: Create and Query a Local Copy of GenBank in R

Language: R - Size: 10 MB - Last synced at: 3 days ago - Pushed at: 6 months ago - Stars: 27 - Forks: 5

moshi4/GBKviz 📦

Easy-to-use web application for visualization and comparison of genomes in Genbank file

Language: Python - Size: 5.25 MB - Last synced at: 14 days ago - Pushed at: over 3 years ago - Stars: 21 - Forks: 4

pirovc/ganon

ganon2 classifies genomic sequences against large sets of references efficiently, with integrated download and update of databases (refseq/genbank), taxonomic profiling (ncbi/gtdb), binning and hierarchical classification, customized reporting and more

Language: Python - Size: 24.2 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 96 - Forks: 12

karubiotools/getSequenceInfo Fork of dcouvin/getSequenceInfo

Perl and Python scripts allowing to get sequence information from GenBank, RefSeq or ENA sequence repositories

Language: Perl - Size: 14.7 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 12 - Forks: 4

Elliot-Chan-120/NCBI__NtDb_GenBank_Parser

Parses GenBank files from the NCBI nucleotide database using accession number and email address associated with NCBI account. Is capable of outputting a .txt file containing basic sequence information, source, CDS, and gene feature dictionaries as well as generate a linear gene map visualizing all of the aforementioned information.

Language: Python - Size: 25.4 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

rcs333/VAPiD

VAPiD: Viral Annotation and Identification Pipeline

Language: Shell - Size: 18 MB - Last synced at: 6 months ago - Pushed at: over 1 year ago - Stars: 52 - Forks: 15

BioJulia/GenomicAnnotations.jl

Language: Julia - Size: 5.03 MB - Last synced at: 13 days ago - Pushed at: 6 months ago - Stars: 16 - Forks: 4

lehwark/GBSON

A new annotation file format based on JSON, containing all information stored in the GenBank format but with advantageoius parsing and information structure properties.

Language: TypeScript - Size: 41 KB - Last synced at: 6 days ago - Pushed at: almost 5 years ago - Stars: 3 - Forks: 0

HobnobMancer/cazy_webscraper

Web scraper to retrieve protein data catalogued by the CAZy, UniProt, NCBI, GTDB and PDB websites/databases.

Language: Python - Size: 46.8 MB - Last synced at: 2 days ago - Pushed at: 12 days ago - Stars: 14 - Forks: 2

fsprojects/BioProviders

F# library for accessing and manipulating bioinformatic datasets.

Language: F# - Size: 251 MB - Last synced at: 25 days ago - Pushed at: over 1 year ago - Stars: 24 - Forks: 0

lucaspalmeira/bioinfo

Guia de programas e ferramentas de bioinformática e química computacional

Size: 16.6 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 2 - Forks: 0

ankushgpta2/tostadas Fork of CDCgov/tostadas

🧬 💻 TOSTADAS → Toolkit for Open Sequence Triage, Annotation and DAtabase Submission

Language: Python - Size: 1.36 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

masikol/con-hi

The program annotates low-coverage and high-coverage regions of sequences in fasta format

Language: Python - Size: 209 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

mtisza1/Cenote-Taker2

Cenote-Taker2: Discover and Annotate Divergent Viral Contigs (Please use Cenote-Taker 3 instead)

Language: Shell - Size: 62.3 MB - Last synced at: 6 months ago - Pushed at: over 1 year ago - Stars: 55 - Forks: 7

pseudogene/uniprime 📦

A workflow-based platform for improved Universal Primer design

Language: PHP - Size: 183 KB - Last synced at: 26 days ago - Pushed at: over 9 years ago - Stars: 1 - Forks: 1

bebop/ark

Go REST API to replace Genbank, Uniprot, Rhea, and CHEMBL

Language: Go - Size: 17 MB - Last synced at: about 2 hours ago - Pushed at: over 1 year ago - Stars: 23 - Forks: 6

mdcjansen/DBA

DNA barcoding analysis pipeline

Language: Python - Size: 16.7 MB - Last synced at: 6 months ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 1

boopsboops/ancistr

Automatically make an Ancistrus phylogeny and identify the common bristlenose catfish

Language: R - Size: 143 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

j-i-l/GenBankParser

Parser (unofficial) for ncbi GenBank data

Language: Python - Size: 208 KB - Last synced at: 16 days ago - Pushed at: almost 6 years ago - Stars: 7 - Forks: 3

maximilianh/multiSub

Prepares a SARS-CoV-2 submission for GISAID, NCBI or ENA. Can read GISAID or NCBI files, or plain fasta+tsv/csv/xls. Finds files in input directory and merges everything into a single output directory. Auto-detects input file formats. Can submit the results to multiple repositories from the command line.

Language: Python - Size: 1020 KB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 35 - Forks: 2

plasmid-designer/genereader

A library to read, manipulate and write various genetic sequencing formats.

Language: Rust - Size: 32.2 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

AntonelliLab/cavvy-tree

:hamster: Building a tree of some small fluffy animals

Language: R - Size: 1.7 MB - Last synced at: over 1 year ago - Pushed at: almost 6 years ago - Stars: 0 - Forks: 1

FABallemand/ProjetAlgorithmesDuTexte

GenBank DNA files parser with graphic user interface

Language: Python - Size: 22.2 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 3

dmyersturnbull/bioio Fork of PharmGKB/genome-sequence-io

Micro-libraries for reading and writing genomic sequence data in various formats.

Language: Java - Size: 690 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

mtisza1/Cenote-Taker

DEPRECATED: Use Cenote-Taker 3 instead

Language: Shell - Size: 147 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 14 - Forks: 2

TimothyStiles/worst-genbank-ever

The most awful genbank file you'll ever need to parse.

Size: 14.6 KB - Last synced at: 5 months ago - Pushed at: over 1 year ago - Stars: 9 - Forks: 0

mansikath/Fasta-to-Genbank-Converter

A simple Python script using Biopython to convert FASTA files to GenBank format. The script prompts the user for input and output filenames, along with the molecule type (default is DNA). Ensure accurate and annotated GenBank files for your biological sequence data.

Language: Python - Size: 5.86 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

drozdovapb/myBedGtfGffVcfTools

home-made scripts to manipulate sequence annotation file formats (gff / vcf / genbank)

Language: Python - Size: 2.89 MB - Last synced at: over 1 year ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 1

bhagesh-codebeast/Bioinformatics

Language: Jupyter Notebook - Size: 1000 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

tomek7667/biotech-js

Package developed at A&A Biotechnology for reading all kinds of biotechnology related files

Language: Gnuplot - Size: 46 MB - Last synced at: 8 days ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

hansiu/MatPhylobi

a tool for automatic construction of molecular data matrix for phylogenetic inference based on GenBank records

Language: Python - Size: 789 KB - Last synced at: almost 2 years ago - Pushed at: over 6 years ago - Stars: 4 - Forks: 0

brinkmanlab/MicrobeDB

Curated mirror of RefSeq Microbial Genomes. Available via CVMFS repository.

Language: Shell - Size: 71.3 KB - Last synced at: almost 2 years ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 0

bfosso/MetaShot

MetaShot (Metagenomics Shotgun) is a complete pipeline designed for the taxonomic classification of the human microbiota members. In MetaShot, third party tools and new developed Python and Bash scripts are integrated to analyze paired-end (PE) Illumina sequences, offering an automated procedure covering all the analysis steps from raw data management to taxonomic profiling. It is designed to analyze both DNA-Seq and RNA-Seq data.

Language: Python - Size: 1.54 MB - Last synced at: almost 2 years ago - Pushed at: about 5 years ago - Stars: 7 - Forks: 5

santiagosnchez/gb2fasta

Perl script to convert GenBank records to FASTA format

Language: Perl - Size: 17.6 KB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 4 - Forks: 0

eparayno/SNPAnalyzer

Takes GenBank data (csv and fasta), aggregates set using common length, and identifies SNPs. Buffered nucleotide location with most frequent mutations will be BLASTed (Basic Local Alignment Search Tool) to return top organism match.

Language: Python - Size: 522 KB - Last synced at: almost 2 years ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

toddknutson/genbank_scrapper

GenBank Metadata Extraction Tool

Language: Python - Size: 1.43 MB - Last synced at: about 2 years ago - Pushed at: over 8 years ago - Stars: 1 - Forks: 0

das2000sidd/All-Code-for-B-KUL-I0U30A

These are python and linux codes for the class mentioned in the header at KU Leuven. It demonstrates some of my python and linux work relevant to bioinformatics.

Language: Python - Size: 3.34 MB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

dnanto/ffbio

flat-file sequence/database utils

Language: Python - Size: 606 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 0

rainbowmycelium/ConSequences

R script for GenBank sequences names changing, filling-in missing molecular markers data and sequences concatenation

Language: R - Size: 24.4 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

RowanDurrant/BankIt_Checker

R function that checks .fasta files are suitable for GenBank submission

Language: R - Size: 9.77 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 2 - Forks: 0

victoria-r/BioPython

A collection of various biopython scripts.

Language: Python - Size: 3.76 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

jperkel/gb_read

An example GenBank file reader in Rust

Language: Rust - Size: 85 KB - Last synced at: over 2 years ago - Pushed at: about 4 years ago - Stars: 34 - Forks: 7

michalbukowski/fetch-genomes

Download genomes from NCBI GenBank FTP site

Language: Python - Size: 72.3 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

hunglin59638/makura

NCBI Genome downloader

Language: Python - Size: 143 KB - Last synced at: 16 days ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

Aurorabili/BITools

some bioinformatics tools

Language: Python - Size: 3.91 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

oleon12/alignTools

Easy download and manage GenBank data, and alignments for phylogenetics

Language: R - Size: 747 KB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 0

abhijeetsingh1704/PROTEINcleaner

a python utility to clean PROTEIN sequences and headers

Language: Python - Size: 6.84 KB - Last synced at: 6 months ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

ShawHahnLab/genbank-sub-20181109-dloop

GenBank Submission 2018/11/09

Language: Python - Size: 97.7 KB - Last synced at: over 2 years ago - Pushed at: almost 7 years ago - Stars: 1 - Forks: 0

michellejlin/lava

LAVA: Longitudinal Analysis of Viral Alleles

Language: Python - Size: 53.5 MB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 8 - Forks: 7

abhijeetsingh1704/SubsetSeq

a python utility to subset multisequence file based on identifiers from external text file

Language: Python - Size: 34.2 KB - Last synced at: 6 months ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

ZooPhy/zoophy-genbankfactory

GenBankFactory for GenBank Data Dumps/Normalization

Language: Java - Size: 446 KB - Last synced at: about 2 months ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 1

biosustain/goodbye-genbank

A Python package for Biopython that gives feature annotations from GenBank records a new and better life

Language: Python - Size: 284 KB - Last synced at: 15 days ago - Pushed at: over 9 years ago - Stars: 14 - Forks: 3

gregyjames/GBtoTiny

Genbank to TinyDB.

Language: Python - Size: 498 KB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

ajodeh-juma/bixcop-2021-python

Simple to moderate python programming tasks with a focus in bioinformatics

Language: HTML - Size: 1.84 MB - Last synced at: over 2 years ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

terrimporter/COI_NCBI_2018

This repository contains the scripts used to retrieve and analyze the data reported in Porter & Hajibabaei 2018 bioRxiv doi: https://doi.org/10.1101/353904

Language: Perl - Size: 46.9 KB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 5 - Forks: 7

blackrim/phlawd_db_maker

this will just get the ncbi db from genbank made

Language: C++ - Size: 844 KB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 2

vbaliga/genbank_downloadR

🔬 Batch downloading of DNA or protein sequences from GenBank

Language: R - Size: 104 KB - Last synced at: 3 months ago - Pushed at: almost 5 years ago - Stars: 3 - Forks: 0

bielasilva/mock_ncbi_download

Randomly download genomes from NCBI RefSeq and Genbank

Language: Python - Size: 7.81 KB - Last synced at: over 2 years ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0

SMRUCC/GCModeller.Core

GCModeller Individual Components: GCModeller base core assembly library on common biological database read and write I/O

Language: Visual Basic .NET - Size: 22.5 MB - Last synced at: 13 days ago - Pushed at: about 5 years ago - Stars: 4 - Forks: 2

ainefairbrother/GenBank-parser

This is a parser written to extract data from a GenBank file and insert it into an SQL database. This is part of a project that I did for my MSc in Bioinformatics.

Language: Python - Size: 12.7 KB - Last synced at: over 2 years ago - Pushed at: almost 7 years ago - Stars: 1 - Forks: 0

loalon/gbcrawler

GenBank complete parser

Language: Python - Size: 1.9 MB - Last synced at: 6 days ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 0

gibarsin/bioinformatics-tp1

Trabajo Práctico para Introducción a Bioinformática en ITBA

Language: Java - Size: 11.4 MB - Last synced at: over 2 years ago - Pushed at: about 7 years ago - Stars: 0 - Forks: 0

kblin/fungal-ui

A web UI for the fungal version of antiSMASH.

Language: JavaScript - Size: 540 KB - Last synced at: 5 months ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0

ChristopherAyling/ScalaPromoterPrediciton

Predicting the promoters of Genes which share common ancestors with E. Coli

Language: Java - Size: 20.5 MB - Last synced at: over 2 years ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 0

gregyjames/genemap

A tool to visualize the genomes of phages.

Language: HTML - Size: 1.95 KB - Last synced at: 2 months ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

glickmac/GRAB

Retrieve and create a custom BLAST database by taxonomic search

Language: Python - Size: 48.5 MB - Last synced at: over 2 years ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

VinLau/perlScripts

Some Perl scripts (with some related to bioinformatics. see READ ME)

Language: Perl - Size: 55.7 KB - Last synced at: over 2 years ago - Pushed at: over 8 years ago - Stars: 0 - Forks: 0