GitHub topics: genbank
yashpandey007/csv-everything
🖼️ Convert images of tables or charts into downloadable CSV files effortlessly with the CSV Everything Chrome Extension using the OpenRouter API.
Language: HTML - Size: 1.3 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 0 - Forks: 0

CDCgov/tostadas
🧬 💻 TOSTADAS → Toolkit for Open Sequence Triage, Annotation and DAtabase Submission
Language: Python - Size: 48.7 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 28 - Forks: 15

nextstrain/mpox
Nextstrain build for mpox virus
Language: Python - Size: 30.6 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 50 - Forks: 23

moshi4/pyGenomeViz
A genome visualization python package for comparative genomics
Language: Python - Size: 75.1 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 357 - Forks: 21

CDCgov/seqsender
Automated Pipeline to Generate FTP Files and Manage Submission of Sequence Data to Public Repositories
Language: Python - Size: 261 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 38 - Forks: 14

pydna-group/pydna
Clone with Python! Data structures for double stranded DNA & simulation of homologous recombination, Gibson assembly, cut & paste cloning.
Language: Python - Size: 58.1 MB - Last synced at: 6 days ago - Pushed at: 12 days ago - Stars: 194 - Forks: 48

kblin/ncbi-genome-download
Scripts to download genomes from the NCBI FTP servers
Language: Python - Size: 359 KB - Last synced at: 12 days ago - Pushed at: about 1 month ago - Stars: 1,026 - Forks: 179

nextstrain/ncov-ingest
A pipeline that ingests SARS-CoV-2 (i.e. nCoV) data from GISAID and Genbank, transforms it, stores it on S3, and triggers Nextstrain nCoV rebuilds.
Language: Python - Size: 402 MB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 37 - Forks: 20

bebop/poly
A Go package for engineering organisms.
Language: Go - Size: 11 MB - Last synced at: 2 days ago - Pushed at: 11 months ago - Stars: 711 - Forks: 71

Edinburgh-Genome-Foundry/DnaFeaturesViewer
:eye: Python library to plot DNA sequence features (e.g. from Genbank files)
Language: Python - Size: 15.6 MB - Last synced at: 22 days ago - Pushed at: 2 months ago - Stars: 647 - Forks: 99

pirovc/genome_updater
Bash script to download/update snapshots of files from NCBI genomes repository (refseq/genbank) with track of changes and without redundancy
Language: Shell - Size: 1.29 MB - Last synced at: 2 days ago - Pushed at: 9 months ago - Stars: 160 - Forks: 15

wpwupingwp/OGU
a toolbox for utilize organelle genomic data
Language: Python - Size: 1.78 MB - Last synced at: 12 days ago - Pushed at: about 1 month ago - Stars: 24 - Forks: 1

Changwanseo/GenMine
GenBank Record downloader for taxonomists
Language: Python - Size: 451 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 7 - Forks: 0

Edinburgh-Genome-Foundry/SnapGeneReader Fork of IsaacLuo/SnapGeneFileReader
👓 Python library to parse Snapgene *.dna files to dict or biopython seqrecord.
Language: Python - Size: 644 KB - Last synced at: 19 days ago - Pushed at: 4 months ago - Stars: 26 - Forks: 7

ropensci/phylotaR
An automated pipeline for retrieving orthologous DNA sequences from GenBank in R
Language: R - Size: 15.2 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 24 - Forks: 9

Robaina/GenBankpy
Tools to download, parse and filter GenBank files
Language: Python - Size: 25 MB - Last synced at: 8 days ago - Pushed at: over 2 years ago - Stars: 6 - Forks: 1

dlesl/gb-io
A Rust library for parsing, writing and manipulating Genbank sequence files
Language: Rust - Size: 3.67 MB - Last synced at: 7 days ago - Pushed at: 5 months ago - Stars: 21 - Forks: 5

eead-csic-compbio/get_homologues
GET_HOMOLOGUES: a versatile software package for pan-genome analysis
Language: Perl - Size: 79.8 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 116 - Forks: 27

Lattice-Automation/seqparse
Parse sequence files (GenBank, FASTA, SnapGene, SBOL) and accession IDs (NCBI, iGEM) to a common format
Language: TypeScript - Size: 4.89 MB - Last synced at: 8 days ago - Pushed at: over 1 year ago - Stars: 17 - Forks: 5

Koeng101/dnadesign
A Go package for designing DNA.
Language: Go - Size: 37.1 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 32 - Forks: 1

moltinginstar/addgene-api
An unofficial API for Addgene, the open-source plasmid repository.
Language: Python - Size: 26.4 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 4 - Forks: 1

engkinandatama/NCBI-Sequence-Fetcher
NCBI Sequence Fetcher is a Python desktop app for downloading nucleotide sequences and extracting metadata from NCBI. It features an easy-to-use GUI, supports FASTA and GenBank formats, and helps researchers students and bioinformaticians efficiently collect DNA sequences and store metadata in Excel files.
Language: Python - Size: 26.4 KB - Last synced at: about 2 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

ropensci/restez
:sleeping: :open_file_folder: Create and Query a Local Copy of GenBank in R
Language: R - Size: 10 MB - Last synced at: 3 days ago - Pushed at: 6 months ago - Stars: 27 - Forks: 5

moshi4/GBKviz 📦
Easy-to-use web application for visualization and comparison of genomes in Genbank file
Language: Python - Size: 5.25 MB - Last synced at: 14 days ago - Pushed at: over 3 years ago - Stars: 21 - Forks: 4

pirovc/ganon
ganon2 classifies genomic sequences against large sets of references efficiently, with integrated download and update of databases (refseq/genbank), taxonomic profiling (ncbi/gtdb), binning and hierarchical classification, customized reporting and more
Language: Python - Size: 24.2 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 96 - Forks: 12

karubiotools/getSequenceInfo Fork of dcouvin/getSequenceInfo
Perl and Python scripts allowing to get sequence information from GenBank, RefSeq or ENA sequence repositories
Language: Perl - Size: 14.7 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 12 - Forks: 4

Elliot-Chan-120/NCBI__NtDb_GenBank_Parser
Parses GenBank files from the NCBI nucleotide database using accession number and email address associated with NCBI account. Is capable of outputting a .txt file containing basic sequence information, source, CDS, and gene feature dictionaries as well as generate a linear gene map visualizing all of the aforementioned information.
Language: Python - Size: 25.4 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

rcs333/VAPiD
VAPiD: Viral Annotation and Identification Pipeline
Language: Shell - Size: 18 MB - Last synced at: 6 months ago - Pushed at: over 1 year ago - Stars: 52 - Forks: 15

BioJulia/GenomicAnnotations.jl
Language: Julia - Size: 5.03 MB - Last synced at: 13 days ago - Pushed at: 6 months ago - Stars: 16 - Forks: 4

lehwark/GBSON
A new annotation file format based on JSON, containing all information stored in the GenBank format but with advantageoius parsing and information structure properties.
Language: TypeScript - Size: 41 KB - Last synced at: 6 days ago - Pushed at: almost 5 years ago - Stars: 3 - Forks: 0

HobnobMancer/cazy_webscraper
Web scraper to retrieve protein data catalogued by the CAZy, UniProt, NCBI, GTDB and PDB websites/databases.
Language: Python - Size: 46.8 MB - Last synced at: 2 days ago - Pushed at: 12 days ago - Stars: 14 - Forks: 2

fsprojects/BioProviders
F# library for accessing and manipulating bioinformatic datasets.
Language: F# - Size: 251 MB - Last synced at: 25 days ago - Pushed at: over 1 year ago - Stars: 24 - Forks: 0

lucaspalmeira/bioinfo
Guia de programas e ferramentas de bioinformática e química computacional
Size: 16.6 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 2 - Forks: 0

ankushgpta2/tostadas Fork of CDCgov/tostadas
🧬 💻 TOSTADAS → Toolkit for Open Sequence Triage, Annotation and DAtabase Submission
Language: Python - Size: 1.36 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

masikol/con-hi
The program annotates low-coverage and high-coverage regions of sequences in fasta format
Language: Python - Size: 209 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

mtisza1/Cenote-Taker2
Cenote-Taker2: Discover and Annotate Divergent Viral Contigs (Please use Cenote-Taker 3 instead)
Language: Shell - Size: 62.3 MB - Last synced at: 6 months ago - Pushed at: over 1 year ago - Stars: 55 - Forks: 7

pseudogene/uniprime 📦
A workflow-based platform for improved Universal Primer design
Language: PHP - Size: 183 KB - Last synced at: 26 days ago - Pushed at: over 9 years ago - Stars: 1 - Forks: 1

bebop/ark
Go REST API to replace Genbank, Uniprot, Rhea, and CHEMBL
Language: Go - Size: 17 MB - Last synced at: about 2 hours ago - Pushed at: over 1 year ago - Stars: 23 - Forks: 6

mdcjansen/DBA
DNA barcoding analysis pipeline
Language: Python - Size: 16.7 MB - Last synced at: 6 months ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 1

boopsboops/ancistr
Automatically make an Ancistrus phylogeny and identify the common bristlenose catfish
Language: R - Size: 143 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

j-i-l/GenBankParser
Parser (unofficial) for ncbi GenBank data
Language: Python - Size: 208 KB - Last synced at: 16 days ago - Pushed at: almost 6 years ago - Stars: 7 - Forks: 3

maximilianh/multiSub
Prepares a SARS-CoV-2 submission for GISAID, NCBI or ENA. Can read GISAID or NCBI files, or plain fasta+tsv/csv/xls. Finds files in input directory and merges everything into a single output directory. Auto-detects input file formats. Can submit the results to multiple repositories from the command line.
Language: Python - Size: 1020 KB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 35 - Forks: 2

plasmid-designer/genereader
A library to read, manipulate and write various genetic sequencing formats.
Language: Rust - Size: 32.2 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

AntonelliLab/cavvy-tree
:hamster: Building a tree of some small fluffy animals
Language: R - Size: 1.7 MB - Last synced at: over 1 year ago - Pushed at: almost 6 years ago - Stars: 0 - Forks: 1

FABallemand/ProjetAlgorithmesDuTexte
GenBank DNA files parser with graphic user interface
Language: Python - Size: 22.2 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 3

dmyersturnbull/bioio Fork of PharmGKB/genome-sequence-io
Micro-libraries for reading and writing genomic sequence data in various formats.
Language: Java - Size: 690 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

mtisza1/Cenote-Taker
DEPRECATED: Use Cenote-Taker 3 instead
Language: Shell - Size: 147 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 14 - Forks: 2

TimothyStiles/worst-genbank-ever
The most awful genbank file you'll ever need to parse.
Size: 14.6 KB - Last synced at: 5 months ago - Pushed at: over 1 year ago - Stars: 9 - Forks: 0

mansikath/Fasta-to-Genbank-Converter
A simple Python script using Biopython to convert FASTA files to GenBank format. The script prompts the user for input and output filenames, along with the molecule type (default is DNA). Ensure accurate and annotated GenBank files for your biological sequence data.
Language: Python - Size: 5.86 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

drozdovapb/myBedGtfGffVcfTools
home-made scripts to manipulate sequence annotation file formats (gff / vcf / genbank)
Language: Python - Size: 2.89 MB - Last synced at: over 1 year ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 1

bhagesh-codebeast/Bioinformatics
Language: Jupyter Notebook - Size: 1000 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

tomek7667/biotech-js
Package developed at A&A Biotechnology for reading all kinds of biotechnology related files
Language: Gnuplot - Size: 46 MB - Last synced at: 8 days ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

hansiu/MatPhylobi
a tool for automatic construction of molecular data matrix for phylogenetic inference based on GenBank records
Language: Python - Size: 789 KB - Last synced at: almost 2 years ago - Pushed at: over 6 years ago - Stars: 4 - Forks: 0

brinkmanlab/MicrobeDB
Curated mirror of RefSeq Microbial Genomes. Available via CVMFS repository.
Language: Shell - Size: 71.3 KB - Last synced at: almost 2 years ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 0

bfosso/MetaShot
MetaShot (Metagenomics Shotgun) is a complete pipeline designed for the taxonomic classification of the human microbiota members. In MetaShot, third party tools and new developed Python and Bash scripts are integrated to analyze paired-end (PE) Illumina sequences, offering an automated procedure covering all the analysis steps from raw data management to taxonomic profiling. It is designed to analyze both DNA-Seq and RNA-Seq data.
Language: Python - Size: 1.54 MB - Last synced at: almost 2 years ago - Pushed at: about 5 years ago - Stars: 7 - Forks: 5

santiagosnchez/gb2fasta
Perl script to convert GenBank records to FASTA format
Language: Perl - Size: 17.6 KB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 4 - Forks: 0

eparayno/SNPAnalyzer
Takes GenBank data (csv and fasta), aggregates set using common length, and identifies SNPs. Buffered nucleotide location with most frequent mutations will be BLASTed (Basic Local Alignment Search Tool) to return top organism match.
Language: Python - Size: 522 KB - Last synced at: almost 2 years ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

toddknutson/genbank_scrapper
GenBank Metadata Extraction Tool
Language: Python - Size: 1.43 MB - Last synced at: about 2 years ago - Pushed at: over 8 years ago - Stars: 1 - Forks: 0

das2000sidd/All-Code-for-B-KUL-I0U30A
These are python and linux codes for the class mentioned in the header at KU Leuven. It demonstrates some of my python and linux work relevant to bioinformatics.
Language: Python - Size: 3.34 MB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

dnanto/ffbio
flat-file sequence/database utils
Language: Python - Size: 606 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 0

rainbowmycelium/ConSequences
R script for GenBank sequences names changing, filling-in missing molecular markers data and sequences concatenation
Language: R - Size: 24.4 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

RowanDurrant/BankIt_Checker
R function that checks .fasta files are suitable for GenBank submission
Language: R - Size: 9.77 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 2 - Forks: 0

victoria-r/BioPython
A collection of various biopython scripts.
Language: Python - Size: 3.76 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

jperkel/gb_read
An example GenBank file reader in Rust
Language: Rust - Size: 85 KB - Last synced at: over 2 years ago - Pushed at: about 4 years ago - Stars: 34 - Forks: 7

michalbukowski/fetch-genomes
Download genomes from NCBI GenBank FTP site
Language: Python - Size: 72.3 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

hunglin59638/makura
NCBI Genome downloader
Language: Python - Size: 143 KB - Last synced at: 16 days ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

Aurorabili/BITools
some bioinformatics tools
Language: Python - Size: 3.91 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

oleon12/alignTools
Easy download and manage GenBank data, and alignments for phylogenetics
Language: R - Size: 747 KB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 0

abhijeetsingh1704/PROTEINcleaner
a python utility to clean PROTEIN sequences and headers
Language: Python - Size: 6.84 KB - Last synced at: 6 months ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

ShawHahnLab/genbank-sub-20181109-dloop
GenBank Submission 2018/11/09
Language: Python - Size: 97.7 KB - Last synced at: over 2 years ago - Pushed at: almost 7 years ago - Stars: 1 - Forks: 0

michellejlin/lava
LAVA: Longitudinal Analysis of Viral Alleles
Language: Python - Size: 53.5 MB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 8 - Forks: 7

abhijeetsingh1704/SubsetSeq
a python utility to subset multisequence file based on identifiers from external text file
Language: Python - Size: 34.2 KB - Last synced at: 6 months ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

ZooPhy/zoophy-genbankfactory
GenBankFactory for GenBank Data Dumps/Normalization
Language: Java - Size: 446 KB - Last synced at: about 2 months ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 1

biosustain/goodbye-genbank
A Python package for Biopython that gives feature annotations from GenBank records a new and better life
Language: Python - Size: 284 KB - Last synced at: 15 days ago - Pushed at: over 9 years ago - Stars: 14 - Forks: 3

gregyjames/GBtoTiny
Genbank to TinyDB.
Language: Python - Size: 498 KB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

ajodeh-juma/bixcop-2021-python
Simple to moderate python programming tasks with a focus in bioinformatics
Language: HTML - Size: 1.84 MB - Last synced at: over 2 years ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

terrimporter/COI_NCBI_2018
This repository contains the scripts used to retrieve and analyze the data reported in Porter & Hajibabaei 2018 bioRxiv doi: https://doi.org/10.1101/353904
Language: Perl - Size: 46.9 KB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 5 - Forks: 7

blackrim/phlawd_db_maker
this will just get the ncbi db from genbank made
Language: C++ - Size: 844 KB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 2

vbaliga/genbank_downloadR
🔬 Batch downloading of DNA or protein sequences from GenBank
Language: R - Size: 104 KB - Last synced at: 3 months ago - Pushed at: almost 5 years ago - Stars: 3 - Forks: 0

bielasilva/mock_ncbi_download
Randomly download genomes from NCBI RefSeq and Genbank
Language: Python - Size: 7.81 KB - Last synced at: over 2 years ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0

SMRUCC/GCModeller.Core
GCModeller Individual Components: GCModeller base core assembly library on common biological database read and write I/O
Language: Visual Basic .NET - Size: 22.5 MB - Last synced at: 13 days ago - Pushed at: about 5 years ago - Stars: 4 - Forks: 2

ainefairbrother/GenBank-parser
This is a parser written to extract data from a GenBank file and insert it into an SQL database. This is part of a project that I did for my MSc in Bioinformatics.
Language: Python - Size: 12.7 KB - Last synced at: over 2 years ago - Pushed at: almost 7 years ago - Stars: 1 - Forks: 0

loalon/gbcrawler
GenBank complete parser
Language: Python - Size: 1.9 MB - Last synced at: 6 days ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 0

gibarsin/bioinformatics-tp1
Trabajo Práctico para Introducción a Bioinformática en ITBA
Language: Java - Size: 11.4 MB - Last synced at: over 2 years ago - Pushed at: about 7 years ago - Stars: 0 - Forks: 0

kblin/fungal-ui
A web UI for the fungal version of antiSMASH.
Language: JavaScript - Size: 540 KB - Last synced at: 5 months ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0

ChristopherAyling/ScalaPromoterPrediciton
Predicting the promoters of Genes which share common ancestors with E. Coli
Language: Java - Size: 20.5 MB - Last synced at: over 2 years ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 0

gregyjames/genemap
A tool to visualize the genomes of phages.
Language: HTML - Size: 1.95 KB - Last synced at: 2 months ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

glickmac/GRAB
Retrieve and create a custom BLAST database by taxonomic search
Language: Python - Size: 48.5 MB - Last synced at: over 2 years ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

VinLau/perlScripts
Some Perl scripts (with some related to bioinformatics. see READ ME)
Language: Perl - Size: 55.7 KB - Last synced at: over 2 years ago - Pushed at: over 8 years ago - Stars: 0 - Forks: 0
