GitHub topics: binning
nf-core/mag
Assembly and binning of metagenomes
Language: Nextflow - Size: 34.7 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 237 - Forks: 126

seqan/raptor
A fast and space-efficient pre-filter for querying very large collections of nucleotide sequences.
Language: C++ - Size: 4.31 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 52 - Forks: 18

rhysnewell/aviary
A hybrid assembly and MAG recovery pipeline (and more!)
Language: Python - Size: 39 MB - Last synced at: about 17 hours ago - Pushed at: 1 day ago - Stars: 95 - Forks: 14

ShichenXie/scorecardpy
Scorecard Development in python, 评分卡
Language: Python - Size: 195 KB - Last synced at: 3 days ago - Pushed at: 9 months ago - Stars: 747 - Forks: 306

guillermo-navas-palencia/optbinning
Optimal binning: monotonic binning with constraints. Support batch & stream optimal binning. Scorecard modelling and counterfactual explanations.
Language: Python - Size: 10.4 MB - Last synced at: 11 days ago - Pushed at: 2 months ago - Stars: 471 - Forks: 105

ShichenXie/scorecard
Scorecard Development in R, 评分卡
Language: R - Size: 15.8 MB - Last synced at: 13 days ago - Pushed at: about 1 year ago - Stars: 164 - Forks: 62

mariuzka/simplebins
Simplebins makes it easy to bin numeric values into intervals.
Language: Python - Size: 21.5 KB - Last synced at: 1 day ago - Pushed at: 26 days ago - Stars: 0 - Forks: 0

metagentools/GraphBin2
☯️🧬 Refined and Overlapped Binning of Metagenomic Contigs Using Assembly Graphs
Language: Python - Size: 89.4 MB - Last synced at: 1 day ago - Pushed at: 2 months ago - Stars: 34 - Forks: 3

uel3/nf-UnO
metagenomics pipeline supporting The UnO Project for identifying pathogens in common across outbreak samples
Language: Nextflow - Size: 62.8 MB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 0 - Forks: 0

waikato-datamining/wai-bynning
Python library for binning data, generating cross-validation fold pairs and random splits from data.
Language: Python - Size: 77.1 KB - Last synced at: 22 days ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

carstenbauer/BinningAnalysis.jl
Statistical standard error estimation tools for correlated data
Language: Julia - Size: 7.22 MB - Last synced at: about 1 month ago - Pushed at: 10 months ago - Stars: 30 - Forks: 8

CAMI-challenge/AMBER
AMBER: Assessment of Metagenome BinnERs
Language: Python - Size: 13.5 MB - Last synced at: 24 days ago - Pushed at: 4 months ago - Stars: 30 - Forks: 7

abeusher/timehash
An algorithm for creating user configurable, variable-precision sliding windows of time. Useful for binning time values in large collections of data.
Language: C# - Size: 1.23 MB - Last synced at: 7 days ago - Pushed at: over 2 years ago - Stars: 42 - Forks: 14

oguzeroglu/Nearby
Find nearby 3D objects in constant time O(1).
Language: JavaScript - Size: 306 KB - Last synced at: 16 days ago - Pushed at: over 2 years ago - Stars: 107 - Forks: 4

KwanLab/Autometa
Autometa: Automated Extraction of Genomes from Shotgun Metagenomes
Language: Python - Size: 78.9 MB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 41 - Forks: 15

metagentools/GraphBin
✨🧬 Refined binning of metagenomic contigs using assembly graphs
Language: Python - Size: 54.3 MB - Last synced at: 24 days ago - Pushed at: 2 months ago - Stars: 91 - Forks: 7

Safwan2003/RandomForest_Heart_Disease_Prediction
A machine learning project using Random Forest Classifier to predict heart disease. Includes data preprocessing (with binning), feature selection, and model evaluation.
Language: Jupyter Notebook - Size: 4.86 MB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

UW-Madison-Bacteriology-Bioinformatics/binning_wf
Pipeline to bin metagenomes into high-quality metagenomic-assembled genomes using DAGman and HTCondor.
Language: Shell - Size: 136 KB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 1 - Forks: 0

arpitnarechania/binguru
BinGuru is an open-source Typescript package to bin/classify data using 18 established binning methods, including a new method, resiliency.
Language: TypeScript - Size: 66.4 KB - Last synced at: 13 days ago - Pushed at: over 1 year ago - Stars: 16 - Forks: 0

erikhuizinga/honeycomb
Tools for hexagonal binning (honeycomb plot) and visualisation.
Language: MATLAB - Size: 32.2 KB - Last synced at: about 1 month ago - Pushed at: about 5 years ago - Stars: 6 - Forks: 0

RobWiederstein/lifeExpectancy
Quarto dashboard on binning and coloring strategies for choropleth maps
Language: JavaScript - Size: 4.64 MB - Last synced at: 3 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

Khushi130404/Binning-Binarization
This project demonstrates binning and binarization on the Titanic dataset, comparing results with and without numeric encoding. Visualizations highlight the transformations and their impact on survival analysis.
Language: Jupyter Notebook - Size: 108 KB - Last synced at: 3 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

ast0815/remu
ReMU - Response Matrix Utilities
Language: Python - Size: 114 MB - Last synced at: 17 days ago - Pushed at: about 2 months ago - Stars: 5 - Forks: 1

FelipeSE98/optimal-monotonic-woe-binning
Language: Jupyter Notebook - Size: 794 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

natashabatalha/PandExo
A Community Tool for Transiting Exoplanet Science with the JWST & HST
Language: Jupyter Notebook - Size: 494 MB - Last synced at: 13 days ago - Pushed at: 8 months ago - Stars: 38 - Forks: 40

anuradhawick/MetaBCC-LR
Reference-free Binning of Metagenomics Long Reads using Coverage and Composition
Language: Python - Size: 441 KB - Last synced at: 28 days ago - Pushed at: 8 months ago - Stars: 20 - Forks: 0

Yash22222/Data-Analysis-With-Python
This repository provides a practical introduction to data acquisition and analysis using Pandas. It covers loading datasets, exploring data, manipulating data, and gaining insights through statistical summaries. Ideal for beginners, it offers code examples and explanations to enhance your data manipulation skills using Pandas for Python.
Language: Jupyter Notebook - Size: 2.67 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

laxeye/pyYAMB
Express metagenome binning
Language: Python - Size: 65.4 KB - Last synced at: 15 days ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

wanghui5801/usmerge
A tool package for one-dimensional data clustering.
Language: Python - Size: 243 KB - Last synced at: 14 days ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

ECheynet/binAveraging
Averaging noisy data into bins
Language: MATLAB - Size: 151 KB - Last synced at: 2 months ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 1

dionhaefner/bayesian-histograms
Bayesian histograms for estimation of binary rare event rates, with fully automated bin pruning :bar_chart:
Language: Jupyter Notebook - Size: 256 KB - Last synced at: 24 days ago - Pushed at: over 3 years ago - Stars: 28 - Forks: 2

francescoalemanno/BayesHistogram.jl
pure Julia package for optimal histogram binning, based on piecewise constant model.
Language: Julia - Size: 670 KB - Last synced at: 1 day ago - Pushed at: over 2 years ago - Stars: 41 - Forks: 3

rsquaredacademy/rbin
Tools for binning data
Language: R - Size: 4.8 MB - Last synced at: 19 days ago - Pushed at: 6 months ago - Stars: 13 - Forks: 3

nunofonseca/msi
Language: Shell - Size: 5.82 MB - Last synced at: 5 months ago - Pushed at: about 2 years ago - Stars: 5 - Forks: 2

jackieblaum/eclipsebin
EclipseBin is a Python package designed for binning light curves of eclipsing binary stars using a non-uniform binning scheme. The package focuses on better capturing the details of eclipses by allocating more bins within the eclipse regions, making it ideal for light curves with narrow eclipses.
Language: Python - Size: 182 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

qiyunlab/binarena
BinaRena: Interactive Visualization and Binning of Metagenomic Contigs
Language: JavaScript - Size: 9.23 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 27 - Forks: 6

ekenes/binning-experiments
Various apps exploring client-side binning in the ArcGIS JS API.
Language: HTML - Size: 658 KB - Last synced at: about 2 months ago - Pushed at: 9 months ago - Stars: 5 - Forks: 1

aarryasutar/Credit_EDA
This project focuses on cleaning and analyzing a loan application dataset to gain insights into the factors influencing loan defaults. Through systematic data cleaning, visualization, and merging with previous application data, it provides a robust foundation for further predictive modeling.
Language: Jupyter Notebook - Size: 1.42 MB - Last synced at: 3 months ago - Pushed at: 10 months ago - Stars: 1 - Forks: 0

AmbreenMahhoor/What-Is-Binning-And-Binarization
Language: Jupyter Notebook - Size: 46.9 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

riddhigupta1110/DataWarehousingAndMining-V-MU-CSE
Codes for Practical experiments of Data Warehousing and Mining (Semester V - Computer Engineering - Mumbai University)
Language: Python - Size: 24.4 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 1 - Forks: 0

metagentools/MetaCoAG
🚦🧬 Binning Metagenomic Contigs via Composition, Coverage and Assembly Graphs
Language: Python - Size: 156 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 42 - Forks: 3

dyxstat/ImputeCC
ImputeCC enhances integrative Hi-C-based metagenomic binning through constrained random-walk-based imputation
Language: Python - Size: 4.9 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 1 - Forks: 1

LeonDlugosch/MetaSeq-Toolkit
This pipeline is intended to be a convenient way to work though large sets of metagenomic or metatranscriptiomic datasets while also retaining high analytical flexibility due to retained intermediate results that might be useful outside of the intended purpose.
Language: Shell - Size: 102 MB - Last synced at: 5 days ago - Pushed at: over 2 years ago - Stars: 7 - Forks: 2

PMassicotte/l3bin
Support the NASA / GlobColour / CCI ISIN grid used for MODIS L3BIN satellite products.
Language: Rust - Size: 12.7 KB - Last synced at: about 1 month ago - Pushed at: 12 months ago - Stars: 1 - Forks: 0

songweizhi/Binning_refiner
Improving genome bins through the combination of different binning programs
Language: Python - Size: 87.6 MB - Last synced at: about 12 hours ago - Pushed at: over 2 years ago - Stars: 31 - Forks: 4

BKR7E/DataBinner
Data auto-binning algorithm- takes x,y,z data and organizes it in x and y user prescribed bins, auto-interpolates data gaps, then plots z
Language: Python - Size: 112 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

mayankmtg/MassSpecPeakDetection
Peak Detection algorithm refine for testing the number of peaks based on different approaches
Language: Python - Size: 25.7 MB - Last synced at: about 1 year ago - Pushed at: almost 7 years ago - Stars: 1 - Forks: 0

1feres1/pynmranalysis
A Python Toolbox for preprocessing and analysing NMR data
Language: Jupyter Notebook - Size: 7.18 MB - Last synced at: 6 days ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 1

marbl/binnacle
Binnacle: Using Scaffolds to Improve the Contiguity and Quality of Metagenomic Bins
Language: Python - Size: 3.26 MB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 9 - Forks: 2

Mateko/ScoreWise
Scorecard tools
Language: Python - Size: 7.81 KB - Last synced at: 2 months ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

DivyaKrishnani/Data-Preprocessing-with-Python
Implementation of Data Preprocessing techniques such as handling missing values, noise smoothing, PCA, etc.
Language: Jupyter Notebook - Size: 1.64 MB - Last synced at: 5 months ago - Pushed at: over 6 years ago - Stars: 8 - Forks: 11

pirovc/metameta
Language: Python - Size: 14.2 MB - Last synced at: 12 months ago - Pushed at: over 5 years ago - Stars: 23 - Forks: 10

claczny/VizBin
Repository of our application for human-augmented binning
Language: Java - Size: 214 MB - Last synced at: about 1 month ago - Pushed at: about 4 years ago - Stars: 27 - Forks: 14

Evan-Mucciolo/pandas-challenge
Mock exercise as Chief Data Scientist analyzing student standardized testing data for school board.
Language: Jupyter Notebook - Size: 471 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

fjebaker/Buckets.jl
🪣 Fast, parallel, low-allocation algorithms for binning numbers.
Language: Julia - Size: 38.1 KB - Last synced at: 7 days ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 1

Yash22222/MY-TP-PROJECTS
I am Uploading My Short simple projects on Python, Java, C++ & C Language in this library.
Language: Jupyter Notebook - Size: 7.93 MB - Last synced at: 25 days ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 0

danillo-alvarenga/zeuss
recogniZing gEnome seqUences in metagenomic aSSemblies
Language: Python - Size: 19.5 KB - Last synced at: over 1 year ago - Pushed at: almost 8 years ago - Stars: 2 - Forks: 0

nidhikargathra/HeartCare
An intelligent system to predict probability of heart diseases by using classification algorithms. View README for details.
Language: Java - Size: 12.4 MB - Last synced at: over 1 year ago - Pushed at: almost 7 years ago - Stars: 2 - Forks: 1

muhammadravi251001/credit-scoring
Code for classifying whether someone can repay their loan to a banking institution using a supervised learning approach: Binning and Logistic Regression.
Language: Python - Size: 9.77 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

ellietoulabi/DataMiningCourseHomeworks
Solutions to data mining course homeworks.
Language: Jupyter Notebook - Size: 20.2 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

Emmanuel-R8/SMBinning
Scoring Modeling and Optimal Binning
Language: R - Size: 346 KB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 0

Amshra267/Cassandra-Udyam
Contains our Approach for the competition organized at Udyam'21
Language: Jupyter Notebook - Size: 3.83 MB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 12 - Forks: 2

darenr/optbinning Fork of guillermo-navas-palencia/optbinning
Optimal binning: monotonic binning with constraints
Size: 1.09 MB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

gipert/bayesian-blocks
Julia and C++ Implementations of the bayesian blocks algorithm https://arxiv.org/abs/1207.5578
Language: C++ - Size: 2.47 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 4

Hamim-Hussain/Analysing-School-District-Data-with-Pandas
My evaluation of the school district's performance included the examination of standardised test results in math and reading, school budgets, and grade point averages. The information was organised by school size and type, with an emphasis on district and charter schools, using pandas.
Language: Jupyter Notebook - Size: 864 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

guhjy/rbin Fork of rsquaredacademy/rbin
Tools for binning data
Language: R - Size: 1.15 MB - Last synced at: almost 2 years ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0

Tscholten1/Map-of-U.S-Electric-Power-Generation
Map of U.S. Electric Power Generation by Fuel Source
Language: CSS - Size: 218 KB - Last synced at: almost 2 years ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

k-palffy/cytobins
An R package for simple, customizable binning of flow cytometric data
Language: R - Size: 4.29 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

Arkadiy-Garber/SprayNPray
Rapid and simple taxonomic profiling of genome and metagenome contigs
Language: Python - Size: 205 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 25 - Forks: 4

martinnff/Binning
Binning method to allow performing point cloud classification tasks on low resources machines.
Language: C++ - Size: 671 KB - Last synced at: almost 2 years ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

envmetagen/metabinkit
Set of programs to perform taxonomic binning.
Language: R - Size: 6.38 MB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 2 - Forks: 1

tomas-o-dev/FeatureFilter
Quick Layered Correlation-based Feature Filtering
Language: Python - Size: 7.02 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

mhyfritz/bin-data
Partition data into given number of chunks and pick a representative value for each chunk.
Language: JavaScript - Size: 396 KB - Last synced at: 24 days ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

podondra/data-preprocessing
data preprocessing examples
Language: Jupyter Notebook - Size: 4.64 MB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 4 - Forks: 4

janukobytsch/big-data-notebook
Language: Jupyter Notebook - Size: 64.5 KB - Last synced at: about 2 years ago - Pushed at: over 9 years ago - Stars: 0 - Forks: 0

Keris/yasc
Yet Another Score Card
Language: Python - Size: 71.3 KB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 3 - Forks: 0

ggirelli/bioTrackBinner 📦
Binning biological data tracks and producing RDS containing data.tables
Language: R - Size: 45.9 KB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 0

lingumd/School_District_Analysis
Updated test score data and school district analysis using Python.
Language: Jupyter Notebook - Size: 784 KB - Last synced at: 2 months ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

bochen0909/hsds-upcxx Fork of Lizhen0909/SpaRC-MPI
hierarchical clustering of DNA sequence using upcxx
Language: C++ - Size: 1.73 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

SujalXplores/Mega-Calculator
All type of calculators like Cuboid (4D), Binning, Chi-square test, Red-black tree, Binary search tree, Longest Common Sub Sequence, Master Theorm, Heap Sort, Decision Theory at one place ✨
Language: HTML - Size: 592 KB - Last synced at: 6 days ago - Pushed at: about 4 years ago - Stars: 3 - Forks: 0

Keris/creditscoring
Credit scoring toolkit with python
Language: Python - Size: 18.6 KB - Last synced at: 5 days ago - Pushed at: over 5 years ago - Stars: 5 - Forks: 0

miwermi/school-district-analysis
School district performance analysis using Python, Pandas, +Jupyter Notebook.
Language: Jupyter Notebook - Size: 2.31 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

fungs/thesis-phd
PhD thesis: Computational Methods for Taxonomic Annotation and Genome Reconstruction in Metagenomics
Language: TeX - Size: 26.1 MB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 2 - Forks: 0

nani757/Binning-Discretization-_-Quantile-Binning-_-KMeans-Binning
these concepts are useful for converting numerical data to categorical
Language: Jupyter Notebook - Size: 73.2 KB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

MengChiehLiu/Entropy-Based-Binning
The function here is designed for binning continuous independent variables, in the way minimizing total entropy of corresponding response. Also this function can plot the change of the entropy in the process.
Language: Jupyter Notebook - Size: 184 KB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

brycehenson/fast_sorted_mask
fast masking of ordered vectors based on binary search
Language: MATLAB - Size: 1.26 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 0

Navadeeppasala/Data-Analysis-with-Python
Why data analysis? , How to understand the problem, what to do for data analysis, and how clean the data for building Machine Learning models
Language: Jupyter Notebook - Size: 201 KB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 0

QuentinLetourneur/Let-it-bin
Optimize workflow for binning metagenomic short reads from multiple samples
Language: Nextflow - Size: 193 KB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 5 - Forks: 2

Towhid1/Feature-Engineering-Basics
Feature engineering is the process of transforming raw data into features. Here are some basic ideas about feature engineering.
Language: Jupyter Notebook - Size: 66.4 MB - Last synced at: about 1 year ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

ToniWestbrook/mitobin
Taxonomic classification and read binning of mitochondrial DNA
Language: Python - Size: 25.4 KB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

ScratchyCode/Photographic-data-binning
Photographic binning
Language: C - Size: 2.1 MB - Last synced at: 10 months ago - Pushed at: about 6 years ago - Stars: 0 - Forks: 0

karanmitroo/stunning-chainsaw
A simplified algorithm to cluster mixed-type data(numerical and categorical).
Language: Python - Size: 521 KB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 0

claczny/busybee_web
Repository for BusyBee Web - Web-based deconvolution of metagenomic data by bootstrapped supervised binning
Size: 1.03 MB - Last synced at: about 2 months ago - Pushed at: about 8 years ago - Stars: 2 - Forks: 0
