An open API service providing repository metadata for many open source software ecosystems.

Topic: "preprocessing"

Unstructured-IO/unstructured

Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.

Language: HTML - Size: 192 MB - Last synced at: 2 days ago - Pushed at: 13 days ago - Stars: 10,915 - Forks: 907

dongrixinyu/JioNLP

中文 NLP 预处理、解析工具包,准确、高效、易用 A Chinese NLP Preprocessing & Parsing Package www.jionlp.com

Language: Python - Size: 158 MB - Last synced at: 13 days ago - Pushed at: 20 days ago - Stars: 3,560 - Forks: 423

nidhaloff/igel

a delightful machine learning tool that allows you to train, test, and use models without writing code

Language: Python - Size: 18.8 MB - Last synced at: 7 days ago - Pushed at: about 2 years ago - Stars: 3,112 - Forks: 179

OpenGene/fastp

An ultra-fast all-in-one FASTQ preprocessor (QC/adapters/trimming/filtering/splitting/merging...)

Language: C++ - Size: 730 KB - Last synced at: 11 days ago - Pushed at: 5 months ago - Stars: 2,049 - Forks: 337

AxeldeRomblay/MLBox

MLBox is a powerful Automated Machine Learning python library.

Language: Python - Size: 50 MB - Last synced at: 7 days ago - Pushed at: over 1 year ago - Stars: 1,512 - Forks: 275

winedarksea/AutoTS

Automated Time Series Forecasting

Language: Python - Size: 46.8 MB - Last synced at: 11 days ago - Pushed at: 18 days ago - Stars: 1,263 - Forks: 110

sunlabuiuc/PyHealth

A Deep Learning Python Toolkit for Healthcare Applications.

Language: Python - Size: 120 MB - Last synced at: 3 days ago - Pushed at: 4 days ago - Stars: 1,098 - Forks: 240

NVIDIA-Merlin/NVTabular

NVTabular is a feature engineering and preprocessing library for tabular data designed to quickly and easily manipulate terabyte scale datasets used to train deep learning based recommender systems.

Language: Python - Size: 98.4 MB - Last synced at: 11 days ago - Pushed at: 8 months ago - Stars: 1,078 - Forks: 146

KinWaiCheuk/nnAudio

Audio processing by using pytorch 1D convolution network

Language: Python - Size: 94.7 MB - Last synced at: 8 days ago - Pushed at: about 1 year ago - Stars: 1,058 - Forks: 91

TheAlgorithms/R

Collection of various algorithms implemented in R.

Language: R - Size: 1.02 MB - Last synced at: 8 days ago - Pushed at: 5 months ago - Stars: 937 - Forks: 311

pytorch/torcharrow 📦

High performance model preprocessing library on PyTorch

Language: Python - Size: 11.3 MB - Last synced at: 12 days ago - Pushed at: about 1 year ago - Stars: 649 - Forks: 80

MinishLab/semhash

Fast Semantic Text Deduplication

Language: Python - Size: 1.61 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 626 - Forks: 28

R1j1t/contextualSpellCheck

✔️Contextual word checker for better suggestions (not actively maintained)

Language: Python - Size: 2.45 MB - Last synced at: 8 days ago - Pushed at: 3 months ago - Stars: 414 - Forks: 64

qd-cae/awesome-CAE

A curated list of awesome CAE frameworks, libraries and software.

Size: 57.6 KB - Last synced at: 11 days ago - Pushed at: 8 months ago - Stars: 405 - Forks: 108

msamogh/nonechucks

Deal with bad samples in your dataset dynamically, use Transforms as Filters, and more!

Language: Python - Size: 25.4 KB - Last synced at: 2 days ago - Pushed at: over 2 years ago - Stars: 377 - Forks: 27

MaxHalford/xam

:dart: Personal data science and machine learning toolbox

Language: Python - Size: 1.12 MB - Last synced at: 14 days ago - Pushed at: about 5 years ago - Stars: 365 - Forks: 76

DataCanvasIO/HyperGBM

A full pipeline AutoML tool for tabular data

Language: Python - Size: 11 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 347 - Forks: 47

ikegami-yukino/jaconv

Pure-Python Japanese character interconverter for Hiragana, Katakana, Hankaku, and Zenkaku

Language: Python - Size: 537 KB - Last synced at: 9 days ago - Pushed at: 3 months ago - Stars: 325 - Forks: 31

advaitsave/Introduction-to-Time-Series-forecasting-Python

Introduction to time series preprocessing and forecasting in Python using AR, MA, ARMA, ARIMA, SARIMA and Prophet model with forecast evaluation.

Language: Jupyter Notebook - Size: 2.02 MB - Last synced at: 26 days ago - Pushed at: over 6 years ago - Stars: 323 - Forks: 138

cylondata/cylon

Cylon is a fast, scalable, distributed memory, parallel runtime with a Pandas like DataFrame.

Language: C++ - Size: 10.7 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 293 - Forks: 44

nlpcl-lab/ace2005-preprocessing

ACE 2005 corpus preprocessing for Event Extraction task

Language: Python - Size: 45.9 KB - Last synced at: about 1 year ago - Pushed at: about 5 years ago - Stars: 280 - Forks: 71

ikegami-yukino/neologdn

Japanese text normalizer for mecab-neologd

Language: Cython - Size: 593 KB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 278 - Forks: 20

dunky11/voicesmith

[WIP] VoiceSmith makes training text to speech models easy.

Language: Python - Size: 57 MB - Last synced at: 21 days ago - Pushed at: over 2 years ago - Stars: 224 - Forks: 32

Deffro/text-preprocessing-techniques

16 Text Preprocessing Techniques in Python for Twitter Sentiment Analysis.

Language: Python - Size: 2.36 MB - Last synced at: over 1 year ago - Pushed at: about 6 years ago - Stars: 217 - Forks: 82

basf/mamba-tabular

Mambular is a Python package that simplifies tabular deep learning by providing a suite of models for regression, classification, and distributional regression tasks. It includes models such as Mambular, TabM, FT-Transformer, TabulaRNN, TabTransformer, and tabular ResNets.

Language: Python - Size: 8.99 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 202 - Forks: 14

jbusecke/xMIP

Analysis ready CMIP6 data in python the easy way with pangeo tools.

Language: Jupyter Notebook - Size: 20.4 MB - Last synced at: 2 days ago - Pushed at: 14 days ago - Stars: 199 - Forks: 44

google/tensorflow-recorder 📦

TFRecorder makes it easy to create TensorFlow records (TFRecords) from Pandas DataFrames and CSVs files containing images or structured data.

Language: Python - Size: 6.54 MB - Last synced at: 1 day ago - Pushed at: almost 3 years ago - Stars: 183 - Forks: 32

quqixun/BrainPrep 📦

Preprocessing pipeline on Brain MR Images through FSL and ANTs, including registration, skull-stripping, bias field correction, enhancement and segmentation.

Language: Python - Size: 43.7 MB - Last synced at: over 1 year ago - Pushed at: almost 7 years ago - Stars: 172 - Forks: 51

free-astro/siril

The Siril image processing software for amateur astronomy

Last synced at: 10 days ago - Stars: 168 - Forks: 99

ropensci/MODIStsp

An "R" package for automatic download and preprocessing of MODIS Land Products Time Series

Language: R - Size: 180 MB - Last synced at: 4 days ago - Pushed at: 9 months ago - Stars: 156 - Forks: 51

Razor12911/xtool 📦

Just some tool repackers like to use...

Language: Pascal - Size: 22.6 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 152 - Forks: 11

githubharald/DeslantImg

The deslanting algorithm sets text upright in images. Python, C++ and OpenCL implementations provided.

Language: C++ - Size: 591 KB - Last synced at: 19 days ago - Pushed at: over 3 years ago - Stars: 149 - Forks: 38

sappelhoff/pyprep

PyPREP: A Python implementation of the Preprocessing Pipeline (PREP) for EEG data

Language: Python - Size: 25.9 MB - Last synced at: 7 days ago - Pushed at: 4 months ago - Stars: 148 - Forks: 35

mlr-org/mlr3pipelines

Dataflow Programming for Machine Learning in R

Language: R - Size: 22.5 MB - Last synced at: 3 days ago - Pushed at: 4 days ago - Stars: 142 - Forks: 26

autoreject/autoreject

Automated rejection and repair of bad trials/sensors in M/EEG

Language: Python - Size: 697 KB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 142 - Forks: 58

jaeho3690/LIDC-IDRI-Preprocessing

This is the preprocessing step of the LIDC-IDRI dataset

Language: Jupyter Notebook - Size: 1.84 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 140 - Forks: 39

chakki-works/chariot

Deliver the ready-to-train data to your NLP model.

Language: Jupyter Notebook - Size: 5.61 MB - Last synced at: 3 days ago - Pushed at: almost 3 years ago - Stars: 122 - Forks: 9

KananVyas/BoxDetection

A Box detection algorithm for any image containing boxes.

Language: Jupyter Notebook - Size: 411 KB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 118 - Forks: 53

lozuwa/impy

Impy is a Python3 library with features that help you in your computer vision tasks.

Language: Python - Size: 91.4 MB - Last synced at: 19 days ago - Pushed at: about 6 years ago - Stars: 116 - Forks: 32

chrise96/3D_Ground_Segmentation

A ground segmentation algorithm for 3D point clouds based on the work described in “Fast segmentation of 3D point clouds: a paradigm on LIDAR data for Autonomous Vehicle Applications”, D. Zermas, I. Izzat and N. Papanikolopoulos, 2017. Distinguish between road and non-road points. Road surface extraction. Plane fit ground filter

Language: C++ - Size: 2.91 MB - Last synced at: about 1 month ago - Pushed at: about 3 years ago - Stars: 108 - Forks: 14

acroucher/PyTOUGH

A Python library for automating TOUGH2 simulations of subsurface fluid and heat flow

Language: Python - Size: 41.6 MB - Last synced at: 26 days ago - Pushed at: about 2 months ago - Stars: 102 - Forks: 37

methlabUZH/automagic

Automagic

Language: MATLAB - Size: 414 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 98 - Forks: 32

madyankin/postcss-each 📦

PostCSS plugin to iterate through values

Language: JavaScript - Size: 581 KB - Last synced at: about 1 month ago - Pushed at: almost 4 years ago - Stars: 94 - Forks: 20

VisLab/EEG-Clean-Tools

Contains tools for EEG standardized preprocessing

Language: MATLAB - Size: 4.32 MB - Last synced at: 10 days ago - Pushed at: 21 days ago - Stars: 92 - Forks: 30

kharchenkolab/dropEst

Pipeline for initial analysis of droplet-based single-cell RNA-seq data

Language: C++ - Size: 47.1 MB - Last synced at: 11 months ago - Pushed at: almost 3 years ago - Stars: 84 - Forks: 43

MLD3/FIDDLE

FlexIble Data-Driven pipeLinE – a preprocessing pipeline that transforms structured EHR data into feature vectors to be used with ML algorithms. https://doi.org/10.1093/jamia/ocaa139

Language: Jupyter Notebook - Size: 6.4 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 80 - Forks: 16

damianhorna/multi-imbalance

Python package for tackling multi-class imbalance problems. http://www.cs.put.poznan.pl/mlango/publications/multiimbalance/

Language: Python - Size: 66 MB - Last synced at: 7 days ago - Pushed at: 11 months ago - Stars: 77 - Forks: 11

Yu-Group/veridical-flow

Making it easier to build stable, trustworthy data-science pipelines based on the PCS framework.

Language: Jupyter Notebook - Size: 13.4 MB - Last synced at: 7 days ago - Pushed at: about 1 year ago - Stars: 70 - Forks: 7

nipreps/dmriprep

dMRIPrep is a robust and easy-to-use pipeline for preprocessing of diverse dMRI data. The transparent workflow dispenses of manual intervention, thereby ensuring the reproducibility of the results.

Language: Python - Size: 115 MB - Last synced at: 7 days ago - Pushed at: over 1 year ago - Stars: 67 - Forks: 25

elcorto/pwtools

pwtools is a Python package for pre- and postprocessing of atomistic calculations, mostly targeted to Quantum Espresso, CPMD, CP2K and LAMMPS. It is almost, but not quite, entirely unlike ASE, with some tools extending numpy/scipy. It has a set of powerful parsers and data types for storing calculation data.

Language: Python - Size: 21.9 MB - Last synced at: 14 days ago - Pushed at: 9 months ago - Stars: 66 - Forks: 15

ildoonet/remote-dataloader

PyTorch DataLoader processed in multiple remote computation machines for heavy data processings

Language: Python - Size: 10.7 KB - Last synced at: 10 months ago - Pushed at: over 5 years ago - Stars: 66 - Forks: 2

hirofumi0810/asr_preprocessing

Python implementation of pre-processing for End-to-End speech recognition

Language: Python - Size: 1.67 MB - Last synced at: almost 2 years ago - Pushed at: about 7 years ago - Stars: 66 - Forks: 22

ALebrun-108/BoxSERS

Python package that provides a full range of functionality to process and analyze vibrational spectra (Raman, SERS, FTIR, etc.).

Language: Jupyter Notebook - Size: 20 MB - Last synced at: 9 days ago - Pushed at: 7 months ago - Stars: 64 - Forks: 15

gregversteeg/gaussianize

Transforms univariate data into normally distributed data

Language: Python - Size: 121 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 63 - Forks: 24

wajuqi/Sentinel-1-preprocessing-using-Snappy

Sentinel-1 image pre-processing using snappy.

Language: Python - Size: 17.6 KB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 63 - Forks: 22

AlessioZanga/PyEEGLab 📦

Analyze and manipulate EEG data using PyEEGLab.

Language: Python - Size: 1.04 GB - Last synced at: 8 days ago - Pushed at: over 4 years ago - Stars: 61 - Forks: 23

keurfonluu/toughio

Pre- and post-processing Python library for TOUGH

Language: Python - Size: 18.1 MB - Last synced at: 7 days ago - Pushed at: 12 days ago - Stars: 60 - Forks: 9

TakeLab/podium

Podium: a framework agnostic Python NLP library for data loading and preprocessing

Language: Python - Size: 2.19 MB - Last synced at: 6 days ago - Pushed at: over 2 years ago - Stars: 60 - Forks: 2

YuxinZhaozyx/pytorch-VideoDataset

Tools for loading video dataset and transforms on video in pytorch. You can directly load video files without preprocessing.

Language: Python - Size: 7.81 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 58 - Forks: 16

taknev83/pywedge

Makes Interactive Chart Widget, Cleans raw data, Runs baseline models, Interactive hyperparameter tuning & tracking

Language: Jupyter Notebook - Size: 9.62 MB - Last synced at: 12 months ago - Pushed at: over 3 years ago - Stars: 54 - Forks: 10

lucasrla/wsi-preprocessing

Simple library for preprocessing histopathological whole-slide images (WSI) into tiles (a.k.a. patches) towards deep learning

Language: Python - Size: 18.6 KB - Last synced at: 9 days ago - Pushed at: over 1 year ago - Stars: 52 - Forks: 14

olivierhagolle/Start_maja

To process a Sentinel-2 time series with MAJA cloud detection and atmospheric correction processor

Language: Python - Size: 483 MB - Last synced at: 10 months ago - Pushed at: about 5 years ago - Stars: 51 - Forks: 15

VincentStimper/mclahe

NumPy and Tensorflow implementation of the Multidimensional Contrast Limited Adaptive Histogram Equalization (MCLAHE) procedure

Language: Python - Size: 16.8 MB - Last synced at: about 6 hours ago - Pushed at: over 2 years ago - Stars: 48 - Forks: 6

nlgranger/SeqTools

A python library to manipulate and transform indexable data (lists, arrays, ...)

Language: Python - Size: 1.56 MB - Last synced at: 7 days ago - Pushed at: about 1 year ago - Stars: 46 - Forks: 4

SilentFlame/Named-Entity-Recognition

Corpus and a baseline neural network system for Named Entity Recognition in Hindi-English Code-Mixed social media text.

Language: Python - Size: 29.2 MB - Last synced at: 14 days ago - Pushed at: over 4 years ago - Stars: 45 - Forks: 16

0xferit/ITU-Turkish-NLP-Pipeline-Caller 📦

A Python3 wrapper tool to help using ITU Turkish NLP Pipeline API -- UNMAINTAINED --

Language: Python - Size: 131 KB - Last synced at: 4 days ago - Pushed at: almost 7 years ago - Stars: 45 - Forks: 9

paulross/cpip

CPIP - a C/C++ preprocessor implemented in Python.

Language: Python - Size: 36.7 MB - Last synced at: 14 days ago - Pushed at: 19 days ago - Stars: 44 - Forks: 4

l-ramirez-lopez/prospectr

R package: Misc. Functions for Processing and Sample Selection of Spectroscopic Data

Language: R - Size: 17.3 MB - Last synced at: 4 days ago - Pushed at: about 1 month ago - Stars: 42 - Forks: 21

preprocessy/preprocessy

Python package for Customizable Data Preprocessing Pipelines

Language: Jupyter Notebook - Size: 992 KB - Last synced at: 19 days ago - Pushed at: about 2 months ago - Stars: 42 - Forks: 14

data-science-lab-amsterdam/skippa

SciKIt-learn Pipeline in PAndas

Language: Python - Size: 423 KB - Last synced at: 8 days ago - Pushed at: over 1 year ago - Stars: 42 - Forks: 1

bids-apps/freesurfer

BIDS app wrapping recon-all from FreeSurfer

Language: Python - Size: 221 KB - Last synced at: 5 months ago - Pushed at: 6 months ago - Stars: 41 - Forks: 35

OanaIgnat/I3D_Keras

I3D implemetation in Keras + video preprocessing + visualization of results

Language: Python - Size: 83 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 41 - Forks: 10

MASILab/PreQual

An automated pipeline for integrated preprocessing and quality assurance of diffusion weighted MRI images

Language: Python - Size: 396 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 40 - Forks: 8

Aura-healthcare/ecg_qc

A library to compute ECG signal quality indicators

Language: Jupyter Notebook - Size: 50.4 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 40 - Forks: 10

TextDatasetCleaner/TextDatasetCleaner

🔬 Очистка датасетов от мусора (нормализация, препроцессинг)

Language: Python - Size: 72.3 KB - Last synced at: 27 days ago - Pushed at: about 4 years ago - Stars: 40 - Forks: 10

ParkerICI/premessa

R package for pre-processing of mass and flow cytometry data

Language: R - Size: 247 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 39 - Forks: 23

Puneet2000/In-Depth-ML

In depth machine learning resources

Language: Jupyter Notebook - Size: 130 MB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 38 - Forks: 16

ag-ds-bubble/swtloc

Python package for Stroke Width Transform - Localizing the Text (Letters & Words) in a Natural Image

Language: Python - Size: 126 MB - Last synced at: 30 days ago - Pushed at: over 1 year ago - Stars: 37 - Forks: 4

Clearailhc/ACE2005-toolkit

Focusing on ACE 2005 data preprocessing, we provide doc-level, sentence-level and BIO-style golden data preprocessing, the only thing you need is the ACE05 row data. Hope you enjoy!😎

Language: Python - Size: 46.6 MB - Last synced at: 6 months ago - Pushed at: almost 4 years ago - Stars: 37 - Forks: 6

SIMEXP/load_confounds 📦

Load fMRIprep confounds in python

Language: Python - Size: 3.15 MB - Last synced at: 6 days ago - Pushed at: about 3 years ago - Stars: 36 - Forks: 12

rachellea/ct-volume-preprocessing

End-to-end Python CT volume preprocessing pipeline to convert raw DICOMs into clean 3D numpy arrays for ML. From paper Draelos et al. "Machine-Learning-Based Multiple Abnormality Prediction with Large-Scale Chest Computed Tomography Volumes."

Language: Python - Size: 25.4 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 36 - Forks: 15

huseinzol05/Machine-Learning-Data-Science-Reuse 📦

Gathers machine learning and data science techniques for problem solving.

Language: Jupyter Notebook - Size: 38.1 MB - Last synced at: about 1 year ago - Pushed at: almost 6 years ago - Stars: 35 - Forks: 32

FareedKhan-dev/Most-powerful-NLP-library

Gemini, as capable as GPT-4, provides a free API with limited access. I tested it with the help of prompt engineering and found that it can solve almost any NLP task you want to tackle.

Language: Jupyter Notebook - Size: 107 KB - Last synced at: 17 days ago - Pushed at: over 1 year ago - Stars: 34 - Forks: 9

fitushar/Brain-Tissue-Segmentation-Using-Deep-Learning-Pipeline-NeuroNet

This Repository is for the MISA Course final project which was Brain tissue segmentation. we adopt NeuroNet which is a comprehensive brain image segmentation tool based on a novel multi-output CNN architecture which has been trained and tuned using IBSR18 dataset

Language: Jupyter Notebook - Size: 5.16 MB - Last synced at: 4 days ago - Pushed at: almost 5 years ago - Stars: 34 - Forks: 9

bids-apps/HCPPipelines

A BIDS App for minimal preprocessing using the HCP Pipelines

Language: Python - Size: 148 KB - Last synced at: 5 months ago - Pushed at: 6 months ago - Stars: 33 - Forks: 30

fkie-cad/Logprep

log data pre processing, generation and shipping in python

Language: Python - Size: 9.34 MB - Last synced at: 2 days ago - Pushed at: 4 days ago - Stars: 32 - Forks: 8

daniellwdb/roka

🤖 Rise of Kingdoms bot to manage kingdom titles and DKP through Discord.

Language: TypeScript - Size: 35.6 MB - Last synced at: 6 days ago - Pushed at: 2 months ago - Stars: 32 - Forks: 18

hellosunking/Ktrim

Ktrim: an extra-fast and accurate adapter- and quality-trimmer for sequencing data

Language: C++ - Size: 336 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 32 - Forks: 7

maruedt/chemometrics

Python library for chemometric data analysis

Language: Python - Size: 37.9 MB - Last synced at: 3 days ago - Pushed at: over 2 years ago - Stars: 32 - Forks: 5

raj-sutariya/indic-num2words

Python library for converting numbers to words for all Indian Languages.

Language: Python - Size: 57.6 KB - Last synced at: 11 months ago - Pushed at: 12 months ago - Stars: 31 - Forks: 10

JuliaML/MLLabelUtils.jl

Utility package for working with classification targets and label-encodings

Language: Julia - Size: 170 KB - Last synced at: about 10 hours ago - Pushed at: over 3 years ago - Stars: 31 - Forks: 13

allenai/smashed

SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batching, and more. Supports datasets from Huggingface, torchdata iterables, or simple lists of dictionaries.

Language: Python - Size: 4.55 MB - Last synced at: 12 months ago - Pushed at: over 1 year ago - Stars: 30 - Forks: 3

hscspring/pnlp

NLP预/后处理工具。

Language: Python - Size: 106 KB - Last synced at: 13 days ago - Pushed at: 21 days ago - Stars: 29 - Forks: 6

karakurai/visual_inspection

An application for visual inspection written in Python, running on Windows, Linux, and macOS. This software enables high-performance visual inspection even with an inexpensive web camera. No GPU machine required. It is possible to automate the inspection in a factory.

Language: Python - Size: 9.17 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 29 - Forks: 7

intuition-dev/INTUITION

Intuition v1. CLI for Pug, CRUD and docs/blogs as staticGen, and much more.

Size: 197 MB - Last synced at: 11 days ago - Pushed at: over 2 years ago - Stars: 29 - Forks: 3

SudhakarKuma/Machine_Learning

A repository of resources for understanding the concepts of machine learning/deep learning. 

Language: Jupyter Notebook - Size: 615 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 29 - Forks: 26

prat96/FLIR_to_Yolo

This script converts FLIR thermal dataset annotations to YOLO format

Language: Python - Size: 16.6 KB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 29 - Forks: 4

propi/rdfrules

RDFRules: Analytical Tool for Rule Mining from RDF Knowledge Graphs

Language: Scala - Size: 271 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 28 - Forks: 2

xushige/HAR-Dataset-Preprocess

Language: Python - Size: 53.7 KB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 28 - Forks: 8

vasisouv/tweets-preprocessor

Repo containing the Twitter preprocessor module, developed by the AUTH OSWinds team

Language: Python - Size: 162 KB - Last synced at: 19 days ago - Pushed at: over 4 years ago - Stars: 28 - Forks: 2

Related Topics
machine-learning 393 python 377 data-science 159 nlp 145 pandas 120 deep-learning 98 classification 96 numpy 71 data-visualization 68 data 67 python3 67 sklearn 63 data-analysis 61 eda 57 natural-language-processing 54 tensorflow 54 logistic-regression 54 dataset 53 random-forest 49 visualization 49 linear-regression 49 exploratory-data-analysis 48 feature-engineering 47 matplotlib 46 machine-learning-algorithms 46 clustering 44 data-cleaning 42 regression 42 scikit-learn 41 jupyter-notebook 40 seaborn 39 data-mining 38 sentiment-analysis 37 keras 37 neural-network 35 image-processing 34 pipeline 33 pytorch 33 nltk 32 r 31 svm 30 neural-networks 27 preprocessor 27 computer-vision 27 feature-extraction 27 analysis 26 ml 25 cnn 25 svm-classifier 23 xgboost 22 decision-trees 22 supervised-learning 21 pca 21 feature-selection 21 artificial-intelligence 20 predictive-modeling 20 nlp-machine-learning 20 datascience 20 statistics 19 eeg 19 prediction 19 naive-bayes-classifier 19 time-series 19 ai 18 knn 18 normalization 18 streamlit 18 knn-classification 18 text 17 text-classification 17 kaggle 17 pca-analysis 17 tf-idf 17 kmeans-clustering 16 opencv 16 text-processing 16 confusion-matrix 16 postprocessing 16 preprocessing-data 16 css 15 word2vec 15 data-preprocessing 15 tokenizer 15 outlier-detection 15 regression-models 15 tokenization 14 random-forest-classifier 14 text-mining 14 neuroimaging 14 mri 14 datacleaning 14 lemmatization 14 dimensionality-reduction 14 hyperparameter-tuning 13 java 13 information-retrieval 13 pandas-dataframe 13 html 12 twitter 12 kmeans 12