An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: preprocessing

Gersha2024/Alzheimer-MRI-Preprocessing-FreeSurfer-SliceSelection-DeepLearning-TransferLearning-EnsembleLearning

🧠 Detect Alzheimer's disease using MRI scans with transfer learning, deep learning, and ensemble methods for accurate stage classification and progression prediction.

Language: Python - Size: 1.34 MB - Last synced at: about 1 hour ago - Pushed at: about 5 hours ago - Stars: 1 - Forks: 0

jbusecke/xMIP

Analysis ready CMIP6 data in python the easy way with pangeo tools.

Language: Jupyter Notebook - Size: 20.4 MB - Last synced at: about 7 hours ago - Pushed at: about 2 months ago - Stars: 203 - Forks: 44

Arwa-Abbas/NexoOps--Intelligent-Network-Management-System

NexoOps is an Intelligent Network Management System which summarizes log files, classify alerts and uses a chatbot to show real time network traffic through commands

Language: Python - Size: 1.19 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 1

songyz2019/hsi-preprocessing-toolkit

A Hyperspectral Image Preprocessing Toolkit from HSI Camera to Machine Learning Dataset

Language: Python - Size: 18.2 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 2 - Forks: 0

feqq1/Air-Aware-smart-Air-Quality-prediction-system

🌍 Monitor and forecast air quality efficiently with AI-driven analytics and interactive dashboards for informed decision-making.

Language: Jupyter Notebook - Size: 4.87 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

oggtgt/AI-Powered-Loan-Eligibility-Risk-Scoring-System

🤖 Build an AI-driven loan eligibility and risk scoring system to facilitate smarter loan decisions with advanced machine learning techniques.

Language: Jupyter Notebook - Size: 5.22 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 1

Renen343/ai-flavor-remover

🌟 Enhance your text by removing AI-generated flavors, making it more natural and engaging while preserving the original meaning.

Size: 7.81 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

Chau2873/UIU-DataMining-Lab

📊 Explore data mining concepts and hands-on Python examples with exercises for the UIU Data Mining Course. Enhance your skills in ML and data visualization.

Language: Jupyter Notebook - Size: 396 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 0 - Forks: 0

EttoreRocchi/combatlearn

The ComBat algorithm for a learning framework (scikit-learn compatible)

Language: Python - Size: 2.49 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 6 - Forks: 0

fkie-cad/Logprep

log data pre processing, generation and shipping in python

Language: Python - Size: 9.76 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 34 - Forks: 10

ivanchetvergov/neuroRap

Automated pipeline for building a GPT-2 fine-tuning dataset by collecting lyrics from Genius and extracting numerical audio features via yt-dlp/FFmpeg.

Language: Jupyter Notebook - Size: 2.55 MB - Last synced at: 7 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

DaveBNU/cortexai

Language: JavaScript - Size: 1.54 MB - Last synced at: 7 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

yassineahmed/preq

preq is the community-driven problem detector for Common Reliability Enumerations (CREs).

Language: Go - Size: 79.1 KB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

nipreps/nifreeze

A flexible framework for volume-wise artifact estimation and correction across multiple 4D neuroimaging modalities (diffusion MRI, functional MRI, and PET)

Language: Python - Size: 130 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 5 - Forks: 5

winedarksea/AutoTS

Automated Time Series Forecasting

Language: Python - Size: 48.7 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 1,330 - Forks: 117

MoMo790-m/Startup-Profit-Prediction

Machine learning project to predict profits of new startups based on R&D, Admin, Marketing, and State data

Language: Jupyter Notebook - Size: 104 KB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

geometric-intelligence/polpo

A Geometric Intelligence Lab's collection of weakly-related tools.

Language: Python - Size: 171 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 0 - Forks: 1

Mohid-Water-Modelling-System/MOHID_Jupyter-Notebooks

Jupyter Notebooks for the MOHID Water Modelling System

Language: Fortran - Size: 82.1 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 0 - Forks: 1

sunlabuiuc/PyHealth

A Deep Learning Python Toolkit for Healthcare Applications.

Language: Python - Size: 124 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 1,315 - Forks: 463

UnicoLab/keras-data-processor

Data Preprocessing model based on Keras preprocessing layers that can be used as a standalone model or incorporated to Keras model as first layers.

Language: Python - Size: 10.8 MB - Last synced at: 7 days ago - Pushed at: 13 days ago - Stars: 7 - Forks: 5

OpenTabular/DeepTab

DeepTabular is a Python package that simplifies tabular deep learning by providing a suite of models for regression, classification, and distributional regression tasks. It includes models such as Mambular, TabM, FT-Transformer, TabulaRNN, TabTransformer, and tabular ResNets.

Language: Python - Size: 8.98 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 275 - Forks: 17

pedroalexleite/NNs-Stock-Prediction

Master’s thesis in Computer Science at FCUP, exploring Neural Networks for stock price prediction on S&P 500 data. Includes extensive data, preprocessing, models, architectures, hyperparameters and regularization optimization with analysis of real-world practicality.

Language: Jupyter Notebook - Size: 13.2 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 0 - Forks: 0

pcm-ds19/TradeAhead-Clustering-Analysis

Unsupervised Learning project analyzing TradeAhead stock data using K-Means, Hierarchical Clustering, and PCA. Includes full EDA, preprocessing, cluster evaluation (Silhouette Score), interpretation, and actionable business insights.

Language: Jupyter Notebook - Size: 3.17 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 0 - Forks: 0

mlr-org/mlr3pipelines

Dataflow Programming for Machine Learning in R

Language: R - Size: 25.8 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 147 - Forks: 28

FillipusAditya/cerviscan-cervical-cancer-detection

CerviScan is a machine learning project for early detection of cervical pre-cancer using color moments and texture features extracted from post-VIA colposcopy images. This research evaluates multiple color spaces, feature fusion strategies, and traditional ML classifiers such as XGBoost and AdaBoost to improve diagnostic performance.

Language: Python - Size: 251 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 0 - Forks: 0

qwkdev/css

Nested CSS Flattener

Language: JavaScript - Size: 439 KB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 0 - Forks: 0

Unstructured-IO/unstructured

Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website to learn more about our enterprise grade Platform product for production grade workflows, partitioning, enrichments, chunking and embedding.

Language: HTML - Size: 194 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 13,204 - Forks: 1,082

AAlwajeeh/ArabicSF

ArabicSF: is a C# application that is consists of a novel Arabic Stylometric Features Tool and other preproccessing tools which are inspired by other works such as Khoja stemmer, Light Stemmer, etc. - ***Note: This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. - If you find this code useful, please cite the following paper: - Mahmoud Al-Ayyoub, Ahmed Alwajeeh and Ismail Hmeidi. An extensive study of authorship authentication of Arabic articles. International Journal of Web Information Systems (IJWIS) 13(1), 2017. doi: 10.1108/IJWIS-03-2016-0011

Language: C# - Size: 1.58 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 2 - Forks: 2

hitchhicker/tweet_nlp_toolkit

Tweet NLP toolkit

Language: Python - Size: 62.5 KB - Last synced at: 6 days ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

seareport/meteo

Python package for automating meteocean data retrieval, preprocessing, and organization for hydrodynnamic model atmospherical forcing.

Language: Python - Size: 115 KB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 1 - Forks: 0

raydac/jcp-ai

Connectors for Java Comment Preprocessor (JCP) to work with LLM clients

Language: Java - Size: 1.08 MB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 4 - Forks: 0

moosmann/matlab

Data reconstruction and analysis tools for tomography data acquired at the P05 Imaging Beamline (IBL) and the P07 High-Energy Material Science (HEMS) beamline at PETRA III at DESY, both operated by Helmholtz-Zentrum Hereon.

Language: MATLAB - Size: 20.6 MB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 9 - Forks: 7

davidpfister/fortiche

Fortran interfaces, classes, headers and extensions.

Language: Fortran - Size: 325 KB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 2 - Forks: 0

yosina-lib/yosina

Yosina is a transliteration library deals with the letters and symbols used in Japanese writing.

Language: Rust - Size: 1.96 MB - Last synced at: about 14 hours ago - Pushed at: 2 months ago - Stars: 20 - Forks: 1

AlexanderFrotscher/UKB-MRI-Preprocessing

This is a custom UKB-MRI-Preprocessing pipeline. It can only run on the BIDS format and provides minimal QC.

Language: Shell - Size: 119 KB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 1 - Forks: 0

pawlyk/dsml-tools

set of Data Science and Machine Learning tools

Language: Python - Size: 269 KB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 1 - Forks: 0

anlijun/awesome-CAE-software

A curated list of awesome CAE frameworks, libraries, and software from a full CAE workflow perspective, including the integration of AI technologies.

Size: 345 KB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 19 - Forks: 1

raj-sutariya/indic-num2words

Python library for converting numbers to words for all Indian Languages.

Language: Python - Size: 117 KB - Last synced at: about 5 hours ago - Pushed at: 6 months ago - Stars: 37 - Forks: 13

sappelhoff/pyprep

PyPREP: A Python implementation of the Preprocessing Pipeline (PREP) for EEG data

Language: Python - Size: 26 MB - Last synced at: 20 days ago - Pushed at: 20 days ago - Stars: 163 - Forks: 35

nznltn/Python-Final-Project-Wine-Quality-Analysis

This project is based on the final assessment from the UCD - Advanced Center (STAT40800) course. It analyzes Portuguese red and white wine data using Python.

Language: Jupyter Notebook - Size: 7.5 MB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 0 - Forks: 0

jbferet/preprocs2

preprocS2 is an R package dedicated to basic preprocessing of Sentinel-2 Level-2A reflectance images.

Language: R - Size: 22.9 MB - Last synced at: 16 days ago - Pushed at: 17 days ago - Stars: 3 - Forks: 0

Mecanik/Modern-Text-Tokenizer

Modern UTF-8 aware C++ tokenizer with vocabulary support, ideal for NLP and transformer models. Header-only and zero-dependency.

Language: C++ - Size: 42 KB - Last synced at: 11 days ago - Pushed at: 4 months ago - Stars: 4 - Forks: 0

constancestreitman/package-delay-analysis

Language: HTML - Size: 435 KB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 0 - Forks: 0

itsatefe/RoboShiraz-AI-Basics

An educational tutorial for beginners to learn the fundamentals of Artificial Intelligence and Machine Learning using Python.

Language: Jupyter Notebook - Size: 13.9 MB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 2 - Forks: 0

speg03/jiren

jinja2 template renderer

Language: Python - Size: 321 KB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 2 - Forks: 0

twocaretcat/mips-variable-replacer

A command-line tool to simplify development in MIPS assembly. Use easy to remember variable names in MIPS and map them to actual registers before assembling

Language: Python - Size: 21.5 KB - Last synced at: 26 days ago - Pushed at: 26 days ago - Stars: 1 - Forks: 0

dongrixinyu/JioNLP

中文 NLP 预处理、解析工具包,准确、高效、易用 A Chinese NLP Preprocessing & Parsing Package www.jionlp.com

Language: Python - Size: 159 MB - Last synced at: 28 days ago - Pushed at: 28 days ago - Stars: 3,746 - Forks: 441

fantaskiss/ComfyUI-node-img88

一组ComfyUI的图片预处理相关节点。因为模型对于图像的两边长有被除数要求,为避免一般缩放产生形变而制作。包括扩图前对图片边长自动扩展的节点2个,生图前载入图片大小预设的节点一个。

Language: Python - Size: 83 KB - Last synced at: 28 days ago - Pushed at: 28 days ago - Stars: 1 - Forks: 0

PavelGrigoryevDS/olist-deep-dive

🌊 Deep Sales Analysis of Olist E-Commerce: EDA | Time Series| Viz | RFM | NLP | Geospatial | Segmentation & Actionable Business Recommendations.

Language: Jupyter Notebook - Size: 116 MB - Last synced at: 29 days ago - Pushed at: 29 days ago - Stars: 3 - Forks: 0

pratham-ak2004/sms-spam-classifier

This repository is deployed in the web with the help of streamlit web host service

Language: Jupyter Notebook - Size: 801 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

nipreps/dmriprep

dMRIPrep is a robust and easy-to-use pipeline for preprocessing of diverse dMRI data. The transparent workflow dispenses of manual intervention, thereby ensuring the reproducibility of the results.

Language: Python - Size: 115 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 71 - Forks: 25

Davide011/ML_project_South_African_Heart_Disease

Public Repository: Machine Learning & Data Mining project using the South African Heart Disease dataset. Applied PCA, Regularized Linear Regression, ANN, Logistic Regression, and Decision Trees with cross-validation for regression and classification. Includes feature scaling, EDA, and statistical tests.

Size: 1.32 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

MinishLab/semhash

Fast Semantic Text Deduplication & Filtering

Language: Python - Size: 6.18 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 825 - Forks: 51

zamirmehdi/Data-and-Information-Analysis-Course

Implementation of data analysis algorithms — normalization, outlier detection, and K-Means — from scratch in Python

Language: Jupyter Notebook - Size: 1.85 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 3 - Forks: 0

jamal919/pycaz

Collection of functions for data analysis, model input preparation, post-processing, analysis.

Language: Python - Size: 1.12 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 5 - Forks: 2

autoreject/autoreject

Automated rejection and repair of bad trials/sensors in M/EEG

Language: Python - Size: 704 KB - Last synced at: 18 days ago - Pushed at: 3 months ago - Stars: 147 - Forks: 59

KeeVeeGames/Shady.gml

GameMaker shader preprocessor for code reuse! Import and inline directives, generating shader variants.

Language: C# - Size: 41.3 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 24 - Forks: 1

qd-cae/awesome-CAE

A curated list of awesome CAE frameworks, libraries and software.

Size: 57.6 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 443 - Forks: 109

Awais-Asghar/SkinSense-Multi-Model-Skin-Cancer-Classifier

A machine learning project for binary classification of skin cancer as malignant or benign, utilizing models like XGBoost, LGBM Classifier, Adaboost, SVM, and Logistic Regression. Features comprehensive data preprocessing, model training, and evaluation for accurate diagnosis.

Language: Jupyter Notebook - Size: 8.56 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

calvinmccarter/kditransform

Kernel density integral transformation: feature preprocessing and univariate clustering (TMLR, 2023)

Language: Python - Size: 15.4 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 9 - Forks: 0

TheAlgorithms/R

Collection of various algorithms implemented in R.

Language: R - Size: 1.37 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1,036 - Forks: 342

OpenGene/fastp

An ultra-fast all-in-one FASTQ preprocessor (QC/adapters/trimming/filtering/splitting/merging...)

Language: C++ - Size: 691 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 2,204 - Forks: 354

SaadatMilad1792/PhysioPrep

A preprocessing pipeline for physiological waveform datasets.

Language: Python - Size: 6.13 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 2 - Forks: 0

EttoreRocchi/MaldiAMRKit

Comprehensive toolkit for MALDI-TOF mass spectrometry data preprocessing for antimicrobial resistance (AMR) prediction purposes

Language: Python - Size: 7.35 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 2 - Forks: 0

labex-labs/scikit-learn-for-beginners

This comprehensive course covers the fundamental concepts and practical techniques of Scikit-learn, the essential machine learning library in Python. Learn to build, train, and evaluate machine learning models using various algorithms and preprocessing techniques.

Size: 37.1 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

l-ramirez-lopez/prospectr

R package: Misc. Functions for Processing and Sample Selection of Spectroscopic Data

Language: R - Size: 17.4 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 44 - Forks: 21

veldhub/veld_chain__demo_nlp_generic_preprocessing

Demo of encapsulation of several commonly used NLP preprocessing workflows

Size: 300 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

veldhub/veld_code__nlp_generic_preprocessing

Encapsulation of several commonly used NLP preprocessing workflows

Language: Python - Size: 122 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

ikegami-yukino/jaconv

Pure-Python Japanese character interconverter for Hiragana, Katakana, Hankaku, and Zenkaku

Language: Python - Size: 369 KB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 335 - Forks: 32

4t2de/disease-classification-ml

Prototype-Based Classifier for Disease Classification

Language: Jupyter Notebook - Size: 6.46 MB - Last synced at: 27 days ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

elcorto/pwtools

pwtools is a Python package for pre- and postprocessing of atomistic calculations, mostly targeted to Quantum Espresso, CPMD, CP2K and LAMMPS. It is almost, but not quite, entirely unlike ASE, with some tools extending numpy/scipy. It has a set of powerful parsers and data types for storing calculation data.

Language: Python - Size: 21.3 MB - Last synced at: about 1 month ago - Pushed at: 6 months ago - Stars: 71 - Forks: 17

allenai/smashed

SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batching, and more. Supports datasets from Huggingface, torchdata iterables, or simple lists of dictionaries.

Language: Python - Size: 4.56 MB - Last synced at: about 3 hours ago - Pushed at: over 1 year ago - Stars: 35 - Forks: 5

AnaAquiles/MiniscopePipeLine

Preprocessing functions for Inscopix movies

Language: Jupyter Notebook - Size: 9.77 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

shawntz/eyeris

Fully featured R package for reproducible pupillometry preprocessing | Interactive reports, BIDS-compliant, High-throughput database tooling out-of-the-box | Developed by neuroscientists at Stanford

Language: R - Size: 106 MB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 5 - Forks: 4

k-vashpanova/rsl-slp

Модель перевода с русского языка на русский жестовый язык

Language: Python - Size: 33.2 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

xga0/contraction_fix

A fast and efficient library for fixing contractions in text

Language: Python - Size: 87.9 KB - Last synced at: about 8 hours ago - Pushed at: 4 months ago - Stars: 6 - Forks: 1

Chandrashekar0123/Students_Passout_Predictions

This Repository consists of Students pass out or fail using Machine Learning Techniques.

Language: Jupyter Notebook - Size: 948 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

LaurentDardenne/Template

Code generation by using text templates

Language: PowerShell - Size: 170 KB - Last synced at: about 2 months ago - Pushed at: over 8 years ago - Stars: 2 - Forks: 0

DmitryRyumin/OpenAV

An open-source library for recognition of speech commands in the user dictionary using audiovisual data of the speaker

Language: Python - Size: 113 MB - Last synced at: about 2 months ago - Pushed at: 9 months ago - Stars: 6 - Forks: 3

pytorch/torcharrow 📦

High performance model preprocessing library on PyTorch

Language: Python - Size: 11.3 MB - Last synced at: 29 days ago - Pushed at: over 1 year ago - Stars: 644 - Forks: 81

Hyland/DocumentFilters

Document Filters is an SDK for applications like content indexing, e-discovery, data migration, and feeding data into AI/ML models by extracting data from unstructured sources. It gives the ability to perform deep inspection, data extraction, output manipulation, and conversion for virtually any type of document, in any programming language.

Language: C++ - Size: 62.7 MB - Last synced at: 20 days ago - Pushed at: 20 days ago - Stars: 23 - Forks: 2

Aura-healthcare/ecg_qc

A library to compute ECG signal quality indicators

Language: Jupyter Notebook - Size: 50.4 MB - Last synced at: about 5 hours ago - Pushed at: about 3 years ago - Stars: 42 - Forks: 10

hscspring/pnlp

NLP预/后处理工具。

Language: Python - Size: 106 KB - Last synced at: about 4 hours ago - Pushed at: 8 months ago - Stars: 30 - Forks: 6

pharo-ai/data-preprocessing

Project including data pre-processing algo. We aim to include scaling, centering, normalization, binarization methods.

Language: Smalltalk - Size: 32.2 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 1

keurfonluu/toughio

Pre- and post-processing Python library for TOUGH

Language: Python - Size: 18.6 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 66 - Forks: 9

PylarBear/pybear

pybear is a Python computing library that augments data analytics functionality found in popular packages that use the scikit-learn API, such as scikit-learn and xgboost.

Language: Python - Size: 50.7 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

AlessioZanga/PyEEGLab 📦

Analyze and manipulate EEG data using PyEEGLab.

Language: Python - Size: 1.04 GB - Last synced at: about 8 hours ago - Pushed at: almost 5 years ago - Stars: 62 - Forks: 23

AlwaysDhruv/Image-Classification-CPP

Hi their my self Dhruv. So this repository or project are developed on C++ and Python for image recognize. C++ are main engine and python are work preprocessing only. more information are in README file.

Language: C++ - Size: 1.06 MB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 2 - Forks: 0

oshinrathor/ML-NLP-Projects

This repository contains a collection of Machine Learning and NLP projects, including sentiment analysis with NLTK, text preprocessing, and deep learning models. It covers techniques like tokenization, stopword removal, lemmatization, rule-based analysis, and transformer models like BERT for practical NLP applications.

Language: Jupyter Notebook - Size: 2.85 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 3 - Forks: 0

OpenTabular/PreTab

pretab is a flexible and extensible preprocessing library for tabular data, built on top of scikit-learn. It provides advanced transformations, spline and neural feature expansions, and seamless integration with embeddings – all designed for modern tabular ML workflows.

Language: Python - Size: 113 KB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 11 - Forks: 1

obtic-sorbonne/Toolbox-site

Pandore offers a set of tools that facilitate the most common corpus processing tasks for digital humanities research. Automatic pipelines for a set of tasks are also available

Language: HTML - Size: 168 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 4 - Forks: 1

0xferit/ITU-Turkish-NLP-Pipeline-Caller 📦

A Python3 wrapper tool to help using ITU Turkish NLP Pipeline API -- UNMAINTAINED --

Language: Python - Size: 131 KB - Last synced at: 20 days ago - Pushed at: over 7 years ago - Stars: 45 - Forks: 9

veldhub/veld_code__wordembeddings_preprocessing

Code velds encapsulating preprocessing for training of wordembeddings.

Language: Python - Size: 51.8 KB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

NirLab-TAU/sleepeegpy

Language: Jupyter Notebook - Size: 166 MB - Last synced at: about 2 months ago - Pushed at: 5 months ago - Stars: 33 - Forks: 12

Ashly1991/rnn-text-classification-tf2

IMDB sentiment analysis with a from-scratch RNN in low-level TensorFlow 2 (no Keras RNN layers). Padding/truncation, vocab limits, and BPTT training.

Language: Jupyter Notebook - Size: 9.77 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

exponentialR/QUB-HRI

Preprocessing Repository of QUB-Perception of Human Enagagement in Assembly Operations Dataset

Language: Python - Size: 91 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 2 - Forks: 0

lennymalard/melpy-project

A NumPy-based deep learning library for building neural networks. It features an automatic differentiation engine and supports training models like LSTM, CNN, and FNN.

Language: Python - Size: 159 MB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 3 - Forks: 0

kashinathbiradar/Bangalore-Housing-Price-Prediction

The objective of the project is to create a machine learning model. We are doing a supervised learning and our aim is to do predictive analysis to predict housing price.

Language: HTML - Size: 84 KB - Last synced at: about 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

AxeldeRomblay/MLBox

MLBox is a powerful Automated Machine Learning python library.

Language: Python - Size: 50 MB - Last synced at: 2 months ago - Pushed at: over 2 years ago - Stars: 1,518 - Forks: 273

Hyedryn/elikopy

ElikoPy is Python library aiming at easing the processing of diffusion imaging for microstructural analysis.

Language: Python - Size: 4.52 MB - Last synced at: 14 days ago - Pushed at: 4 months ago - Stars: 19 - Forks: 5

Related Keywords
machine-learning 296 python 276 data-science 119 nlp 113 deep-learning 78 pandas 76 classification 71 data 57 data-analysis 53 data-visualization 49 numpy 46 python3 44 sklearn 44 tensorflow 43 feature-engineering 43 natural-language-processing 41 dataset 40 logistic-regression 40 eda 36 random-forest 34 visualization 34 linear-regression 34 data-mining 33 regression 32 exploratory-data-analysis 30 clustering 29 scikit-learn 28 neural-network 28 machine-learning-algorithms 28 keras 28 pytorch 27 r 27 image-processing 27 pipeline 26 matplotlib 26 jupyter-notebook 25 nltk 25 seaborn 25 feature-extraction 24 data-cleaning 24 sentiment-analysis 22 artificial-intelligence 22 preprocessor 21 neural-networks 21 computer-vision 20 ml 20 svm 19 supervised-learning 17 ai 17 svm-classifier 17 eeg 17 nlp-machine-learning 17 analysis 17 normalization 16 java 16 xgboost 16 cnn 16 text-processing 16 statistics 16 prediction 15 decision-trees 15 opencv 15 datascience 15 postprocessing 14 knn-classification 14 knn 14 text 14 predictive-modeling 14 feature-selection 14 naive-bayes-classifier 13 kaggle 13 tokenizer 13 mri 13 time-series 13 neuroimaging 12 streamlit 12 preprocessing-data 12 text-classification 12 text-mining 12 pca 12 ensemble-learning 12 c 11 regression-models 11 datacleaning 11 tf-idf 11 css 11 lemmatization 11 word2vec 10 matlab 10 fmri 10 data-preprocessing 10 random-forest-classifier 10 twitter 10 unsupervised-learning 10 weka 10 datamining 10 pandas-dataframe 10 spacy 9 bert 9 flask 9