GitHub topics: preprocessing
Gersha2024/Alzheimer-MRI-Preprocessing-FreeSurfer-SliceSelection-DeepLearning-TransferLearning-EnsembleLearning
🧠 Detect Alzheimer's disease using MRI scans with transfer learning, deep learning, and ensemble methods for accurate stage classification and progression prediction.
Language: Python - Size: 1.34 MB - Last synced at: about 1 hour ago - Pushed at: about 5 hours ago - Stars: 1 - Forks: 0
jbusecke/xMIP
Analysis ready CMIP6 data in python the easy way with pangeo tools.
Language: Jupyter Notebook - Size: 20.4 MB - Last synced at: about 7 hours ago - Pushed at: about 2 months ago - Stars: 203 - Forks: 44
Arwa-Abbas/NexoOps--Intelligent-Network-Management-System
NexoOps is an Intelligent Network Management System which summarizes log files, classify alerts and uses a chatbot to show real time network traffic through commands
Language: Python - Size: 1.19 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 1
songyz2019/hsi-preprocessing-toolkit
A Hyperspectral Image Preprocessing Toolkit from HSI Camera to Machine Learning Dataset
Language: Python - Size: 18.2 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 2 - Forks: 0
feqq1/Air-Aware-smart-Air-Quality-prediction-system
🌍 Monitor and forecast air quality efficiently with AI-driven analytics and interactive dashboards for informed decision-making.
Language: Jupyter Notebook - Size: 4.87 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0
oggtgt/AI-Powered-Loan-Eligibility-Risk-Scoring-System
🤖 Build an AI-driven loan eligibility and risk scoring system to facilitate smarter loan decisions with advanced machine learning techniques.
Language: Jupyter Notebook - Size: 5.22 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 1
Renen343/ai-flavor-remover
🌟 Enhance your text by removing AI-generated flavors, making it more natural and engaging while preserving the original meaning.
Size: 7.81 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0
Chau2873/UIU-DataMining-Lab
📊 Explore data mining concepts and hands-on Python examples with exercises for the UIU Data Mining Course. Enhance your skills in ML and data visualization.
Language: Jupyter Notebook - Size: 396 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 0 - Forks: 0
EttoreRocchi/combatlearn
The ComBat algorithm for a learning framework (scikit-learn compatible)
Language: Python - Size: 2.49 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 6 - Forks: 0
fkie-cad/Logprep
log data pre processing, generation and shipping in python
Language: Python - Size: 9.76 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 34 - Forks: 10
ivanchetvergov/neuroRap
Automated pipeline for building a GPT-2 fine-tuning dataset by collecting lyrics from Genius and extracting numerical audio features via yt-dlp/FFmpeg.
Language: Jupyter Notebook - Size: 2.55 MB - Last synced at: 7 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0
DaveBNU/cortexai
Language: JavaScript - Size: 1.54 MB - Last synced at: 7 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0
yassineahmed/preq
preq is the community-driven problem detector for Common Reliability Enumerations (CREs).
Language: Go - Size: 79.1 KB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0
nipreps/nifreeze
A flexible framework for volume-wise artifact estimation and correction across multiple 4D neuroimaging modalities (diffusion MRI, functional MRI, and PET)
Language: Python - Size: 130 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 5 - Forks: 5
winedarksea/AutoTS
Automated Time Series Forecasting
Language: Python - Size: 48.7 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 1,330 - Forks: 117
MoMo790-m/Startup-Profit-Prediction
Machine learning project to predict profits of new startups based on R&D, Admin, Marketing, and State data
Language: Jupyter Notebook - Size: 104 KB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0
geometric-intelligence/polpo
A Geometric Intelligence Lab's collection of weakly-related tools.
Language: Python - Size: 171 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 0 - Forks: 1
Mohid-Water-Modelling-System/MOHID_Jupyter-Notebooks
Jupyter Notebooks for the MOHID Water Modelling System
Language: Fortran - Size: 82.1 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 0 - Forks: 1
sunlabuiuc/PyHealth
A Deep Learning Python Toolkit for Healthcare Applications.
Language: Python - Size: 124 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 1,315 - Forks: 463
UnicoLab/keras-data-processor
Data Preprocessing model based on Keras preprocessing layers that can be used as a standalone model or incorporated to Keras model as first layers.
Language: Python - Size: 10.8 MB - Last synced at: 7 days ago - Pushed at: 13 days ago - Stars: 7 - Forks: 5
OpenTabular/DeepTab
DeepTabular is a Python package that simplifies tabular deep learning by providing a suite of models for regression, classification, and distributional regression tasks. It includes models such as Mambular, TabM, FT-Transformer, TabulaRNN, TabTransformer, and tabular ResNets.
Language: Python - Size: 8.98 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 275 - Forks: 17
pedroalexleite/NNs-Stock-Prediction
Master’s thesis in Computer Science at FCUP, exploring Neural Networks for stock price prediction on S&P 500 data. Includes extensive data, preprocessing, models, architectures, hyperparameters and regularization optimization with analysis of real-world practicality.
Language: Jupyter Notebook - Size: 13.2 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 0 - Forks: 0
pcm-ds19/TradeAhead-Clustering-Analysis
Unsupervised Learning project analyzing TradeAhead stock data using K-Means, Hierarchical Clustering, and PCA. Includes full EDA, preprocessing, cluster evaluation (Silhouette Score), interpretation, and actionable business insights.
Language: Jupyter Notebook - Size: 3.17 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 0 - Forks: 0
mlr-org/mlr3pipelines
Dataflow Programming for Machine Learning in R
Language: R - Size: 25.8 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 147 - Forks: 28
FillipusAditya/cerviscan-cervical-cancer-detection
CerviScan is a machine learning project for early detection of cervical pre-cancer using color moments and texture features extracted from post-VIA colposcopy images. This research evaluates multiple color spaces, feature fusion strategies, and traditional ML classifiers such as XGBoost and AdaBoost to improve diagnostic performance.
Language: Python - Size: 251 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 0 - Forks: 0
qwkdev/css
Nested CSS Flattener
Language: JavaScript - Size: 439 KB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 0 - Forks: 0
Unstructured-IO/unstructured
Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website to learn more about our enterprise grade Platform product for production grade workflows, partitioning, enrichments, chunking and embedding.
Language: HTML - Size: 194 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 13,204 - Forks: 1,082
AAlwajeeh/ArabicSF
ArabicSF: is a C# application that is consists of a novel Arabic Stylometric Features Tool and other preproccessing tools which are inspired by other works such as Khoja stemmer, Light Stemmer, etc. - ***Note: This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. - If you find this code useful, please cite the following paper: - Mahmoud Al-Ayyoub, Ahmed Alwajeeh and Ismail Hmeidi. An extensive study of authorship authentication of Arabic articles. International Journal of Web Information Systems (IJWIS) 13(1), 2017. doi: 10.1108/IJWIS-03-2016-0011
Language: C# - Size: 1.58 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 2 - Forks: 2
hitchhicker/tweet_nlp_toolkit
Tweet NLP toolkit
Language: Python - Size: 62.5 KB - Last synced at: 6 days ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0
seareport/meteo
Python package for automating meteocean data retrieval, preprocessing, and organization for hydrodynnamic model atmospherical forcing.
Language: Python - Size: 115 KB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 1 - Forks: 0
raydac/jcp-ai
Connectors for Java Comment Preprocessor (JCP) to work with LLM clients
Language: Java - Size: 1.08 MB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 4 - Forks: 0
moosmann/matlab
Data reconstruction and analysis tools for tomography data acquired at the P05 Imaging Beamline (IBL) and the P07 High-Energy Material Science (HEMS) beamline at PETRA III at DESY, both operated by Helmholtz-Zentrum Hereon.
Language: MATLAB - Size: 20.6 MB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 9 - Forks: 7
davidpfister/fortiche
Fortran interfaces, classes, headers and extensions.
Language: Fortran - Size: 325 KB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 2 - Forks: 0
yosina-lib/yosina
Yosina is a transliteration library deals with the letters and symbols used in Japanese writing.
Language: Rust - Size: 1.96 MB - Last synced at: about 14 hours ago - Pushed at: 2 months ago - Stars: 20 - Forks: 1
AlexanderFrotscher/UKB-MRI-Preprocessing
This is a custom UKB-MRI-Preprocessing pipeline. It can only run on the BIDS format and provides minimal QC.
Language: Shell - Size: 119 KB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 1 - Forks: 0
pawlyk/dsml-tools
set of Data Science and Machine Learning tools
Language: Python - Size: 269 KB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 1 - Forks: 0
anlijun/awesome-CAE-software
A curated list of awesome CAE frameworks, libraries, and software from a full CAE workflow perspective, including the integration of AI technologies.
Size: 345 KB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 19 - Forks: 1
raj-sutariya/indic-num2words
Python library for converting numbers to words for all Indian Languages.
Language: Python - Size: 117 KB - Last synced at: about 5 hours ago - Pushed at: 6 months ago - Stars: 37 - Forks: 13
sappelhoff/pyprep
PyPREP: A Python implementation of the Preprocessing Pipeline (PREP) for EEG data
Language: Python - Size: 26 MB - Last synced at: 20 days ago - Pushed at: 20 days ago - Stars: 163 - Forks: 35
nznltn/Python-Final-Project-Wine-Quality-Analysis
This project is based on the final assessment from the UCD - Advanced Center (STAT40800) course. It analyzes Portuguese red and white wine data using Python.
Language: Jupyter Notebook - Size: 7.5 MB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 0 - Forks: 0
jbferet/preprocs2
preprocS2 is an R package dedicated to basic preprocessing of Sentinel-2 Level-2A reflectance images.
Language: R - Size: 22.9 MB - Last synced at: 16 days ago - Pushed at: 17 days ago - Stars: 3 - Forks: 0
Mecanik/Modern-Text-Tokenizer
Modern UTF-8 aware C++ tokenizer with vocabulary support, ideal for NLP and transformer models. Header-only and zero-dependency.
Language: C++ - Size: 42 KB - Last synced at: 11 days ago - Pushed at: 4 months ago - Stars: 4 - Forks: 0
constancestreitman/package-delay-analysis
Language: HTML - Size: 435 KB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 0 - Forks: 0
itsatefe/RoboShiraz-AI-Basics
An educational tutorial for beginners to learn the fundamentals of Artificial Intelligence and Machine Learning using Python.
Language: Jupyter Notebook - Size: 13.9 MB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 2 - Forks: 0
speg03/jiren
jinja2 template renderer
Language: Python - Size: 321 KB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 2 - Forks: 0
twocaretcat/mips-variable-replacer
A command-line tool to simplify development in MIPS assembly. Use easy to remember variable names in MIPS and map them to actual registers before assembling
Language: Python - Size: 21.5 KB - Last synced at: 26 days ago - Pushed at: 26 days ago - Stars: 1 - Forks: 0
dongrixinyu/JioNLP
中文 NLP 预处理、解析工具包,准确、高效、易用 A Chinese NLP Preprocessing & Parsing Package www.jionlp.com
Language: Python - Size: 159 MB - Last synced at: 28 days ago - Pushed at: 28 days ago - Stars: 3,746 - Forks: 441
fantaskiss/ComfyUI-node-img88
一组ComfyUI的图片预处理相关节点。因为模型对于图像的两边长有被除数要求,为避免一般缩放产生形变而制作。包括扩图前对图片边长自动扩展的节点2个,生图前载入图片大小预设的节点一个。
Language: Python - Size: 83 KB - Last synced at: 28 days ago - Pushed at: 28 days ago - Stars: 1 - Forks: 0
PavelGrigoryevDS/olist-deep-dive
🌊 Deep Sales Analysis of Olist E-Commerce: EDA | Time Series| Viz | RFM | NLP | Geospatial | Segmentation & Actionable Business Recommendations.
Language: Jupyter Notebook - Size: 116 MB - Last synced at: 29 days ago - Pushed at: 29 days ago - Stars: 3 - Forks: 0
pratham-ak2004/sms-spam-classifier
This repository is deployed in the web with the help of streamlit web host service
Language: Jupyter Notebook - Size: 801 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0
nipreps/dmriprep
dMRIPrep is a robust and easy-to-use pipeline for preprocessing of diverse dMRI data. The transparent workflow dispenses of manual intervention, thereby ensuring the reproducibility of the results.
Language: Python - Size: 115 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 71 - Forks: 25
Davide011/ML_project_South_African_Heart_Disease
Public Repository: Machine Learning & Data Mining project using the South African Heart Disease dataset. Applied PCA, Regularized Linear Regression, ANN, Logistic Regression, and Decision Trees with cross-validation for regression and classification. Includes feature scaling, EDA, and statistical tests.
Size: 1.32 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0
MinishLab/semhash
Fast Semantic Text Deduplication & Filtering
Language: Python - Size: 6.18 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 825 - Forks: 51
zamirmehdi/Data-and-Information-Analysis-Course
Implementation of data analysis algorithms — normalization, outlier detection, and K-Means — from scratch in Python
Language: Jupyter Notebook - Size: 1.85 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 3 - Forks: 0
jamal919/pycaz
Collection of functions for data analysis, model input preparation, post-processing, analysis.
Language: Python - Size: 1.12 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 5 - Forks: 2
autoreject/autoreject
Automated rejection and repair of bad trials/sensors in M/EEG
Language: Python - Size: 704 KB - Last synced at: 18 days ago - Pushed at: 3 months ago - Stars: 147 - Forks: 59
KeeVeeGames/Shady.gml
GameMaker shader preprocessor for code reuse! Import and inline directives, generating shader variants.
Language: C# - Size: 41.3 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 24 - Forks: 1
qd-cae/awesome-CAE
A curated list of awesome CAE frameworks, libraries and software.
Size: 57.6 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 443 - Forks: 109
Awais-Asghar/SkinSense-Multi-Model-Skin-Cancer-Classifier
A machine learning project for binary classification of skin cancer as malignant or benign, utilizing models like XGBoost, LGBM Classifier, Adaboost, SVM, and Logistic Regression. Features comprehensive data preprocessing, model training, and evaluation for accurate diagnosis.
Language: Jupyter Notebook - Size: 8.56 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0
calvinmccarter/kditransform
Kernel density integral transformation: feature preprocessing and univariate clustering (TMLR, 2023)
Language: Python - Size: 15.4 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 9 - Forks: 0
TheAlgorithms/R
Collection of various algorithms implemented in R.
Language: R - Size: 1.37 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1,036 - Forks: 342
OpenGene/fastp
An ultra-fast all-in-one FASTQ preprocessor (QC/adapters/trimming/filtering/splitting/merging...)
Language: C++ - Size: 691 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 2,204 - Forks: 354
SaadatMilad1792/PhysioPrep
A preprocessing pipeline for physiological waveform datasets.
Language: Python - Size: 6.13 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 2 - Forks: 0
EttoreRocchi/MaldiAMRKit
Comprehensive toolkit for MALDI-TOF mass spectrometry data preprocessing for antimicrobial resistance (AMR) prediction purposes
Language: Python - Size: 7.35 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 2 - Forks: 0
labex-labs/scikit-learn-for-beginners
This comprehensive course covers the fundamental concepts and practical techniques of Scikit-learn, the essential machine learning library in Python. Learn to build, train, and evaluate machine learning models using various algorithms and preprocessing techniques.
Size: 37.1 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0
l-ramirez-lopez/prospectr
R package: Misc. Functions for Processing and Sample Selection of Spectroscopic Data
Language: R - Size: 17.4 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 44 - Forks: 21
veldhub/veld_chain__demo_nlp_generic_preprocessing
Demo of encapsulation of several commonly used NLP preprocessing workflows
Size: 300 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0
veldhub/veld_code__nlp_generic_preprocessing
Encapsulation of several commonly used NLP preprocessing workflows
Language: Python - Size: 122 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0
ikegami-yukino/jaconv
Pure-Python Japanese character interconverter for Hiragana, Katakana, Hankaku, and Zenkaku
Language: Python - Size: 369 KB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 335 - Forks: 32
4t2de/disease-classification-ml
Prototype-Based Classifier for Disease Classification
Language: Jupyter Notebook - Size: 6.46 MB - Last synced at: 27 days ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0
elcorto/pwtools
pwtools is a Python package for pre- and postprocessing of atomistic calculations, mostly targeted to Quantum Espresso, CPMD, CP2K and LAMMPS. It is almost, but not quite, entirely unlike ASE, with some tools extending numpy/scipy. It has a set of powerful parsers and data types for storing calculation data.
Language: Python - Size: 21.3 MB - Last synced at: about 1 month ago - Pushed at: 6 months ago - Stars: 71 - Forks: 17
allenai/smashed
SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batching, and more. Supports datasets from Huggingface, torchdata iterables, or simple lists of dictionaries.
Language: Python - Size: 4.56 MB - Last synced at: about 3 hours ago - Pushed at: over 1 year ago - Stars: 35 - Forks: 5
AnaAquiles/MiniscopePipeLine
Preprocessing functions for Inscopix movies
Language: Jupyter Notebook - Size: 9.77 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0
shawntz/eyeris
Fully featured R package for reproducible pupillometry preprocessing | Interactive reports, BIDS-compliant, High-throughput database tooling out-of-the-box | Developed by neuroscientists at Stanford
Language: R - Size: 106 MB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 5 - Forks: 4
k-vashpanova/rsl-slp
Модель перевода с русского языка на русский жестовый язык
Language: Python - Size: 33.2 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0
xga0/contraction_fix
A fast and efficient library for fixing contractions in text
Language: Python - Size: 87.9 KB - Last synced at: about 8 hours ago - Pushed at: 4 months ago - Stars: 6 - Forks: 1
Chandrashekar0123/Students_Passout_Predictions
This Repository consists of Students pass out or fail using Machine Learning Techniques.
Language: Jupyter Notebook - Size: 948 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0
LaurentDardenne/Template
Code generation by using text templates
Language: PowerShell - Size: 170 KB - Last synced at: about 2 months ago - Pushed at: over 8 years ago - Stars: 2 - Forks: 0
DmitryRyumin/OpenAV
An open-source library for recognition of speech commands in the user dictionary using audiovisual data of the speaker
Language: Python - Size: 113 MB - Last synced at: about 2 months ago - Pushed at: 9 months ago - Stars: 6 - Forks: 3
pytorch/torcharrow 📦
High performance model preprocessing library on PyTorch
Language: Python - Size: 11.3 MB - Last synced at: 29 days ago - Pushed at: over 1 year ago - Stars: 644 - Forks: 81
Hyland/DocumentFilters
Document Filters is an SDK for applications like content indexing, e-discovery, data migration, and feeding data into AI/ML models by extracting data from unstructured sources. It gives the ability to perform deep inspection, data extraction, output manipulation, and conversion for virtually any type of document, in any programming language.
Language: C++ - Size: 62.7 MB - Last synced at: 20 days ago - Pushed at: 20 days ago - Stars: 23 - Forks: 2
Aura-healthcare/ecg_qc
A library to compute ECG signal quality indicators
Language: Jupyter Notebook - Size: 50.4 MB - Last synced at: about 5 hours ago - Pushed at: about 3 years ago - Stars: 42 - Forks: 10
hscspring/pnlp
NLP预/后处理工具。
Language: Python - Size: 106 KB - Last synced at: about 4 hours ago - Pushed at: 8 months ago - Stars: 30 - Forks: 6
pharo-ai/data-preprocessing
Project including data pre-processing algo. We aim to include scaling, centering, normalization, binarization methods.
Language: Smalltalk - Size: 32.2 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 1
keurfonluu/toughio
Pre- and post-processing Python library for TOUGH
Language: Python - Size: 18.6 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 66 - Forks: 9
PylarBear/pybear
pybear is a Python computing library that augments data analytics functionality found in popular packages that use the scikit-learn API, such as scikit-learn and xgboost.
Language: Python - Size: 50.7 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0
AlessioZanga/PyEEGLab 📦
Analyze and manipulate EEG data using PyEEGLab.
Language: Python - Size: 1.04 GB - Last synced at: about 8 hours ago - Pushed at: almost 5 years ago - Stars: 62 - Forks: 23
AlwaysDhruv/Image-Classification-CPP
Hi their my self Dhruv. So this repository or project are developed on C++ and Python for image recognize. C++ are main engine and python are work preprocessing only. more information are in README file.
Language: C++ - Size: 1.06 MB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 2 - Forks: 0
oshinrathor/ML-NLP-Projects
This repository contains a collection of Machine Learning and NLP projects, including sentiment analysis with NLTK, text preprocessing, and deep learning models. It covers techniques like tokenization, stopword removal, lemmatization, rule-based analysis, and transformer models like BERT for practical NLP applications.
Language: Jupyter Notebook - Size: 2.85 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 3 - Forks: 0
OpenTabular/PreTab
pretab is a flexible and extensible preprocessing library for tabular data, built on top of scikit-learn. It provides advanced transformations, spline and neural feature expansions, and seamless integration with embeddings – all designed for modern tabular ML workflows.
Language: Python - Size: 113 KB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 11 - Forks: 1
obtic-sorbonne/Toolbox-site
Pandore offers a set of tools that facilitate the most common corpus processing tasks for digital humanities research. Automatic pipelines for a set of tasks are also available
Language: HTML - Size: 168 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 4 - Forks: 1
0xferit/ITU-Turkish-NLP-Pipeline-Caller 📦
A Python3 wrapper tool to help using ITU Turkish NLP Pipeline API -- UNMAINTAINED --
Language: Python - Size: 131 KB - Last synced at: 20 days ago - Pushed at: over 7 years ago - Stars: 45 - Forks: 9
veldhub/veld_code__wordembeddings_preprocessing
Code velds encapsulating preprocessing for training of wordembeddings.
Language: Python - Size: 51.8 KB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0
NirLab-TAU/sleepeegpy
Language: Jupyter Notebook - Size: 166 MB - Last synced at: about 2 months ago - Pushed at: 5 months ago - Stars: 33 - Forks: 12
Ashly1991/rnn-text-classification-tf2
IMDB sentiment analysis with a from-scratch RNN in low-level TensorFlow 2 (no Keras RNN layers). Padding/truncation, vocab limits, and BPTT training.
Language: Jupyter Notebook - Size: 9.77 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0
exponentialR/QUB-HRI
Preprocessing Repository of QUB-Perception of Human Enagagement in Assembly Operations Dataset
Language: Python - Size: 91 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 2 - Forks: 0
lennymalard/melpy-project
A NumPy-based deep learning library for building neural networks. It features an automatic differentiation engine and supports training models like LSTM, CNN, and FNN.
Language: Python - Size: 159 MB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 3 - Forks: 0
kashinathbiradar/Bangalore-Housing-Price-Prediction
The objective of the project is to create a machine learning model. We are doing a supervised learning and our aim is to do predictive analysis to predict housing price.
Language: HTML - Size: 84 KB - Last synced at: about 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0
AxeldeRomblay/MLBox
MLBox is a powerful Automated Machine Learning python library.
Language: Python - Size: 50 MB - Last synced at: 2 months ago - Pushed at: over 2 years ago - Stars: 1,518 - Forks: 273
Hyedryn/elikopy
ElikoPy is Python library aiming at easing the processing of diffusion imaging for microstructural analysis.
Language: Python - Size: 4.52 MB - Last synced at: 14 days ago - Pushed at: 4 months ago - Stars: 19 - Forks: 5