An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: tabular-data

BONDO2K-cloud/acupoftea

[PHP] ACUPOFTEA WEBSHELL BYPASS SERV 403 404

Language: PHP - Size: 19.5 KB - Last synced at: 40 minutes ago - Pushed at: about 2 hours ago - Stars: 1 - Forks: 1

jrzaurin/pytorch-widedeep

A flexible package for multimodal-deep-learning to combine tabular data with text and images using Wide and Deep models in Pytorch

Language: Python - Size: 99.6 MB - Last synced at: about 5 hours ago - Pushed at: 4 months ago - Stars: 1,356 - Forks: 193

tabularis-ai/be_great

A novel approach for synthesizing tabular data using pretrained large language models

Language: Python - Size: 4.29 MB - Last synced at: about 15 hours ago - Pushed at: about 16 hours ago - Stars: 310 - Forks: 52

bvaughn/react-virtualized

React components for efficiently rendering large lists and tabular data

Language: JavaScript - Size: 46.5 MB - Last synced at: about 15 hours ago - Pushed at: 5 months ago - Stars: 26,839 - Forks: 3,065

youngfish42/Awesome-FL

Comprehensive and timely academic information on federated learning (papers, frameworks, datasets, tutorials, workshops)

Language: Python - Size: 5.42 MB - Last synced at: 1 day ago - Pushed at: 27 days ago - Stars: 1,756 - Forks: 197

PriorLabs/tabpfn-client

⚡ Easy API access to the tabular foundation model TabPFN ⚡

Language: Python - Size: 354 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 174 - Forks: 18

TabArena/tabarena_benchmarking_examples

Examples for using TabArena for benchmarking machine learning models (on SLURM)

Language: Python - Size: 96.7 KB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 5 - Forks: 0

TabArena/tabarena_dataset_curation

TabArena's dataset curation repository.

Language: Python - Size: 205 KB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 2 - Forks: 0

Diyago/Tabular-data-generation

We well know GANs for success in the realistic image generation. However, they can be applied in tabular data generation. We will review and examine some recent papers about tabular GANs in action.

Language: Python - Size: 52.8 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 552 - Forks: 82

NVIDIA-Merlin/Transformers4Rec

Transformers4Rec is a flexible and efficient library for sequential and session-based recommendation and works with PyTorch.

Language: Python - Size: 211 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 1,189 - Forks: 150

LAMDA-Tabular/TALENT

A comprehensive toolkit and benchmark for tabular data learning, featuring 30+ deep methods, more than 10 classical methods, and 300 diverse tabular datasets.

Language: Python - Size: 95.3 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 651 - Forks: 30

autogluon/autogluon

Fast and Accurate ML in 3 Lines of Code

Language: Python - Size: 22.1 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 8,970 - Forks: 1,028

ImJaeSung/Synthesizers

Implementations of various synthesizers with pytorch.

Language: Python - Size: 14.7 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 1 - Forks: 0

dholzmueller/pytabkit

ML models + benchmark for tabular data classification and regression

Language: Python - Size: 1.15 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 185 - Forks: 18

synthesized-io/datasets

Language: Python - Size: 134 MB - Last synced at: 3 days ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 1

Clearbox-AI/preprocessor

A fast and felxible data preprocessor based on polars.

Language: Python - Size: 1.88 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 6 - Forks: 0

martinjurkovic/syntherela

A package for benchmarking synthetic relational data generation methods

Language: Python - Size: 1.25 MB - Last synced at: 3 days ago - Pushed at: about 1 month ago - Stars: 53 - Forks: 1

yandex-research/rtdl

Research on Tabular Deep Learning: Papers & Packages

Language: Python - Size: 24.6 MB - Last synced at: 4 days ago - Pushed at: 8 months ago - Stars: 995 - Forks: 110

nhn/tui.grid

🍞🔡 The Powerful Component to Display and Edit Data. Experience the Ultimate Data Transformer!

Language: TypeScript - Size: 64.8 MB - Last synced at: 2 days ago - Pushed at: over 1 year ago - Stars: 2,471 - Forks: 400

posit-dev/pointblank

Data validation made beautiful and powerful

Language: Python - Size: 98.6 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 230 - Forks: 13

Xatta-Trone/awesome-tdl

A curated collection of TDL (Tabular Deep Learning) resources—libraries, projects, tutorials, papers, and more—for researchers and developers in the field.

Language: JavaScript - Size: 140 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 2 - Forks: 0

johnkerl/miller

Miller is like awk, sed, cut, join, and sort for name-indexed data such as CSV, TSV, and tabular JSON

Language: Go - Size: 201 MB - Last synced at: 5 days ago - Pushed at: 14 days ago - Stars: 9,329 - Forks: 224

RyanWangZf/transtab

NeurIPS'22 | TransTab: Learning Transferable Tabular Transformers Across Tables

Language: Python - Size: 1.31 MB - Last synced at: 6 days ago - Pushed at: 4 months ago - Stars: 194 - Forks: 26

PhoebusSi/Alpaca-CoT

We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts to initiate any meaningful PR on this repo and integrate as many LLM related technologies as possible. 我们打造了方便研究人员上手和使用大模型等微调平台,我们欢迎开源爱好者发起任何有意义的pr!

Language: Jupyter Notebook - Size: 137 MB - Last synced at: 3 days ago - Pushed at: over 1 year ago - Stars: 2,751 - Forks: 254

PriorLabs/TabPFN

⚡ TabPFN: Foundation Model for Tabular Data ⚡

Language: Python - Size: 253 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 3,857 - Forks: 353

vaexio/vaex

Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second 🚀

Language: Python - Size: 133 MB - Last synced at: 5 days ago - Pushed at: 9 months ago - Stars: 8,393 - Forks: 599

sdv-dev/Copulas

A library to model multivariate data using copulas.

Language: Python - Size: 31.7 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 595 - Forks: 116

dmitryglhf/autodask

AutoML Library Based on Dask with Bee Colony Optimization

Language: Python - Size: 1.66 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 3 - Forks: 0

PriorLabs/tabpfn-extensions

Community extensions for TabPFN - the foundation model for tabular data. Built with TabPFN! 🤗

Language: Python - Size: 761 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 139 - Forks: 26

JuliaData/DataFramesMeta.jl

Metaprogramming tools for DataFrames

Language: Julia - Size: 1.48 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 487 - Forks: 56

JuliaData/DataFrames.jl

In-memory tabular data in Julia

Language: Julia - Size: 28.8 MB - Last synced at: 1 day ago - Pushed at: 3 months ago - Stars: 1,774 - Forks: 372

MigoXLab/awesome-data-quality

A comprehensive collection of data quality resources, tools, papers, and projects across various data types including traditional data, LLM pretraining/fine-tuning data, multimodal data, and more. Essential reference for researchers and practitioners in data-centric AI.

Size: 22.5 KB - Last synced at: 8 days ago - Pushed at: 15 days ago - Stars: 6 - Forks: 0

supernifty/csvtools

Command line processing of tabular data (CSVs, TSVs)

Language: Python - Size: 159 KB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 3 - Forks: 2

Lightning-Universe/lightning-flash 📦

Your PyTorch AI Factory - Flash enables you to easily configure and run complex AI recipes for over 15 tasks across 7 data domains

Language: Python - Size: 12.8 MB - Last synced at: 7 days ago - Pushed at: over 1 year ago - Stars: 1,744 - Forks: 211

manujosephv/pytorch_tabular

A standard framework for modelling Deep Learning Models for tabular data

Language: Python - Size: 39.6 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 1,536 - Forks: 153

capitalone/DataProfiler

What's in your data? Extract schema, statistics and entities from datasets

Language: Python - Size: 35.7 MB - Last synced at: 3 days ago - Pushed at: 3 months ago - Stars: 1,494 - Forks: 172

johnnyhwu/Awesome-LLM-Tabular

Awesome-LLM-Tabular: a curated list of Large Language Model applied to Tabular Data

Size: 1.2 MB - Last synced at: 2 days ago - Pushed at: 6 months ago - Stars: 398 - Forks: 30

wwweiwei/awesome-self-supervised-learning-for-tabular-data

A collection of research materials on SSL for non-sequential tabular data (SSL4NSTD)

Size: 72.3 KB - Last synced at: 2 days ago - Pushed at: 4 months ago - Stars: 194 - Forks: 12

sdv-dev/CTGAN

Conditional GAN for generating synthetic tabular data.

Language: Python - Size: 1.83 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 1,410 - Forks: 314

openml/openml-python

OpenML's Python API for a World of Data and More 💫

Language: Python - Size: 194 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 298 - Forks: 152

yangfa-zhang/lunax

Lunax is a machine learning framework specifically designed for the processing and analysis of tabular data.

Language: Python - Size: 25.5 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 1 - Forks: 0

aerosol/Tabula

:u7533: Pretty printer for maps/structs collections (Elixir)

Language: Elixir - Size: 40 KB - Last synced at: about 9 hours ago - Pushed at: over 3 years ago - Stars: 95 - Forks: 3

automl/Auto-PyTorch

Automatic architecture search and hyperparameter optimization for PyTorch

Language: Python - Size: 19.4 MB - Last synced at: 4 days ago - Pushed at: about 1 year ago - Stars: 2,449 - Forks: 299

scottrhoyt/SwiftyTextTable

A lightweight library for generating text tables.

Language: Swift - Size: 368 KB - Last synced at: about 9 hours ago - Pushed at: over 2 years ago - Stars: 324 - Forks: 29

Bilpapster/QualiTab

🔎 Investigating the performance of tabular foundation models in the wild of imperfect data

Language: Python - Size: 213 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 1 - Forks: 0

romanmicuda/predict-engineering-salaries

Your challenge in this competition is to predict whether a job's salary falls into one of three categories: High, Medium, or Low, using the provided job-related data. Can you build a model that accurately classifies salaries based on factors like job title, description, and required qualifications? Let’s find out!

Language: Jupyter Notebook - Size: 6.16 MB - Last synced at: 3 days ago - Pushed at: 12 days ago - Stars: 0 - Forks: 0

turicas/rows

A common, beautiful interface to tabular data, no matter the format

Language: Python - Size: 7.72 MB - Last synced at: 2 days ago - Pushed at: 13 days ago - Stars: 876 - Forks: 135

teuben/nemo

a Stellar Dynamics Toolbox (Not Everybody Must Observe)

Language: C - Size: 46 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 68 - Forks: 51

vanderschaarlab/synthcity

A library for generating and evaluating synthetic tabular data for privacy, fairness and data augmentation.

Language: Python - Size: 6.77 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 555 - Forks: 76

alexhallam/tv

📺(tv) Tidy Viewer is a cross-platform CLI csv pretty printer that uses column styling to maximize viewer enjoyment.

Language: Rust - Size: 33.2 MB - Last synced at: 6 days ago - Pushed at: 6 months ago - Stars: 2,097 - Forks: 40

tdspora/syngen

Open-source version of the TDspora synthetic data generation algorithm.

Language: Jupyter Notebook - Size: 18.2 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 17 - Forks: 9

DataCanvasIO/HyperGBM

A full pipeline AutoML tool for tabular data

Language: Python - Size: 11 MB - Last synced at: 3 days ago - Pushed at: 2 months ago - Stars: 351 - Forks: 47

sentinel-energy/friendly_data

Data format to interoperate between models and frameworks

Language: Python - Size: 4.65 MB - Last synced at: 7 days ago - Pushed at: almost 3 years ago - Stars: 12 - Forks: 2

saulpw/visidata

A terminal spreadsheet multitool for discovering and arranging data

Language: Python - Size: 52.5 MB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 8,262 - Forks: 296

paulhorton/perltab

Extension of Perl autosplit mode for command-line processing of tabular data

Language: Perl - Size: 395 KB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 5 - Forks: 0

Desbordante/desbordante-core

Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algorithms. Desbordante has a console version and an easy-to-use web application.

Language: C++ - Size: 146 MB - Last synced at: 9 days ago - Pushed at: 22 days ago - Stars: 406 - Forks: 76

sdv-dev/SDGym

Benchmarking synthetic data generation methods.

Language: Python - Size: 3.06 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 274 - Forks: 63

daq-tools/skeem

Infer SQL DDL statements from tabular data.

Language: Python - Size: 260 KB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 3 - Forks: 1

PASTAplus/dex

Explore and subset CSV tables using associated EML metadata

Language: Python - Size: 925 KB - Last synced at: 7 days ago - Pushed at: 15 days ago - Stars: 3 - Forks: 0

sorenfyhn/ridge-regression

Framework for a ridge regression model that predicts heating load.

Language: Python - Size: 940 KB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 0 - Forks: 0

ncss-tech/stats_for_soil_survey

S4SS: Statistics for Soil Survey

Language: HTML - Size: 720 MB - Last synced at: 4 days ago - Pushed at: 17 days ago - Stars: 35 - Forks: 9

TLINDEN/tablizer

Manipulate tabular output of other programs

Language: Go - Size: 2.45 MB - Last synced at: 2 days ago - Pushed at: 17 days ago - Stars: 6 - Forks: 0

reubano/meza

A Python toolkit for processing tabular data

Language: Python - Size: 4.5 MB - Last synced at: 5 days ago - Pushed at: 4 months ago - Stars: 416 - Forks: 29

lyrasis/csv-data-tools

Tools for working with CSV data (or other basic tabular data formats)

Language: Ruby - Size: 80.1 KB - Last synced at: 17 days ago - Pushed at: 18 days ago - Stars: 0 - Forks: 0

feedzai/fairgbm

Train Gradient Boosting models that are both high-performance *and* Fair!

Language: C++ - Size: 43 MB - Last synced at: about 23 hours ago - Pushed at: about 1 year ago - Stars: 105 - Forks: 7

openalloc/SwiftTabler

A multi-platform SwiftUI component for tabular data

Language: Swift - Size: 1.41 MB - Last synced at: 13 days ago - Pushed at: about 2 years ago - Stars: 142 - Forks: 16

inokawa/virtua

A zero-config, fast and small (~3kB) virtual list (and grid) component for React, Vue, Solid and Svelte.

Language: TypeScript - Size: 105 MB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 2,815 - Forks: 73

firmai/deltapy

DeltaPy - Tabular Data Augmentation (by @firmai)

Language: Jupyter Notebook - Size: 1.47 MB - Last synced at: 8 days ago - Pushed at: almost 2 years ago - Stars: 547 - Forks: 56

trl-lab/tabular-robustness

The benchmark code to the paper "How well do LLMs reason over tabular data, really?"

Language: Python - Size: 11.5 MB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 1 - Forks: 0

sigpwned/tabular4j

A library for reading and writing pan-format tabular data, especially spreadsheets, in Java 11+.

Language: Java - Size: 370 KB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 1 - Forks: 0

ExtractTable/ExtractTable-py

Python library to extract tabular data from images and scanned PDFs

Language: Python - Size: 3.39 MB - Last synced at: 18 days ago - Pushed at: 11 months ago - Stars: 279 - Forks: 34

maxwellt23/SwiftFrames

A Swift-native DataFrame library inspired by pandas — load, view, transform, and export tabular data with ease.

Language: Swift - Size: 25.4 KB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 0 - Forks: 0

4D-STAR/opat-core

Core libraries and modules for OPAT file format

Language: C++ - Size: 12.3 MB - Last synced at: 22 days ago - Pushed at: 23 days ago - Stars: 0 - Forks: 2

SeldonIO/alibi-detect

Algorithms for outlier, adversarial and drift detection

Language: Jupyter Notebook - Size: 35.3 MB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 2,382 - Forks: 232

fatbobman/TabularBuilder

Declarative TabularData creation for Swift - Convert objects to DataFrames with type-safe, SwiftUI-like syntax

Language: Swift - Size: 21.5 KB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 3 - Forks: 0

GhentCDH/taulu

Taulu is a Python package designed to segment tabular data in scanned or photographed documents.

Language: Python - Size: 11.4 MB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 2 - Forks: 0

ValentinMargraf/ActiveLearningPipelines

Specificy, execute and monitor performances of active learning pipelines.

Language: Python - Size: 1.89 MB - Last synced at: 5 days ago - Pushed at: 9 months ago - Stars: 23 - Forks: 1

approximatelabs/sketch

AI code-writing assistant that understands data content

Language: Python - Size: 8.98 MB - Last synced at: 22 days ago - Pushed at: over 1 year ago - Stars: 2,269 - Forks: 119

eZWALT/eZAutoML

A Democratized, lightweight and modern framework for Python Automated Machine Learning

Language: Python - Size: 1.83 MB - Last synced at: 7 days ago - Pushed at: about 1 month ago - Stars: 53 - Forks: 1

sassoftware/dpmm

dpmm: a library for synthetic tabular data generation with rich functionality and end-to-end Differential Privacy guarantees

Language: Python - Size: 661 KB - Last synced at: 23 days ago - Pushed at: 24 days ago - Stars: 3 - Forks: 0

yandex-research/tabred

(ICLR 2025 Spotlight) TabReD: Analyzing Pitfalls and Filling the Gaps in Tabular Deep Learning Benchmarks

Language: Python - Size: 4.05 MB - Last synced at: 22 days ago - Pushed at: 23 days ago - Stars: 73 - Forks: 4

LennartPurucker/finetune_tabpfn_v2

Code for finetuning TabPFN on one downstream tabular dataset.

Language: Python - Size: 62.5 KB - Last synced at: 20 days ago - Pushed at: about 1 month ago - Stars: 59 - Forks: 11

hitsz-ids/synthetic-data-generator

SDG is a specialized framework designed to generate high-quality structured tabular data.

Language: Python - Size: 4.19 MB - Last synced at: 3 days ago - Pushed at: 4 days ago - Stars: 2,359 - Forks: 384

AhmedYousriSobhi/aCupOfTea

Let's settle down, rest our minds, spill the tea of our experience in multiple ai fields [Data Science, Machine Learning, Deep Learning], including many other aspects starting from prorgramming and clean code till design patterns & businness interference. Enjoy the drink, and if you find something interesting here, offer us a cup of tea.

Language: Jupyter Notebook - Size: 6.04 MB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 5 - Forks: 0

soda-inria/tabicl

Repository for TabICL: A Tabular Foundation Model for In-Context Learning on Large Data

Language: Python - Size: 1.88 MB - Last synced at: 25 days ago - Pushed at: about 1 month ago - Stars: 73 - Forks: 10

worldbank/REaLTabFormer

A suite of auto-regressive and Seq2Seq (sequence-to-sequence) transformer models for tabular and relational synthetic data generation.

Language: Jupyter Notebook - Size: 12.3 MB - Last synced at: 22 days ago - Pushed at: 26 days ago - Stars: 228 - Forks: 28

Yu-Group/imodels-experiments

Experiments with experimental rule-based models to go along with imodels.

Language: Jupyter Notebook - Size: 223 MB - Last synced at: 26 days ago - Pushed at: 27 days ago - Stars: 16 - Forks: 5

infinite-table/infinite-react

The modern React DataGrid for building apps — faster

Language: JavaScript - Size: 92.7 MB - Last synced at: 26 days ago - Pushed at: 27 days ago - Stars: 81 - Forks: 5

sdv-dev/TGAN

Generative adversarial training for generating synthetic tabular data.

Language: Python - Size: 7.84 MB - Last synced at: 22 days ago - Pushed at: over 2 years ago - Stars: 290 - Forks: 91

aws-samples/aws-machine-learning-university-dte

Machine Learning University: Decision Trees and Ensemble Methods

Language: Jupyter Notebook - Size: 26.4 MB - Last synced at: 22 days ago - Pushed at: 9 months ago - Stars: 252 - Forks: 89

TheDataStation/pneuma

LLM-Powered Data Discovery System for Tabular Data

Language: Python - Size: 59.8 MB - Last synced at: 25 days ago - Pushed at: about 2 months ago - Stars: 10 - Forks: 4

Enselic/git-repo-language-trends

Analyze programming language usage over time in a git repository and produce a graphical or textual representation of the result.

Language: Python - Size: 217 KB - Last synced at: 14 days ago - Pushed at: 8 months ago - Stars: 3 - Forks: 0

ds-modules/core-resources

Short examples and templates for common ds-module tasks

Language: Jupyter Notebook - Size: 9.47 MB - Last synced at: 29 days ago - Pushed at: 29 days ago - Stars: 12 - Forks: 22

microsoft/FLAML

A fast library for AutoML and tuning. Join our Discord: https://discord.gg/Cppx2vSPVP.

Language: Jupyter Notebook - Size: 206 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 4,138 - Forks: 535

IBM/TabFormer

Code & Data for "Tabular Transformers for Modeling Multivariate Time Series" (ICASSP, 2021)

Language: Python - Size: 460 KB - Last synced at: 18 days ago - Pushed at: almost 2 years ago - Stars: 336 - Forks: 89

spapicchio/QATCH

Official implementation of QATCH: Benchmarking SQL-centric tasks with Table Representation Learning Models on Your Data

Language: Python - Size: 5.07 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 28 - Forks: 0

schneiderkamplab/syntheval

Software for evaluating the quality of synthetic data compared with real data.

Language: Python - Size: 2.68 MB - Last synced at: 28 days ago - Pushed at: 3 months ago - Stars: 22 - Forks: 7

rainalexotl/ml-economics-animal-crossing

A neural network in PyTorch that predicts in-game clothing prices in ACNH. Used extensive EDA and categorical embeddings, achieving RMSE of 0.15 on scaled prices. Fun, game-based use case with real-world ML pipeline structure.

Language: Jupyter Notebook - Size: 1.71 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

amaiya/ktrain

ktrain is a Python library that makes deep learning and AI more accessible and easier to apply

Language: Jupyter Notebook - Size: 108 MB - Last synced at: 6 days ago - Pushed at: 5 months ago - Stars: 1,260 - Forks: 266

zadid6pretam/TabSeq

TabSeq: A Framework for Deep Learning on Tabular Data via Sequential Ordering

Language: Python - Size: 855 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 6 - Forks: 0