An open API service providing repository metadata for many open source software ecosystems.

Topic: "tabular-data"

bvaughn/react-virtualized

React components for efficiently rendering large lists and tabular data

Language: JavaScript - Size: 46.5 MB - Last synced at: 7 days ago - Pushed at: 11 months ago - Stars: 27,049 - Forks: 3,059

autogluon/autogluon

Fast and Accurate ML in 3 Lines of Code

Language: Python - Size: 23.3 MB - Last synced at: 3 days ago - Pushed at: 5 days ago - Stars: 9,716 - Forks: 1,101

johnkerl/miller

Miller is like awk, sed, cut, join, and sort for name-indexed data such as CSV, TSV, and tabular JSON

Language: Go - Size: 201 MB - Last synced at: 11 days ago - Pushed at: 13 days ago - Stars: 9,562 - Forks: 230

saulpw/visidata

A terminal spreadsheet multitool for discovering and arranging data

Language: Python - Size: 52.9 MB - Last synced at: 6 days ago - Pushed at: 7 days ago - Stars: 8,678 - Forks: 321

vaexio/vaex

Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second 🚀

Language: Python - Size: 133 MB - Last synced at: 4 days ago - Pushed at: about 2 months ago - Stars: 8,463 - Forks: 603

PriorLabs/TabPFN

⚡ TabPFN: Foundation Model for Tabular Data ⚡

Language: Jupyter Notebook - Size: 268 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 5,314 - Forks: 522

microsoft/FLAML

A fast library for AutoML and tuning. Join our Discord: https://discord.gg/Cppx2vSPVP.

Language: Jupyter Notebook - Size: 209 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 4,206 - Forks: 544

inokawa/virtua

A zero-config, fast and small (~3kB) virtual list (and grid) component for React, Vue, Solid and Svelte.

Language: TypeScript - Size: 156 MB - Last synced at: 13 days ago - Pushed at: 14 days ago - Stars: 3,290 - Forks: 94

antonycourtney/tad

A desktop application for viewing and analyzing tabular data

Language: TypeScript - Size: 27.9 MB - Last synced at: 9 months ago - Pushed at: 10 months ago - Stars: 3,272 - Forks: 120

dreamquark-ai/tabnet

PyTorch implementation of TabNet paper : https://arxiv.org/pdf/1908.07442.pdf

Language: Python - Size: 6.61 MB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 2,865 - Forks: 512

PhoebusSi/Alpaca-CoT

We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts to initiate any meaningful PR on this repo and integrate as many LLM related technologies as possible. 我们打造了方便研究人员上手和使用大模型等微调平台,我们欢迎开源爱好者发起任何有意义的pr!

Language: Jupyter Notebook - Size: 137 MB - Last synced at: 18 days ago - Pushed at: about 2 years ago - Stars: 2,786 - Forks: 252

automl/Auto-PyTorch

Automatic architecture search and hyperparameter optimization for PyTorch

Language: Python - Size: 19.4 MB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 2,497 - Forks: 301

nhn/tui.grid

🍞🔡 The Powerful Component to Display and Edit Data. Experience the Ultimate Data Transformer!

Language: TypeScript - Size: 64.8 MB - Last synced at: 5 months ago - Pushed at: almost 2 years ago - Stars: 2,476 - Forks: 397

SeldonIO/alibi-detect

Algorithms for outlier, adversarial and drift detection

Language: Jupyter Notebook - Size: 35.4 MB - Last synced at: 3 days ago - Pushed at: 13 days ago - Stars: 2,469 - Forks: 241

hitsz-ids/synthetic-data-generator

SDG is a specialized framework designed to generate high-quality structured tabular data.

Language: Python - Size: 4.19 MB - Last synced at: 21 days ago - Pushed at: 23 days ago - Stars: 2,398 - Forks: 386

approximatelabs/sketch

AI code-writing assistant that understands data content

Language: Python - Size: 8.98 MB - Last synced at: about 2 months ago - Pushed at: almost 2 years ago - Stars: 2,287 - Forks: 119

alexhallam/tv

📺(tv) Tidy Viewer is a cross-platform CLI csv pretty printer that uses column styling to maximize viewer enjoyment.

Language: Rust - Size: 40.3 MB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 2,109 - Forks: 40

youngfish42/Awesome-FL

Comprehensive and timely academic information on federated learning (papers, frameworks, datasets, tutorials, workshops)

Language: Python - Size: 3.45 MB - Last synced at: 3 days ago - Pushed at: 3 months ago - Stars: 1,901 - Forks: 211

JuliaData/DataFrames.jl

In-memory tabular data in Julia

Language: Julia - Size: 29.9 MB - Last synced at: 18 days ago - Pushed at: 23 days ago - Stars: 1,802 - Forks: 374

Lightning-Universe/lightning-flash 📦

Your PyTorch AI Factory - Flash enables you to easily configure and run complex AI recipes for over 15 tasks across 7 data domains

Language: Python - Size: 12.8 MB - Last synced at: 3 days ago - Pushed at: about 2 years ago - Stars: 1,736 - Forks: 210

manujosephv/pytorch_tabular

A standard framework for modelling Deep Learning Models for tabular data

Language: Python - Size: 39.6 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1,588 - Forks: 160

capitalone/DataProfiler

What's in your data? Extract schema, statistics and entities from datasets

Language: Python - Size: 35.7 MB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 1,523 - Forks: 178

sdv-dev/CTGAN

Conditional GAN for generating synthetic tabular data.

Language: Python - Size: 1.86 MB - Last synced at: 18 days ago - Pushed at: about 1 month ago - Stars: 1,494 - Forks: 323

eBay/tsv-utils

eBay's TSV Utilities: Command line tools for large, tabular data files. Filtering, statistics, sampling, joins and more.

Language: D - Size: 2.77 MB - Last synced at: 14 days ago - Pushed at: over 3 years ago - Stars: 1,462 - Forks: 82

jrzaurin/pytorch-widedeep

A flexible package for multimodal-deep-learning to combine tabular data with text and images using Wide and Deep models in Pytorch

Language: Python - Size: 100 MB - Last synced at: 16 days ago - Pushed at: 3 months ago - Stars: 1,393 - Forks: 196

amaiya/ktrain

ktrain is a Python library that makes deep learning and AI more accessible and easier to apply

Language: Jupyter Notebook - Size: 108 MB - Last synced at: 4 months ago - Pushed at: 11 months ago - Stars: 1,260 - Forks: 265

NVIDIA-Merlin/Transformers4Rec

Transformers4Rec is a flexible and efficient library for sequential and session-based recommendation and works with PyTorch.

Language: Python - Size: 211 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1,220 - Forks: 154

yandex-research/rtdl

Research on Tabular Deep Learning: Papers & Packages

Language: Python - Size: 24.6 MB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 1,052 - Forks: 113

lucidrains/tab-transformer-pytorch

Implementation of TabTransformer, attention network for tabular data, in Pytorch

Language: Python - Size: 281 KB - Last synced at: 3 days ago - Pushed at: 6 days ago - Stars: 1,047 - Forks: 126

aws-samples/aws-machine-learning-university-accelerated-tab

Machine Learning University: Accelerated Tabular Data Class

Language: Jupyter Notebook - Size: 38.9 MB - Last synced at: 7 months ago - Pushed at: about 1 year ago - Stars: 1,018 - Forks: 314

turicas/rows

A common, beautiful interface to tabular data, no matter the format

Language: Python - Size: 7.81 MB - Last synced at: 23 days ago - Pushed at: 24 days ago - Stars: 886 - Forks: 136

LAMDA-Tabular/TALENT

A comprehensive toolkit and benchmark for tabular data learning, featuring 35+ deep methods, more than 10 classical methods, and 300 diverse tabular datasets.

Language: Python - Size: 106 MB - Last synced at: 3 days ago - Pushed at: 7 days ago - Stars: 795 - Forks: 45

DataCanvasIO/DeepTables

DeepTables: Deep-learning Toolkit for Tabular data

Language: Python - Size: 5.75 MB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 694 - Forks: 120

adrienjoly/npm-pdfreader

🚜 Parse text and tables from PDF files.

Language: HTML - Size: 1.9 MB - Last synced at: 17 days ago - Pushed at: about 1 month ago - Stars: 693 - Forks: 87

sdv-dev/Copulas

A library to model multivariate data using copulas.

Language: Python - Size: 30.5 MB - Last synced at: 18 days ago - Pushed at: about 1 month ago - Stars: 625 - Forks: 117

vanderschaarlab/synthcity

A library for generating and evaluating synthetic tabular data for privacy, fairness and data augmentation.

Language: Python - Size: 6.8 MB - Last synced at: 27 days ago - Pushed at: 6 months ago - Stars: 622 - Forks: 82

georgian-io/Multimodal-Toolkit

Multimodal model for text and tabular data with HuggingFace transformers as building block for text data

Language: Python - Size: 72.1 MB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 607 - Forks: 91

ropensci/tabulapdf

Bindings for Tabula PDF Table Extractor Library

Language: R - Size: 32.4 MB - Last synced at: 5 days ago - Pushed at: 12 months ago - Stars: 563 - Forks: 72

firmai/deltapy

DeltaPy - Tabular Data Augmentation (by @firmai)

Language: Jupyter Notebook - Size: 1.47 MB - Last synced at: about 2 months ago - Pushed at: over 2 years ago - Stars: 554 - Forks: 57

Diyago/Tabular-data-generation

We well know GANs for success in the realistic image generation. However, they can be applied in tabular data generation. We will review and examine some recent papers about tabular GANs in action.

Language: Python - Size: 52.8 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 552 - Forks: 82

JuliaData/DataFramesMeta.jl

Metaprogramming tools for DataFrames

Language: Julia - Size: 1.73 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 493 - Forks: 57

shanmukh05/Machine-Learning-Roadmap

A roadmap for getting started with Machine Learning

Size: 266 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 462 - Forks: 77

Desbordante/desbordante-core

Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algorithms. Desbordante has a console version and an easy-to-use web application.

Language: C++ - Size: 152 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 453 - Forks: 87

pavankataria/SwiftDataTables

A Swift Data Table package, display grid-like data sets in a nicely formatted table for iOS. Subclassing UICollectionView that allows ordering, and searching with extensible options.

Language: Swift - Size: 11.4 MB - Last synced at: 3 months ago - Pushed at: almost 4 years ago - Stars: 451 - Forks: 70

reubano/meza

A Python toolkit for processing tabular data

Language: Python - Size: 4.5 MB - Last synced at: 3 months ago - Pushed at: 10 months ago - Stars: 420 - Forks: 29

johnnyhwu/Awesome-LLM-Tabular

Awesome-LLM-Tabular: a curated list of Large Language Model applied to Tabular Data

Size: 1.2 MB - Last synced at: 3 days ago - Pushed at: about 1 year ago - Stars: 418 - Forks: 33

carefree0910/carefree-learn

Deep Learning ❤️ PyTorch

Language: Python - Size: 6.42 MB - Last synced at: 8 days ago - Pushed at: almost 2 years ago - Stars: 412 - Forks: 39

somepago/saint

The official PyTorch implementation of recent paper - SAINT: Improved Neural Networks for Tabular Data via Row Attention and Contrastive Pre-Training

Language: Python - Size: 229 KB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 382 - Forks: 61

Meteor-Community-Packages/meteor-tabular

Reactive datatables for large or small datasets

Language: JavaScript - Size: 598 KB - Last synced at: 5 days ago - Pushed at: about 1 month ago - Stars: 360 - Forks: 136

DataCanvasIO/HyperGBM

A full pipeline AutoML tool for tabular data

Language: Python - Size: 11 MB - Last synced at: 4 months ago - Pushed at: 8 months ago - Stars: 355 - Forks: 47

IBM/TabFormer

Code & Data for "Tabular Transformers for Modeling Multivariate Time Series" (ICASSP, 2021)

Language: Python - Size: 436 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 350 - Forks: 89

scottrhoyt/SwiftyTextTable

A lightweight library for generating text tables.

Language: Swift - Size: 368 KB - Last synced at: 12 days ago - Pushed at: almost 3 years ago - Stars: 331 - Forks: 28

tabularis-ai/be_great

A novel approach for synthesizing tabular data using pretrained large language models

Language: Python - Size: 4.31 MB - Last synced at: 23 days ago - Pushed at: 25 days ago - Stars: 330 - Forks: 55

continuum/active_importer 📦

Define importers that load tabular data from spreadsheets or CSV files into any ActiveRecord-like ORM.

Language: Ruby - Size: 160 KB - Last synced at: 9 days ago - Pushed at: about 4 years ago - Stars: 329 - Forks: 19

openml/openml-python

OpenML's Python API for a World of Data and More 💫

Language: Python - Size: 203 MB - Last synced at: about 20 hours ago - Pushed at: 2 days ago - Stars: 319 - Forks: 205

abhijithneilabraham/tableQA

AI Tool for querying natural language on tabular data.

Language: Python - Size: 28.2 MB - Last synced at: 10 days ago - Pushed at: about 2 years ago - Stars: 316 - Forks: 46

dholzmueller/pytabkit

ML models + benchmark for tabular data classification and regression

Language: Python - Size: 1.37 MB - Last synced at: 4 days ago - Pushed at: 8 days ago - Stars: 311 - Forks: 31

yandex-research/tabular-dl-tabr

The implementation of "TabR: Unlocking the Power of Retrieval-Augmented Tabular Deep Learning"

Language: Python - Size: 19.5 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 310 - Forks: 34

posit-dev/pointblank

Data validation toolkit for assessing and monitoring data quality.

Language: Python - Size: 293 MB - Last synced at: 26 days ago - Pushed at: 26 days ago - Stars: 303 - Forks: 22

keithknott26/datadash

Visualize and graph data in the terminal

Language: Go - Size: 96.8 MB - Last synced at: 4 months ago - Pushed at: almost 3 years ago - Stars: 295 - Forks: 14

sdv-dev/TGAN

Generative adversarial training for generating synthetic tabular data.

Language: Python - Size: 7.84 MB - Last synced at: 3 months ago - Pushed at: about 3 years ago - Stars: 294 - Forks: 91

sdv-dev/SDGym

Benchmarking synthetic data generation methods.

Language: Python - Size: 3.59 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 289 - Forks: 65

ExtractTable/ExtractTable-py

Python library to extract tabular data from images and scanned PDFs

Language: Python - Size: 3.39 MB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 283 - Forks: 35

healthylaife/MIMIC-IV-Data-Pipeline

A customizable pipeline for data extraction from MIMIC-IV; now multimodal!

Language: Jupyter Notebook - Size: 229 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 276 - Forks: 96

aws-samples/aws-machine-learning-university-dte

Machine Learning University: Decision Trees and Ensemble Methods

Language: Jupyter Notebook - Size: 26.4 MB - Last synced at: 7 months ago - Pushed at: over 1 year ago - Stars: 252 - Forks: 89

worldbank/REaLTabFormer

A suite of auto-regressive and Seq2Seq (sequence-to-sequence) transformer models for tabular and relational synthetic data generation.

Language: Jupyter Notebook - Size: 13.4 MB - Last synced at: 3 months ago - Pushed at: 5 months ago - Stars: 234 - Forks: 29

soda-inria/tabicl

Repository for TabICL: A Tabular Foundation Model for In-Context Learning on Large Data

Language: Python - Size: 1.85 MB - Last synced at: 28 days ago - Pushed at: 28 days ago - Stars: 232 - Forks: 43

PriorLabs/tabpfn-extensions

Community extensions for TabPFN - the foundation model for tabular data. Built with TabPFN! 🤗

Language: Python - Size: 1.33 MB - Last synced at: 7 days ago - Pushed at: 9 days ago - Stars: 228 - Forks: 46

SpursGoZmy/Tabular-LLM

本项目旨在收集开源的表格智能任务数据集(比如表格问答、表格-文本生成等),将原始数据整理为指令微调格式的数据并微调LLM,进而增强LLM对于表格数据的理解,最终构建出专门面向表格智能任务的大型语言模型。

Size: 6.48 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 228 - Forks: 17

PriorLabs/tabpfn-client

⚡ Easy API access to the tabular foundation model TabPFN ⚡

Language: Python - Size: 425 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 215 - Forks: 20

YGZWQZD/LAMDA-SSL

30 Semi-Supervised Learning Algorithms

Language: Python - Size: 10.5 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 212 - Forks: 16

wwweiwei/awesome-self-supervised-learning-for-tabular-data

A collection of research materials on SSL for non-sequential tabular data (SSL4NSTD)

Size: 77.1 KB - Last synced at: 10 days ago - Pushed at: about 2 months ago - Stars: 205 - Forks: 13

BdR76/CSVLint

CSV Lint plug-in for Notepad++ for syntax highlighting, csv validation, automatic column and datatype detecting, fixed width datasets, change datetime format, decimal separator, sort data, count unique values, convert to xml, json, sql etc. A plugin for data cleaning and working with messy data files.

Language: C# - Size: 13.3 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 205 - Forks: 18

RyanWangZf/transtab

NeurIPS'22 | TransTab: Learning Transferable Tabular Transformers Across Tables

Language: Python - Size: 1.31 MB - Last synced at: 23 days ago - Pushed at: 10 months ago - Stars: 202 - Forks: 29

mirador/mirador

Tool for visual exploration of complex data.

Language: Java - Size: 29.2 MB - Last synced at: almost 2 years ago - Pushed at: about 7 years ago - Stars: 191 - Forks: 24

rubycocos/csvreader

csvreader library / gem - read tabular data in the comma-separated values (csv) format the right way (uses best practices out-of-the-box with zero-configuration)

Language: Ruby - Size: 459 KB - Last synced at: 4 days ago - Pushed at: 8 months ago - Stars: 177 - Forks: 9

nirum/tableprint

Pretty console printing :clipboard: of tabular data in python :snake:

Language: Python - Size: 478 KB - Last synced at: 3 months ago - Pushed at: about 2 years ago - Stars: 171 - Forks: 17

openalloc/SwiftTabler

A multi-platform SwiftUI component for tabular data

Language: Swift - Size: 1.41 MB - Last synced at: 11 days ago - Pushed at: over 2 years ago - Stars: 145 - Forks: 17

AstraZeneca/SubTab

The official implementation of the paper, "SubTab: Subsetting Features of Tabular Data for Self-Supervised Representation Learning"

Language: Python - Size: 42.6 MB - Last synced at: 4 months ago - Pushed at: over 3 years ago - Stars: 145 - Forks: 21

szmikler/progress-table

Display progress as a pretty table in the command line.

Language: Python - Size: 23.3 MB - Last synced at: 16 days ago - Pushed at: 19 days ago - Stars: 144 - Forks: 3

sl-solution/InMemoryDatasets.jl

Multithreaded package for working with tabular data in Julia

Language: Julia - Size: 7.76 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 130 - Forks: 19

ajayarunachalam/msda

Library for multi-dimensional, multi-sensor, uni/multivariate time series data analysis, unsupervised feature selection, unsupervised deep anomaly detection, and prototype of explainable AI for anomaly detector

Language: Jupyter Notebook - Size: 11.5 MB - Last synced at: 3 months ago - Pushed at: about 4 years ago - Stars: 129 - Forks: 30

yubowenok/visflow

Web-based Dataflow Framework for Visual Data Exploration

Language: TypeScript - Size: 11.9 MB - Last synced at: 9 months ago - Pushed at: almost 3 years ago - Stars: 123 - Forks: 16

feedzai/fairgbm

Train Gradient Boosting models that are both high-performance *and* Fair!

Language: C++ - Size: 43 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 105 - Forks: 7

olehmberg/winter

WInte.r is a Java framework for end-to-end data integration. The WInte.r framework implements well-known methods for data pre-processing, schema matching, identity resolution, data fusion, and result evaluation.

Language: Java - Size: 18.6 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 105 - Forks: 32

juancarlospaco/faster-than-csv

Faster CSV for Python

Language: Python - Size: 15.5 MB - Last synced at: 9 months ago - Pushed at: almost 4 years ago - Stars: 102 - Forks: 8

radi-cho/GatedTabTransformer

A deep learning tabular classification architecture inspired by TabTransformer with integrated gated multilayer perceptron.

Language: Jupyter Notebook - Size: 60.4 MB - Last synced at: 6 months ago - Pushed at: almost 3 years ago - Stars: 99 - Forks: 7

s-marton/GRANDE

(ICLR 2024) GRANDE: Gradient-Based Decision Tree Ensembles

Language: Jupyter Notebook - Size: 9.89 MB - Last synced at: 5 days ago - Pushed at: 8 days ago - Stars: 98 - Forks: 11

aerosol/Tabula

:u7533: Pretty printer for maps/structs collections (Elixir)

Language: Elixir - Size: 40 KB - Last synced at: about 1 month ago - Pushed at: almost 4 years ago - Stars: 94 - Forks: 3

naity/image_tabular

Integrate image and tabular data for deep learning

Language: Jupyter Notebook - Size: 216 MB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 89 - Forks: 40

LAMDA-Tabular/Tabular-Survey

Awesome Tabular Deep Learning for "Representation Learning for Tabular Data: A Comprehensive Survey"

Size: 1.48 MB - Last synced at: about 2 hours ago - Pushed at: 1 day ago - Stars: 88 - Forks: 9

infinite-table/infinite-react

The modern React DataGrid for building apps — faster

Language: JavaScript - Size: 104 MB - Last synced at: 4 days ago - Pushed at: 7 days ago - Stars: 87 - Forks: 5

decodingai-magazine/tabular-semantic-search-tutorial

📚 Tutorial on building a modern search app for Amazon e-commerce products leveraging tabular semantic search and natural language queries.

Language: Jupyter Notebook - Size: 1.01 MB - Last synced at: 2 months ago - Pushed at: 8 months ago - Stars: 85 - Forks: 20

machinelearningnuremberg/WellTunedSimpleNets

[NeurIPS 2021] Well-tuned Simple Nets Excel on Tabular Datasets

Language: Python - Size: 95.7 KB - Last synced at: 7 months ago - Pushed at: almost 3 years ago - Stars: 85 - Forks: 15

r-rudra/tidycells

Automatic transformation of untidy spreadsheet-like data into tidy form

Language: R - Size: 2.82 MB - Last synced at: 5 months ago - Pushed at: 10 months ago - Stars: 83 - Forks: 10

nusdbsystem/ARM-Net

A ready-to-use framework of the state-of-the-art models for structured (tabular) data learning with PyTorch. Applications include recommendation, CRT prediction, healthcare analytics, anomaly detection, and etc.

Language: Python - Size: 24.1 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 80 - Forks: 13

keizerzilla/telegram-chat-parser

Python script to parse a Telegram chat history backup (JSON) into tabular format (CSV). No extra packages required, only Python 3.x!

Language: Python - Size: 42 KB - Last synced at: 4 months ago - Pushed at: about 1 year ago - Stars: 80 - Forks: 27

jrieke/fastapi-csv

🏗️ Create APIs from CSV files within seconds, using fastapi

Language: Python - Size: 230 KB - Last synced at: about 1 month ago - Pushed at: over 4 years ago - Stars: 79 - Forks: 16

PolyMathOrg/DataFrame

DataFrame in Pharo - tabular data structures for data analysis

Language: Smalltalk - Size: 2.13 MB - Last synced at: 7 months ago - Pushed at: 10 months ago - Stars: 77 - Forks: 28

LennartPurucker/finetune_tabpfn_v2

Code for finetuning TabPFN on one downstream tabular dataset.

Language: Python - Size: 67.4 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 76 - Forks: 14

Related Topics
machine-learning 186 python 119 deep-learning 104 data-science 80 csv 63 pytorch 57 pandas 35 synthetic-data 35 classification 34 data-analysis 31 data-visualization 30 table 29 scikit-learn 28 automl 25 tabular 25 data 24 regression 23 xgboost 22 kaggle 21 nlp 20 json 20 feature-engineering 19 excel 18 transformer 17 tsv 16 time-series 16 artificial-intelligence 16 generative-ai 15 explainable-ai 15 python3 14 ai 14 lightgbm 14 statistics 13 generative-model 13 gan 12 jupyter-notebook 12 ml 12 datasets 12 javascript 12 deep-neural-networks 12 spreadsheet 12 neural-network 12 database 11 ast 11 tdast 11 unist 11 data-cleaning 11 generative-adversarial-network 11 natural-language-processing 11 computer-vision 11 numpy 11 data-augmentation 11 pytorch-lightning 10 llm 10 xai 10 dataset 10 self-supervised-learning 10 benchmark 10 sql 10 csv-files 10 synthetic-dataset-generation 10 visualization 10 hyperparameter-optimization 10 grid 10 tables 9 catboost 9 util 9 julia 9 kaggle-competition 9 typescript 9 semi-supervised-learning 9 eda 8 research 8 healthcare 8 tensorflow 8 neural-networks 8 ocr 8 knowledge-graph 8 graph-neural-networks 8 tdast-util 8 tabpfn 8 machine-learning-algorithms 8 dataframe 8 gradient-boosting 7 diffusion-models 7 synthetic-data-generation 7 structured-data 7 data-generation 7 data-quality 7 tabular-methods 7 tabular-data-formatter 7 preprocessing 7 terminal 7 command-line 7 open-source 7 ensemble-learning 7 fastapi 7 decision-trees 7 keras 7 data-preprocessing 7