GitHub topics: tabular-data
Xatta-Trone/awesome-tdl
A curated collection of TDL (Tabular Deep Learning) resources—libraries, projects, tutorials, papers, and more—for researchers and developers in the field.
Language: JavaScript - Size: 140 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 2 - Forks: 0

johnkerl/miller
Miller is like awk, sed, cut, join, and sort for name-indexed data such as CSV, TSV, and tabular JSON
Language: Go - Size: 201 MB - Last synced at: about 4 hours ago - Pushed at: 8 days ago - Stars: 9,329 - Forks: 224

RyanWangZf/transtab
NeurIPS'22 | TransTab: Learning Transferable Tabular Transformers Across Tables
Language: Python - Size: 1.31 MB - Last synced at: about 19 hours ago - Pushed at: 3 months ago - Stars: 194 - Forks: 26

BONDO2K-cloud/acupoftea
[PHP] ACUPOFTEA WEBSHELL BYPASS SERV 403 404
Language: PHP - Size: 19.5 KB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 1 - Forks: 1

PriorLabs/TabPFN
⚡ TabPFN: Foundation Model for Tabular Data ⚡
Language: Python - Size: 253 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 3,857 - Forks: 353

sdv-dev/Copulas
A library to model multivariate data using copulas.
Language: Python - Size: 31.7 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 595 - Forks: 116

dmitryglhf/autodask
AutoML Library Based on Dask with Bee Colony Optimization
Language: Python - Size: 1.66 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 3 - Forks: 0

PriorLabs/tabpfn-extensions
Community extensions for TabPFN - the foundation model for tabular data. Built with TabPFN! 🤗
Language: Python - Size: 761 KB - Last synced at: about 13 hours ago - Pushed at: about 14 hours ago - Stars: 139 - Forks: 26

JuliaData/DataFramesMeta.jl
Metaprogramming tools for DataFrames
Language: Julia - Size: 1.48 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 487 - Forks: 56

MigoXLab/awesome-data-quality
A comprehensive collection of data quality resources, tools, papers, and projects across various data types including traditional data, LLM pretraining/fine-tuning data, multimodal data, and more. Essential reference for researchers and practitioners in data-centric AI.
Size: 22.5 KB - Last synced at: 3 days ago - Pushed at: 10 days ago - Stars: 6 - Forks: 0

supernifty/csvtools
Command line processing of tabular data (CSVs, TSVs)
Language: Python - Size: 159 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 3 - Forks: 2

youngfish42/Awesome-FL
Comprehensive and timely academic information on federated learning (papers, frameworks, datasets, tutorials, workshops)
Language: Python - Size: 5.42 MB - Last synced at: 3 days ago - Pushed at: 22 days ago - Stars: 1,753 - Forks: 197

Lightning-Universe/lightning-flash 📦
Your PyTorch AI Factory - Flash enables you to easily configure and run complex AI recipes for over 15 tasks across 7 data domains
Language: Python - Size: 12.8 MB - Last synced at: 1 day ago - Pushed at: over 1 year ago - Stars: 1,744 - Forks: 211

LAMDA-Tabular/TALENT
A comprehensive toolkit and benchmark for tabular data learning, featuring 30+ deep methods, more than 10 classical methods, and 300 diverse tabular datasets.
Language: Python - Size: 95.3 MB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 644 - Forks: 29

bvaughn/react-virtualized
React components for efficiently rendering large lists and tabular data
Language: JavaScript - Size: 46.5 MB - Last synced at: 4 days ago - Pushed at: 5 months ago - Stars: 26,831 - Forks: 3,065

sdv-dev/CTGAN
Conditional GAN for generating synthetic tabular data.
Language: Python - Size: 1.83 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 1,410 - Forks: 314

posit-dev/pointblank
Data validation made beautiful and powerful
Language: Python - Size: 82 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 208 - Forks: 13

openml/openml-python
OpenML's Python API for a World of Data and More 💫
Language: Python - Size: 194 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 298 - Forks: 152

yangfa-zhang/lunax
Lunax is a machine learning framework specifically designed for the processing and analysis of tabular data.
Language: Python - Size: 25.5 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 1 - Forks: 0

aerosol/Tabula
:u7533: Pretty printer for maps/structs collections (Elixir)
Language: Elixir - Size: 40 KB - Last synced at: 3 days ago - Pushed at: over 3 years ago - Stars: 95 - Forks: 3

ImJaeSung/Synthesizers
Implementations of various synthesizers with pytorch.
Language: Python - Size: 14.7 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 1 - Forks: 0

Bilpapster/QualiTab
🔎 Investigating the performance of tabular foundation models in the wild of imperfect data
Language: Python - Size: 213 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 1 - Forks: 0

autogluon/autogluon
Fast and Accurate ML in 3 Lines of Code
Language: Python - Size: 22.1 MB - Last synced at: 7 days ago - Pushed at: 9 days ago - Stars: 8,934 - Forks: 1,025

romanmicuda/predict-engineering-salaries
Your challenge in this competition is to predict whether a job's salary falls into one of three categories: High, Medium, or Low, using the provided job-related data. Can you build a model that accurately classifies salaries based on factors like job title, description, and required qualifications? Let’s find out!
Language: Jupyter Notebook - Size: 6.16 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 0 - Forks: 0

turicas/rows
A common, beautiful interface to tabular data, no matter the format
Language: Python - Size: 7.72 MB - Last synced at: 5 days ago - Pushed at: 8 days ago - Stars: 876 - Forks: 135

teuben/nemo
a Stellar Dynamics Toolbox (Not Everybody Must Observe)
Language: C - Size: 46 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 68 - Forks: 51

vanderschaarlab/synthcity
A library for generating and evaluating synthetic tabular data for privacy, fairness and data augmentation.
Language: Python - Size: 6.77 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 555 - Forks: 76

alexhallam/tv
📺(tv) Tidy Viewer is a cross-platform CLI csv pretty printer that uses column styling to maximize viewer enjoyment.
Language: Rust - Size: 33.2 MB - Last synced at: 1 day ago - Pushed at: 6 months ago - Stars: 2,097 - Forks: 40

tdspora/syngen
Open-source version of the TDspora synthetic data generation algorithm.
Language: Jupyter Notebook - Size: 18.2 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 17 - Forks: 9

sentinel-energy/friendly_data
Data format to interoperate between models and frameworks
Language: Python - Size: 4.65 MB - Last synced at: 1 day ago - Pushed at: almost 3 years ago - Stars: 12 - Forks: 2

saulpw/visidata
A terminal spreadsheet multitool for discovering and arranging data
Language: Python - Size: 52.5 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 8,262 - Forks: 296

paulhorton/perltab
Extension of Perl autosplit mode for command-line processing of tabular data
Language: Perl - Size: 395 KB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 5 - Forks: 0

jrzaurin/pytorch-widedeep
A flexible package for multimodal-deep-learning to combine tabular data with text and images using Wide and Deep models in Pytorch
Language: Python - Size: 99.6 MB - Last synced at: 6 days ago - Pushed at: 4 months ago - Stars: 1,356 - Forks: 193

Desbordante/desbordante-core
Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algorithms. Desbordante has a console version and an easy-to-use web application.
Language: C++ - Size: 146 MB - Last synced at: 4 days ago - Pushed at: 17 days ago - Stars: 406 - Forks: 76

sdv-dev/SDGym
Benchmarking synthetic data generation methods.
Language: Python - Size: 3.06 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 274 - Forks: 63

daq-tools/skeem
Infer SQL DDL statements from tabular data.
Language: Python - Size: 260 KB - Last synced at: 9 days ago - Pushed at: 10 days ago - Stars: 3 - Forks: 1

PhoebusSi/Alpaca-CoT
We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts to initiate any meaningful PR on this repo and integrate as many LLM related technologies as possible. 我们打造了方便研究人员上手和使用大模型等微调平台,我们欢迎开源爱好者发起任何有意义的pr!
Language: Jupyter Notebook - Size: 137 MB - Last synced at: 10 days ago - Pushed at: over 1 year ago - Stars: 2,751 - Forks: 254

Clearbox-AI/preprocessor
A fast and felxible data preprocessor based on polars.
Language: Python - Size: 1.88 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 6 - Forks: 0

JuliaData/DataFrames.jl
In-memory tabular data in Julia
Language: Julia - Size: 28.8 MB - Last synced at: 5 days ago - Pushed at: 3 months ago - Stars: 1,773 - Forks: 372

PASTAplus/dex
Explore and subset CSV tables using associated EML metadata
Language: Python - Size: 925 KB - Last synced at: 2 days ago - Pushed at: 10 days ago - Stars: 3 - Forks: 0

sorenfyhn/ridge-regression
Framework for a ridge regression model that predicts heating load.
Language: Python - Size: 940 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 0 - Forks: 0

ncss-tech/stats_for_soil_survey
S4SS: Statistics for Soil Survey
Language: HTML - Size: 720 MB - Last synced at: 6 days ago - Pushed at: 11 days ago - Stars: 35 - Forks: 9

TabArena/tabarena_benchmarking_examples
Examples for using TabArena for benchmarking machine learning models (on SLURM)
Language: Python - Size: 73.2 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 4 - Forks: 0

dholzmueller/pytabkit
ML models + benchmark for tabular data classification and regression
Language: Python - Size: 1.12 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 168 - Forks: 17

lyrasis/csv-data-tools
Tools for working with CSV data (or other basic tabular data formats)
Language: Ruby - Size: 80.1 KB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 0 - Forks: 0

openalloc/SwiftTabler
A multi-platform SwiftUI component for tabular data
Language: Swift - Size: 1.41 MB - Last synced at: 8 days ago - Pushed at: about 2 years ago - Stars: 142 - Forks: 16

inokawa/virtua
A zero-config, fast and small (~3kB) virtual list (and grid) component for React, Vue, Solid and Svelte.
Language: TypeScript - Size: 105 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 2,815 - Forks: 73

firmai/deltapy
DeltaPy - Tabular Data Augmentation (by @firmai)
Language: Jupyter Notebook - Size: 1.47 MB - Last synced at: 3 days ago - Pushed at: almost 2 years ago - Stars: 547 - Forks: 56

trl-lab/tabular-robustness
The benchmark code to the paper "How well do LLMs reason over tabular data, really?"
Language: Python - Size: 11.5 MB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 1 - Forks: 0

sigpwned/tabular4j
A library for reading and writing pan-format tabular data, especially spreadsheets, in Java 11+.
Language: Java - Size: 370 KB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 1 - Forks: 0

nhn/tui.grid
🍞🔡 The Powerful Component to Display and Edit Data. Experience the Ultimate Data Transformer!
Language: TypeScript - Size: 64.8 MB - Last synced at: 12 days ago - Pushed at: over 1 year ago - Stars: 2,471 - Forks: 399

automl/Auto-PyTorch
Automatic architecture search and hyperparameter optimization for PyTorch
Language: Python - Size: 19.4 MB - Last synced at: 7 days ago - Pushed at: about 1 year ago - Stars: 2,450 - Forks: 299

ExtractTable/ExtractTable-py
Python library to extract tabular data from images and scanned PDFs
Language: Python - Size: 3.39 MB - Last synced at: 13 days ago - Pushed at: 11 months ago - Stars: 279 - Forks: 34

maxwellt23/SwiftFrames
A Swift-native DataFrame library inspired by pandas — load, view, transform, and export tabular data with ease.
Language: Swift - Size: 25.4 KB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 0 - Forks: 0

4D-STAR/opat-core
Core libraries and modules for OPAT file format
Language: C++ - Size: 12.3 MB - Last synced at: 17 days ago - Pushed at: 18 days ago - Stars: 0 - Forks: 2

SeldonIO/alibi-detect
Algorithms for outlier, adversarial and drift detection
Language: Jupyter Notebook - Size: 35.3 MB - Last synced at: 17 days ago - Pushed at: 18 days ago - Stars: 2,382 - Forks: 232

fatbobman/TabularBuilder
Declarative TabularData creation for Swift - Convert objects to DataFrames with type-safe, SwiftUI-like syntax
Language: Swift - Size: 21.5 KB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 3 - Forks: 0

GhentCDH/taulu
Taulu is a Python package designed to segment tabular data in scanned or photographed documents.
Language: Python - Size: 11.4 MB - Last synced at: 17 days ago - Pushed at: 18 days ago - Stars: 2 - Forks: 0

ValentinMargraf/ActiveLearningPipelines
Specificy, execute and monitor performances of active learning pipelines.
Language: Python - Size: 1.89 MB - Last synced at: about 4 hours ago - Pushed at: 9 months ago - Stars: 23 - Forks: 1

approximatelabs/sketch
AI code-writing assistant that understands data content
Language: Python - Size: 8.98 MB - Last synced at: 17 days ago - Pushed at: over 1 year ago - Stars: 2,269 - Forks: 119

eZWALT/eZAutoML
A Democratized, lightweight and modern framework for Python Automated Machine Learning
Language: Python - Size: 1.83 MB - Last synced at: 1 day ago - Pushed at: about 1 month ago - Stars: 53 - Forks: 1

sassoftware/dpmm
dpmm: a library for synthetic tabular data generation with rich functionality and end-to-end Differential Privacy guarantees
Language: Python - Size: 661 KB - Last synced at: 18 days ago - Pushed at: 19 days ago - Stars: 3 - Forks: 0

tabularis-ai/be_great
A novel approach for synthesizing tabular data using pretrained large language models
Language: Python - Size: 4.29 MB - Last synced at: 9 days ago - Pushed at: about 1 month ago - Stars: 312 - Forks: 52

manujosephv/pytorch_tabular
A standard framework for modelling Deep Learning Models for tabular data
Language: Python - Size: 39.6 MB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 1,524 - Forks: 153

yandex-research/tabred
(ICLR 2025 Spotlight) TabReD: Analyzing Pitfalls and Filling the Gaps in Tabular Deep Learning Benchmarks
Language: Python - Size: 4.05 MB - Last synced at: 17 days ago - Pushed at: 18 days ago - Stars: 73 - Forks: 4

LennartPurucker/finetune_tabpfn_v2
Code for finetuning TabPFN on one downstream tabular dataset.
Language: Python - Size: 62.5 KB - Last synced at: 14 days ago - Pushed at: 30 days ago - Stars: 59 - Forks: 11

vaexio/vaex
Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second 🚀
Language: Python - Size: 133 MB - Last synced at: 18 days ago - Pushed at: 9 months ago - Stars: 8,387 - Forks: 598

AhmedYousriSobhi/aCupOfTea
Let's settle down, rest our minds, spill the tea of our experience in multiple ai fields [Data Science, Machine Learning, Deep Learning], including many other aspects starting from prorgramming and clean code till design patterns & businness interference. Enjoy the drink, and if you find something interesting here, offer us a cup of tea.
Language: Jupyter Notebook - Size: 6.04 MB - Last synced at: 20 days ago - Pushed at: 20 days ago - Stars: 5 - Forks: 0

soda-inria/tabicl
Repository for TabICL: A Tabular Foundation Model for In-Context Learning on Large Data
Language: Python - Size: 1.88 MB - Last synced at: 20 days ago - Pushed at: about 1 month ago - Stars: 73 - Forks: 10

worldbank/REaLTabFormer
A suite of auto-regressive and Seq2Seq (sequence-to-sequence) transformer models for tabular and relational synthetic data generation.
Language: Jupyter Notebook - Size: 12.3 MB - Last synced at: 17 days ago - Pushed at: 21 days ago - Stars: 228 - Forks: 28

Yu-Group/imodels-experiments
Experiments with experimental rule-based models to go along with imodels.
Language: Jupyter Notebook - Size: 223 MB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 16 - Forks: 5

infinite-table/infinite-react
The modern React DataGrid for building apps — faster
Language: JavaScript - Size: 92.7 MB - Last synced at: 21 days ago - Pushed at: 22 days ago - Stars: 81 - Forks: 5

wwweiwei/awesome-self-supervised-learning-for-tabular-data
A collection of research materials on SSL for non-sequential tabular data (SSL4NSTD)
Size: 72.3 KB - Last synced at: 10 days ago - Pushed at: 4 months ago - Stars: 192 - Forks: 12

sdv-dev/TGAN
Generative adversarial training for generating synthetic tabular data.
Language: Python - Size: 7.84 MB - Last synced at: 16 days ago - Pushed at: over 2 years ago - Stars: 290 - Forks: 91

aws-samples/aws-machine-learning-university-dte
Machine Learning University: Decision Trees and Ensemble Methods
Language: Jupyter Notebook - Size: 26.4 MB - Last synced at: 17 days ago - Pushed at: 9 months ago - Stars: 252 - Forks: 89

TheDataStation/pneuma
LLM-Powered Data Discovery System for Tabular Data
Language: Python - Size: 59.8 MB - Last synced at: 20 days ago - Pushed at: about 2 months ago - Stars: 10 - Forks: 4

Enselic/git-repo-language-trends
Analyze programming language usage over time in a git repository and produce a graphical or textual representation of the result.
Language: Python - Size: 217 KB - Last synced at: 9 days ago - Pushed at: 7 months ago - Stars: 3 - Forks: 0

ds-modules/core-resources
Short examples and templates for common ds-module tasks
Language: Jupyter Notebook - Size: 9.47 MB - Last synced at: 24 days ago - Pushed at: 24 days ago - Stars: 12 - Forks: 22

microsoft/FLAML
A fast library for AutoML and tuning. Join our Discord: https://discord.gg/Cppx2vSPVP.
Language: Jupyter Notebook - Size: 206 MB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 4,138 - Forks: 535

IBM/TabFormer
Code & Data for "Tabular Transformers for Modeling Multivariate Time Series" (ICASSP, 2021)
Language: Python - Size: 460 KB - Last synced at: 13 days ago - Pushed at: almost 2 years ago - Stars: 336 - Forks: 89

spapicchio/QATCH
Official implementation of QATCH: Benchmarking SQL-centric tasks with Table Representation Learning Models on Your Data
Language: Python - Size: 5.07 MB - Last synced at: 26 days ago - Pushed at: 26 days ago - Stars: 28 - Forks: 0

schneiderkamplab/syntheval
Software for evaluating the quality of synthetic data compared with real data.
Language: Python - Size: 2.68 MB - Last synced at: 23 days ago - Pushed at: 3 months ago - Stars: 22 - Forks: 7

TLINDEN/tablizer
Manipulate tabular output of other programs
Language: Go - Size: 2.45 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 5 - Forks: 0

rainalexotl/ml-economics-animal-crossing
A neural network in PyTorch that predicts in-game clothing prices in ACNH. Used extensive EDA and categorical embeddings, achieving RMSE of 0.15 on scaled prices. Fun, game-based use case with real-world ML pipeline structure.
Language: Jupyter Notebook - Size: 1.71 MB - Last synced at: 26 days ago - Pushed at: 26 days ago - Stars: 0 - Forks: 0

amaiya/ktrain
ktrain is a Python library that makes deep learning and AI more accessible and easier to apply
Language: Jupyter Notebook - Size: 108 MB - Last synced at: about 12 hours ago - Pushed at: 5 months ago - Stars: 1,260 - Forks: 266

zadid6pretam/TabSeq
TabSeq: A Framework for Deep Learning on Tabular Data via Sequential Ordering
Language: Python - Size: 855 KB - Last synced at: 28 days ago - Pushed at: 28 days ago - Stars: 6 - Forks: 0

sebhaan/TabPFGen
TabPFGen: Synthetic Tabular Data Generation with TabPFN
Language: Python - Size: 140 KB - Last synced at: 28 days ago - Pushed at: 28 days ago - Stars: 7 - Forks: 0

DataCanvasIO/DeepTables
DeepTables: Deep-learning Toolkit for Tabular data
Language: Python - Size: 5.75 MB - Last synced at: 24 days ago - Pushed at: 7 months ago - Stars: 682 - Forks: 120

JuliaAPlavin/QuackIO.jl
Language: Julia - Size: 18.6 KB - Last synced at: 11 days ago - Pushed at: 3 months ago - Stars: 16 - Forks: 2

bowen-upenn/GeoGrid_Bench
GeoGrid-Bench: Can Foundation Models Understand Multimodal Gridded Geo-Spatial Data?
Language: Python - Size: 196 MB - Last synced at: 29 days ago - Pushed at: 29 days ago - Stars: 3 - Forks: 0

python-tableformatter/tableformatter
Tabular data formatter allowing printing from both arbitrary Iterables of Iterables or Iterables of objects via introspection
Language: Python - Size: 257 KB - Last synced at: 12 days ago - Pushed at: about 2 years ago - Stars: 19 - Forks: 4

RohanAdwankar/share-df
Python Package to Share/Edit Pandas/Polars DF with web interface!
Language: JavaScript - Size: 364 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 8 - Forks: 0

capitalone/DataProfiler
What's in your data? Extract schema, statistics and entities from datasets
Language: Python - Size: 35.7 MB - Last synced at: 25 days ago - Pushed at: 3 months ago - Stars: 1,492 - Forks: 171

TabArena/tabarena_dataset_curation
TabArena's dataset curation repository.
Language: Python - Size: 200 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

Alcoholrithm/TabularS3L
A PyTorch Lightning-based library for self- and semi-supervised learning on tabular data.
Language: Python - Size: 449 KB - Last synced at: 18 days ago - Pushed at: 4 months ago - Stars: 35 - Forks: 3

blei-lab/treeffuser
Treeffuser is an easy-to-use package for probabilistic prediction and probabilistic regression on tabular data with tree-based diffusion models.
Language: Jupyter Notebook - Size: 80.3 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 44 - Forks: 4

Clearbox-AI/clearbox-synthetic-kit
Clearbox AI's all-in-one solution for generation and evaluation of synthetic tabular and time-series data.
Language: Python - Size: 5.01 MB - Last synced at: 7 days ago - Pushed at: about 1 month ago - Stars: 43 - Forks: 1

ropensci/tabulapdf
Bindings for Tabula PDF Table Extractor Library
Language: R - Size: 32.4 MB - Last synced at: 14 days ago - Pushed at: 6 months ago - Stars: 556 - Forks: 72

Anaxagor/applybn
Multi-purpose data analysis framework based on Bayesian networks and Causal models
Language: Python - Size: 145 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 16 - Forks: 4

socialfoundations/folktexts
Evaluate uncertainty, calibration, accuracy, and fairness of LLMs on real-world survey data!
Language: Jupyter Notebook - Size: 28.8 MB - Last synced at: 29 days ago - Pushed at: 2 months ago - Stars: 22 - Forks: 4
