An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: tabular-data

Xatta-Trone/awesome-tdl

A curated collection of TDL (Tabular Deep Learning) resources—libraries, projects, tutorials, papers, and more—for researchers and developers in the field.

Language: JavaScript - Size: 140 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 2 - Forks: 0

johnkerl/miller

Miller is like awk, sed, cut, join, and sort for name-indexed data such as CSV, TSV, and tabular JSON

Language: Go - Size: 201 MB - Last synced at: about 4 hours ago - Pushed at: 8 days ago - Stars: 9,329 - Forks: 224

RyanWangZf/transtab

NeurIPS'22 | TransTab: Learning Transferable Tabular Transformers Across Tables

Language: Python - Size: 1.31 MB - Last synced at: about 19 hours ago - Pushed at: 3 months ago - Stars: 194 - Forks: 26

BONDO2K-cloud/acupoftea

[PHP] ACUPOFTEA WEBSHELL BYPASS SERV 403 404

Language: PHP - Size: 19.5 KB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 1 - Forks: 1

PriorLabs/TabPFN

⚡ TabPFN: Foundation Model for Tabular Data ⚡

Language: Python - Size: 253 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 3,857 - Forks: 353

sdv-dev/Copulas

A library to model multivariate data using copulas.

Language: Python - Size: 31.7 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 595 - Forks: 116

dmitryglhf/autodask

AutoML Library Based on Dask with Bee Colony Optimization

Language: Python - Size: 1.66 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 3 - Forks: 0

PriorLabs/tabpfn-extensions

Community extensions for TabPFN - the foundation model for tabular data. Built with TabPFN! 🤗

Language: Python - Size: 761 KB - Last synced at: about 13 hours ago - Pushed at: about 14 hours ago - Stars: 139 - Forks: 26

JuliaData/DataFramesMeta.jl

Metaprogramming tools for DataFrames

Language: Julia - Size: 1.48 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 487 - Forks: 56

MigoXLab/awesome-data-quality

A comprehensive collection of data quality resources, tools, papers, and projects across various data types including traditional data, LLM pretraining/fine-tuning data, multimodal data, and more. Essential reference for researchers and practitioners in data-centric AI.

Size: 22.5 KB - Last synced at: 3 days ago - Pushed at: 10 days ago - Stars: 6 - Forks: 0

supernifty/csvtools

Command line processing of tabular data (CSVs, TSVs)

Language: Python - Size: 159 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 3 - Forks: 2

youngfish42/Awesome-FL

Comprehensive and timely academic information on federated learning (papers, frameworks, datasets, tutorials, workshops)

Language: Python - Size: 5.42 MB - Last synced at: 3 days ago - Pushed at: 22 days ago - Stars: 1,753 - Forks: 197

Lightning-Universe/lightning-flash 📦

Your PyTorch AI Factory - Flash enables you to easily configure and run complex AI recipes for over 15 tasks across 7 data domains

Language: Python - Size: 12.8 MB - Last synced at: 1 day ago - Pushed at: over 1 year ago - Stars: 1,744 - Forks: 211

LAMDA-Tabular/TALENT

A comprehensive toolkit and benchmark for tabular data learning, featuring 30+ deep methods, more than 10 classical methods, and 300 diverse tabular datasets.

Language: Python - Size: 95.3 MB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 644 - Forks: 29

bvaughn/react-virtualized

React components for efficiently rendering large lists and tabular data

Language: JavaScript - Size: 46.5 MB - Last synced at: 4 days ago - Pushed at: 5 months ago - Stars: 26,831 - Forks: 3,065

sdv-dev/CTGAN

Conditional GAN for generating synthetic tabular data.

Language: Python - Size: 1.83 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 1,410 - Forks: 314

posit-dev/pointblank

Data validation made beautiful and powerful

Language: Python - Size: 82 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 208 - Forks: 13

openml/openml-python

OpenML's Python API for a World of Data and More 💫

Language: Python - Size: 194 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 298 - Forks: 152

yangfa-zhang/lunax

Lunax is a machine learning framework specifically designed for the processing and analysis of tabular data.

Language: Python - Size: 25.5 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 1 - Forks: 0

aerosol/Tabula

:u7533: Pretty printer for maps/structs collections (Elixir)

Language: Elixir - Size: 40 KB - Last synced at: 3 days ago - Pushed at: over 3 years ago - Stars: 95 - Forks: 3

ImJaeSung/Synthesizers

Implementations of various synthesizers with pytorch.

Language: Python - Size: 14.7 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 1 - Forks: 0

Bilpapster/QualiTab

🔎 Investigating the performance of tabular foundation models in the wild of imperfect data

Language: Python - Size: 213 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 1 - Forks: 0

autogluon/autogluon

Fast and Accurate ML in 3 Lines of Code

Language: Python - Size: 22.1 MB - Last synced at: 7 days ago - Pushed at: 9 days ago - Stars: 8,934 - Forks: 1,025

romanmicuda/predict-engineering-salaries

Your challenge in this competition is to predict whether a job's salary falls into one of three categories: High, Medium, or Low, using the provided job-related data. Can you build a model that accurately classifies salaries based on factors like job title, description, and required qualifications? Let’s find out!

Language: Jupyter Notebook - Size: 6.16 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 0 - Forks: 0

turicas/rows

A common, beautiful interface to tabular data, no matter the format

Language: Python - Size: 7.72 MB - Last synced at: 5 days ago - Pushed at: 8 days ago - Stars: 876 - Forks: 135

teuben/nemo

a Stellar Dynamics Toolbox (Not Everybody Must Observe)

Language: C - Size: 46 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 68 - Forks: 51

vanderschaarlab/synthcity

A library for generating and evaluating synthetic tabular data for privacy, fairness and data augmentation.

Language: Python - Size: 6.77 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 555 - Forks: 76

alexhallam/tv

📺(tv) Tidy Viewer is a cross-platform CLI csv pretty printer that uses column styling to maximize viewer enjoyment.

Language: Rust - Size: 33.2 MB - Last synced at: 1 day ago - Pushed at: 6 months ago - Stars: 2,097 - Forks: 40

tdspora/syngen

Open-source version of the TDspora synthetic data generation algorithm.

Language: Jupyter Notebook - Size: 18.2 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 17 - Forks: 9

sentinel-energy/friendly_data

Data format to interoperate between models and frameworks

Language: Python - Size: 4.65 MB - Last synced at: 1 day ago - Pushed at: almost 3 years ago - Stars: 12 - Forks: 2

saulpw/visidata

A terminal spreadsheet multitool for discovering and arranging data

Language: Python - Size: 52.5 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 8,262 - Forks: 296

paulhorton/perltab

Extension of Perl autosplit mode for command-line processing of tabular data

Language: Perl - Size: 395 KB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 5 - Forks: 0

jrzaurin/pytorch-widedeep

A flexible package for multimodal-deep-learning to combine tabular data with text and images using Wide and Deep models in Pytorch

Language: Python - Size: 99.6 MB - Last synced at: 6 days ago - Pushed at: 4 months ago - Stars: 1,356 - Forks: 193

Desbordante/desbordante-core

Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algorithms. Desbordante has a console version and an easy-to-use web application.

Language: C++ - Size: 146 MB - Last synced at: 4 days ago - Pushed at: 17 days ago - Stars: 406 - Forks: 76

sdv-dev/SDGym

Benchmarking synthetic data generation methods.

Language: Python - Size: 3.06 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 274 - Forks: 63

daq-tools/skeem

Infer SQL DDL statements from tabular data.

Language: Python - Size: 260 KB - Last synced at: 9 days ago - Pushed at: 10 days ago - Stars: 3 - Forks: 1

PhoebusSi/Alpaca-CoT

We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts to initiate any meaningful PR on this repo and integrate as many LLM related technologies as possible. 我们打造了方便研究人员上手和使用大模型等微调平台,我们欢迎开源爱好者发起任何有意义的pr!

Language: Jupyter Notebook - Size: 137 MB - Last synced at: 10 days ago - Pushed at: over 1 year ago - Stars: 2,751 - Forks: 254

Clearbox-AI/preprocessor

A fast and felxible data preprocessor based on polars.

Language: Python - Size: 1.88 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 6 - Forks: 0

JuliaData/DataFrames.jl

In-memory tabular data in Julia

Language: Julia - Size: 28.8 MB - Last synced at: 5 days ago - Pushed at: 3 months ago - Stars: 1,773 - Forks: 372

PASTAplus/dex

Explore and subset CSV tables using associated EML metadata

Language: Python - Size: 925 KB - Last synced at: 2 days ago - Pushed at: 10 days ago - Stars: 3 - Forks: 0

sorenfyhn/ridge-regression

Framework for a ridge regression model that predicts heating load.

Language: Python - Size: 940 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 0 - Forks: 0

ncss-tech/stats_for_soil_survey

S4SS: Statistics for Soil Survey

Language: HTML - Size: 720 MB - Last synced at: 6 days ago - Pushed at: 11 days ago - Stars: 35 - Forks: 9

TabArena/tabarena_benchmarking_examples

Examples for using TabArena for benchmarking machine learning models (on SLURM)

Language: Python - Size: 73.2 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 4 - Forks: 0

dholzmueller/pytabkit

ML models + benchmark for tabular data classification and regression

Language: Python - Size: 1.12 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 168 - Forks: 17

lyrasis/csv-data-tools

Tools for working with CSV data (or other basic tabular data formats)

Language: Ruby - Size: 80.1 KB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 0 - Forks: 0

openalloc/SwiftTabler

A multi-platform SwiftUI component for tabular data

Language: Swift - Size: 1.41 MB - Last synced at: 8 days ago - Pushed at: about 2 years ago - Stars: 142 - Forks: 16

inokawa/virtua

A zero-config, fast and small (~3kB) virtual list (and grid) component for React, Vue, Solid and Svelte.

Language: TypeScript - Size: 105 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 2,815 - Forks: 73

firmai/deltapy

DeltaPy - Tabular Data Augmentation (by @firmai)

Language: Jupyter Notebook - Size: 1.47 MB - Last synced at: 3 days ago - Pushed at: almost 2 years ago - Stars: 547 - Forks: 56

trl-lab/tabular-robustness

The benchmark code to the paper "How well do LLMs reason over tabular data, really?"

Language: Python - Size: 11.5 MB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 1 - Forks: 0

sigpwned/tabular4j

A library for reading and writing pan-format tabular data, especially spreadsheets, in Java 11+.

Language: Java - Size: 370 KB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 1 - Forks: 0

nhn/tui.grid

🍞🔡 The Powerful Component to Display and Edit Data. Experience the Ultimate Data Transformer!

Language: TypeScript - Size: 64.8 MB - Last synced at: 12 days ago - Pushed at: over 1 year ago - Stars: 2,471 - Forks: 399

automl/Auto-PyTorch

Automatic architecture search and hyperparameter optimization for PyTorch

Language: Python - Size: 19.4 MB - Last synced at: 7 days ago - Pushed at: about 1 year ago - Stars: 2,450 - Forks: 299

ExtractTable/ExtractTable-py

Python library to extract tabular data from images and scanned PDFs

Language: Python - Size: 3.39 MB - Last synced at: 13 days ago - Pushed at: 11 months ago - Stars: 279 - Forks: 34

maxwellt23/SwiftFrames

A Swift-native DataFrame library inspired by pandas — load, view, transform, and export tabular data with ease.

Language: Swift - Size: 25.4 KB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 0 - Forks: 0

4D-STAR/opat-core

Core libraries and modules for OPAT file format

Language: C++ - Size: 12.3 MB - Last synced at: 17 days ago - Pushed at: 18 days ago - Stars: 0 - Forks: 2

SeldonIO/alibi-detect

Algorithms for outlier, adversarial and drift detection

Language: Jupyter Notebook - Size: 35.3 MB - Last synced at: 17 days ago - Pushed at: 18 days ago - Stars: 2,382 - Forks: 232

fatbobman/TabularBuilder

Declarative TabularData creation for Swift - Convert objects to DataFrames with type-safe, SwiftUI-like syntax

Language: Swift - Size: 21.5 KB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 3 - Forks: 0

GhentCDH/taulu

Taulu is a Python package designed to segment tabular data in scanned or photographed documents.

Language: Python - Size: 11.4 MB - Last synced at: 17 days ago - Pushed at: 18 days ago - Stars: 2 - Forks: 0

ValentinMargraf/ActiveLearningPipelines

Specificy, execute and monitor performances of active learning pipelines.

Language: Python - Size: 1.89 MB - Last synced at: about 4 hours ago - Pushed at: 9 months ago - Stars: 23 - Forks: 1

approximatelabs/sketch

AI code-writing assistant that understands data content

Language: Python - Size: 8.98 MB - Last synced at: 17 days ago - Pushed at: over 1 year ago - Stars: 2,269 - Forks: 119

eZWALT/eZAutoML

A Democratized, lightweight and modern framework for Python Automated Machine Learning

Language: Python - Size: 1.83 MB - Last synced at: 1 day ago - Pushed at: about 1 month ago - Stars: 53 - Forks: 1

sassoftware/dpmm

dpmm: a library for synthetic tabular data generation with rich functionality and end-to-end Differential Privacy guarantees

Language: Python - Size: 661 KB - Last synced at: 18 days ago - Pushed at: 19 days ago - Stars: 3 - Forks: 0

tabularis-ai/be_great

A novel approach for synthesizing tabular data using pretrained large language models

Language: Python - Size: 4.29 MB - Last synced at: 9 days ago - Pushed at: about 1 month ago - Stars: 312 - Forks: 52

manujosephv/pytorch_tabular

A standard framework for modelling Deep Learning Models for tabular data

Language: Python - Size: 39.6 MB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 1,524 - Forks: 153

yandex-research/tabred

(ICLR 2025 Spotlight) TabReD: Analyzing Pitfalls and Filling the Gaps in Tabular Deep Learning Benchmarks

Language: Python - Size: 4.05 MB - Last synced at: 17 days ago - Pushed at: 18 days ago - Stars: 73 - Forks: 4

LennartPurucker/finetune_tabpfn_v2

Code for finetuning TabPFN on one downstream tabular dataset.

Language: Python - Size: 62.5 KB - Last synced at: 14 days ago - Pushed at: 30 days ago - Stars: 59 - Forks: 11

vaexio/vaex

Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second 🚀

Language: Python - Size: 133 MB - Last synced at: 18 days ago - Pushed at: 9 months ago - Stars: 8,387 - Forks: 598

AhmedYousriSobhi/aCupOfTea

Let's settle down, rest our minds, spill the tea of our experience in multiple ai fields [Data Science, Machine Learning, Deep Learning], including many other aspects starting from prorgramming and clean code till design patterns & businness interference. Enjoy the drink, and if you find something interesting here, offer us a cup of tea.

Language: Jupyter Notebook - Size: 6.04 MB - Last synced at: 20 days ago - Pushed at: 20 days ago - Stars: 5 - Forks: 0

soda-inria/tabicl

Repository for TabICL: A Tabular Foundation Model for In-Context Learning on Large Data

Language: Python - Size: 1.88 MB - Last synced at: 20 days ago - Pushed at: about 1 month ago - Stars: 73 - Forks: 10

worldbank/REaLTabFormer

A suite of auto-regressive and Seq2Seq (sequence-to-sequence) transformer models for tabular and relational synthetic data generation.

Language: Jupyter Notebook - Size: 12.3 MB - Last synced at: 17 days ago - Pushed at: 21 days ago - Stars: 228 - Forks: 28

Yu-Group/imodels-experiments

Experiments with experimental rule-based models to go along with imodels.

Language: Jupyter Notebook - Size: 223 MB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 16 - Forks: 5

infinite-table/infinite-react

The modern React DataGrid for building apps — faster

Language: JavaScript - Size: 92.7 MB - Last synced at: 21 days ago - Pushed at: 22 days ago - Stars: 81 - Forks: 5

wwweiwei/awesome-self-supervised-learning-for-tabular-data

A collection of research materials on SSL for non-sequential tabular data (SSL4NSTD)

Size: 72.3 KB - Last synced at: 10 days ago - Pushed at: 4 months ago - Stars: 192 - Forks: 12

sdv-dev/TGAN

Generative adversarial training for generating synthetic tabular data.

Language: Python - Size: 7.84 MB - Last synced at: 16 days ago - Pushed at: over 2 years ago - Stars: 290 - Forks: 91

aws-samples/aws-machine-learning-university-dte

Machine Learning University: Decision Trees and Ensemble Methods

Language: Jupyter Notebook - Size: 26.4 MB - Last synced at: 17 days ago - Pushed at: 9 months ago - Stars: 252 - Forks: 89

TheDataStation/pneuma

LLM-Powered Data Discovery System for Tabular Data

Language: Python - Size: 59.8 MB - Last synced at: 20 days ago - Pushed at: about 2 months ago - Stars: 10 - Forks: 4

Enselic/git-repo-language-trends

Analyze programming language usage over time in a git repository and produce a graphical or textual representation of the result.

Language: Python - Size: 217 KB - Last synced at: 9 days ago - Pushed at: 7 months ago - Stars: 3 - Forks: 0

ds-modules/core-resources

Short examples and templates for common ds-module tasks

Language: Jupyter Notebook - Size: 9.47 MB - Last synced at: 24 days ago - Pushed at: 24 days ago - Stars: 12 - Forks: 22

microsoft/FLAML

A fast library for AutoML and tuning. Join our Discord: https://discord.gg/Cppx2vSPVP.

Language: Jupyter Notebook - Size: 206 MB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 4,138 - Forks: 535

IBM/TabFormer

Code & Data for "Tabular Transformers for Modeling Multivariate Time Series" (ICASSP, 2021)

Language: Python - Size: 460 KB - Last synced at: 13 days ago - Pushed at: almost 2 years ago - Stars: 336 - Forks: 89

spapicchio/QATCH

Official implementation of QATCH: Benchmarking SQL-centric tasks with Table Representation Learning Models on Your Data

Language: Python - Size: 5.07 MB - Last synced at: 26 days ago - Pushed at: 26 days ago - Stars: 28 - Forks: 0

schneiderkamplab/syntheval

Software for evaluating the quality of synthetic data compared with real data.

Language: Python - Size: 2.68 MB - Last synced at: 23 days ago - Pushed at: 3 months ago - Stars: 22 - Forks: 7

TLINDEN/tablizer

Manipulate tabular output of other programs

Language: Go - Size: 2.45 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 5 - Forks: 0

rainalexotl/ml-economics-animal-crossing

A neural network in PyTorch that predicts in-game clothing prices in ACNH. Used extensive EDA and categorical embeddings, achieving RMSE of 0.15 on scaled prices. Fun, game-based use case with real-world ML pipeline structure.

Language: Jupyter Notebook - Size: 1.71 MB - Last synced at: 26 days ago - Pushed at: 26 days ago - Stars: 0 - Forks: 0

amaiya/ktrain

ktrain is a Python library that makes deep learning and AI more accessible and easier to apply

Language: Jupyter Notebook - Size: 108 MB - Last synced at: about 12 hours ago - Pushed at: 5 months ago - Stars: 1,260 - Forks: 266

zadid6pretam/TabSeq

TabSeq: A Framework for Deep Learning on Tabular Data via Sequential Ordering

Language: Python - Size: 855 KB - Last synced at: 28 days ago - Pushed at: 28 days ago - Stars: 6 - Forks: 0

sebhaan/TabPFGen

TabPFGen: Synthetic Tabular Data Generation with TabPFN

Language: Python - Size: 140 KB - Last synced at: 28 days ago - Pushed at: 28 days ago - Stars: 7 - Forks: 0

DataCanvasIO/DeepTables

DeepTables: Deep-learning Toolkit for Tabular data

Language: Python - Size: 5.75 MB - Last synced at: 24 days ago - Pushed at: 7 months ago - Stars: 682 - Forks: 120

JuliaAPlavin/QuackIO.jl

Language: Julia - Size: 18.6 KB - Last synced at: 11 days ago - Pushed at: 3 months ago - Stars: 16 - Forks: 2

bowen-upenn/GeoGrid_Bench

GeoGrid-Bench: Can Foundation Models Understand Multimodal Gridded Geo-Spatial Data?

Language: Python - Size: 196 MB - Last synced at: 29 days ago - Pushed at: 29 days ago - Stars: 3 - Forks: 0

python-tableformatter/tableformatter

Tabular data formatter allowing printing from both arbitrary Iterables of Iterables or Iterables of objects via introspection

Language: Python - Size: 257 KB - Last synced at: 12 days ago - Pushed at: about 2 years ago - Stars: 19 - Forks: 4

RohanAdwankar/share-df

Python Package to Share/Edit Pandas/Polars DF with web interface!

Language: JavaScript - Size: 364 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 8 - Forks: 0

capitalone/DataProfiler

What's in your data? Extract schema, statistics and entities from datasets

Language: Python - Size: 35.7 MB - Last synced at: 25 days ago - Pushed at: 3 months ago - Stars: 1,492 - Forks: 171

TabArena/tabarena_dataset_curation

TabArena's dataset curation repository.

Language: Python - Size: 200 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

Alcoholrithm/TabularS3L

A PyTorch Lightning-based library for self- and semi-supervised learning on tabular data.

Language: Python - Size: 449 KB - Last synced at: 18 days ago - Pushed at: 4 months ago - Stars: 35 - Forks: 3

blei-lab/treeffuser

Treeffuser is an easy-to-use package for probabilistic prediction and probabilistic regression on tabular data with tree-based diffusion models.

Language: Jupyter Notebook - Size: 80.3 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 44 - Forks: 4

Clearbox-AI/clearbox-synthetic-kit

Clearbox AI's all-in-one solution for generation and evaluation of synthetic tabular and time-series data.

Language: Python - Size: 5.01 MB - Last synced at: 7 days ago - Pushed at: about 1 month ago - Stars: 43 - Forks: 1

ropensci/tabulapdf

Bindings for Tabula PDF Table Extractor Library

Language: R - Size: 32.4 MB - Last synced at: 14 days ago - Pushed at: 6 months ago - Stars: 556 - Forks: 72

Anaxagor/applybn

Multi-purpose data analysis framework based on Bayesian networks and Causal models

Language: Python - Size: 145 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 16 - Forks: 4

socialfoundations/folktexts

Evaluate uncertainty, calibration, accuracy, and fairness of LLMs on real-world survey data!

Language: Jupyter Notebook - Size: 28.8 MB - Last synced at: 29 days ago - Pushed at: 2 months ago - Stars: 22 - Forks: 4