GitHub topics: tabular-data
an-seunghwan/DistVAE-Tabular
Official implementation of 'Distributional Learning of Variational AutoEncoder: Application to Synthetic Data Generation' (DistVAE) with pytorch (NeurIPS 2023 accepted paper).
Language: Jupyter Notebook - Size: 78.1 KB - Last synced at: 29 days ago - Pushed at: 6 months ago - Stars: 3 - Forks: 0

alexzwanenburg/familiar
Repository for the familiar R-package. Familiar implements an end-to-end pipeline for interpretable machine learning of tabular data.
Language: R - Size: 807 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 30 - Forks: 3

machinelearningnuremberg/DPL
[NeurIPS 2023] Multi-fidelity hyperparameter optimization with deep power laws that achieves state-of-the-art results across diverse benchmarks.
Language: Python - Size: 13.7 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 15 - Forks: 3

YRL-AIDA/CoLeM
CoLeM framework is a table model based on contrastive learning techniques for solving the problem of Column Type Annotation.
Language: Python - Size: 8.93 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 4 - Forks: 0

Minqi824/ADGym
Official Implement of "ADGym: Design Choices for Deep Anomaly Detection", NeurIPS 2023
Language: Python - Size: 153 MB - Last synced at: 3 months ago - Pushed at: almost 2 years ago - Stars: 29 - Forks: 6

pgdr/ph
ph — the tabular data shell tool
Language: Python - Size: 719 KB - Last synced at: 6 days ago - Pushed at: about 2 years ago - Stars: 17 - Forks: 3

RyanJJP/CHARMS
The code repository for ICML24 paper "Tabular Insights, Visual Impacts: Transferring Expertise from Tables to Images"
Language: Python - Size: 973 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 13 - Forks: 1

nimh-dsst/dataset-phenotypes
Preparatory scripts for BIDS tabular phenotypic data in large neuroimaging datasets.
Language: Python - Size: 7.64 MB - Last synced at: about 2 months ago - Pushed at: almost 2 years ago - Stars: 9 - Forks: 2

atfortes/Repeat-Buyers-Prediction
Alibaba Cloud | Tianchi Competition: TMALL Repeat Buyers Prediction Top 0.7% Solution
Language: Jupyter Notebook - Size: 6.28 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

calvinmccarter/unmasking-trees
Tabular data imputation and generation via incremental XGBoost unmasking
Language: Jupyter Notebook - Size: 7.4 MB - Last synced at: 16 days ago - Pushed at: 4 months ago - Stars: 11 - Forks: 0

jianzhnie/AutoTabular
Automatic machine learning for tabular data. ⚡🔥⚡
Language: Python - Size: 5.91 MB - Last synced at: 15 days ago - Pushed at: over 3 years ago - Stars: 70 - Forks: 10

Meteor-Community-Packages/meteor-tabular
Reactive datatables for large or small datasets
Language: JavaScript - Size: 358 KB - Last synced at: 29 days ago - Pushed at: 3 months ago - Stars: 362 - Forks: 138

E3-JSI/StreamStoryPyClient
StreamStory python client
Language: Python - Size: 291 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

CEID-HPCLAB/Mneme
High-performance out-of-core preprocessing of large tabular datasets on single-node systems.
Language: Python - Size: 80.1 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

wizzard/tprint-rs
Rust crate to print tabular data
Language: Rust - Size: 104 KB - Last synced at: 19 days ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

dataclr/dataclr
Feature selection for tabular datasets using advanced filter and wrapper methods
Language: Python - Size: 107 KB - Last synced at: 16 days ago - Pushed at: 4 months ago - Stars: 17 - Forks: 1

MattBlue00/polimi-thesis
Research Thesis Project at Politecnico di Milano, A.Y. 2023-2024
Language: Python - Size: 9.23 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

LUKEWOLF979/hands-on-machine-learning-projects
Hands-on machine learning projects with full implementation and datasets. Includes House Price Prediction, Spam Email Detection, and Customer Segmentation.
Size: 1000 Bytes - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

kennethreitz/tablib Fork of jazzband/tablib
Python Module for Tabular Datasets in XLS, CSV, JSON, YAML, &c.
Size: 1.83 MB - Last synced at: 25 days ago - Pushed at: about 3 years ago - Stars: 62 - Forks: 14

kingculme0/excel-2021
Progrаm for free download Microsoft Excel 2021 here
Size: 7.5 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

nikita631/excel-2021
Progrаm for free download Microsoft Excel 2021 here
Size: 1000 Bytes - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

miguelmoralh/feature-selection-benchmark
Comprehensive benchmark study of feature selection techniques for predictive machine learning models on tabular data. Various feature selection methods are evaluated across different data characteristics and predictive scenarios.
Language: Python - Size: 26.6 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 2 - Forks: 0

unnir/DeepTLF
A Novel Hybrid Deep Learning Model for Heterogeneous Tabular Data
Language: Python - Size: 411 KB - Last synced at: 17 days ago - Pushed at: 6 months ago - Stars: 20 - Forks: 1

siraug/Prostate-Cancer-Prediction-With-OmniXAI
This repository is dedicated to raising awareness about prostate cancer through the prediction of prostate cancer and the explanation of the model's prediction using OmniXAI Explainers.
Language: Jupyter Notebook - Size: 1.42 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 2 - Forks: 0

Sim98B/TabularDataGeneration
Synthetic Data Generation: Tabular & Medical Imaging. A comprehensive project focused on generating synthetic data for tabular datasets and medical imaging.
Language: Jupyter Notebook - Size: 214 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

aAa1928/heart-disease-ml-classifier
A PyTorch model with a 99.27% accuracy designed to predict the risk of heart disease based on a combination of symptoms, lifestyle factors, and medical history.
Language: Python - Size: 2.93 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

moshafieeha/Machine-Learning-and-Deep-Learning-Mini-Projects
Hands-on projects that address various real-world Machine Learning and Deep Learning challenges.
Language: Jupyter Notebook - Size: 128 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 4 - Forks: 0

nicomignoni/tab2img
A tool to convert tabular data into images, in order to be used by CNNs Inspired by the "DeepInsight" paper.
Language: Python - Size: 497 KB - Last synced at: 12 days ago - Pushed at: 6 months ago - Stars: 25 - Forks: 5

r-rudra/tidycells
Automatic transformation of untidy spreadsheet-like data into tidy form
Language: R - Size: 2.81 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 83 - Forks: 10

yandex-research/tabgraphs
A benchmark of meaningful graph datasets with tabular node features
Language: Python - Size: 138 KB - Last synced at: 4 months ago - Pushed at: 5 months ago - Stars: 12 - Forks: 1

continuum/active_importer 📦
Define importers that load tabular data from spreadsheets or CSV files into any ActiveRecord-like ORM.
Language: Ruby - Size: 160 KB - Last synced at: 7 days ago - Pushed at: over 3 years ago - Stars: 330 - Forks: 19

cldf/csvw
CSV on the web
Language: Python - Size: 320 KB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 38 - Forks: 6

PolyMathOrg/DataFrame
DataFrame in Pharo - tabular data structures for data analysis
Language: Smalltalk - Size: 2.13 MB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 77 - Forks: 28

TortueSagace/versatile_evasion_attacks
Security protocols for estimating adversarial robustness of machine learning models for both tabular and image datasets. This package implements a set of evasion attacks based on metaheuristic optimization algorithms, and complex cost functions to give reliable results for tabular problems.
Language: Jupyter Notebook - Size: 8.25 MB - Last synced at: 17 days ago - Pushed at: 5 months ago - Stars: 3 - Forks: 1

bportelalp/Beporsoft.TabularSheets
Object collections to spreadsheets in a simple way
Language: C# - Size: 369 KB - Last synced at: 24 days ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

tomaztk/DAX_Functions
DAX Functions with Power BI
Language: TSQL - Size: 10.6 MB - Last synced at: 3 months ago - Pushed at: over 4 years ago - Stars: 27 - Forks: 11

mohalim/IFENet
IFENet, a deep learning model for tabular data
Language: Python - Size: 52.4 MB - Last synced at: about 1 month ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

radiradev/flowmatching-bdt
Implementation of flow matching on tabular data using XGBoost
Language: Python - Size: 430 KB - Last synced at: about 1 month ago - Pushed at: 6 months ago - Stars: 6 - Forks: 0

radlfabs/flexcv
Python package customizing nested cross validation for tabular data.
Language: Python - Size: 3.46 MB - Last synced at: 28 days ago - Pushed at: 10 months ago - Stars: 2 - Forks: 0

petercorke/ansitable
Quick, easy and pretty display of tabular data or matrices, with optional ANSI color and borders
Language: Python - Size: 7.33 MB - Last synced at: 14 days ago - Pushed at: 7 months ago - Stars: 18 - Forks: 4

bhattbhavesh91/table-question-answering-demo
Question Answering on Tabular Data with HuggingFace Transformers Pipeline & TAPAS
Language: Jupyter Notebook - Size: 18.6 KB - Last synced at: 2 months ago - Pushed at: over 3 years ago - Stars: 25 - Forks: 17

AdrianBZG/TabMDA
Code for "TabMDA: Tabular Manifold Data Augmentation for Any Classifier using Transformers with In-context Subsetting" (ICML 2024)
Language: Python - Size: 6.52 MB - Last synced at: 3 months ago - Pushed at: 11 months ago - Stars: 5 - Forks: 0

vaheqelyan/react-keyview
React components to display the list, table, and grid, without scrolling, use the keyboard keys to navigate through the data
Language: TypeScript - Size: 38.1 KB - Last synced at: about 8 hours ago - Pushed at: over 6 years ago - Stars: 16 - Forks: 0

Baukebrenninkmeijer/On-the-Generation-and-Evaluation-of-Synthetic-Tabular-Data-using-GANs
Repository for the results of my master thesis, about the generation and evaluation of synthetic data using GANs
Language: Jupyter Notebook - Size: 48.1 MB - Last synced at: about 2 months ago - Pushed at: about 2 years ago - Stars: 44 - Forks: 4

aimclub/asid
AutoML tool for imbalanced and small tabular datasets
Language: Python - Size: 5.33 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 6 - Forks: 2

automl/llms_feature_engineering_bias
LLMs Feature Engineering Bias [UNDER CONSTRUCTION]
Language: Python - Size: 24.4 KB - Last synced at: 7 days ago - Pushed at: 7 months ago - Stars: 4 - Forks: 1

junayed-hasan/Life-Satisfaction-Machine-Learning
This repo contains code for predicting life satisfaction using machine learning and explainable AI, as published in Heliyon. It includes a Jupyter Notebook with data processing, model building, and result visualization using Python libraries. The analysis uses the SHILD dataset to explore factors influencing life satisfaction.
Language: Jupyter Notebook - Size: 13.6 MB - Last synced at: 16 days ago - Pushed at: 6 months ago - Stars: 2 - Forks: 0

AyberkYavuz/artificial_neural_network_automation
This repository is for automating artificial neural network model creation with tabular data using Keras framework.
Language: Python - Size: 283 KB - Last synced at: 14 days ago - Pushed at: almost 4 years ago - Stars: 4 - Forks: 1

bsantanna/iban-validator-model
A Machine Learning Artificial Neural Network Model for validating IBAN account numbers.
Language: Jupyter Notebook - Size: 1.06 MB - Last synced at: 3 months ago - Pushed at: almost 2 years ago - Stars: 4 - Forks: 0

Ritvik19/pyradox-tabular
State of the Art Neural Networks for Tabular Deep Learning
Language: Python - Size: 78.1 KB - Last synced at: 3 months ago - Pushed at: over 3 years ago - Stars: 9 - Forks: 1

merenna/synthetic-circle
🟥 Comprises 10,000 two-dimensional points organized into 100 distinct circles. Designed for evaluating clustering algorithms like k-means, it presents a well-defined clustering challenge. Each point is labeled with its corresponding circle, making it suitable for both classification and clustering tasks.
Size: 218 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

merenna/micro-gas-turbine-electrical-energy
🟥Contains time-series data from a 3-kilowatt micro gas turbine. It records electrical power output in relation to input control signals. Designed for regression analysis, it aims to predict energy output based on control signals. The dataset includes eight time series with varying durations and input signal patterns.
Size: 874 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

jianzhnie/MultimodalTookit
Incorporate Image, Text and Tabular Data with HuggingFace Transformers
Language: Python - Size: 1000 KB - Last synced at: 16 days ago - Pushed at: over 3 years ago - Stars: 12 - Forks: 0

junayed-hasan/Adult-Income-Prediction-Machine-Learning
This project analyzes the Adult Income Dataset to predict individual income levels based on census data. It showcases data preprocessing, visualization, and machine learning techniques. The workflow includes data cleaning, transformation, feature engineering, and predictive modeling, providing insights into income factors.
Language: Jupyter Notebook - Size: 7.53 MB - Last synced at: 3 months ago - Pushed at: 7 months ago - Stars: 2 - Forks: 0

anselmeamekoe/DelayedLabelStream
Language: Jupyter Notebook - Size: 1.78 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

subhashpolisetti/Dimensionality_Reduction
This repository demonstrates various dimensionality reduction techniques on image and tabular datasets. It explores and visualizes methods like PCA, t-SNE, UMAP, ISOMAP, and Autoencoders, comparing their performance and effectiveness with interactive visualizations.
Language: Jupyter Notebook - Size: 4.16 MB - Last synced at: 3 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

chahelgupta/travel-booking-management-system-sql
SQL Travel Booking Management System: Seamlessly manage bookings, transportation options, and customer data in this comprehensive travel management solution.
Size: 3.31 MB - Last synced at: 4 months ago - Pushed at: 11 months ago - Stars: 3 - Forks: 1

Preetraj2002/Tablify
Converts a snapshot of a table (an image) into tabular data using OCR. It using image processing and enhancement techniques to help with the OCR.
Language: Python - Size: 872 KB - Last synced at: 3 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

Wittline/wbz
A parallel implementation of the bzip2 data compressor in python, this data compression pipeline is using algorithms like Burrows–Wheeler transform (BWT) and Move to front (MTF) to improve the Huffman compression. For now, this tool only will be focused on compressing .csv files, and other files on tabular format.
Language: Python - Size: 578 KB - Last synced at: 3 months ago - Pushed at: about 3 years ago - Stars: 13 - Forks: 3

rafaelgreca/e2e-mlops-project
The purpose of this project's design, development, and structure is to create an end-to-end Machine Learning Operations (MLOps) lifecycle to classify an individual's level of obesity based on their physical characteristics and eating habits.
Language: HTML - Size: 34 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

subhashpolisetti/EDA-Timeseries-Tabular
This repository contains two machine learning projects: Air Quality Prediction, which predicts CO(GT) levels using environmental and pollutant data with AutoML, and NYC Taxi Fare Prediction, predicts taxi fares based on trip data using automated machine learning. Both projects showcase data analysis,preprocessing, and predictive modeling techniques
Language: Jupyter Notebook - Size: 3.41 MB - Last synced at: 3 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

mcanouil/rdatabase
Slides on how to get data from files, databases or webpages into R (two-days workshop; in French).
Language: CSS - Size: 13.9 MB - Last synced at: 3 months ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

openalloc/TablerDemo
A demonstration of SwiftTabler, a multi-platform SwiftUI component for tabular data
Language: Swift - Size: 837 KB - Last synced at: about 2 months ago - Pushed at: about 2 years ago - Stars: 3 - Forks: 2

drkbluescience/UP-School-Bitexen-Datathon-2024_V2
This repository contains analyses and visualizations of gender-related statistics, awarded second place in the UP School & Bitexen Women in Datathon 2024
Language: Jupyter Notebook - Size: 1.66 MB - Last synced at: 4 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

drkbluescience/AutoGluon_Cameroon_Air_Quality
Finished 5th in the Cameroon Air Quality Prediction competition, later refining the model to achieve a score better than the 1st place submission using AutoGluon.
Language: Jupyter Notebook - Size: 4.16 MB - Last synced at: 4 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

drkbluescience/WiDS2024_Challenge2_MetastaticDiagnosisRegression
This notebook presents an exploratory data analysis (EDA) and regression modeling approach for the WiDS Datathon 2024 Challenge #2.
Language: Jupyter Notebook - Size: 20.4 MB - Last synced at: 3 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

Antoine68/parqueditor
Easly view, create and edit parquet files with the desktop application Parqueditor
Language: Java - Size: 427 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

rajoy99/keshik
Oversample class imbalanced tabular data by Denoising Diffusion Probabilistic Model (DDPM)
Language: Python - Size: 112 KB - Last synced at: 2 days ago - Pushed at: about 2 years ago - Stars: 5 - Forks: 1

microsoft/CASPR
CASPR is a deep learning framework applying transformer architecture to learn and predict from tabular data at scale.
Language: Python - Size: 2.47 MB - Last synced at: 3 days ago - Pushed at: over 2 years ago - Stars: 38 - Forks: 3

oemof/oemof-tabular
Load oemof energy systems from tabular data sources.
Language: Python - Size: 4.94 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 7 - Forks: 5

siddharth-nandagopal/billionaires-rag-query
Billionaires RAG Query uses LLMs and a RAG framework to analyze the world's billionaires list. Extracts tabular data from PDFs, converts to multiple formats, and enables precise queries about net worth, age, and more. Integrates with Poetry and asdf for easy setup and management.
Language: Python - Size: 707 KB - Last synced at: 3 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

namkoong-lab/LLM-Tabular-Shifts
Code for "LLM Embeddings Improve Test-time Adaptation to Tabular Y|X-Shifts"
Language: Python - Size: 149 KB - Last synced at: 8 months ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

nimblelearn/datapackage-m
Power Query M functions for working with Tabular Data Packages (Frictionless Data) in Power BI and Excel
Language: R - Size: 27.3 MB - Last synced at: 7 months ago - Pushed at: about 4 years ago - Stars: 28 - Forks: 4

livetocode/tabular-data-differ
A very efficient library for diffing two sorted streams of tabular data, such as CSV files.
Language: TypeScript - Size: 181 KB - Last synced at: 7 days ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

AndreaZoccatelli/light_permanova
A lightweight implementation of PERMANOVA based on Euclidean distance from centroid
Language: Jupyter Notebook - Size: 2.63 MB - Last synced at: 4 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

echosprint/TabularTransformer
An end-to-end deep learning framework based on Transformer model, adapted for tabular data domain, support both supervised and self-supervised learning, used for regression, classification tasks.
Language: Python - Size: 764 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

alok-ai-lab/DeepInsight3D_pkg
DeepInsight3D package to deal with multi-omics or multi-layered data
Language: MATLAB - Size: 5.8 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 6 - Forks: 2

KhoaTran3126/MCTS_Variants_Performance-Prediction-Competition
A collection of analysis and models used for the competition.
Language: Jupyter Notebook - Size: 229 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

SimonBlanke/search-data-explorer
Visualize search-data from your gradient-free-optimization run.
Language: Python - Size: 1.17 MB - Last synced at: 17 days ago - Pushed at: about 1 year ago - Stars: 3 - Forks: 1

enzomarx/DataWrangling
This repository contains tools and tutorials for cleaning and preparing your data for analysis. It is focused on data scientists and analysts who want to improve their data processing skills. Available tools include: Pandas NumPy Matplotlib
Language: Python - Size: 1.94 MB - Last synced at: 3 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

Adeemy/end-to-end-ml
End-to-end ML project for tabular data.
Language: Jupyter Notebook - Size: 18 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 2 - Forks: 0

yrodriguezmd/Synthetic_Medical_Tabular_Data
Generate synthetic medical data from a patient population dataset.
Language: Jupyter Notebook - Size: 386 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

AmishiDesai04/travel-management-system-sql
Welcome to our Travel Management System repository, a SQL-based database management project. It handles customer and employee details, destinations, transportation (trains, buses, flights, cruises, cars), payments, and bookings. Complex SQL queries including subqueries and joins are used for data extraction and analysis.
Size: 28.3 KB - Last synced at: 4 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

ofirmanor/nicetable
A clean and elegant way to print text tables in Python with minimal boilerplate code
Language: Python - Size: 127 KB - Last synced at: 14 days ago - Pushed at: over 4 years ago - Stars: 6 - Forks: 0

mateoespinosa/tabcbm
Official Implementation of TMLR's paper: "TabCBM: Concept-based Interpretable Neural Networks for Tabular Data"
Language: Python - Size: 11.8 MB - Last synced at: 11 months ago - Pushed at: about 1 year ago - Stars: 4 - Forks: 2

Ipsedo/TabPFN
TabPFN implementation with PyTorch
Language: Python - Size: 173 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 2 - Forks: 1

JoseMHU/Tabulation-generator-from-Pandas
This repository automates the creation of output tabulations for publication in .xlsx files from a Pandas DataFrame
Language: Python - Size: 583 KB - Last synced at: 4 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 0

abdullahelen/NuDIT
Transforming Numerical Data to Images for Deep Networks.
Language: MATLAB - Size: 9.99 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 0

SergioArroni/Synthetic-Tabular-Data-Generator
This repository contains the code and data used for a research project by Sergio Arroni. The goal of the project is to create a synthetic tabular data generator using diffusion models. Different datasets can be found in ./all_results/synthetic within their respective folders.
Language: Python - Size: 161 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

Steema/TeeGrid-NET-Samples
This repository includes all the content and samples that make use of the TeeGrid NET product.
Language: C# - Size: 5.77 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 1 - Forks: 0

shinnapinna/data_science_portfolio
Portfolio of data science projects showing my skill range and competencies.
Language: Jupyter Notebook - Size: 13.6 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

ablacan/gmda
This is the official implementation for the Generative Modeling Density Alignment (GMDA). This work was presented in the paper "Frugal Generative Modeling for Tabular Data" at ECML 2024.
Language: Python - Size: 505 KB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

somayehpakdel/predicting_air_pollution_measurements
predicting_air_pollution_measurements based on three models.
Language: Jupyter Notebook - Size: 239 KB - Last synced at: 12 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

superctj/observatory-library
Python library for embedding inference of relational tables.
Language: Python - Size: 136 KB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 3 - Forks: 0

SAP-archive/security-research-differential-privacy-generative-models-framework 📦
The framework is for training and evaluating differentially private generatative models. It allows for the creation of an anonymized version of an original dataset potentially containing personal data.
Language: Jupyter Notebook - Size: 4.05 MB - Last synced at: about 2 months ago - Pushed at: 12 months ago - Stars: 6 - Forks: 3

superctj/observatory
Characterization of relational table embeddings (VLDB 2024).
Language: Python - Size: 9.23 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 22 - Forks: 0

lattice-ai/rtdl Fork of yandex-research/rtdl 📦
The `rtdl` library + The W&B implementation of the paper "Revisiting Deep Learning Models for Tabular Data"
Language: Jupyter Notebook - Size: 14.3 MB - Last synced at: 7 days ago - Pushed at: over 3 years ago - Stars: 5 - Forks: 1

DavidGonzalezFernandez/TFM
Repositorio con el código de los experimentos de mi TFM titulado "Transformación de Datos Tabulares a Imágenes Sintéticas: Optimización y Evaluación de la Librería TINTOlib en Python"
Language: Jupyter Notebook - Size: 198 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

manikyabard/transfertab
Transfer Learning for Tabular Data
Language: Jupyter Notebook - Size: 11.3 MB - Last synced at: 3 months ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 0

anopsy/Equity_in_Healthcare
Predicitng a timely diagnosis in metastatic cancer patients. Data cleaning, feature engineering and hyperparams tuning of classification model ensemble
Language: Jupyter Notebook - Size: 15.2 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0
