An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: dataprofiling

capitalone/DataProfiler

What's in your data? Extract schema, statistics and entities from datasets

Language: Python - Size: 35.7 MB - Last synced at: 10 days ago - Pushed at: about 1 month ago - Stars: 1,475 - Forks: 170

DataKitchen/dataops-testgen

DataOps Data Quality TestGen is part of DataKitchen's Open Source Data Observability. DataOps TestGen delivers simple, fast data quality test generation and execution by data profiling,  new dataset hygiene review, AI generation of data quality validation tests, ongoing testing of data refreshes, & continuous anomaly monitoring

Language: Python - Size: 5.22 MB - Last synced at: 15 days ago - Pushed at: 18 days ago - Stars: 55 - Forks: 3

SanderBos1/profilerInsight

profilerInsight is a data profiling tool designed to extract and analyze metadata from flat datafiles and different type of databases. This tool is currently under develpment

Size: 744 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

selva221724/edaSQL

edaSQL is a python library to bridge the SQL with Exploratory Data Analysis where you can connect to the Database and insert the queries. The query results can be passed to the EDA tool which can give greater insights to the user.

Language: Python - Size: 4.91 MB - Last synced at: 9 days ago - Pushed at: over 3 years ago - Stars: 10 - Forks: 1

atom071/pandas_learning

Pandas Exercises

Language: Jupyter Notebook - Size: 55.7 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

kumod007/Data-Profilling

DATA PROFILING is a process of examining, analyzing, and creating useful summaries of data. The process yields a high-level overview which aids in the discovery of data quality issues, risks, and overall trends.

Language: Jupyter Notebook - Size: 43 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

martandsingh/SQL-DQC

SQL based data profiling & data quality checks, which will help you to perform data profiling & data quality checks on SQL database at table & database level.

Language: TSQL - Size: 363 KB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 5 - Forks: 0

kanishksha/sample-data-profile

customer review on jupyter notebook

Language: Jupyter Notebook - Size: 1.51 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0