GitHub topics: dataprofiling
capitalone/DataProfiler
What's in your data? Extract schema, statistics and entities from datasets
Language: Python - Size: 35.7 MB - Last synced at: 10 days ago - Pushed at: about 1 month ago - Stars: 1,475 - Forks: 170

DataKitchen/dataops-testgen
DataOps Data Quality TestGen is part of DataKitchen's Open Source Data Observability. DataOps TestGen delivers simple, fast data quality test generation and execution by data profiling, new dataset hygiene review, AI generation of data quality validation tests, ongoing testing of data refreshes, & continuous anomaly monitoring
Language: Python - Size: 5.22 MB - Last synced at: 15 days ago - Pushed at: 18 days ago - Stars: 55 - Forks: 3

SanderBos1/profilerInsight
profilerInsight is a data profiling tool designed to extract and analyze metadata from flat datafiles and different type of databases. This tool is currently under develpment
Size: 744 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

selva221724/edaSQL
edaSQL is a python library to bridge the SQL with Exploratory Data Analysis where you can connect to the Database and insert the queries. The query results can be passed to the EDA tool which can give greater insights to the user.
Language: Python - Size: 4.91 MB - Last synced at: 9 days ago - Pushed at: over 3 years ago - Stars: 10 - Forks: 1

atom071/pandas_learning
Pandas Exercises
Language: Jupyter Notebook - Size: 55.7 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

kumod007/Data-Profilling
DATA PROFILING is a process of examining, analyzing, and creating useful summaries of data. The process yields a high-level overview which aids in the discovery of data quality issues, risks, and overall trends.
Language: Jupyter Notebook - Size: 43 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

martandsingh/SQL-DQC
SQL based data profiling & data quality checks, which will help you to perform data profiling & data quality checks on SQL database at table & database level.
Language: TSQL - Size: 363 KB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 5 - Forks: 0

kanishksha/sample-data-profile
customer review on jupyter notebook
Language: Jupyter Notebook - Size: 1.51 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0
