An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: data-profiler

rstudio/pointblank

Data quality assessment and metadata reporting for data frames and database tables

Language: R - Size: 105 MB - Last synced at: 3 days ago - Pushed at: 14 days ago - Stars: 987 - Forks: 59

tsegall/fta

Metadata/data identification Java library. Identifies Semantic Type information (e.g. Gender, Age, Color, Country,...). Extensive country/language support. Extensible via user-defined plugins. Comprehensive Profiling support.

Language: Java - Size: 5.98 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 29 - Forks: 3

InfuseAI/piperider

Code review for data in dbt

Language: Python - Size: 32.6 MB - Last synced at: 12 days ago - Pushed at: 8 months ago - Stars: 489 - Forks: 24

ray310/Panda-Helper

Panda-Helper is a simple, open-source, Python data-profiling utility for Pandas' DataFrames and Series.

Language: Python - Size: 779 KB - Last synced at: 9 days ago - Pushed at: 6 months ago - Stars: 3 - Forks: 1

pflooky/data-caterer-example

Example API implementation for Data Caterer

Language: Scala - Size: 1.83 MB - Last synced at: about 4 hours ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 3