GitHub topics: data-verification
sparkdq-community/sparkdq
A declarative PySpark framework for row- and aggregate-level data quality validation.
Language: Python - Size: 724 KB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 1 - Forks: 0

unionai-oss/pandera
A light-weight, flexible, and expressive statistical data testing library
Language: Python - Size: 3.95 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 3,760 - Forks: 330

rstudio/pointblank
Data quality assessment and metadata reporting for data frames and database tables
Language: R - Size: 105 MB - Last synced at: 14 days ago - Pushed at: 20 days ago - Stars: 947 - Forks: 58

darsan-in/Nexa-Bot
Nexa Auto automates the process of verifying the authenticity of addresses for room service eligibility and retrieving detailed specifications across multiple websites. Utilizing Selenium for web automation and GPT for handling missing data, Nexa Auto significantly reduces manual effort in data entry tasks.
Language: Python - Size: 10.6 MB - Last synced at: 4 days ago - Pushed at: 6 months ago - Stars: 3 - Forks: 0

Elevated-Standards/DataDemise 📦
DataDemise is an application for certifying and verifying the destruction of data stored across various cloud providers. It ensures secure and verifiable destruction of data, providing certificates as proof of destruction.
Language: Go - Size: 88.9 KB - Last synced at: 5 days ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

verifalia/verifalia-js-sdk
Verifalia REST API - Javascript SDK and helper library, for Node.js and the browser: verify email addresses in real-time and check whether they are deliverable, invalid, or otherwise risky.
Language: JavaScript - Size: 692 KB - Last synced at: 9 days ago - Pushed at: 12 months ago - Stars: 12 - Forks: 3

rrwen/recovr-infracycle
Pedalling Forward: The Evolution of Dedicated Cycling Infrastructure in Canadian Cities from 2010 to 2022
Language: R - Size: 427 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

yusangeng/io-validate
Javascript data validator.
Language: JavaScript - Size: 657 KB - Last synced at: 28 days ago - Pushed at: over 1 year ago - Stars: 6 - Forks: 0

Ramy-Badr-Ahmed/Merkle-DAG-Matlab
Merkle-Directed Acyclic Graph (DAG) in MATLAB - https://doi.org/10.5281/zenodo.12808889
Language: MATLAB - Size: 22.5 KB - Last synced at: 29 days ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

CramBL/fastPASTA
Mirror of the repository on CERN's Gitlab. CLI for viewing and verifying data integrity on the raw binary data read out from the ALICE detector and its subdetectors.
Language: Rust - Size: 11.5 MB - Last synced at: 11 days ago - Pushed at: 11 months ago - Stars: 4 - Forks: 1

YannisPap/Wrangle-OpenStreetMap-Data
Chose a region and used data munging techniques to assess the quality of the data for validity, accuracy, completeness, consistency and uniformity.
Language: HTML - Size: 1.34 MB - Last synced at: over 1 year ago - Pushed at: almost 7 years ago - Stars: 1 - Forks: 0

GrahamJamesKeane/SPUR-2020-Topical-Analysis-Toolkit
Deliver insights into the topical content of undergraduate degree programmes.
Language: Python - Size: 5.57 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

verifalia/verifalia-node-sdk 📦
Verifalia SDK for Node.js - OBSOLETE, please use https://github.com/verifalia/verifalia-js-sdk
Language: JavaScript - Size: 8.79 KB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 4 - Forks: 5

IQTLabs/VennData 📦
One of the biggest barriers to widespread machine learning adoption is the difficulty in collecting a 'good' dataset. There is an overall consensus that a 'good' dataset is a big dataset, but we believe that we can do better. As such the VennData project was created to develop tools to guide in the collection, curation, augmentation and validation of data.
Language: Jupyter Notebook - Size: 115 MB - Last synced at: 12 months ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 1

markodayan/noob-ethereum
Minimalist Ethereum library for JavaScript/TypeScript developers
Language: TypeScript - Size: 1.48 MB - Last synced at: 5 days ago - Pushed at: about 2 years ago - Stars: 2 - Forks: 0

v1k1nghawk/Signa
File Fingerprinting
Language: C++ - Size: 8.79 KB - Last synced at: almost 2 years ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

ajitsing/data_verifier
Ruby gem to verify data
Language: Ruby - Size: 17.6 KB - Last synced at: about 1 year ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 0

abramisola/dangerous
It's a dangerous world out there.
Language: Go - Size: 15.6 KB - Last synced at: 4 months ago - Pushed at: about 7 years ago - Stars: 0 - Forks: 0
