An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: data-verification

sparkdq-community/sparkdq

A declarative PySpark framework for row- and aggregate-level data quality validation.

Language: Python - Size: 724 KB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 1 - Forks: 0

unionai-oss/pandera

A light-weight, flexible, and expressive statistical data testing library

Language: Python - Size: 3.95 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 3,760 - Forks: 330

rstudio/pointblank

Data quality assessment and metadata reporting for data frames and database tables

Language: R - Size: 105 MB - Last synced at: 14 days ago - Pushed at: 20 days ago - Stars: 947 - Forks: 58

darsan-in/Nexa-Bot

Nexa Auto automates the process of verifying the authenticity of addresses for room service eligibility and retrieving detailed specifications across multiple websites. Utilizing Selenium for web automation and GPT for handling missing data, Nexa Auto significantly reduces manual effort in data entry tasks.

Language: Python - Size: 10.6 MB - Last synced at: 4 days ago - Pushed at: 6 months ago - Stars: 3 - Forks: 0

Elevated-Standards/DataDemise 📦

DataDemise is an application for certifying and verifying the destruction of data stored across various cloud providers. It ensures secure and verifiable destruction of data, providing certificates as proof of destruction.

Language: Go - Size: 88.9 KB - Last synced at: 5 days ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

verifalia/verifalia-js-sdk

Verifalia REST API - Javascript SDK and helper library, for Node.js and the browser: verify email addresses in real-time and check whether they are deliverable, invalid, or otherwise risky.

Language: JavaScript - Size: 692 KB - Last synced at: 9 days ago - Pushed at: 12 months ago - Stars: 12 - Forks: 3

rrwen/recovr-infracycle

Pedalling Forward: The Evolution of Dedicated Cycling Infrastructure in Canadian Cities from 2010 to 2022

Language: R - Size: 427 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

yusangeng/io-validate

Javascript data validator.

Language: JavaScript - Size: 657 KB - Last synced at: 28 days ago - Pushed at: over 1 year ago - Stars: 6 - Forks: 0

Ramy-Badr-Ahmed/Merkle-DAG-Matlab

Merkle-Directed Acyclic Graph (DAG) in MATLAB - https://doi.org/10.5281/zenodo.12808889

Language: MATLAB - Size: 22.5 KB - Last synced at: 29 days ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

CramBL/fastPASTA

Mirror of the repository on CERN's Gitlab. CLI for viewing and verifying data integrity on the raw binary data read out from the ALICE detector and its subdetectors.

Language: Rust - Size: 11.5 MB - Last synced at: 11 days ago - Pushed at: 11 months ago - Stars: 4 - Forks: 1

YannisPap/Wrangle-OpenStreetMap-Data

Chose a region and used data munging techniques to assess the quality of the data for validity, accuracy, completeness, consistency and uniformity.

Language: HTML - Size: 1.34 MB - Last synced at: over 1 year ago - Pushed at: almost 7 years ago - Stars: 1 - Forks: 0

GrahamJamesKeane/SPUR-2020-Topical-Analysis-Toolkit

Deliver insights into the topical content of undergraduate degree programmes.

Language: Python - Size: 5.57 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

verifalia/verifalia-node-sdk 📦

Verifalia SDK for Node.js - OBSOLETE, please use https://github.com/verifalia/verifalia-js-sdk

Language: JavaScript - Size: 8.79 KB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 4 - Forks: 5

IQTLabs/VennData 📦

One of the biggest barriers to widespread machine learning adoption is the difficulty in collecting a 'good' dataset. There is an overall consensus that a 'good' dataset is a big dataset, but we believe that we can do better. As such the VennData project was created to develop tools to guide in the collection, curation, augmentation and validation of data.

Language: Jupyter Notebook - Size: 115 MB - Last synced at: 12 months ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 1

markodayan/noob-ethereum

Minimalist Ethereum library for JavaScript/TypeScript developers

Language: TypeScript - Size: 1.48 MB - Last synced at: 5 days ago - Pushed at: about 2 years ago - Stars: 2 - Forks: 0

v1k1nghawk/Signa

File Fingerprinting

Language: C++ - Size: 8.79 KB - Last synced at: almost 2 years ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

ajitsing/data_verifier

Ruby gem to verify data

Language: Ruby - Size: 17.6 KB - Last synced at: about 1 year ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 0

abramisola/dangerous

It's a dangerous world out there.

Language: Go - Size: 15.6 KB - Last synced at: 4 months ago - Pushed at: about 7 years ago - Stars: 0 - Forks: 0

Related Keywords
data-verification 18 data-validation 4 data-verifier 3 testing-tools 3 data-cleaning 3 verification 2 data-science 2 data-integrity 2 testing 2 e-mail 2 email-validation 2 email-verification 2 pandas 2 data-check 2 typescript 2 data-assertions 2 data-quality 2 verifalia 2 directed-acyclic-graph 1 graph-algorithms 1 graph-algorithms-and-data-sturcture 1 integrity-check 1 java-security 1 matlab 1 matlab-oop 1 merkle-dag 1 cli 1 daq 1 rust 1 python 1 data-structures-and-algorithms 1 cycling 1 dag 1 cryptographic-hash-functions 1 vancouver 1 cycling-analytics 1 infrastructure 1 postdoc 1 recovr 1 uoft 1 trend-analysis 1 trend 1 toronto 1 model-verification 1 cryptography 1 ethereum 1 rlp 1 checksum-calculator 1 cryptographic-signature 1 data-security 1 data-validator 1 file-fingerprinting 1 infosec 1 md5-checksum 1 multithreading 1 data-sanity 1 ruby 1 ruby-gem 1 security 1 signature 1 academic-project 1 data-collection 1 matplotlib-pyplot 1 maynooth-university 1 natural-language-processing 1 python-3 1 seaborn 1 spur-2020 1 sqlite3-database 1 topic-analysis 1 undergraduate-research 1 visualization 1 wordcloud 1 data 1 data-curation 1 machine-learning 1 model 1 city-data 1 ai-enhanced-automation 1 address-verification 1 address-validation 1 address-lookup 1 yaml-configuration 1 schema-validation 1 reporting-tool 1 easy-to-understand 1 database-tables 1 data-profiler 1 data-management 1 data-inference 1 data-frames 1 data-dictionaries 1 data-checker 1 validation 1 schema 1 pandas-validator 1 pandas-validation 1 pandas-dataframe 1 hypothesis-testing 1 dataframes 1