An open API service providing repository metadata for many open source software ecosystems.

Topic: "data-anonymization"

microsoft/presidio

An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.

Language: Python - Size: 222 MB - Last synced at: about 19 hours ago - Pushed at: 2 days ago - Stars: 4,470 - Forks: 634

securitybunker/databunker

Secure Vault for Customer PII/PHI/PCI/KYC Records

Language: Go - Size: 11.1 MB - Last synced at: 11 days ago - Pushed at: 27 days ago - Stars: 1,292 - Forks: 83

arx-deidentifier/arx

ARX is a comprehensive open source data anonymization tool aiming to provide scalability and usability. It supports various anonymization techniques, methods for analyzing data quality and re-identification risks and it supports well-known privacy models, such as k-anonymity, l-diversity, t-closeness and differential privacy.

Language: Java - Size: 375 MB - Last synced at: 11 days ago - Pushed at: 4 months ago - Stars: 654 - Forks: 220

ArtLabss/open-data-anonymizer

Python Data Anonymization & Masking Library For Data Science Tasks

Language: Python - Size: 40.2 MB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 261 - Forks: 31

makinacorpus/DbToolsBundle

A PHP library to back up, restore and anonymize databases

Language: PHP - Size: 1.65 MB - Last synced at: 8 days ago - Pushed at: 14 days ago - Stars: 199 - Forks: 15

BMW-InnovationLab/BMW-Anonymization-API

This repository allows you to anonymize sensitive information in images/videos. The solution is fully compatible with the DL-based training/inference solutions that we already published/will publish for Object Detection and Semantic Segmentation.

Language: Python - Size: 44.7 MB - Last synced at: 19 days ago - Pushed at: about 1 year ago - Stars: 185 - Forks: 18

Mobile-IoT-Security-Lab/HideDroid

HideDroid is an Android app that allows the per-app anonymization of collected personal data according to a privacy level chosen by the user.

Language: Java - Size: 9.12 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 117 - Forks: 8

privateai/deid-examples

Examples scripts that showcase how to use Private AI Text to de-identify, redact, hash, tokenize, mask and synthesize PII in text.

Language: Jupyter Notebook - Size: 37.8 MB - Last synced at: 14 days ago - Pushed at: 15 days ago - Stars: 80 - Forks: 1

jftuga/deidentification

Deidentify people's names and gender specific pronouns

Language: Python - Size: 280 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 33 - Forks: 2

IFCA-Advanced-Computing/anjana

ANJANA is a Python library for anonymizing sensitive data

Language: Python - Size: 1.22 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 30 - Forks: 3

snaplet/docs

Snaplet Documentation

Language: HTML - Size: 13.7 MB - Last synced at: 3 days ago - Pushed at: 8 months ago - Stars: 28 - Forks: 10

thoughtworks-datakind/anonymizer

Library for identification, anonymization and de-anonymization of PII data

Language: Python - Size: 112 KB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 22 - Forks: 5

KI-AIM/Cinnamon

Cinnamon is a modular application designed to offer robust functionalities for data anonymization, synthetization, and evaluation.

Language: Java - Size: 40.5 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 21 - Forks: 1

nikosgalanis/bsc-thesis

🎓🔒 Creating, Analyzing and Testing Differential Privacy Protocols, aiming in Data Protection and Anonymization.

Language: Jupyter Notebook - Size: 18.3 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 15 - Forks: 1

yevh/anonymizer

Anonymize sensitive data in your datasets.

Language: Python - Size: 1.16 MB - Last synced at: 23 days ago - Pushed at: over 1 year ago - Stars: 12 - Forks: 1

stefanrmmr/differentially_private_synthetic_data

Differentially Private Synthetic Data Generation [DP-SDG] - Experimental Setups & Knowledge Base - WORK IN PROGRESS

Language: Jupyter Notebook - Size: 5.23 MB - Last synced at: 20 days ago - Pushed at: over 2 years ago - Stars: 12 - Forks: 2

fabriziosalmi/csv-anonymizer

CSV fuzzer/anonymizer

Language: JavaScript - Size: 96.7 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 9 - Forks: 0

fgmacedo/datanonymizer

Anonymizer tool for datasets such CSV files

Language: Python - Size: 21.5 KB - Last synced at: about 6 hours ago - Pushed at: 7 months ago - Stars: 9 - Forks: 0

OsgiliathEnterprise/data-migrator

Generate anonymized test dataset from production data and configurable anonymization sequences. Execute base to base (vendor agnostic) export and import

Language: Java - Size: 2.18 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 6 - Forks: 2

eriknovak/anonipy

Data anonymization package, supporting different anonymization strategies

Language: Python - Size: 1.12 MB - Last synced at: 11 days ago - Pushed at: 2 months ago - Stars: 6 - Forks: 3

DataFog/datafog-python

Privacy Engineering for the Generative AI era

Language: Python - Size: 78.1 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 5 - Forks: 2

Aymane11/anonymize

Data anonymization made easy

Language: Python - Size: 168 KB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 5 - Forks: 2

ryokugyu/One-Pass-KMeans-Algorithms

Implementation of An Efficient Clustering Method for k-Anonymization in Python 2.7

Language: Python - Size: 5.1 MB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 5 - Forks: 2

jaimedantas/data-anonymization-diabetes

Impacts of data anonymization on model prediction for diabetes

Language: MATLAB - Size: 460 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 4 - Forks: 3

sonbachmi/NgAnonymize

Data anonymization using Angular 2+

Language: TypeScript - Size: 2.48 MB - Last synced at: 14 days ago - Pushed at: almost 4 years ago - Stars: 4 - Forks: 0

sandrociceros/DataMasker Fork of Steveiwonder/DataMasker

A free data masking and/or anonymizer library

Language: C# - Size: 46.9 KB - Last synced at: over 1 year ago - Pushed at: almost 7 years ago - Stars: 3 - Forks: 1

0xsarwagya/site

My portfolio website built with Next.js and ShadcnUI. Displays My Projects and Work Experience

Language: TypeScript - Size: 3.52 MB - Last synced at: 7 days ago - Pushed at: 4 months ago - Stars: 2 - Forks: 0

Vishwamitra/data-anonymizer

DataAnonymizer is an open-source personal data anonymization tool designed for GDPR compliancy

Language: TypeScript - Size: 228 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

data-protection-helpers/induction-anonymization

Induction to anonymization of data

Language: Jupyter Notebook - Size: 13.9 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

henryhamon/iris-disguise

Data Anonymization tool for InterSystems IRIS

Language: ObjectScript - Size: 119 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 3

adaptant-labs/go-minimizer

Data minimization, pseudonymization, and anonymization helpers for Go

Language: Go - Size: 9.77 KB - Last synced at: 29 days ago - Pushed at: over 5 years ago - Stars: 2 - Forks: 0

tonyvargese/data-anonymisation-differential-privacy

Language: Python - Size: 5.16 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 1

prathwik0/real-incognito

Runner Up AIML - HackToFuture, SJEC, Mangalore (Rs. 20,000)

Language: Svelte - Size: 6.24 MB - Last synced at: 12 months ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 2

jagumiel/Artificial-Intelligence

First steps on AI. May help for learners like me.

Language: Jupyter Notebook - Size: 31.6 MB - Last synced at: almost 2 years ago - Pushed at: about 3 years ago - Stars: 1 - Forks: 0

thearchitector/stoplight 📦

Data anonymization signals for Tortoise ORM.

Language: Python - Size: 141 KB - Last synced at: 12 months ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

TTitcombe/PrivacyPanda

Anonymize your Pandas data. Preserve privacy.

Language: Python - Size: 77.1 KB - Last synced at: about 20 hours ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 1

adaptant-labs/data-minimization-service

A simple data minimization and anonymization microservice wrapped around go-minimizer

Language: Go - Size: 12.7 KB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 0

jamesstonehill/anonymous

Data anonymization for ActiveRecord

Language: Ruby - Size: 19.5 KB - Last synced at: 4 days ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 4

sandrociceros/arx Fork of arx-deidentifier/arx

ARX is a comprehensive open source data anonymization tool aiming to provide scalability and usability. It supports various anonymization techniques, methods for analyzing data quality and re-identification risks and it supports well-known privacy models, such as k-anonymity, l-diversity, t-closeness and differential privacy.

Language: Java - Size: 375 MB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 1

mitchelllisle/redacted

📛 An experimental data anonymisation library

Language: Python - Size: 59.6 KB - Last synced at: 5 days ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

ausdfrost/anonymizePy

🌱 anonymizePy helps you anonymize your data with ease

Language: Python - Size: 8.71 MB - Last synced at: 1 day ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

CogNetSys/Sonarum

Sonarum revolutionizes human-machine communication by securing real-time text, audio, and video streams while remaining fast, secure, and lightweight. It detects and controls sensitive and secure data on-the-fly, ensuring privacy and security without compromising quality.

Size: 1.95 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

Lefteris-Souflas/Census-Privacy-Analysis

Exploring US Census microdata, tackling privacy issues, and anonymization. Exercise A delves into quasi-identifiers, anonymization methods, identification risks, and differential privacy. Exercise B involves data loading, k-anonymity, histograms, adding noise for privacy, computing private averages, and analyzing privacy parameter impacts.

Language: Jupyter Notebook - Size: 3.9 MB - Last synced at: about 2 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Wenox/distributed-anonymisation

Distributed Anonymization Platform for SQL databases

Language: Java - Size: 110 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

VijayKrishnanSR/VijayKrishnanSR.github.io

A selected collection of my work samples

Size: 126 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

dmatsanganis/Data_Anonimization_Privacy_Threats_in_US_Census_Microdata_Analysis

This repository contains an analysis of the US Census Bureau's microdata from the 2010 census. The current analysis focuses on understanding the privacy threats associated with the non-anonymized dataset and exploring techniques to preserve privacy while analyzing the data.

Language: HTML - Size: 19 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 1

vishal-kumar-paswan/Data-Anonymization-using-Python

Anonymizing confidential data using the concept of masking.

Language: Jupyter Notebook - Size: 17.6 KB - Last synced at: 12 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

DatafoundationSystem/ashe

Data anonymity made simple

Language: JavaScript - Size: 105 MB - Last synced at: about 1 year ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

F4de78/hkp-coherence

DPP - "Anonymizing Transaction Databases for Publication" - AA 2022/23

Language: TeX - Size: 14.3 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

athulck/Data-Anonymization-Tool

M.Tech final year project to create a data anonymization tool.

Language: Python - Size: 1.34 MB - Last synced at: almost 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

Daniel-Hinz/Database-Security-Visualizer

A fully responsive, full stack web application with a working login system designed to demonstrate the benefits of password hashing, salting, and data anonymization.

Language: Python - Size: 847 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 1

gorskii/excel-hashing-notebook

Hash data stored in Excel spreadsheet using pandas and Python's hashlib library

Language: Jupyter Notebook - Size: 17.6 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

hamza1886/data-anonymization

Anonymize data using AES-128 encryption/decryption algorithm.

Language: Python - Size: 8.79 KB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

ngshya/anon-ae

Data anonymization

Language: Jupyter Notebook - Size: 1.11 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 1

markblue777/AnonDataGenerator

A project for generating data defined by a data definition file, the data would be a representation of real data that would be expected. Also enables the anonymisation of Personal identifiable information of data provided in either CSV or SQL connection.

Language: C# - Size: 823 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

sandrociceros/SnakeDumper Fork of digilist/SnakeDumper

Anonymize your database dumps.

Language: PHP - Size: 219 KB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0

sandrociceros/DataDefender Fork of armenak/DataDefender

Sensitive Data Management: Data Discovery and Anonymization toolkit

Language: Java - Size: 136 MB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 1

sandrociceros/neuralyzer Fork of edyan/neuralyzer

Neuralyzer is a library and a command line tool to anonymize databases (by updating existing data or populating a table with fake data)

Language: PHP - Size: 13.8 MB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0

sandrociceros/anonymizer-1 Fork of linkorb/anonymizer

Anonymizer: scrambles your confidential production data for use in test environments

Language: PHP - Size: 16.6 KB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0

lorenzomagazzini/meg-ctf

MATLAB code for extracting, converting and anonymising files in CTF MEG proprietary format.

Language: Matlab - Size: 4.88 KB - Last synced at: over 1 year ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 0

Related Topics
anonymization 19 privacy 12 python 11 gdpr 8 data-privacy 7 data-science 7 machine-learning 6 differential-privacy 5 data-masking 5 pii 5 privacy-protection 5 de-identification 4 python3 4 data-protection 4 open-source 4 pii-detection 4 pandas 4 synthetic-data 4 security 4 k-anonymity 3 anonymize 3 pii-anonymization 3 data 3 privacy-tools 3 sensitive-data 3 privacy-preserving-machine-learning 2 deep-learning 2 data-analysis 2 anonymization-service 2 object-detection 2 masking 2 synthetic-dataset-generation 2 anonymization-api 2 quasi-identifiers 2 privacy-enhancing-technologies 2 anonymity 2 encryption 2 anonymizer 2 data-generation 2 data-anonymized 2 data-anonymity 2 ccpa 2 compliance 2 data-tokenization 2 data-minimization 2 package 2 database 2 data-analytics 2 ai 2 named-entity-recognition 2 data-security 2 jupyter-notebook 2 nlp 2 text-anonymization 2 deidentification 2 data-obfuscation 1 pseudonymization 1 cpra 1 data-loss-prevention 1 de-identify 1 dlp 1 hipaa 1 redact 1 redaction 1 t-closeness 1 synthetic-data-generator 1 tokenize 1 open-data 1 l-diversity 1 analytics 1 analytics-tool 1 databases 1 dashboard 1 deployment 1 rubygem 1 anonimized-data 1 transformers 1 spacy 1 phi 1 anonymization-technique 1 personally-identifiable-information 1 bmw 1 computer-vision 1 docker 1 image-transformation 1 inference 1 no-code 1 openvino 1 pytorch 1 semantic-segmentation 1 tensorflow 1 image-redactor 1 tensorflow-training 1 video 1 video-anonymization 1 guardrails 1 gaussian-mechanism 1 gaussian-noise 1 data-redaction 1 data-migration 1