An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: data-anonymization

IFCA-Advanced-Computing/anjana

ANJANA is a Python library for anonymizing sensitive data

Language: Python - Size: 1.22 MB - Last synced at: about 2 hours ago - Pushed at: about 2 hours ago - Stars: 30 - Forks: 3

microsoft/presidio

Context aware, pluggable and customizable data protection and de-identification SDK for text, images and structured data.

Language: Python - Size: 222 MB - Last synced at: 1 day ago - Pushed at: 14 days ago - Stars: 4,462 - Forks: 632

KI-AIM/Cinnamon

Cinnamon is a modular application designed to offer robust functionalities for data anonymization, synthetization, and evaluation.

Language: Java - Size: 40.5 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 21 - Forks: 1

securitybunker/databunker

Secure Vault for Customer PII/PHI/PCI/KYC Records

Language: Go - Size: 11.1 MB - Last synced at: 9 days ago - Pushed at: 26 days ago - Stars: 1,292 - Forks: 83

makinacorpus/DbToolsBundle

A PHP library to back up, restore and anonymize databases

Language: PHP - Size: 1.65 MB - Last synced at: 6 days ago - Pushed at: 12 days ago - Stars: 199 - Forks: 15

privateai/deid-examples

Examples scripts that showcase how to use Private AI Text to de-identify, redact, hash, tokenize, mask and synthesize PII in text.

Language: Jupyter Notebook - Size: 37.8 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 80 - Forks: 1

arx-deidentifier/arx

ARX is a comprehensive open source data anonymization tool aiming to provide scalability and usability. It supports various anonymization techniques, methods for analyzing data quality and re-identification risks and it supports well-known privacy models, such as k-anonymity, l-diversity, t-closeness and differential privacy.

Language: Java - Size: 375 MB - Last synced at: 9 days ago - Pushed at: 4 months ago - Stars: 654 - Forks: 220

BMW-InnovationLab/BMW-Anonymization-API

This repository allows you to anonymize sensitive information in images/videos. The solution is fully compatible with the DL-based training/inference solutions that we already published/will publish for Object Detection and Semantic Segmentation.

Language: Python - Size: 44.7 MB - Last synced at: 17 days ago - Pushed at: about 1 year ago - Stars: 185 - Forks: 18

thoughtworks-datakind/anonymizer

Library for identification, anonymization and de-anonymization of PII data

Language: Python - Size: 112 KB - Last synced at: 29 days ago - Pushed at: over 2 years ago - Stars: 22 - Forks: 5

stefanrmmr/differentially_private_synthetic_data

Differentially Private Synthetic Data Generation [DP-SDG] - Experimental Setups & Knowledge Base - WORK IN PROGRESS

Language: Jupyter Notebook - Size: 5.23 MB - Last synced at: 19 days ago - Pushed at: over 2 years ago - Stars: 12 - Forks: 2

OsgiliathEnterprise/data-migrator

Generate anonymized test dataset from production data and configurable anonymization sequences. Execute base to base (vendor agnostic) export and import

Language: Java - Size: 2.18 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 6 - Forks: 2

snaplet/docs

Snaplet Documentation

Language: HTML - Size: 13.7 MB - Last synced at: about 23 hours ago - Pushed at: 8 months ago - Stars: 28 - Forks: 10

fabriziosalmi/csv-anonymizer

CSV fuzzer/anonymizer

Language: JavaScript - Size: 96.7 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 9 - Forks: 0

ArtLabss/open-data-anonymizer

Python Data Anonymization & Masking Library For Data Science Tasks

Language: Python - Size: 40.2 MB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 261 - Forks: 31

jftuga/deidentification

Deidentify people's names and gender specific pronouns

Language: Python - Size: 280 KB - Last synced at: 29 days ago - Pushed at: 29 days ago - Stars: 33 - Forks: 2

eriknovak/anonipy

Data anonymization package, supporting different anonymization strategies

Language: Python - Size: 1.12 MB - Last synced at: 9 days ago - Pushed at: 2 months ago - Stars: 6 - Forks: 3

0xsarwagya/site

My portfolio website built with Next.js and ShadcnUI. Displays My Projects and Work Experience

Language: TypeScript - Size: 3.52 MB - Last synced at: 5 days ago - Pushed at: 3 months ago - Stars: 2 - Forks: 0

fgmacedo/datanonymizer

Anonymizer tool for datasets such CSV files

Language: Python - Size: 21.5 KB - Last synced at: 15 days ago - Pushed at: 7 months ago - Stars: 9 - Forks: 0

yevh/anonymizer

Anonymize sensitive data in your datasets.

Language: Python - Size: 1.16 MB - Last synced at: 21 days ago - Pushed at: over 1 year ago - Stars: 12 - Forks: 1

DataFog/datafog-python

Privacy Engineering for the Generative AI era

Language: Python - Size: 78.1 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 5 - Forks: 2

mitchelllisle/redacted

📛 An experimental data anonymisation library

Language: Python - Size: 59.6 KB - Last synced at: 3 days ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

adaptant-labs/data-minimization-service

A simple data minimization and anonymization microservice wrapped around go-minimizer

Language: Go - Size: 12.7 KB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 0

ausdfrost/anonymizePy

🌱 anonymizePy helps you anonymize your data with ease

Language: Python - Size: 8.71 MB - Last synced at: about 1 month ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

tonyvargese/data-anonymisation-differential-privacy

Language: Python - Size: 5.16 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 1

CogNetSys/Sonarum

Sonarum revolutionizes human-machine communication by securing real-time text, audio, and video streams while remaining fast, secure, and lightweight. It detects and controls sensitive and secure data on-the-fly, ensuring privacy and security without compromising quality.

Size: 1.95 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

Aymane11/anonymize

Data anonymization made easy

Language: Python - Size: 168 KB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 5 - Forks: 2

Lefteris-Souflas/Census-Privacy-Analysis

Exploring US Census microdata, tackling privacy issues, and anonymization. Exercise A delves into quasi-identifiers, anonymization methods, identification risks, and differential privacy. Exercise B involves data loading, k-anonymity, histograms, adding noise for privacy, computing private averages, and analyzing privacy parameter impacts.

Language: Jupyter Notebook - Size: 3.9 MB - Last synced at: about 2 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

prathwik0/real-incognito

Runner Up AIML - HackToFuture, SJEC, Mangalore (Rs. 20,000)

Language: Svelte - Size: 6.24 MB - Last synced at: 12 months ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 2

Mobile-IoT-Security-Lab/HideDroid

HideDroid is an Android app that allows the per-app anonymization of collected personal data according to a privacy level chosen by the user.

Language: Java - Size: 9.12 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 117 - Forks: 8

Wenox/distributed-anonymisation

Distributed Anonymization Platform for SQL databases

Language: Java - Size: 110 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Vishwamitra/data-anonymizer

DataAnonymizer is an open-source personal data anonymization tool designed for GDPR compliancy

Language: TypeScript - Size: 228 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

sandrociceros/DataMasker Fork of Steveiwonder/DataMasker

A free data masking and/or anonymizer library

Language: C# - Size: 46.9 KB - Last synced at: over 1 year ago - Pushed at: almost 7 years ago - Stars: 3 - Forks: 1

thearchitector/stoplight 📦

Data anonymization signals for Tortoise ORM.

Language: Python - Size: 141 KB - Last synced at: 12 months ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

jaimedantas/data-anonymization-diabetes

Impacts of data anonymization on model prediction for diabetes

Language: MATLAB - Size: 460 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 4 - Forks: 3

vishal-kumar-paswan/Data-Anonymization-using-Python

Anonymizing confidential data using the concept of masking.

Language: Jupyter Notebook - Size: 17.6 KB - Last synced at: 12 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

nikosgalanis/bsc-thesis

🎓🔒 Creating, Analyzing and Testing Differential Privacy Protocols, aiming in Data Protection and Anonymization.

Language: Jupyter Notebook - Size: 18.3 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 15 - Forks: 1

dmatsanganis/Data_Anonimization_Privacy_Threats_in_US_Census_Microdata_Analysis

This repository contains an analysis of the US Census Bureau's microdata from the 2010 census. The current analysis focuses on understanding the privacy threats associated with the non-anonymized dataset and exploring techniques to preserve privacy while analyzing the data.

Language: HTML - Size: 19 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 1

adaptant-labs/go-minimizer

Data minimization, pseudonymization, and anonymization helpers for Go

Language: Go - Size: 9.77 KB - Last synced at: 27 days ago - Pushed at: over 5 years ago - Stars: 2 - Forks: 0

F4de78/hkp-coherence

DPP - "Anonymizing Transaction Databases for Publication" - AA 2022/23

Language: TeX - Size: 14.3 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

DatafoundationSystem/ashe

Data anonymity made simple

Language: JavaScript - Size: 105 MB - Last synced at: about 1 year ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

henryhamon/iris-disguise

Data Anonymization tool for InterSystems IRIS

Language: ObjectScript - Size: 119 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 3

ryokugyu/One-Pass-KMeans-Algorithms

Implementation of An Efficient Clustering Method for k-Anonymization in Python 2.7

Language: Python - Size: 5.1 MB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 5 - Forks: 2

athulck/Data-Anonymization-Tool

M.Tech final year project to create a data anonymization tool.

Language: Python - Size: 1.34 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

VijayKrishnanSR/VijayKrishnanSR.github.io

A selected collection of my work samples

Size: 126 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

gorskii/excel-hashing-notebook

Hash data stored in Excel spreadsheet using pandas and Python's hashlib library

Language: Jupyter Notebook - Size: 17.6 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

data-protection-helpers/induction-anonymization

Induction to anonymization of data

Language: Jupyter Notebook - Size: 13.9 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

jagumiel/Artificial-Intelligence

First steps on AI. May help for learners like me.

Language: Jupyter Notebook - Size: 31.6 MB - Last synced at: almost 2 years ago - Pushed at: about 3 years ago - Stars: 1 - Forks: 0

Daniel-Hinz/Database-Security-Visualizer

A fully responsive, full stack web application with a working login system designed to demonstrate the benefits of password hashing, salting, and data anonymization.

Language: Python - Size: 847 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 1

sonbachmi/NgAnonymize

Data anonymization using Angular 2+

Language: TypeScript - Size: 2.48 MB - Last synced at: 12 days ago - Pushed at: almost 4 years ago - Stars: 4 - Forks: 0

hamza1886/data-anonymization

Anonymize data using AES-128 encryption/decryption algorithm.

Language: Python - Size: 8.79 KB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

jamesstonehill/anonymous

Data anonymization for ActiveRecord

Language: Ruby - Size: 19.5 KB - Last synced at: 2 days ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 4

ngshya/anon-ae

Data anonymization

Language: Jupyter Notebook - Size: 1.11 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 1

markblue777/AnonDataGenerator

A project for generating data defined by a data definition file, the data would be a representation of real data that would be expected. Also enables the anonymisation of Personal identifiable information of data provided in either CSV or SQL connection.

Language: C# - Size: 823 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

sandrociceros/arx Fork of arx-deidentifier/arx

ARX is a comprehensive open source data anonymization tool aiming to provide scalability and usability. It supports various anonymization techniques, methods for analyzing data quality and re-identification risks and it supports well-known privacy models, such as k-anonymity, l-diversity, t-closeness and differential privacy.

Language: Java - Size: 375 MB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 1

TTitcombe/PrivacyPanda

Anonymize your Pandas data. Preserve privacy.

Language: Python - Size: 77.1 KB - Last synced at: 7 months ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 1

sandrociceros/SnakeDumper Fork of digilist/SnakeDumper

Anonymize your database dumps.

Language: PHP - Size: 219 KB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0

sandrociceros/anonymizer-1 Fork of linkorb/anonymizer

Anonymizer: scrambles your confidential production data for use in test environments

Language: PHP - Size: 16.6 KB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0

sandrociceros/neuralyzer Fork of edyan/neuralyzer

Neuralyzer is a library and a command line tool to anonymize databases (by updating existing data or populating a table with fake data)

Language: PHP - Size: 13.8 MB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0

sandrociceros/DataDefender Fork of armenak/DataDefender

Sensitive Data Management: Data Discovery and Anonymization toolkit

Language: Java - Size: 136 MB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 1

lorenzomagazzini/meg-ctf

MATLAB code for extracting, converting and anonymising files in CTF MEG proprietary format.

Language: Matlab - Size: 4.88 KB - Last synced at: over 1 year ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 0

Related Keywords
data-anonymization 60 anonymization 19 privacy 12 python 11 gdpr 8 data-science 7 data-privacy 6 machine-learning 6 privacy-protection 6 data-protection 5 pii 5 differential-privacy 5 data-masking 5 de-identification 4 pii-detection 4 pandas 4 pii-anonymization 4 python3 4 security 4 synthetic-data 4 open-source 4 synthetic-dataset-generation 3 text-anonymization 3 privacy-tools 3 anonymize 3 data 3 k-anonymity 3 anonymity 2 deidentification 2 masking 2 deep-learning 2 object-detection 2 privacy-enhancing-technologies 2 data-analysis 2 data-anonymized 2 data-anonymity 2 data-security 2 anonymization-service 2 privacy-preserving-machine-learning 2 quasi-identifiers 2 jupyter-notebook 2 data-generation 2 sensitive-data 2 ai 2 package 2 data-minimization 2 anonymizer 2 data-tokenization 2 data-analytics 2 data-loss-prevention 2 data-scrubbing 2 dlp 2 ccpa 2 compliance 2 database 2 encryption 2 anonymization-api 2 anonymized 1 tensorflow-lite 1 jupyter-notebooks 1 microservice 1 data-extraction 1 ctf 1 gaussian-naive-bayes-implementation 1 streamlit 1 ai-communication 1 audio-security 1 privacy-preserving 1 real-time-audio-processing 1 secure-communications 1 streaming-audio 1 data-generator 1 gaussian-mechanism 1 gaussian-noise 1 autoencoder 1 pseudonymization 1 gan 1 hackathon 1 android 1 crypto 1 password-hashing 1 cryptography 1 mri-data 1 dataset 1 datasets 1 datasets-csv 1 datasets-preparation 1 sensitive 1 meg 1 matlab 1 data-preprocessing 1 devsecaiops 1 devsecops-ai 1 data-manipulation 1 tensorflow2 1 llm-privacy 1 observability 1 rag 1 stream-processing 1 anonymisation 1