An open API service providing repository metadata for many open source software ecosystems.

Topic: "data-anonymization"

microsoft/presidio

An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.

Language: Python - Size: 223 MB - Last synced at: 1 day ago - Pushed at: 5 days ago - Stars: 4,899 - Forks: 677

securitybunker/databunker

Secure Vault for Customer PII/PHI/PCI/KYC Records

Language: Go - Size: 11.1 MB - Last synced at: 1 day ago - Pushed at: 11 days ago - Stars: 1,308 - Forks: 82

arx-deidentifier/arx

ARX is a comprehensive open source data anonymization tool aiming to provide scalability and usability. It supports various anonymization techniques, methods for analyzing data quality and re-identification risks and it supports well-known privacy models, such as k-anonymity, l-diversity, t-closeness and differential privacy.

Language: Java - Size: 375 MB - Last synced at: 3 months ago - Pushed at: 6 months ago - Stars: 654 - Forks: 220

ArtLabss/open-data-anonymizer

Python Data Anonymization & Masking Library For Data Science Tasks

Language: Python - Size: 40.2 MB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 266 - Forks: 34

makinacorpus/DbToolsBundle

A PHP library to back up, restore and anonymize databases

Language: PHP - Size: 1.65 MB - Last synced at: 1 day ago - Pushed at: 3 months ago - Stars: 208 - Forks: 16

BMW-InnovationLab/BMW-Anonymization-API

This repository allows you to anonymize sensitive information in images/videos. The solution is fully compatible with the DL-based training/inference solutions that we already published/will publish for Object Detection and Semantic Segmentation.

Language: Python - Size: 44.7 MB - Last synced at: 20 days ago - Pushed at: over 1 year ago - Stars: 186 - Forks: 18

Mobile-IoT-Security-Lab/HideDroid

HideDroid is an Android app that allows the per-app anonymization of collected personal data according to a privacy level chosen by the user.

Language: Java - Size: 9.12 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 117 - Forks: 8

privateai/deid-examples

Examples scripts that showcase how to use Private AI Text to de-identify, redact, hash, tokenize, mask and synthesize PII in text.

Language: Jupyter Notebook - Size: 37.8 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 81 - Forks: 1

jftuga/deidentification

Deidentify people's names and gender specific pronouns

Language: Python - Size: 284 KB - Last synced at: 24 days ago - Pushed at: about 2 months ago - Stars: 35 - Forks: 2

IFCA-Advanced-Computing/anjana

ANJANA is a Python library for anonymizing sensitive data

Language: Python - Size: 1.24 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 32 - Forks: 3

snaplet/docs

Snaplet Documentation

Language: HTML - Size: 13.7 MB - Last synced at: 6 days ago - Pushed at: 10 months ago - Stars: 28 - Forks: 10

thoughtworks-datakind/anonymizer

Library for identification, anonymization and de-anonymization of PII data

Language: Python - Size: 112 KB - Last synced at: 2 months ago - Pushed at: over 2 years ago - Stars: 22 - Forks: 5

KI-AIM/Cinnamon

Cinnamon is a modular application designed to offer robust functionalities for data anonymization, synthetization, and evaluation.

Language: Java - Size: 40.5 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 21 - Forks: 1

nikosgalanis/bsc-thesis

🎓🔒 Creating, Analyzing and Testing Differential Privacy Protocols, aiming in Data Protection and Anonymization.

Language: Jupyter Notebook - Size: 18.3 MB - Last synced at: almost 2 years ago - Pushed at: almost 4 years ago - Stars: 15 - Forks: 1

aliengiraffe/deidentify

Simple yet powerful tool for identifying and anonymizing personal information in various formats.

Language: Go - Size: 92.8 KB - Last synced at: 24 days ago - Pushed at: 24 days ago - Stars: 12 - Forks: 0

yevh/anonymizer

Anonymize sensitive data in your datasets.

Language: Python - Size: 1.16 MB - Last synced at: about 2 months ago - Pushed at: almost 2 years ago - Stars: 12 - Forks: 1

stefanrmmr/differentially_private_synthetic_data

Differentially Private Synthetic Data Generation [DP-SDG] - Experimental Setups & Knowledge Base - WORK IN PROGRESS

Language: Jupyter Notebook - Size: 5.23 MB - Last synced at: 3 days ago - Pushed at: almost 3 years ago - Stars: 12 - Forks: 2

fabriziosalmi/csv-anonymizer

CSV fuzzer/anonymizer

Language: JavaScript - Size: 260 KB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 10 - Forks: 0

fgmacedo/datanonymizer

Anonymizer tool for datasets such CSV files

Language: Python - Size: 21.5 KB - Last synced at: about 1 month ago - Pushed at: 9 months ago - Stars: 9 - Forks: 0

eriknovak/anonipy

Data anonymization package, supporting different anonymization strategies

Language: Python - Size: 1.1 MB - Last synced at: 9 days ago - Pushed at: about 2 months ago - Stars: 7 - Forks: 3

OsgiliathEnterprise/data-migrator

Generate anonymized test dataset from production data and configurable anonymization sequences. Execute base to base (vendor agnostic) export and import

Language: Java - Size: 2.11 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 6 - Forks: 2

Aymane11/anonymize

Data anonymization made easy

Language: Python - Size: 168 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 5 - Forks: 2

ryokugyu/One-Pass-KMeans-Algorithms

Implementation of An Efficient Clustering Method for k-Anonymization in Python 2.7

Language: Python - Size: 5.1 MB - Last synced at: over 2 years ago - Pushed at: over 5 years ago - Stars: 5 - Forks: 2

jaimedantas/data-anonymization-diabetes

Impacts of data anonymization on model prediction for diabetes

Language: MATLAB - Size: 460 KB - Last synced at: almost 2 years ago - Pushed at: almost 4 years ago - Stars: 4 - Forks: 3

sonbachmi/NgAnonymize

Data anonymization using Angular 2+

Language: TypeScript - Size: 2.48 MB - Last synced at: 24 days ago - Pushed at: about 4 years ago - Stars: 4 - Forks: 0

sandrociceros/DataMasker Fork of Steveiwonder/DataMasker

A free data masking and/or anonymizer library

Language: C# - Size: 46.9 KB - Last synced at: over 1 year ago - Pushed at: about 7 years ago - Stars: 3 - Forks: 1

0xsarwagya/site

My portfolio website built with Next.js and ShadcnUI. Displays My Projects and Work Experience

Language: TypeScript - Size: 3.52 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 2 - Forks: 0

Club-Innovate/GenAI-SQL-CLI

GenAI-SQL is a modular, extensible suite of AI-powered tools for automating SQL code improvement, documentation, and validation. Built for developers, analysts, and data engineers, it leverages Azure OpenAI (GPT-4o) to analyze, refactor, comment, explain, test, and audit SQL — all within a secure, asynchronous, and HIPAA-compliant framework.

Language: Python - Size: 1.03 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 2 - Forks: 0

pavanad/beegen

BeeGen is an intelligent command-line tool designed to assist developers with everyday tasks, leveraging the power of generative AI.

Language: Python - Size: 407 KB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 2 - Forks: 0

Vishwamitra/data-anonymizer

DataAnonymizer is an open-source personal data anonymization tool designed for GDPR compliancy

Language: TypeScript - Size: 228 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

data-protection-helpers/induction-anonymization

Induction to anonymization of data

Language: Jupyter Notebook - Size: 13.9 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

henryhamon/iris-disguise

Data Anonymization tool for InterSystems IRIS

Language: ObjectScript - Size: 119 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 3

adaptant-labs/go-minimizer

Data minimization, pseudonymization, and anonymization helpers for Go

Language: Go - Size: 9.77 KB - Last synced at: 3 months ago - Pushed at: over 5 years ago - Stars: 2 - Forks: 0

tonyvargese/data-anonymisation-differential-privacy

Language: Python - Size: 5.16 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 1

prathwik0/real-incognito

Runner Up AIML - HackToFuture, SJEC, Mangalore (Rs. 20,000)

Language: Svelte - Size: 6.24 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 2

jagumiel/Artificial-Intelligence

First steps on AI. May help for learners like me.

Language: Jupyter Notebook - Size: 31.6 MB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

thearchitector/stoplight 📦

Data anonymization signals for Tortoise ORM.

Language: Python - Size: 141 KB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

ngshya/anon-ae

Data anonymization

Language: Jupyter Notebook - Size: 1.11 MB - Last synced at: 8 days ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 1

TTitcombe/PrivacyPanda

Anonymize your Pandas data. Preserve privacy.

Language: Python - Size: 77.1 KB - Last synced at: about 1 month ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 1

adaptant-labs/data-minimization-service

A simple data minimization and anonymization microservice wrapped around go-minimizer

Language: Go - Size: 12.7 KB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 0

jamesstonehill/anonymous

Data anonymization for ActiveRecord

Language: Ruby - Size: 19.5 KB - Last synced at: 6 days ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 4

sandrociceros/arx Fork of arx-deidentifier/arx

ARX is a comprehensive open source data anonymization tool aiming to provide scalability and usability. It supports various anonymization techniques, methods for analyzing data quality and re-identification risks and it supports well-known privacy models, such as k-anonymity, l-diversity, t-closeness and differential privacy.

Language: Java - Size: 375 MB - Last synced at: over 1 year ago - Pushed at: almost 7 years ago - Stars: 1 - Forks: 1

mitchelllisle/redacted

📛 An experimental data anonymisation library

Language: Python - Size: 60.5 KB - Last synced at: 19 days ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

SakinaJaffri/CommonWealth_Bank_Virtual_Internship

• Data Aggregation and Analysis • Data Anonymisation • Propose Data Analysis Approaches • Designing a Database

Language: Jupyter Notebook - Size: 19.5 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

ausdfrost/anonymizePy

🌱 anonymizePy helps you anonymize your data with ease

Language: Python - Size: 8.71 MB - Last synced at: 28 days ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

CogNetSys/Sonarum

Sonarum revolutionizes human-machine communication by securing real-time text, audio, and video streams while remaining fast, secure, and lightweight. It detects and controls sensitive and secure data on-the-fly, ensuring privacy and security without compromising quality.

Size: 1.95 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Lefteris-Souflas/Census-Privacy-Analysis

Exploring US Census microdata, tackling privacy issues, and anonymization. Exercise A delves into quasi-identifiers, anonymization methods, identification risks, and differential privacy. Exercise B involves data loading, k-anonymity, histograms, adding noise for privacy, computing private averages, and analyzing privacy parameter impacts.

Language: Jupyter Notebook - Size: 3.9 MB - Last synced at: 4 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Wenox/distributed-anonymisation

Distributed Anonymization Platform for SQL databases

Language: Java - Size: 110 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

VijayKrishnanSR/VijayKrishnanSR.github.io

A selected collection of my work samples

Size: 126 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

dmatsanganis/Data_Anonimization_Privacy_Threats_in_US_Census_Microdata_Analysis

This repository contains an analysis of the US Census Bureau's microdata from the 2010 census. The current analysis focuses on understanding the privacy threats associated with the non-anonymized dataset and exploring techniques to preserve privacy while analyzing the data.

Language: HTML - Size: 19 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 1

vishal-kumar-paswan/Data-Anonymization-using-Python

Anonymizing confidential data using the concept of masking.

Language: Jupyter Notebook - Size: 17.6 KB - Last synced at: about 1 year ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

DatafoundationSystem/ashe

Data anonymity made simple

Language: JavaScript - Size: 105 MB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

F4de78/hkp-coherence

DPP - "Anonymizing Transaction Databases for Publication" - AA 2022/23

Language: TeX - Size: 14.3 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

athulck/Data-Anonymization-Tool

M.Tech final year project to create a data anonymization tool.

Language: Python - Size: 1.34 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

Daniel-Hinz/Database-Security-Visualizer

A fully responsive, full stack web application with a working login system designed to demonstrate the benefits of password hashing, salting, and data anonymization.

Language: Python - Size: 847 KB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 1

gorskii/excel-hashing-notebook

Hash data stored in Excel spreadsheet using pandas and Python's hashlib library

Language: Jupyter Notebook - Size: 17.6 KB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

hamza1886/data-anonymization

Anonymize data using AES-128 encryption/decryption algorithm.

Language: Python - Size: 8.79 KB - Last synced at: over 2 years ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

markblue777/AnonDataGenerator

A project for generating data defined by a data definition file, the data would be a representation of real data that would be expected. Also enables the anonymisation of Personal identifiable information of data provided in either CSV or SQL connection.

Language: C# - Size: 823 KB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

sandrociceros/SnakeDumper Fork of digilist/SnakeDumper

Anonymize your database dumps.

Language: PHP - Size: 219 KB - Last synced at: over 1 year ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 0

sandrociceros/DataDefender Fork of armenak/DataDefender

Sensitive Data Management: Data Discovery and Anonymization toolkit

Language: Java - Size: 136 MB - Last synced at: over 1 year ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 1

sandrociceros/neuralyzer Fork of edyan/neuralyzer

Neuralyzer is a library and a command line tool to anonymize databases (by updating existing data or populating a table with fake data)

Language: PHP - Size: 13.8 MB - Last synced at: over 1 year ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 0

sandrociceros/anonymizer-1 Fork of linkorb/anonymizer

Anonymizer: scrambles your confidential production data for use in test environments

Language: PHP - Size: 16.6 KB - Last synced at: over 1 year ago - Pushed at: almost 7 years ago - Stars: 0 - Forks: 0

lorenzomagazzini/meg-ctf

MATLAB code for extracting, converting and anonymising files in CTF MEG proprietary format.

Language: Matlab - Size: 4.88 KB - Last synced at: over 1 year ago - Pushed at: about 7 years ago - Stars: 0 - Forks: 0

Related Topics
anonymization 20 privacy 12 python 10 gdpr 9 data-masking 8 data-privacy 8 data-science 6 differential-privacy 5 data-protection 5 machine-learning 5 pii 5 security 4 python3 4 pii-detection 4 synthetic-data 4 privacy-protection 4 sensitive-data 4 de-identification 4 privacy-tools 4 pandas 4 compliance 3 k-anonymity 3 pii-anonymization 3 data-security 3 open-source 3 anonymize 3 deidentification 3 data 3 named-entity-recognition 2 data-tokenization 2 artificial-intelligence 2 anonymizer 2 database 2 sql 2 cli 2 nlp 2 anonymity 2 data-analysis 2 anonymization-service 2 data-anonymized 2 data-minimization 2 masking 2 natural-language-processing 2 data-anonymity 2 data-generation 2 privacy-enhancing-technologies 2 package 2 anonymization-api 2 data-analytics 2 object-detection 2 privacy-preserving-machine-learning 2 text-anonymization 2 synthetic-dataset-generation 2 developer-tools 2 quasi-identifiers 2 redaction 2 jupyter-notebook 2 deep-learning 2 encryption 2 ccpa 2 llama 1 redact 1 mock-api 1 ollama 1 langchain 1 openai 1 readme-generator 1 semantic-search 1 terminal-chat 1 hipaa 1 translation-tool 1 data-obfuscation 1 data-redaction 1 dlp 1 de-identify 1 data-loss-prevention 1 cpra 1 pategan 1 privacy-preserving-synthetic-data 1 dsgvokonform 1 sensitive-data-security 1 dpwgan 1 dpsdg 1 synthetic-data-generation 1 dpgan 1 ai-assistant 1 automation 1 differentially-private 1 cross-platform 1 arx 1 term-extraction 1 code-snippets 1 faiss 1 tokenize 1 gemini 1 synthetic-data-generator 1 generative-ai 1 legaltech 1 passportjs 1 piidata 1