Topic: "adversarial-attacks"
BishopFox/sliver
Adversary Emulation Framework
Language: Go - Size: 165 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 9,288 - Forks: 1,255

Trusted-AI/adversarial-robustness-toolbox
Adversarial Robustness Toolbox (ART) - Python Library for Machine Learning Security - Evasion, Poisoning, Extraction, Inference - Red and Blue Teams
Language: Python - Size: 610 MB - Last synced at: 6 days ago - Pushed at: 7 days ago - Stars: 5,234 - Forks: 1,210

makcedward/nlpaug
Data augmentation for NLP
Language: Jupyter Notebook - Size: 3.21 MB - Last synced at: 17 days ago - Pushed at: 11 months ago - Stars: 4,551 - Forks: 468

QData/TextAttack
TextAttack 🐙 is a Python framework for adversarial attacks, data augmentation, and model training in NLP https://textattack.readthedocs.io/en/master/
Language: Python - Size: 25.3 MB - Last synced at: 3 days ago - Pushed at: 10 months ago - Stars: 3,160 - Forks: 414

bethgelab/foolbox
A Python toolbox to create adversarial examples that fool neural networks in PyTorch, TensorFlow, and JAX
Language: Python - Size: 10.7 MB - Last synced at: 9 days ago - Pushed at: about 1 year ago - Stars: 2,852 - Forks: 432

microsoft/promptbench
A unified evaluation framework for large language models
Language: Python - Size: 5.56 MB - Last synced at: 5 days ago - Pushed at: 13 days ago - Stars: 2,606 - Forks: 190

Harry24k/adversarial-attacks-pytorch
PyTorch implementation of adversarial attacks [torchattacks]
Language: Python - Size: 50.6 MB - Last synced at: 1 day ago - Pushed at: 11 months ago - Stars: 2,016 - Forks: 360

thunlp/TAADpapers
Must-read Papers on Textual Adversarial Attack and Defense
Language: Python - Size: 295 KB - Last synced at: 3 days ago - Pushed at: 3 months ago - Stars: 1,546 - Forks: 194

advboxes/AdvBox
Advbox is a toolbox to generate adversarial examples that fool neural networks in PaddlePaddle、PyTorch、Caffe2、MxNet、Keras、TensorFlow and Advbox can benchmark the robustness of machine learning models. Advbox give a command line tool to generate adversarial examples with Zero-Coding.
Language: Jupyter Notebook - Size: 99.3 MB - Last synced at: about 1 month ago - Pushed at: about 2 years ago - Stars: 1,391 - Forks: 265

ThuCCSLab/Awesome-LM-SSP
A reading list for large models safety, security, and privacy (including Awesome LLM Security, Safety, etc.).
Size: 2.46 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 1,386 - Forks: 88

BorealisAI/advertorch
A Toolbox for Adversarial Robustness Research
Language: Jupyter Notebook - Size: 8.19 MB - Last synced at: 21 days ago - Pushed at: over 1 year ago - Stars: 1,334 - Forks: 198

DSE-MSU/DeepRobust
A pytorch adversarial library for attack and defense methods on images and graphs
Language: Python - Size: 11.9 MB - Last synced at: 28 days ago - Pushed at: 10 months ago - Stars: 1,035 - Forks: 193

shubhomoydas/ad_examples
A collection of anomaly detection methods (iid/point-based, graph and time series) including active learning for anomaly detection/discovery, bayesian rule-mining, description for diversity/explanation/interpretability. Analysis of incorporating label feedback with ensemble and tree-based detectors. Includes adversarial attacks with Graph Convolutional Network.
Language: Python - Size: 125 MB - Last synced at: about 2 months ago - Pushed at: 12 months ago - Stars: 855 - Forks: 184

safe-graph/graph-adversarial-learning-literature
A curated list of adversarial attacks and defenses papers on graph-structured data.
Size: 544 KB - Last synced at: 5 days ago - Pushed at: over 1 year ago - Stars: 851 - Forks: 132

thunlp/OpenAttack
An Open-Source Package for Textual Adversarial Attack.
Language: Python - Size: 4.65 MB - Last synced at: about 17 hours ago - Pushed at: almost 2 years ago - Stars: 727 - Forks: 127

fra31/auto-attack
Code relative to "Reliable evaluation of adversarial robustness with an ensemble of diverse parameter-free attacks"
Language: Python - Size: 39.7 MB - Last synced at: 10 days ago - Pushed at: 12 months ago - Stars: 694 - Forks: 118

hendrycks/natural-adv-examples
A Harder ImageNet Test Set (CVPR 2021)
Language: Python - Size: 2.1 MB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 603 - Forks: 52

MadryLab/photoguard
Raising the Cost of Malicious AI-Powered Image Editing
Language: Jupyter Notebook - Size: 17.1 MB - Last synced at: about 1 month ago - Pushed at: about 2 years ago - Stars: 589 - Forks: 48

jind11/TextFooler
A Model for Natural Language Attack on Text Classification and Inference
Language: Python - Size: 2.77 MB - Last synced at: 24 days ago - Pushed at: over 2 years ago - Stars: 509 - Forks: 82

thu-ml/ares
A Python library for adversarial machine learning focusing on benchmarking adversarial robustness.
Language: Python - Size: 378 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 502 - Forks: 88

Koukyosyumei/AIJack
Security and Privacy Risk Simulator for Machine Learning (arXiv:2312.17667)
Language: C++ - Size: 152 MB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 388 - Forks: 63

ChandlerBang/awesome-graph-attack-papers
Adversarial attacks and defenses on Graph Neural Networks.
Size: 90.8 KB - Last synced at: 11 days ago - Pushed at: about 1 year ago - Stars: 381 - Forks: 31

deadbits/vigil-llm
⚡ Vigil ⚡ Detect prompt injections, jailbreaks, and other potentially risky Large Language Model (LLM) inputs
Language: Python - Size: 548 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 370 - Forks: 41

sarathknv/adversarial-examples-pytorch
Implementation of Papers on Adversarial Examples
Language: Python - Size: 254 MB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 367 - Forks: 74

agencyenterprise/PromptInject
PromptInject is a framework that assembles prompts in a modular fashion to provide a quantitative analysis of the robustness of LLMs to adversarial prompt attacks. 🏆 Best Paper Awards @ NeurIPS ML Safety Workshop 2022
Language: Python - Size: 222 KB - Last synced at: 6 days ago - Pushed at: about 1 year ago - Stars: 365 - Forks: 36

HuntDownProject/HEDnsExtractor
A suite for hunting suspicious targets, expose domains and phishing discovery
Language: Go - Size: 3.11 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 333 - Forks: 45

natanielruiz/disrupting-deepfakes
🔥🔥Defending Against Deepfakes Using Adversarial Attacks on Conditional Image Translation Networks
Language: Python - Size: 49.2 MB - Last synced at: 3 months ago - Pushed at: about 5 years ago - Stars: 326 - Forks: 47

hbaniecki/adversarial-explainable-ai
💡 Adversarial attacks on explanations and how to defend them
Size: 2.62 MB - Last synced at: about 2 months ago - Pushed at: 5 months ago - Stars: 314 - Forks: 48

pumpbin/pumpbin
🎃 PumpBin is an Implant Generation Platform.
Language: Rust - Size: 2.31 MB - Last synced at: about 1 month ago - Pushed at: 10 months ago - Stars: 304 - Forks: 33

ChandlerBang/Pro-GNN
Implementation of the KDD 2020 paper "Graph Structure Learning for Robust Graph Neural Networks"
Language: Python - Size: 9.86 MB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 293 - Forks: 45

ain-soph/trojanzoo
TrojanZoo provides a universal pytorch platform to conduct security researches (especially backdoor attacks/defenses) of image classification in deep learning.
Language: Python - Size: 15.7 MB - Last synced at: 3 months ago - Pushed at: 9 months ago - Stars: 288 - Forks: 63

automorphic-ai/aegis
Self-hardening firewall for large language models
Language: Python - Size: 21.5 KB - Last synced at: about 2 months ago - Pushed at: about 1 year ago - Stars: 264 - Forks: 6

1Konny/FGSM
Simple pytorch implementation of FGSM and I-FGSM
Language: Python - Size: 14.3 MB - Last synced at: over 1 year ago - Pushed at: about 7 years ago - Stars: 250 - Forks: 69

haofanwang/Awesome-Computer-Vision
Awesome Resources for Advanced Computer Vision Topics
Size: 93.8 KB - Last synced at: 7 days ago - Pushed at: over 2 years ago - Stars: 230 - Forks: 43

kabkabm/defensegan
Defense-GAN: Protecting Classifiers Against Adversarial Attacks Using Generative Models (published in ICLR2018)
Language: Python - Size: 2.72 MB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 221 - Forks: 62

VinAIResearch/Anti-DreamBooth
Anti-DreamBooth: Protecting users from personalized text-to-image synthesis (ICCV 2023)
Language: Python - Size: 106 MB - Last synced at: about 2 months ago - Pushed at: 4 months ago - Stars: 220 - Forks: 19

danielzuegner/nettack
Implementation of the paper "Adversarial Attacks on Neural Networks for Graph Data".
Language: Python - Size: 485 KB - Last synced at: 6 months ago - Pushed at: almost 3 years ago - Stars: 218 - Forks: 56

ryderling/DEEPSEC
DEEPSEC: A Uniform Platform for Security Analysis of Deep Learning Model
Language: Python - Size: 172 MB - Last synced at: 23 days ago - Pushed at: almost 6 years ago - Stars: 215 - Forks: 70

The-Z-Labs/bof-launcher
Beacon Object File (BOF) launcher - library for executing BOF files in C/C++/Zig applications
Language: Zig - Size: 790 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 204 - Forks: 16

tao-bai/attack-and-defense-methods
A curated list of papers on adversarial machine learning (adversarial examples and defense methods).
Language: TeX - Size: 17.4 MB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 197 - Forks: 25

bosch-aisecurity-aishield/watchtower
AIShield Watchtower: Dive Deep into AI's Secrets! 🔍 Open-source tool by AIShield for AI model insights & vulnerability scans. Secure your AI supply chain today! ⚙️🛡️
Language: PureBasic - Size: 21.1 MB - Last synced at: 24 days ago - Pushed at: 24 days ago - Stars: 191 - Forks: 15

a1600012888/YOPO-You-Only-Propagate-Once
Code for our nips19 paper: You Only Propagate Once: Accelerating Adversarial Training Via Maximal Principle
Language: Python - Size: 770 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 173 - Forks: 30

ashafahi/free_adv_train
Official TensorFlow Implementation of Adversarial Training for Free! which trains robust models at no extra cost compared to natural training.
Language: Python - Size: 48.8 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 170 - Forks: 30

csdongxian/AWP
Codes for NeurIPS 2020 paper "Adversarial Weight Perturbation Helps Robust Generalization"
Language: Python - Size: 372 KB - Last synced at: about 1 year ago - Pushed at: about 4 years ago - Stars: 163 - Forks: 22

safreita1/TIGER
Python toolbox to evaluate graph vulnerability and robustness (CIKM 2021)
Language: Python - Size: 22.6 MB - Last synced at: 11 days ago - Pushed at: over 2 years ago - Stars: 158 - Forks: 26

sisinflab/adversarial-recommender-systems-survey
The goal of this survey is two-fold: (i) to present recent advances on adversarial machine learning (AML) for the security of RS (i.e., attacking and defense recommendation models), (ii) to show another successful application of AML in generative adversarial networks (GANs) for generative applications, thanks to their ability for learning (high-dimensional) data distributions. In this survey, we provide an exhaustive literature review of 74 articles published in major RS and ML journals and conferences. This review serves as a reference for the RS community, working on the security of RS or on generative models using GANs to improve their quality.
Size: 203 KB - Last synced at: 10 months ago - Pushed at: about 4 years ago - Stars: 156 - Forks: 32

PKU-YuanGroup/Hallucination-Attack
Attack to induce LLMs within hallucinations
Language: Python - Size: 2.73 MB - Last synced at: 19 days ago - Pushed at: 12 months ago - Stars: 155 - Forks: 19

shangtse/robust-physical-attack
Physical adversarial attack for fooling the Faster R-CNN object detector
Language: Jupyter Notebook - Size: 11.9 MB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 154 - Forks: 49

jeromerony/adversarial-library
Library containing PyTorch implementations of various adversarial attacks and resources
Language: Python - Size: 201 KB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 153 - Forks: 20

Harry24k/PGD-pytorch
A pytorch implementation of "Towards Deep Learning Models Resistant to Adversarial Attacks"
Language: Jupyter Notebook - Size: 621 KB - Last synced at: about 1 month ago - Pushed at: over 5 years ago - Stars: 153 - Forks: 37

danielzuegner/gnn-meta-attack
Implementation of the paper "Adversarial Attacks on Graph Neural Networks via Meta Learning".
Language: Python - Size: 1.15 MB - Last synced at: 6 months ago - Pushed at: over 3 years ago - Stars: 142 - Forks: 26

safellama/plexiglass
A toolkit for detecting and protecting against vulnerabilities in Large Language Models (LLMs).
Language: Python - Size: 20.6 MB - Last synced at: about 14 hours ago - Pushed at: over 1 year ago - Stars: 136 - Forks: 15

OmidPoursaeed/Generative_Adversarial_Perturbations
Generative Adversarial Perturbations (CVPR 2018)
Language: Python - Size: 388 KB - Last synced at: 5 months ago - Pushed at: over 4 years ago - Stars: 136 - Forks: 23

git-disl/TOG
Real-time object detection is one of the key applications of deep neural networks (DNNs) for real-world mission-critical systems. While DNN-powered object detection systems celebrate many life-enriching opportunities, they also open doors for misuse and abuse. This project presents a suite of adversarial objectness gradient attacks, coined as TOG, which can cause the state-of-the-art deep object detection networks to suffer from untargeted random attacks or even targeted attacks with three types of specificity: (1) object-vanishing, (2) object-fabrication, and (3) object-mislabeling. Apart from tailoring an adversarial perturbation for each input image, we further demonstrate TOG as a universal attack, which trains a single adversarial perturbation that can be generalized to effectively craft an unseen input with a negligible attack time cost. Also, we apply TOG as an adversarial patch attack, a form of physical attacks, showing its ability to optimize a visually confined patch filled with malicious patterns, deceiving well-trained object detectors to misbehave purposefully.
Language: Jupyter Notebook - Size: 59 MB - Last synced at: 21 days ago - Pushed at: about 2 years ago - Stars: 135 - Forks: 41

jeromerony/fast_adversarial
Code for the CVPR 2019 article "Decoupling Direction and Norm for Efficient Gradient-Based L2 Adversarial Attacks and Defenses"
Language: Python - Size: 234 KB - Last synced at: about 1 month ago - Pushed at: over 4 years ago - Stars: 134 - Forks: 14

nebula-beta/awesome-adversarial-deep-learning
A list of awesome resources for adversarial attack and defense method in deep learning
Size: 150 KB - Last synced at: 12 days ago - Pushed at: over 2 years ago - Stars: 133 - Forks: 11

EdisonLeeeee/RS-Adversarial-Learning
A curated collection of adversarial attack and defense on recommender systems.
Size: 62.5 KB - Last synced at: 8 days ago - Pushed at: about 3 years ago - Stars: 133 - Forks: 7

max-andr/square-attack
Square Attack: a query-efficient black-box adversarial attack via random search [ECCV 2020]
Language: Python - Size: 12.5 MB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 128 - Forks: 24

declare-lab/dialogue-understanding
This repository contains PyTorch implementation for the baseline models from the paper Utterance-level Dialogue Understanding: An Empirical Study
Language: Python - Size: 216 MB - Last synced at: 28 days ago - Pushed at: about 2 years ago - Stars: 126 - Forks: 21

gmh14/RobNets
[CVPR 2020] When NAS Meets Robustness: In Search of Robust Architectures against Adversarial Attacks
Language: Python - Size: 324 KB - Last synced at: about 12 hours ago - Pushed at: over 4 years ago - Stars: 124 - Forks: 15

wy1iu/DCNets
Implementation for <Decoupled Networks> in CVPR'18.
Language: Python - Size: 479 KB - Last synced at: over 1 year ago - Pushed at: almost 7 years ago - Stars: 115 - Forks: 31

vita-epfl/s-attack
[CVPR 2025] Official implementation of three papers "Certified Human Trajectory Prediction", "Vehicle trajectory prediction works, but not everywhere", and "Are socially-aware trajectory prediction models really socially-aware?".
Language: Python - Size: 108 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 106 - Forks: 16

ShawnXYang/Face-Robustness-Benchmark
An adversarial robustness evaluation library on face recognition.
Language: Python - Size: 19.5 MB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 106 - Forks: 15

cuge1995/awesome-3D-point-cloud-attacks
List of state of the art papers, code, and other resources
Size: 38.1 KB - Last synced at: 4 days ago - Pushed at: over 2 years ago - Stars: 106 - Forks: 14

chs20/RobustVLM
[ICML 2024] Unsupervised Adversarial Fine-Tuning of Vision Embeddings for Robust Large Vision-Language Models
Language: Python - Size: 10.2 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 98 - Forks: 3

Eaphan/Robust3DOD
A Comprehensive Study of the Robustness for LiDAR-based 3D Object Detectors against Adversarial Attacks
Language: Python - Size: 1.57 MB - Last synced at: about 2 months ago - Pushed at: 8 months ago - Stars: 96 - Forks: 6

DmitryRyumin/WACV-2024-Papers
WACV 2024 Papers: Discover cutting-edge research from WACV 2024, the leading computer vision conference. Stay updated on the latest in computer vision and deep learning, with code included. ⭐ support visual intelligence development!
Language: Python - Size: 7.31 MB - Last synced at: 30 days ago - Pushed at: 8 months ago - Stars: 96 - Forks: 13

iArunava/scratchai
scratchai is a Deep Learning library that aims to store all Deep Learning algorithms. With easy calls to do all the common tasks in AI.
Language: Python - Size: 17.6 MB - Last synced at: 6 months ago - Pushed at: about 1 year ago - Stars: 96 - Forks: 18

reds-lab/Narcissus
The official implementation of the CCS'23 paper, Narcissus clean-label backdoor attack -- only takes THREE images to poison a face recognition dataset in a clean-label way and achieves a 99.89% attack success rate.
Language: Python - Size: 143 KB - Last synced at: 11 months ago - Pushed at: about 2 years ago - Stars: 96 - Forks: 10

THUDM/grb
Graph Robustness Benchmark: A scalable, unified, modular, and reproducible benchmark for evaluating the adversarial robustness of Graph Machine Learning.
Language: Python - Size: 12.7 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 94 - Forks: 17

zqzqz/AdvTrajectoryPrediction
Implementation of CVPR 2022 paper "On Adversarial Robustness of Trajectory Prediction for Autonomous Vehicles" https://arxiv.org/abs/2201.05057
Language: Python - Size: 84.8 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 91 - Forks: 17

chenhongge/StateAdvDRL
[NeurIPS 2020, Spotlight] Code for "Robust Deep Reinforcement Learning against Adversarial Perturbations on Observations"
Size: 4.8 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 91 - Forks: 16

thunlp/SememePSO-Attack
Code and data of the ACL 2020 paper "Word-level Textual Adversarial Attacking as Combinatorial Optimization"
Language: Python - Size: 58.7 MB - Last synced at: 3 days ago - Pushed at: about 4 years ago - Stars: 88 - Forks: 14

nebula-beta/torchadver
A PyTorch Toolbox for creating adversarial examples that fool neural networks.
Language: Python - Size: 38.9 MB - Last synced at: over 1 year ago - Pushed at: almost 6 years ago - Stars: 88 - Forks: 7

dipanjanS/adversarial-learning-robustness
Contains materials for workshops pertaining to adversarial robustness in deep learning.
Language: Jupyter Notebook - Size: 77.6 MB - Last synced at: 12 days ago - Pushed at: about 4 years ago - Stars: 86 - Forks: 40

EdisonLeeeee/GreatX
A graph reliability toolbox based on PyTorch and PyTorch Geometric (PyG).
Language: Python - Size: 8.14 MB - Last synced at: 7 days ago - Pushed at: 7 months ago - Stars: 85 - Forks: 13

FAKEBOB-adversarial-attack/FAKEBOB
Source code for paper "Who is real Bob? Adversarial Attacks on Speaker Recognition Systems" (IEEE S&P 2021)
Language: Python - Size: 12.1 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 84 - Forks: 24

qilong-zhang/Patch-wise-iterative-attack
Patch-wise iterative attack (accepted by ECCV 2020) to improve the transferability of adversarial examples.
Language: Python - Size: 145 MB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 84 - Forks: 21

as791/Adversarial-Example-Attack-and-Defense
This repository contains the implementation of three adversarial example attack methods FGSM, IFGSM, MI-FGSM and one Distillation as defense against all attacks using MNIST dataset.
Language: Jupyter Notebook - Size: 1.71 MB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 84 - Forks: 21

moohax/Proof-Pudding
Copy cat model for Proofpoint
Language: Python - Size: 20.1 MB - Last synced at: 12 days ago - Pushed at: about 5 years ago - Stars: 83 - Forks: 4

AI-secure/InfoBERT
[ICLR 2021] "InfoBERT: Improving Robustness of Language Models from An Information Theoretic Perspective" by Boxin Wang, Shuohang Wang, Yu Cheng, Zhe Gan, Ruoxi Jia, Bo Li, Jingjing Liu
Language: Python - Size: 72.3 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 81 - Forks: 6

WenRichard/DIAC2019-Adversarial-Attack-Share
DIAC2019基于Adversarial Attack的问题等价性判别比赛
Size: 114 KB - Last synced at: almost 2 years ago - Pushed at: over 5 years ago - Stars: 81 - Forks: 12

392781/FaceOff
Steps towards physical adversarial attacks on facial recognition
Language: Python - Size: 132 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 80 - Forks: 14

neu-autonomy/nfl_veripy
Formal Verification of Neural Feedback Loops (NFLs)
Language: Python - Size: 66.4 MB - Last synced at: 20 days ago - Pushed at: 8 months ago - Stars: 79 - Forks: 15

ForeverPs/Robust-Classification
CVPR 2022 Workshop Robust Classification
Language: Python - Size: 145 MB - Last synced at: 4 months ago - Pushed at: almost 3 years ago - Stars: 79 - Forks: 3

openopt/chop
CHOP: An optimization library based on PyTorch, with applications to adversarial examples and structured neural network training.
Language: Python - Size: 378 KB - Last synced at: 6 days ago - Pushed at: about 1 year ago - Stars: 77 - Forks: 14

hfawaz/ijcnn19attacks
Adversarial Attacks on Deep Neural Networks for Time Series Classification
Language: Jupyter Notebook - Size: 4.77 MB - Last synced at: about 1 month ago - Pushed at: almost 5 years ago - Stars: 77 - Forks: 28

layumi/Awesome-Fools
:skull: A collection of methods to fool the deep neural network :skull:
Size: 41 KB - Last synced at: 8 days ago - Pushed at: 8 months ago - Stars: 76 - Forks: 8

jinzhuoran/RWKU
RWKU: Benchmarking Real-World Knowledge Unlearning for Large Language Models. NeurIPS 2024
Language: Python - Size: 3.82 MB - Last synced at: about 1 month ago - Pushed at: 7 months ago - Stars: 72 - Forks: 7

davide97l/rl-policies-attacks-defenses
Adversarial attacks on Deep Reinforcement Learning (RL)
Language: Jupyter Notebook - Size: 346 MB - Last synced at: about 1 year ago - Pushed at: about 4 years ago - Stars: 72 - Forks: 12

OPTML-Group/Diffusion-MU-Attack
The official implementation of ECCV'24 paper "To Generate or Not? Safety-Driven Unlearned Diffusion Models Are Still Easy To Generate Unsafe Images ... For Now". This work introduces one fast and effective attack method to evaluate the harmful-content generation ability of safety-driven unlearned diffusion models.
Language: Python - Size: 11.9 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 71 - Forks: 3

bhheo/BSS_distillation
Knowledge Distillation with Adversarial Samples Supporting Decision Boundary (AAAI 2019)
Language: Python - Size: 1.3 MB - Last synced at: 12 days ago - Pushed at: over 5 years ago - Stars: 71 - Forks: 11

UCSC-VLAA/vllm-safety-benchmark
[ECCV 2024] Official PyTorch Implementation of "How Many Unicorns Are in This Image? A Safety Evaluation Benchmark for Vision LLMs"
Language: Python - Size: 3.17 MB - Last synced at: 5 months ago - Pushed at: over 1 year ago - Stars: 70 - Forks: 3

hyperion-ml/hyperion
Python toolkit for speech processing
Language: Python - Size: 154 MB - Last synced at: 10 days ago - Pushed at: about 1 month ago - Stars: 68 - Forks: 21

dylan-slack/Fooling-LIME-SHAP
Adversarial Attacks on Post Hoc Explanation Techniques (LIME/SHAP)
Language: Jupyter Notebook - Size: 1.06 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 68 - Forks: 16

ai4ce/FLAT
[ICCV2021 Oral] Fooling LiDAR by Attacking GPS Trajectory
Language: Python - Size: 48.9 MB - Last synced at: about 1 month ago - Pushed at: almost 3 years ago - Stars: 68 - Forks: 10

val-iisc/GD-UAP
Generalized Data-free Universal Adversarial Perturbations
Language: Python - Size: 8.66 MB - Last synced at: about 1 year ago - Pushed at: over 6 years ago - Stars: 68 - Forks: 13

Harry24k/FGSM-pytorch
A pytorch implementation of "Explaining and harnessing adversarial examples"
Language: Jupyter Notebook - Size: 698 KB - Last synced at: about 1 month ago - Pushed at: over 5 years ago - Stars: 67 - Forks: 16

snakers4/msc-2018-final
Language: Python - Size: 251 KB - Last synced at: 6 days ago - Pushed at: almost 7 years ago - Stars: 66 - Forks: 16

jfc43/robust-ood-detection
Robust Out-of-distribution Detection in Neural Networks
Language: Python - Size: 37.1 KB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 65 - Forks: 7
