adversarial-attacks | Topic | Ecosyste.ms: Repos

Topic: "adversarial-attacks"

BishopFox/sliver

Adversary Emulation Framework

Language: Go - Size: 165 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 9,288 - Forks: 1,255

Trusted-AI/adversarial-robustness-toolbox

Adversarial Robustness Toolbox (ART) - Python Library for Machine Learning Security - Evasion, Poisoning, Extraction, Inference - Red and Blue Teams

Language: Python - Size: 610 MB - Last synced at: 6 days ago - Pushed at: 7 days ago - Stars: 5,234 - Forks: 1,210

makcedward/nlpaug

Data augmentation for NLP

Language: Jupyter Notebook - Size: 3.21 MB - Last synced at: 17 days ago - Pushed at: 11 months ago - Stars: 4,551 - Forks: 468

QData/TextAttack

TextAttack 🐙 is a Python framework for adversarial attacks, data augmentation, and model training in NLP https://textattack.readthedocs.io/en/master/

Language: Python - Size: 25.3 MB - Last synced at: 3 days ago - Pushed at: 10 months ago - Stars: 3,160 - Forks: 414

bethgelab/foolbox

A Python toolbox to create adversarial examples that fool neural networks in PyTorch, TensorFlow, and JAX

Language: Python - Size: 10.7 MB - Last synced at: 9 days ago - Pushed at: about 1 year ago - Stars: 2,852 - Forks: 432

microsoft/promptbench

A unified evaluation framework for large language models

Language: Python - Size: 5.56 MB - Last synced at: 5 days ago - Pushed at: 13 days ago - Stars: 2,606 - Forks: 190

Harry24k/adversarial-attacks-pytorch

PyTorch implementation of adversarial attacks [torchattacks]

Language: Python - Size: 50.6 MB - Last synced at: 1 day ago - Pushed at: 11 months ago - Stars: 2,016 - Forks: 360

thunlp/TAADpapers

Must-read Papers on Textual Adversarial Attack and Defense

Language: Python - Size: 295 KB - Last synced at: 3 days ago - Pushed at: 3 months ago - Stars: 1,546 - Forks: 194

Advbox is a toolbox to generate adversarial examples that fool neural networks in PaddlePaddle、PyTorch、Caffe2、MxNet、Keras、TensorFlow and Advbox can benchmark the robustness of machine learning models. Advbox give a command line tool to generate adversarial examples with Zero-Coding.

Language: Jupyter Notebook - Size: 99.3 MB - Last synced at: about 1 month ago - Pushed at: about 2 years ago - Stars: 1,391 - Forks: 265

ThuCCSLab/Awesome-LM-SSP

A reading list for large models safety, security, and privacy (including Awesome LLM Security, Safety, etc.).

Size: 2.46 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 1,386 - Forks: 88

BorealisAI/advertorch

A Toolbox for Adversarial Robustness Research

Language: Jupyter Notebook - Size: 8.19 MB - Last synced at: 21 days ago - Pushed at: over 1 year ago - Stars: 1,334 - Forks: 198

DSE-MSU/DeepRobust

A pytorch adversarial library for attack and defense methods on images and graphs

Language: Python - Size: 11.9 MB - Last synced at: 28 days ago - Pushed at: 10 months ago - Stars: 1,035 - Forks: 193

shubhomoydas/ad_examples

A collection of anomaly detection methods (iid/point-based, graph and time series) including active learning for anomaly detection/discovery, bayesian rule-mining, description for diversity/explanation/interpretability. Analysis of incorporating label feedback with ensemble and tree-based detectors. Includes adversarial attacks with Graph Convolutional Network.

Language: Python - Size: 125 MB - Last synced at: about 2 months ago - Pushed at: 12 months ago - Stars: 855 - Forks: 184

safe-graph/graph-adversarial-learning-literature

A curated list of adversarial attacks and defenses papers on graph-structured data.

Size: 544 KB - Last synced at: 5 days ago - Pushed at: over 1 year ago - Stars: 851 - Forks: 132

thunlp/OpenAttack

An Open-Source Package for Textual Adversarial Attack.

Language: Python - Size: 4.65 MB - Last synced at: about 17 hours ago - Pushed at: almost 2 years ago - Stars: 727 - Forks: 127

fra31/auto-attack

Code relative to "Reliable evaluation of adversarial robustness with an ensemble of diverse parameter-free attacks"

Language: Python - Size: 39.7 MB - Last synced at: 10 days ago - Pushed at: 12 months ago - Stars: 694 - Forks: 118

hendrycks/natural-adv-examples

A Harder ImageNet Test Set (CVPR 2021)

Language: Python - Size: 2.1 MB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 603 - Forks: 52

MadryLab/photoguard

Raising the Cost of Malicious AI-Powered Image Editing

Language: Jupyter Notebook - Size: 17.1 MB - Last synced at: about 1 month ago - Pushed at: about 2 years ago - Stars: 589 - Forks: 48

jind11/TextFooler

A Model for Natural Language Attack on Text Classification and Inference

Language: Python - Size: 2.77 MB - Last synced at: 24 days ago - Pushed at: over 2 years ago - Stars: 509 - Forks: 82

thu-ml/ares

A Python library for adversarial machine learning focusing on benchmarking adversarial robustness.

Language: Python - Size: 378 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 502 - Forks: 88

Koukyosyumei/AIJack

Security and Privacy Risk Simulator for Machine Learning (arXiv:2312.17667)

Language: C++ - Size: 152 MB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 388 - Forks: 63

ChandlerBang/awesome-graph-attack-papers

Adversarial attacks and defenses on Graph Neural Networks.

Size: 90.8 KB - Last synced at: 11 days ago - Pushed at: about 1 year ago - Stars: 381 - Forks: 31

deadbits/vigil-llm

⚡ Vigil ⚡ Detect prompt injections, jailbreaks, and other potentially risky Large Language Model (LLM) inputs

Language: Python - Size: 548 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 370 - Forks: 41

sarathknv/adversarial-examples-pytorch

Implementation of Papers on Adversarial Examples

Language: Python - Size: 254 MB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 367 - Forks: 74

agencyenterprise/PromptInject

PromptInject is a framework that assembles prompts in a modular fashion to provide a quantitative analysis of the robustness of LLMs to adversarial prompt attacks. 🏆 Best Paper Awards @ NeurIPS ML Safety Workshop 2022

Language: Python - Size: 222 KB - Last synced at: 6 days ago - Pushed at: about 1 year ago - Stars: 365 - Forks: 36

HuntDownProject/HEDnsExtractor

A suite for hunting suspicious targets, expose domains and phishing discovery

Language: Go - Size: 3.11 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 333 - Forks: 45

natanielruiz/disrupting-deepfakes

🔥🔥Defending Against Deepfakes Using Adversarial Attacks on Conditional Image Translation Networks

Language: Python - Size: 49.2 MB - Last synced at: 3 months ago - Pushed at: about 5 years ago - Stars: 326 - Forks: 47

hbaniecki/adversarial-explainable-ai

💡 Adversarial attacks on explanations and how to defend them

Size: 2.62 MB - Last synced at: about 2 months ago - Pushed at: 5 months ago - Stars: 314 - Forks: 48

pumpbin/pumpbin

🎃 PumpBin is an Implant Generation Platform.

Language: Rust - Size: 2.31 MB - Last synced at: about 1 month ago - Pushed at: 10 months ago - Stars: 304 - Forks: 33

ChandlerBang/Pro-GNN

Implementation of the KDD 2020 paper "Graph Structure Learning for Robust Graph Neural Networks"

Language: Python - Size: 9.86 MB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 293 - Forks: 45

ain-soph/trojanzoo

TrojanZoo provides a universal pytorch platform to conduct security researches (especially backdoor attacks/defenses) of image classification in deep learning.

Language: Python - Size: 15.7 MB - Last synced at: 3 months ago - Pushed at: 9 months ago - Stars: 288 - Forks: 63

automorphic-ai/aegis

Self-hardening firewall for large language models

Language: Python - Size: 21.5 KB - Last synced at: about 2 months ago - Pushed at: about 1 year ago - Stars: 264 - Forks: 6

1Konny/FGSM

Simple pytorch implementation of FGSM and I-FGSM

Language: Python - Size: 14.3 MB - Last synced at: over 1 year ago - Pushed at: about 7 years ago - Stars: 250 - Forks: 69

haofanwang/Awesome-Computer-Vision

Awesome Resources for Advanced Computer Vision Topics

Size: 93.8 KB - Last synced at: 7 days ago - Pushed at: over 2 years ago - Stars: 230 - Forks: 43

kabkabm/defensegan

Defense-GAN: Protecting Classifiers Against Adversarial Attacks Using Generative Models (published in ICLR2018)

Language: Python - Size: 2.72 MB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 221 - Forks: 62

VinAIResearch/Anti-DreamBooth

Anti-DreamBooth: Protecting users from personalized text-to-image synthesis (ICCV 2023)

Language: Python - Size: 106 MB - Last synced at: about 2 months ago - Pushed at: 4 months ago - Stars: 220 - Forks: 19

danielzuegner/nettack

Implementation of the paper "Adversarial Attacks on Neural Networks for Graph Data".

Language: Python - Size: 485 KB - Last synced at: 6 months ago - Pushed at: almost 3 years ago - Stars: 218 - Forks: 56

ryderling/DEEPSEC

DEEPSEC: A Uniform Platform for Security Analysis of Deep Learning Model

Language: Python - Size: 172 MB - Last synced at: 23 days ago - Pushed at: almost 6 years ago - Stars: 215 - Forks: 70

The-Z-Labs/bof-launcher

Beacon Object File (BOF) launcher - library for executing BOF files in C/C++/Zig applications

Language: Zig - Size: 790 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 204 - Forks: 16

tao-bai/attack-and-defense-methods

A curated list of papers on adversarial machine learning (adversarial examples and defense methods).

Language: TeX - Size: 17.4 MB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 197 - Forks: 25

bosch-aisecurity-aishield/watchtower

AIShield Watchtower: Dive Deep into AI's Secrets! 🔍 Open-source tool by AIShield for AI model insights & vulnerability scans. Secure your AI supply chain today! ⚙️🛡️

Language: PureBasic - Size: 21.1 MB - Last synced at: 24 days ago - Pushed at: 24 days ago - Stars: 191 - Forks: 15

a1600012888/YOPO-You-Only-Propagate-Once

Code for our nips19 paper: You Only Propagate Once: Accelerating Adversarial Training Via Maximal Principle

Language: Python - Size: 770 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 173 - Forks: 30

ashafahi/free_adv_train

Official TensorFlow Implementation of Adversarial Training for Free! which trains robust models at no extra cost compared to natural training.

Language: Python - Size: 48.8 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 170 - Forks: 30

csdongxian/AWP

Codes for NeurIPS 2020 paper "Adversarial Weight Perturbation Helps Robust Generalization"

Language: Python - Size: 372 KB - Last synced at: about 1 year ago - Pushed at: about 4 years ago - Stars: 163 - Forks: 22

safreita1/TIGER

Python toolbox to evaluate graph vulnerability and robustness (CIKM 2021)

Language: Python - Size: 22.6 MB - Last synced at: 11 days ago - Pushed at: over 2 years ago - Stars: 158 - Forks: 26

sisinflab/adversarial-recommender-systems-survey

The goal of this survey is two-fold: (i) to present recent advances on adversarial machine learning (AML) for the security of RS (i.e., attacking and defense recommendation models), (ii) to show another successful application of AML in generative adversarial networks (GANs) for generative applications, thanks to their ability for learning (high-dimensional) data distributions. In this survey, we provide an exhaustive literature review of 74 articles published in major RS and ML journals and conferences. This review serves as a reference for the RS community, working on the security of RS or on generative models using GANs to improve their quality.

Size: 203 KB - Last synced at: 10 months ago - Pushed at: about 4 years ago - Stars: 156 - Forks: 32

PKU-YuanGroup/Hallucination-Attack

Attack to induce LLMs within hallucinations

Language: Python - Size: 2.73 MB - Last synced at: 19 days ago - Pushed at: 12 months ago - Stars: 155 - Forks: 19

shangtse/robust-physical-attack

Physical adversarial attack for fooling the Faster R-CNN object detector

Language: Jupyter Notebook - Size: 11.9 MB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 154 - Forks: 49

jeromerony/adversarial-library

Library containing PyTorch implementations of various adversarial attacks and resources

Language: Python - Size: 201 KB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 153 - Forks: 20

Harry24k/PGD-pytorch

A pytorch implementation of "Towards Deep Learning Models Resistant to Adversarial Attacks"

Language: Jupyter Notebook - Size: 621 KB - Last synced at: about 1 month ago - Pushed at: over 5 years ago - Stars: 153 - Forks: 37

danielzuegner/gnn-meta-attack

Implementation of the paper "Adversarial Attacks on Graph Neural Networks via Meta Learning".

Language: Python - Size: 1.15 MB - Last synced at: 6 months ago - Pushed at: over 3 years ago - Stars: 142 - Forks: 26

safellama/plexiglass

A toolkit for detecting and protecting against vulnerabilities in Large Language Models (LLMs).

Language: Python - Size: 20.6 MB - Last synced at: about 14 hours ago - Pushed at: over 1 year ago - Stars: 136 - Forks: 15

OmidPoursaeed/Generative_Adversarial_Perturbations

Generative Adversarial Perturbations (CVPR 2018)

Language: Python - Size: 388 KB - Last synced at: 5 months ago - Pushed at: over 4 years ago - Stars: 136 - Forks: 23

git-disl/TOG

Real-time object detection is one of the key applications of deep neural networks (DNNs) for real-world mission-critical systems. While DNN-powered object detection systems celebrate many life-enriching opportunities, they also open doors for misuse and abuse. This project presents a suite of adversarial objectness gradient attacks, coined as TOG, which can cause the state-of-the-art deep object detection networks to suffer from untargeted random attacks or even targeted attacks with three types of specificity: (1) object-vanishing, (2) object-fabrication, and (3) object-mislabeling. Apart from tailoring an adversarial perturbation for each input image, we further demonstrate TOG as a universal attack, which trains a single adversarial perturbation that can be generalized to effectively craft an unseen input with a negligible attack time cost. Also, we apply TOG as an adversarial patch attack, a form of physical attacks, showing its ability to optimize a visually confined patch filled with malicious patterns, deceiving well-trained object detectors to misbehave purposefully.

Language: Jupyter Notebook - Size: 59 MB - Last synced at: 21 days ago - Pushed at: about 2 years ago - Stars: 135 - Forks: 41

jeromerony/fast_adversarial

Code for the CVPR 2019 article "Decoupling Direction and Norm for Efficient Gradient-Based L2 Adversarial Attacks and Defenses"

Language: Python - Size: 234 KB - Last synced at: about 1 month ago - Pushed at: over 4 years ago - Stars: 134 - Forks: 14

nebula-beta/awesome-adversarial-deep-learning

A list of awesome resources for adversarial attack and defense method in deep learning

Size: 150 KB - Last synced at: 12 days ago - Pushed at: over 2 years ago - Stars: 133 - Forks: 11

EdisonLeeeee/RS-Adversarial-Learning

A curated collection of adversarial attack and defense on recommender systems.

Size: 62.5 KB - Last synced at: 8 days ago - Pushed at: about 3 years ago - Stars: 133 - Forks: 7

max-andr/square-attack

Square Attack: a query-efficient black-box adversarial attack via random search [ECCV 2020]

Language: Python - Size: 12.5 MB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 128 - Forks: 24

declare-lab/dialogue-understanding

This repository contains PyTorch implementation for the baseline models from the paper Utterance-level Dialogue Understanding: An Empirical Study

Language: Python - Size: 216 MB - Last synced at: 28 days ago - Pushed at: about 2 years ago - Stars: 126 - Forks: 21

gmh14/RobNets

[CVPR 2020] When NAS Meets Robustness: In Search of Robust Architectures against Adversarial Attacks

Language: Python - Size: 324 KB - Last synced at: about 12 hours ago - Pushed at: over 4 years ago - Stars: 124 - Forks: 15

wy1iu/DCNets

Implementation for <Decoupled Networks> in CVPR'18.

Language: Python - Size: 479 KB - Last synced at: over 1 year ago - Pushed at: almost 7 years ago - Stars: 115 - Forks: 31

vita-epfl/s-attack

[CVPR 2025] Official implementation of three papers "Certified Human Trajectory Prediction", "Vehicle trajectory prediction works, but not everywhere", and "Are socially-aware trajectory prediction models really socially-aware?".

Language: Python - Size: 108 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 106 - Forks: 16

ShawnXYang/Face-Robustness-Benchmark

An adversarial robustness evaluation library on face recognition.

Language: Python - Size: 19.5 MB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 106 - Forks: 15

cuge1995/awesome-3D-point-cloud-attacks

List of state of the art papers, code, and other resources

Size: 38.1 KB - Last synced at: 4 days ago - Pushed at: over 2 years ago - Stars: 106 - Forks: 14

chs20/RobustVLM

[ICML 2024] Unsupervised Adversarial Fine-Tuning of Vision Embeddings for Robust Large Vision-Language Models

Language: Python - Size: 10.2 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 98 - Forks: 3

Eaphan/Robust3DOD

A Comprehensive Study of the Robustness for LiDAR-based 3D Object Detectors against Adversarial Attacks

Language: Python - Size: 1.57 MB - Last synced at: about 2 months ago - Pushed at: 8 months ago - Stars: 96 - Forks: 6

DmitryRyumin/WACV-2024-Papers

WACV 2024 Papers: Discover cutting-edge research from WACV 2024, the leading computer vision conference. Stay updated on the latest in computer vision and deep learning, with code included. ⭐ support visual intelligence development!

Language: Python - Size: 7.31 MB - Last synced at: 30 days ago - Pushed at: 8 months ago - Stars: 96 - Forks: 13

iArunava/scratchai

scratchai is a Deep Learning library that aims to store all Deep Learning algorithms. With easy calls to do all the common tasks in AI.

Language: Python - Size: 17.6 MB - Last synced at: 6 months ago - Pushed at: about 1 year ago - Stars: 96 - Forks: 18

reds-lab/Narcissus

The official implementation of the CCS'23 paper, Narcissus clean-label backdoor attack -- only takes THREE images to poison a face recognition dataset in a clean-label way and achieves a 99.89% attack success rate.

Language: Python - Size: 143 KB - Last synced at: 11 months ago - Pushed at: about 2 years ago - Stars: 96 - Forks: 10

THUDM/grb

Graph Robustness Benchmark: A scalable, unified, modular, and reproducible benchmark for evaluating the adversarial robustness of Graph Machine Learning.

Language: Python - Size: 12.7 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 94 - Forks: 17

zqzqz/AdvTrajectoryPrediction

Implementation of CVPR 2022 paper "On Adversarial Robustness of Trajectory Prediction for Autonomous Vehicles" https://arxiv.org/abs/2201.05057

Language: Python - Size: 84.8 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 91 - Forks: 17

chenhongge/StateAdvDRL

[NeurIPS 2020, Spotlight] Code for "Robust Deep Reinforcement Learning against Adversarial Perturbations on Observations"

Size: 4.8 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 91 - Forks: 16

thunlp/SememePSO-Attack

Code and data of the ACL 2020 paper "Word-level Textual Adversarial Attacking as Combinatorial Optimization"

Language: Python - Size: 58.7 MB - Last synced at: 3 days ago - Pushed at: about 4 years ago - Stars: 88 - Forks: 14

nebula-beta/torchadver

A PyTorch Toolbox for creating adversarial examples that fool neural networks.

Language: Python - Size: 38.9 MB - Last synced at: over 1 year ago - Pushed at: almost 6 years ago - Stars: 88 - Forks: 7

dipanjanS/adversarial-learning-robustness

Contains materials for workshops pertaining to adversarial robustness in deep learning.

Language: Jupyter Notebook - Size: 77.6 MB - Last synced at: 12 days ago - Pushed at: about 4 years ago - Stars: 86 - Forks: 40

EdisonLeeeee/GreatX

A graph reliability toolbox based on PyTorch and PyTorch Geometric (PyG).

Language: Python - Size: 8.14 MB - Last synced at: 7 days ago - Pushed at: 7 months ago - Stars: 85 - Forks: 13

FAKEBOB-adversarial-attack/FAKEBOB

Source code for paper "Who is real Bob? Adversarial Attacks on Speaker Recognition Systems" (IEEE S&P 2021)

Language: Python - Size: 12.1 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 84 - Forks: 24

qilong-zhang/Patch-wise-iterative-attack

Patch-wise iterative attack (accepted by ECCV 2020) to improve the transferability of adversarial examples.

Language: Python - Size: 145 MB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 84 - Forks: 21

as791/Adversarial-Example-Attack-and-Defense

This repository contains the implementation of three adversarial example attack methods FGSM, IFGSM, MI-FGSM and one Distillation as defense against all attacks using MNIST dataset.

Language: Jupyter Notebook - Size: 1.71 MB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 84 - Forks: 21

moohax/Proof-Pudding

Copy cat model for Proofpoint

Language: Python - Size: 20.1 MB - Last synced at: 12 days ago - Pushed at: about 5 years ago - Stars: 83 - Forks: 4

AI-secure/InfoBERT

[ICLR 2021] "InfoBERT: Improving Robustness of Language Models from An Information Theoretic Perspective" by Boxin Wang, Shuohang Wang, Yu Cheng, Zhe Gan, Ruoxi Jia, Bo Li, Jingjing Liu

Language: Python - Size: 72.3 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 81 - Forks: 6

WenRichard/DIAC2019-Adversarial-Attack-Share

DIAC2019基于Adversarial Attack的问题等价性判别比赛

Size: 114 KB - Last synced at: almost 2 years ago - Pushed at: over 5 years ago - Stars: 81 - Forks: 12

392781/FaceOff

Steps towards physical adversarial attacks on facial recognition

Language: Python - Size: 132 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 80 - Forks: 14

neu-autonomy/nfl_veripy

Formal Verification of Neural Feedback Loops (NFLs)

Language: Python - Size: 66.4 MB - Last synced at: 20 days ago - Pushed at: 8 months ago - Stars: 79 - Forks: 15

ForeverPs/Robust-Classification

CVPR 2022 Workshop Robust Classification

Language: Python - Size: 145 MB - Last synced at: 4 months ago - Pushed at: almost 3 years ago - Stars: 79 - Forks: 3

openopt/chop

CHOP: An optimization library based on PyTorch, with applications to adversarial examples and structured neural network training.

Language: Python - Size: 378 KB - Last synced at: 6 days ago - Pushed at: about 1 year ago - Stars: 77 - Forks: 14

hfawaz/ijcnn19attacks

Adversarial Attacks on Deep Neural Networks for Time Series Classification

Language: Jupyter Notebook - Size: 4.77 MB - Last synced at: about 1 month ago - Pushed at: almost 5 years ago - Stars: 77 - Forks: 28

layumi/Awesome-Fools

:skull: A collection of methods to fool the deep neural network :skull:

Size: 41 KB - Last synced at: 8 days ago - Pushed at: 8 months ago - Stars: 76 - Forks: 8

jinzhuoran/RWKU

RWKU: Benchmarking Real-World Knowledge Unlearning for Large Language Models. NeurIPS 2024

Language: Python - Size: 3.82 MB - Last synced at: about 1 month ago - Pushed at: 7 months ago - Stars: 72 - Forks: 7

davide97l/rl-policies-attacks-defenses

Adversarial attacks on Deep Reinforcement Learning (RL)

Language: Jupyter Notebook - Size: 346 MB - Last synced at: about 1 year ago - Pushed at: about 4 years ago - Stars: 72 - Forks: 12

OPTML-Group/Diffusion-MU-Attack

The official implementation of ECCV'24 paper "To Generate or Not? Safety-Driven Unlearned Diffusion Models Are Still Easy To Generate Unsafe Images ... For Now". This work introduces one fast and effective attack method to evaluate the harmful-content generation ability of safety-driven unlearned diffusion models.

Language: Python - Size: 11.9 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 71 - Forks: 3