GitHub topics: harmful
HKU-TASR/Sanitizer
[EuroS&P 2025] Sanitizer is a server-side method that ensures client-embedded backdoors can only be used for contribution demonstration in federated learning but not be triggered on natural queries in harmful ways.
Language: Python - Size: 78 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 3 - Forks: 0

git-disl/awesome_LLM-harmful-fine-tuning-papers
A survey on harmful fine-tuning attack for large language model
Size: 5.64 MB - Last synced at: 8 days ago - Pushed at: 17 days ago - Stars: 157 - Forks: 4

git-disl/Booster
This is the official code for the paper "Booster: Tackling Harmful Fine-tuning for Large Language Models via Attenuating Harmful Perturbation" (ICLR2025).
Language: Shell - Size: 293 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 19 - Forks: 0

git-disl/Virus
This is the official code for the paper "Virus: Harmful Fine-tuning Attack for Large Language Models Bypassing Guardrail Moderation"
Language: Python - Size: 90.3 MB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 41 - Forks: 2

QiBowen2008/WindowsVirusCodes
94种病毒的源代码
Language: Pascal - Size: 1.26 MB - Last synced at: 22 days ago - Pushed at: 9 months ago - Stars: 2 - Forks: 1

git-disl/Vaccine
This is the official code for the paper "Vaccine: Perturbation-aware Alignment for Large Language Models" (NeurIPS2024)
Language: Shell - Size: 730 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 19 - Forks: 0

git-disl/Lisa
This is the official code for the paper "Lazy Safety Alignment for Large Language Models against Harmful Fine-tuning"
Language: Python - Size: 46.9 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 5 - Forks: 0

alessandromonolo/Classification-Of-Days-Exceeding-The-PM-Threshold
The research mainly aims to identify through classification algorithms if one day, based on its climatic features and concentrations of harmful elements in the air, it turns out to be harmful (or not) to the health of citizens in the Milan metropolis. A second prediction model was adopted to predict daily mean PM2.5 values.
Language: Jupyter Notebook - Size: 5.82 MB - Last synced at: 2 months ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

HalilDeniz/44-PythonRansom Fork of atilsamancioglu/44-PythonRansom
Educational Ransomware Simulation
Language: Python - Size: 3.91 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

almaas-izdihar/warmingup
your device may get warm while visiting this site
Language: HTML - Size: 7.81 KB - Last synced at: 9 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

bogdzn/very-bad-scripts
run at your own risk
Language: Shell - Size: 1000 Bytes - Last synced at: almost 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

oniraf/pol
ruby unbounded pooling
Language: Ruby - Size: 8.79 KB - Last synced at: about 1 month ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0

oniraf/rimc
dead simple and stupid ruby in memory cache
Language: Ruby - Size: 14.6 KB - Last synced at: about 1 year ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0
