Topic: "ai-interpretability"
kou-saki/i-asked-it-to-forget
I Asked It to Forget, but It Didn't — A Case of Miscommunication Between AI and Humans
Size: 25.4 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

Wondermongering/LinguisticPerturber
Probing linguistic robustness in transformers: a quantum-inspired approach to AI interpretability
Language: Python - Size: 14.6 KB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

AlexTMjugador/redwoodresearch-interp-docker
📦 Redwood Research's transformer interpretability tools, conveniently packaged in a Docker container for simple and reproducible deployments.
Language: Dockerfile - Size: 5.86 KB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0
