causal-intervention | Topic | Ecosyste.ms: Repos

Topic: "causal-intervention"

Evaluate interpretability methods on localizing and disentangling concepts in LLMs.

Language: Python - Size: 661 KB - Last synced at: about 1 month ago - Pushed at: 7 months ago - Stars: 43 - Forks: 7

CausalVLR: A Toolbox and Benchmark for Visual-Linguistic Causal Reasoning

Size: 37.3 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 15 - Forks: 4

Demystifying Verbatim Memorization in Large Language Models

Language: Python - Size: 428 KB - Last synced at: about 1 month ago - Pushed at: 9 months ago - Stars: 4 - Forks: 2

A framework for evaluating auto-interp pipelines, i.e., natural language explanations of neurons.

Language: Python - Size: 495 KB - Last synced at: 2 months ago - Pushed at: 11 months ago - Stars: 2 - Forks: 0

[EMNLP 2023] A Causal View of Entity Bias in (Large) Language Models

Language: Python - Size: 385 KB - Last synced at: 12 months ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0

A causal intervention framework to learn robust and interpretable character representations inside subword-based language models

Language: Jupyter Notebook - Size: 23.5 MB - Last synced at: 2 months ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0