An open API service providing repository metadata for many open source software ecosystems.

Topic: "causal-intervention"

explanare/ravel

Evaluate interpretability methods on localizing and disentangling concepts in LLMs.

Language: Python - Size: 661 KB - Last synced at: about 1 month ago - Pushed at: 7 months ago - Stars: 43 - Forks: 7

YangLiu9208/CausalVLR

CausalVLR: A Toolbox and Benchmark for Visual-Linguistic Causal Reasoning

Size: 37.3 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 15 - Forks: 4

explanare/verbatim-memorization

Demystifying Verbatim Memorization in Large Language Models

Language: Python - Size: 428 KB - Last synced at: about 1 month ago - Pushed at: 9 months ago - Stars: 4 - Forks: 2

explanare/eval-neuron-explanation

A framework for evaluating auto-interp pipelines, i.e., natural language explanations of neurons.

Language: Python - Size: 495 KB - Last synced at: 2 months ago - Pushed at: 11 months ago - Stars: 2 - Forks: 0

luka-group/Causal-View-of-Entity-Bias

[EMNLP 2023] A Causal View of Entity Bias in (Large) Language Models

Language: Python - Size: 385 KB - Last synced at: 12 months ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0

explanare/char-iit

A causal intervention framework to learn robust and interpretable character representations inside subword-based language models

Language: Jupyter Notebook - Size: 23.5 MB - Last synced at: 2 months ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0