GitHub topics: interpretability

Repositories

SeldonIO/alibi

Algorithms for explaining machine learning models

Language: Python - Size: 30.3 MB - Last synced at: 38 minutes ago - Pushed at: 5 days ago - Stars: 2,500 - Forks: 257

jphall663/awesome-machine-learning-interpretability

A curated list of awesome responsible machine learning resources.

Size: 4.45 MB - Last synced at: 44 minutes ago - Pushed at: 27 days ago - Stars: 3,783 - Forks: 599

Dependable-Intelligent-Systems-Lab/xwhy

Explaining black boxes with a SMILE: Statistical Mode-agnostic Interpretability with Local Explanations

Language: JavaScript - Size: 24.9 MB - Last synced at: about 9 hours ago - Pushed at: about 10 hours ago - Stars: 10 - Forks: 2

csinva/imodels

Interpretable ML package 🔍 for concise, transparent, and accurate predictive modeling (sklearn-compatible).

Language: Jupyter Notebook - Size: 162 MB - Last synced at: about 9 hours ago - Pushed at: 2 months ago - Stars: 1,454 - Forks: 124

AdamCoscia/KnowledgeVIS

Visually compare fill-in-the-blank LLM prompts to uncover learned biases and associations!

Language: JavaScript - Size: 6.59 MB - Last synced at: about 14 hours ago - Pushed at: about 15 hours ago - Stars: 8 - Forks: 1

boniolp/kGraph

Graph Embedding for Interpretable Time Series Clustering

Language: Python - Size: 49.4 MB - Last synced at: about 17 hours ago - Pushed at: about 18 hours ago - Stars: 28 - Forks: 1

frgfm/torch-cam

Class activation maps for your PyTorch models (CAM, Grad-CAM, Grad-CAM++, Smooth Grad-CAM++, Score-CAM, SS-CAM, IS-CAM, XGrad-CAM, Layer-CAM)

Language: Python - Size: 10.1 MB - Last synced at: about 11 hours ago - Pushed at: 2 days ago - Stars: 2,189 - Forks: 220

hijohnnylin/neuronpedia

open source interpretability platform 🧠

Language: TypeScript - Size: 10.3 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 114 - Forks: 11

stellargraph/stellargraph

StellarGraph - Machine Learning on Graphs

Language: Python - Size: 92.5 MB - Last synced at: about 2 hours ago - Pushed at: about 1 year ago - Stars: 3,004 - Forks: 434

shap/shap

A game theoretic approach to explain the output of any machine learning model.

Language: Jupyter Notebook - Size: 301 MB - Last synced at: 1 day ago - Pushed at: 2 days ago - Stars: 23,858 - Forks: 3,359

ndif-team/nnsight

The nnsight package enables interpreting and manipulating the internals of deep learned models.

Language: Jupyter Notebook - Size: 49.8 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 562 - Forks: 50

jacobgil/pytorch-grad-cam

Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.

Language: Python - Size: 134 MB - Last synced at: 2 days ago - Pushed at: about 1 month ago - Stars: 11,610 - Forks: 1,635

pytorch/captum

Model interpretability and understanding for PyTorch

Language: Python - Size: 306 MB - Last synced at: about 15 hours ago - Pushed at: 6 days ago - Stars: 5,209 - Forks: 517

sicara/tf-explain

Interpretability Methods for tf.keras models with Tensorflow 2.x

Language: Python - Size: 931 KB - Last synced at: 1 day ago - Pushed at: 11 months ago - Stars: 1,026 - Forks: 110

poloclub/webshap

JavaScript library to explain any machine learning models anywhere!

Language: TypeScript - Size: 35.5 MB - Last synced at: about 11 hours ago - Pushed at: about 2 years ago - Stars: 56 - Forks: 11

chaoyanghe/Awesome-Federated-Learning

FedML - The Research and Production Integrated Federated Learning Library: https://fedml.ai

Size: 210 KB - Last synced at: about 1 hour ago - Pushed at: over 2 years ago - Stars: 1,957 - Forks: 329

stanfordnlp/axbench

Stanford NLP Python library for benchmarking the utility of LLM interpretability methods

Language: Python - Size: 617 MB - Last synced at: about 2 hours ago - Pushed at: about 2 months ago - Stars: 78 - Forks: 6

boniolp/graphit

Graph-based Time Series Clustering Visualisation Tools

Language: Python - Size: 5.75 MB - Last synced at: 2 days ago - Pushed at: 3 days ago - Stars: 1 - Forks: 0

faded53222/NSWord

RNA Modification Detection using Nanopore Direct RNA Sequencing via improved Transformer

Language: Jupyter Notebook - Size: 19.1 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 2 - Forks: 0

stanfordnlp/pyreft

Stanford NLP Python library for Representation Finetuning (ReFT)

Language: Python - Size: 104 MB - Last synced at: about 14 hours ago - Pushed at: 3 months ago - Stars: 1,466 - Forks: 125

alvinwan/neural-backed-decision-trees

Making decision trees competitive with neural networks on CIFAR10, CIFAR100, TinyImagenet200, Imagenet

Language: Python - Size: 2.57 MB - Last synced at: 3 days ago - Pushed at: about 2 years ago - Stars: 621 - Forks: 132

iancovert/sage

For calculating global feature importance using Shapley values.

Language: Python - Size: 7.93 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 270 - Forks: 35

vanderschaarlab/autoprognosis

A system for automating the design of predictive modeling pipelines tailored for clinical prognosis.

Language: Python - Size: 960 KB - Last synced at: 1 day ago - Pushed at: about 2 months ago - Stars: 147 - Forks: 28

google-deepmind/penzai

A JAX research toolkit for building, editing, and visualizing neural networks.

Language: Python - Size: 484 MB - Last synced at: about 7 hours ago - Pushed at: 18 days ago - Stars: 1,779 - Forks: 62

g8a9/ferret

A python package for benchmarking interpretability techniques on Transformers.

Language: Python - Size: 1.52 MB - Last synced at: about 5 hours ago - Pushed at: 8 months ago - Stars: 211 - Forks: 15

SteveKGYang/MentalLLaMA

This repository introduces MentaLLaMA, the first open-source instruction following large language model for interpretable mental health analysis.

Language: Python - Size: 13.2 MB - Last synced at: 3 days ago - Pushed at: about 1 year ago - Stars: 260 - Forks: 27

MAIF/shapash

🔅 Shapash: User-friendly Explainability and Interpretability to Develop Reliable and Transparent Machine Learning Models

Language: Jupyter Notebook - Size: 61.8 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 2,872 - Forks: 346

IAAR-Shanghai/Awesome-Attention-Heads

An awesome repository & A comprehensive survey on interpretability of LLM attention heads.

Language: TeX - Size: 6.07 MB - Last synced at: 4 days ago - Pushed at: 2 months ago - Stars: 348 - Forks: 12

ZFancy/awesome-activation-engineering

A curated list of resources for activation engineering

Size: 174 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 69 - Forks: 1

ML4BM-Lab/SENA

Official repository for the SENA-discrepancy-VAE model.

Language: Python - Size: 27.3 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 4 - Forks: 1

interpretml/interpret

Fit interpretable models. Explain blackbox machine learning.

Language: C++ - Size: 14.7 MB - Last synced at: 2 days ago - Pushed at: 19 days ago - Stars: 6,486 - Forks: 746

bartbussmann/BatchTopK

Implementation of the BatchTopK activation function for training sparse autoencoders (SAEs)

Language: Python - Size: 22.5 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 38 - Forks: 6

evandez/neuron-descriptions

Natural Language Descriptions of Deep Visual Features, ICLR 2022

Language: Python - Size: 3.04 MB - Last synced at: 4 days ago - Pushed at: almost 2 years ago - Stars: 65 - Forks: 7

DavidUdell/sparse_circuit_discovery

Circuit discovery in GPT-2 small, using sparse autoencoding

Language: Python - Size: 19.2 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 7 - Forks: 1

nathanwang16/Fractal

Models learn representations, and world patterns. Fractal is beautiful, but not the key.

Language: Jupyter Notebook - Size: 4.47 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

ndif-team/ndif-website

The website for NDIF, the National Deep Inference Fabric

Language: HTML - Size: 39.8 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 2 - Forks: 1

EthicalML/xai

XAI - An eXplainability toolbox for machine learning

Language: Python - Size: 17.8 MB - Last synced at: 5 days ago - Pushed at: over 3 years ago - Stars: 1,168 - Forks: 179

ndif-team/ndif

The NDIF server, which performs deep inference and serves nnsight requests remotely

Language: Python - Size: 18.6 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 25 - Forks: 7

microsoft/responsible-ai-toolbox

Responsible AI Toolbox is a suite of tools providing model and data exploration and assessment user interfaces and libraries that enable a better understanding of AI systems. These interfaces and libraries empower developers and stakeholders of AI systems to develop and monitor AI more responsibly, and take better data-driven actions.

Language: TypeScript - Size: 111 MB - Last synced at: 6 days ago - Pushed at: 3 months ago - Stars: 1,529 - Forks: 402

google/yggdrasil-decision-forests

A library to train, evaluate, interpret, and productionize decision forest models such as Random Forest and Gradient Boosted Decision Trees.

Language: C++ - Size: 39.5 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 569 - Forks: 60

Oxid15/xai-benchmark

Open and extensible benchmark for XAI methods

Language: Python - Size: 1.78 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 6 - Forks: 0

wangyongjie-ntu/Awesome-explainable-AI

A collection of research materials on explainable AI/ML

Language: Markdown - Size: 1.93 MB - Last synced at: 6 days ago - Pushed at: about 2 months ago - Stars: 1,494 - Forks: 203

pietrobarbiero/pytorch_explain

PyTorch Explain: Interpretable Deep Learning in Python.

Language: Jupyter Notebook - Size: 42.1 MB - Last synced at: 3 days ago - Pushed at: 12 months ago - Stars: 154 - Forks: 14

EthicalML/awesome-production-machine-learning

A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning

Size: 2.36 MB - Last synced at: 7 days ago - Pushed at: 10 days ago - Stars: 18,412 - Forks: 2,342

tensorflow/decision-forests

A collection of state-of-the-art algorithms for the training, serving and interpretation of Decision Forest models in Keras.

Language: Python - Size: 5.87 MB - Last synced at: 6 days ago - Pushed at: about 1 month ago - Stars: 680 - Forks: 113

ModelOriented/hstats

Friedman's H-statistics

Language: R - Size: 217 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 30 - Forks: 1

understandable-machine-intelligence-lab/Quantus

Quantus is an eXplainable AI toolkit for responsible evaluation of neural network explanations

Language: Jupyter Notebook - Size: 147 MB - Last synced at: 6 days ago - Pushed at: 3 months ago - Stars: 598 - Forks: 77

ModelOriented/DALEX

moDel Agnostic Language for Exploration and eXplanation

Language: Python - Size: 798 MB - Last synced at: 4 days ago - Pushed at: 3 months ago - Stars: 1,420 - Forks: 168

tensorflow/tcav

Code for the TCAV ML interpretability project

Language: Jupyter Notebook - Size: 625 KB - Last synced at: 6 days ago - Pushed at: 10 months ago - Stars: 640 - Forks: 152

mmschlk/shapiq

Shapley Interactions and Shapley Values for Machine Learning

Language: Python - Size: 309 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 514 - Forks: 34

jorge-martinez-gil/graphcodebert-interpretability

Augmenting the Interpretability of GraphCodeBERT for Code Similarity Tasks

Language: Python - Size: 9.71 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 5 - Forks: 0

chr5tphr/zennit

Zennit is a high-level framework in Python using PyTorch for explaining/exploring neural networks using attribution methods like LRP.

Language: Python - Size: 2.28 MB - Last synced at: 2 days ago - Pushed at: 10 months ago - Stars: 225 - Forks: 34

deel-ai/xplique

👋 Xplique is a Neural Networks Explainability Toolbox

Language: Python - Size: 33.4 MB - Last synced at: about 13 hours ago - Pushed at: 7 months ago - Stars: 688 - Forks: 58

stanfordnlp/pyvene

Stanford NLP Python library for understanding and improving PyTorch models via interventions

Language: Python - Size: 25.4 MB - Last synced at: about 14 hours ago - Pushed at: 13 days ago - Stars: 740 - Forks: 82

jasonjmcghee/livelove

Love2D LSP (VS Code / Neovim / Zed / etc.) extension for live coding and live variable tracking

Language: JavaScript - Size: 5.33 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 130 - Forks: 2

rmovva/HypotheSAEs

Hypothesizing interpretable relationships in text datasets using sparse autoencoders.

Language: Jupyter Notebook - Size: 11.5 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 24 - Forks: 2

bgreenwell/ebm

Explainable Boosting Machines

Language: R - Size: 44.5 MB - Last synced at: 6 days ago - Pushed at: 2 months ago - Stars: 3 - Forks: 1

KempnerInstitute/overcomplete

👋 Overcomplete is a Vision-based SAE Toolbox

Language: Python - Size: 57.2 MB - Last synced at: 7 days ago - Pushed at: about 2 months ago - Stars: 53 - Forks: 1

rigvedrs/YOLO-V11-CAM

Wanna know what your model sees? Here's a package for applying EigenCAM and generating heatmap from the new YOLO V11 model

Language: Jupyter Notebook - Size: 40 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 201 - Forks: 42

Sorades/CLAT

[TMI 2024] Code for "Concept-based Lesion Aware Transformer for Interpretable Retinal Disease Diagnosis"

Language: Python - Size: 617 KB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 19 - Forks: 0

ChicagoHAI/hypothesis-generation

This is the official repository for HypoGeniC (Hypothesis Generation in Context) and HypoRefine, which are automated, data-driven tools that leverage large language models to generate hypothesis for open-domain research. For more details, please see the original paper using the link below.

Language: Python - Size: 121 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 65 - Forks: 8

microsoft/automated-brain-explanations

Generating and validating natural-language explanations for the brain.

Language: Jupyter Notebook - Size: 1.06 GB - Last synced at: 1 day ago - Pushed at: about 2 months ago - Stars: 52 - Forks: 6

poloclub/timbertrek

Explore and compare 1K+ accurate decision trees in your browser!

Language: TypeScript - Size: 36.9 MB - Last synced at: 6 days ago - Pushed at: about 1 year ago - Stars: 161 - Forks: 10

tensorflow/lucid 📦

A collection of infrastructure and tools for research in neural network interpretability.

Language: Jupyter Notebook - Size: 141 MB - Last synced at: 6 days ago - Pushed at: over 2 years ago - Stars: 4,694 - Forks: 654

PKU-Alignment/aligner

[NeurIPS 2024 Oral] Aligner: Efficient Alignment by Learning to Correct

Language: Python - Size: 16.3 MB - Last synced at: 7 days ago - Pushed at: 4 months ago - Stars: 170 - Forks: 8

JoaoLages/diffusers-interpret

Diffusers-Interpret 🤗🧨🕵️‍♀️: Model explainability for 🤗 Diffusers. Get explanations for your generated images.

Language: Jupyter Notebook - Size: 77.5 MB - Last synced at: about 9 hours ago - Pushed at: over 2 years ago - Stars: 276 - Forks: 14

trustyai-explainability/trustyai-explainability

TrustyAI Explainability Toolkit

Language: Java - Size: 19 MB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 39 - Forks: 42

JHoelli/Awesome-Time-Series-Explainability

A list of (post-hoc) XAI for time series

Size: 424 KB - Last synced at: 13 days ago - Pushed at: 8 months ago - Stars: 134 - Forks: 16

OpenMOSS/Language-Model-SAEs

For OpenMOSS Mechanistic Interpretability Team's Sparse Autoencoder (SAE) research.

Language: Python - Size: 10.2 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 113 - Forks: 13

dobriban/Principles-of-AI-LLMs

Materials for the course Principles of AI: LLMs at UPenn (Stat 9911, Spring 2025). LLM architectures, training paradigms (pre- and post-training, alignment), test-time computation, reasoning, safety and robustness (jailbreaking, oversight, uncertainty), representations, interpretability (circuits), etc.

Size: 188 MB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 31 - Forks: 2