GitHub topics: interpretability-and-explainability

Repositories

HennyJie/IBGNN

MICCAI 2022 (Oral): Interpretable Graph Neural Networks for Connectome-Based Brain Disorder Analysis

Language: Python - Size: 890 KB - Last synced at: 3 days ago - Pushed at: about 2 years ago - Stars: 59 - Forks: 7

bishwamittra/nus_thesis

My PhD thesis in NUS. Making it public so that future graduate students may benefit.

Language: TeX - Size: 63.5 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 0 - Forks: 0

tail-unica/hopwise

hopwise: A Python Library for Explainable Recommendation based on Path Reasoning over Knowledge Graphs

Language: Python - Size: 19.1 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 4 - Forks: 1

vdlad/Remarkable-Robustness-of-LLMs

Codebase the paper "The Remarkable Robustness of LLMs: Stages of Inference?"

Language: Jupyter Notebook - Size: 4.05 MB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 18 - Forks: 1

cwangrun/MGProto

[TPAMI 2025] Mixture of Gaussian-distributed Prototypes with Generative Modelling for Interpretable and Trustworthy Image Recognition

Language: Python - Size: 2.5 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 7 - Forks: 0

Trustworthy-ML-Lab/posthoc-generative-cbm

[CVPR 2025] Concept Bottleneck Autoencoder (CB-AE) -- efficiently transform any pretrained (black-box) image generative model into an interpretable generative concept bottleneck model (CBM) with minimal concept supervision, while preserving image quality

Language: Jupyter Notebook - Size: 3 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 7 - Forks: 1

shuyang-dong/Counterfactual_Explanation_for_RL

Counterfactual explanations for continuous reinforcement learning with simulation in T1D and Gym environments.

Language: Python - Size: 33.9 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 1

bgreenwell/ebm

Explainable Boosting Machines

Language: R - Size: 44.5 MB - Last synced at: 24 days ago - Pushed at: 4 months ago - Stars: 3 - Forks: 1

ruizheliUOA/Awesome-Interpretability-in-Large-Language-Models

This repository collects all relevant resources about interpretability in LLMs

Size: 63.5 KB - Last synced at: about 2 months ago - Pushed at: 8 months ago - Stars: 343 - Forks: 24

warisgill/TraceFL

TraceFL is a novel mechanism for Federated Learning that achieves interpretability by tracking neuron provenance. It identifies clients responsible for global model predictions, achieving 99% accuracy across diverse datasets (e.g., medical imaging) and neural networks (e.g., GPT).

Language: Python - Size: 3.96 MB - Last synced at: 7 days ago - Pushed at: 8 months ago - Stars: 10 - Forks: 0

Skyyyy0920/FGAI

Language: Python - Size: 1.89 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

liugangcode/GREA

[KDD'22] Source codes of "Graph Rationalization with Environment-based Augmentations"

Language: Python - Size: 145 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 38 - Forks: 6

cwangrun/ST-ProtoPNet

[ICCV 2023] Learning Support and Trivial Prototypes for Interpretable Image Classification

Language: Python - Size: 841 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 20 - Forks: 2

fguzman82/PhD-Thesis

Interpretability: Methods for Identification and Retrieval of Concepts in CNN Networks

Language: Jupyter Notebook - Size: 101 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

MattScicluna/interpretable_tsne

Implementation of the gradient-based t-SNE sttribution method described in our GLBIO oral presentation: 'Towards Computing Attributions for Dimensionality Reduction Techniques'

Language: Python - Size: 7.8 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 1

Wuyxin/DISC

Official code of "Discover and Cure: Concept-aware Mitigation of Spurious Correlation" (ICML 2023)

Language: Python - Size: 294 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 32 - Forks: 5

goz1985/RST-ARM-GLM_-Research

Work on combining Logit model with an information granulation method for better interpretability

Language: R - Size: 286 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

DimitrisReppas/On_visual_explanation_of_supervised_and_self-supervised_learning

Visualization methods to interpret CNNs and Vision Transformers, trained in a supervised or self-supervised way. The methods are based on CAM or on the attention mechanism of Transformers. The results are evaluated qualitatively and quantitatively.

Language: Python - Size: 14.5 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

WanyuGroup/CVPR2022-OrphicX

Official code for the CVPR 2022 (oral) paper "OrphicX: A Causality-Inspired Latent Variable Model for Interpreting Graph Neural Networks."

Language: Python - Size: 1.94 MB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 23 - Forks: 10

VictorNico/NNs_from_scratch

Build a Neural net from scratch without keras or pytorch just by using numpy for calculus, pandas for data loading.

Language: Jupyter Notebook - Size: 29.3 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

Related Keywords

interpretability-and-explainability 20 explainable-ai 4 graph-neural-networks 4 machine-learning 4 interpretability 3 mechanistic-interpretability 2 interpretable-deep-learning 2 computer-vision 2 interpretable-ai 2 pytorch 2 image-recognition 1 interpretable-neural-networks 1 cnn 1 dimensionality-reduction 1 research-tool 1 data-augmentation 1 generalization 1 fine-grained-classification 1 rationalization 1 polymer-property-prediction 1 molecular-property-prediction 1 gnns 1 testing 1 software-engineering 1 federated-learning 1 explainability 1 differential-privacy 1 debugging 1 generative-ai 1 simulated-annealing-algorithm 1 pandas 1 oop-principles 1 numpy 1 neural-network 1 mnist 1 gradiantdescent 1 eda 1 docstrings 1 causality 1 transformers 1 supervised-learning 1 self-supervised-learning 1 deep-neural-networks 1 cnns 1 rough-sets 1 glm 1 boosting-ensemble 1 bagging-ensemble 1 stable-diffusion 1 spurious-correlation 1 icml-2023 1 deep-learning 1 concept-bottleneck-models 1 trustworthy-ai 1 prototypical-networks 1 out-of-distribution-detection 1 interpretable-image-recognition 1 intepretable-machine-learning 1 horse-racing-ai 1 generative-modelling 1 game-theory-algorithms 1 ai-safety 1 recommender-systems 1 path-reasoning 1 link-prediction 1 knowledge-graph-embeddings 1 formal-methods 1 fairness 1 miccai2022 1 healthcare 1 brain 1 accountability 1 sparse-autoencoder 1 dictionary-learning 1 xai 1 interpretable-models 1 interpretable-ml 1 interpretable-machine-learning 1 interpretable 1 iml 1 glassbox 1 explainable-ml 1 explainable-machine-learning 1 blackbox 1 ai 1 temporal-logic 1 t1d-simulator 1 reinforcement-learning 1 python 1 openai-gym 1 human-centered-ai 1 counterfactual-explanation 1

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Repos