#

interpretability

Here are 650 public repositories matching this topic...

interpretml / interpret

Fit interpretable models. Explain blackbox machine learning.

machine-learning ai scikit-learn artificial-intelligence transparency blackbox bias differential-privacy gradient-boosting interpretability interpretable-ai interpretable-ml explainable-ai explainable-ml xai interpretable-machine-learning iml explainability interpretml

Updated Aug 7, 2024
C++

pytorch / captum

Model interpretability and understanding for PyTorch

interpretability feature-importance interpretable-ai interpretable-ml feature-attribution

Updated Aug 7, 2024
Python

trustyai-explainability / trustyai-explainability

TrustyAI Explainability Toolkit

python java hacktoberfest interpretability xai explainability xai-library explainableai

Updated Aug 7, 2024
Java

OpenMOSS / Language-Model-SAEs

For OpenMOSS Mechanistic Interpretability Team's Sparse Autoencoder (SAE) research.

sparse-autoencoders interpretability sparse-dictionary mechanistic-interpretability

Updated Aug 7, 2024
Jupyter Notebook

shap / shap

A game theoretic approach to explain the output of any machine learning model.

machine-learning deep-learning gradient-boosting interpretability shapley shap explainability

Updated Aug 7, 2024
Jupyter Notebook

EthicalML / awesome-production-machine-learning

A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning

machine-learning data-mining awesome deep-learning awesome-list interpretability privacy-preserving production-machine-learning mlops privacy-preserving-machine-learning explainability responsible-ai machine-learning-operations ml-ops ml-operations privacy-preserving-ml large-scale-ml production-ml large-scale-machine-learning

Updated Aug 7, 2024

leap-laboratories / PIZZA

An attribution library for LLMs

ai pytorch artificial-intelligence interpretability llm

Updated Aug 7, 2024
Python

google-deepmind / penzai

A JAX research toolkit for building, editing, and visualizing neural networks.

visualization neural-networks interpretability fine-tuning jax

Updated Aug 7, 2024
Python

microsoft / responsible-ai-toolbox

Responsible AI Toolbox is a suite of tools providing model and data exploration and assessment user interfaces and libraries that enable a better understanding of AI systems. These interfaces and libraries empower developers and stakeholders of AI systems to develop and monitor AI more responsibly, and take better data-driven actions.

Updated Aug 7, 2024
TypeScript

ndif-team / ndif-website

The website for NDIF, the National Deep Inference Fabric

website machine-learning interpretability science-research

Updated Aug 6, 2024
HTML

iancovert / sage

For calculating global feature importance using Shapley values.

machine-learning interpretability shapley explainability

Updated Aug 6, 2024
Python

ykumards / simtorch

PyTorch library to compare similarity between NN representations

deep-learning pytorch neural-networks interpretability cka

Updated Aug 6, 2024
Python

google / yggdrasil-decision-forests

A library to train, evaluate, interpret, and productionize decision forest models such as Random Forest and Gradient Boosted Decision Trees.

javascript python go cli machine-learning cpp random-forest tensorflow pypi distributed-computing ml cart decision-trees gradient-boosting interpretability decision-forest

Updated Aug 6, 2024
C++

stanfordnlp / pyreft

ReFT: Representation Finetuning for Language Models

interpretability reft representation-finetuning

Updated Aug 6, 2024
Python

pyvene

stanfordnlp / pyvene

Stanford NLP Python Library for Understanding and Improving PyTorch Models via Interventions

intervention interpretability mechanistic-interpretability activation-intervention activation-patching

Updated Aug 5, 2024
Python

KohlerHECTOR / interpreter-py

Implementation of Interpretable and Editable Programmatic Tree Policies for Reinforcement Learning (Kohler, Delfosse, et. al. 2024).

reinforcement-learning code-generation programmatic imitation-learning interpretability mujoco explainable-ai explainability program-generation

Updated Aug 5, 2024
Python

ModelOriented / kernelshap

Different SHAP algorithms

machine-learning rstats interpretability explainable-ai xai interpretable-machine-learning shap

Updated Aug 7, 2024
R

explanare / ravel

Evaluate interpretability methods on localizing and disentangling concepts in LLMs.

intervention interpretability sparse-autoencoder probing disentangled-representations causal-intervention

Updated Aug 5, 2024
Python

xplique

deel-ai / xplique

👋 Xplique is a Neural Networks Explainability Toolbox

interpretability explainable-ai explainable-ml xai

Updated Aug 5, 2024
Python

alanqrwang / keymorph

Robust multimodal image registration via keypoints

deep-learning neural-network pytorch affine registration robust keypoints brain interpretability multimodal

Updated Aug 5, 2024
Python

Improve this page

Add a description, image, and links to the interpretability topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the interpretability topic, visit your repo's landing page and select "manage topics."