-
BauKit
- light, simple, and well loved
-
- uses jaxtyping, aliases models into a common interface, not as huggingface compatible as other libs
-
an extremely opinionated toolkit for doing whatever you want to specific models,
-
Tuned Lens
- tools for looking at how transformer predictions are built layer-by-layer
-
vgel/repeng - A library for making RepE control vectors
-
-
- aim to keep it as simple as bauket eventually, and support remote mechinterp. HuggingFace compatible
-
-
penzai
- jax-based, not HuggingFace-native
-
ViT-Prisma
- mechanistic interpretability for vision and video transformers
-
Transformer Debugger (OpenAI)
- not HuggingFace-native
-
Graphpatch
- promising but abandoned
-
cupbearer A library for mechanistic anomaly detection
-
Overcomplete
- vision SAE toolbox
-
vLLM-Hook
- program internal states of vLLM-served models
-
vllm-lens
- extract residual stream activations and apply steering vectors in vLLM
-
Neuronpedia - public feature/neuron browser
-
Docent - interactive model explanation and steering interface
- captum
- inseq
- Explabox (2022)
- IBM: AIX360 (2019)
- Microsoft: Responsible AI Toolbox (2021)
- Dashboard that integrates: Error analysis, Fairlearn, InterpretML, DiCE, EconML and Data Balance
- InterpretML
- SHAP, Mimic and LIME explainers. Permutation feature importance.
- MI2.ai
- DrWhy (2019)
- DALEX, survex, Arena, fairmodels,
- Currently working on: ARES, xSurvival, Large Model Analysis
- DrWhy (2019)
- XAI (2018)
- ELI5
- NN-SVG
- Neptune-AI blog
- Neptune-AI blog
- AI Ethics tool landscape
See this lit review of Adapter intervention types
TODO format https://github.com/vgel/repeng https://github.com/IBM/AISteer360 https://github.com/wassname/ssteer-eval-aware https://github.com/IBM/activation-steering https://github.com/chili-lab/Spherical-Steering https://github.com/safety-research/weight-steering
- jsonformer
- doesn't do enums. huggingface only
- prob_jsonformer - Jsonformer, but it can output the probability of each choice in a single pass. Has enum
- outlines
- Microsoft Guidance
- lmql.ai
- llama.cpp grammar
- langchain output_parsers
- salute - typescript
- TypeChat - typescript
- guardrails
- clownfish - 2023 Modifying Transformers to Follow a JSON Schema - not updated
- relm - 2023 Regular Expression engine for Language Models - not updated
- Constrained-Text-Generation-Studio
- kor
- lm-format-enforcer - remote api's
- instructor - for remote api's without logits
- Promptify