attribution-graphs

Here are 2 public repositories matching this topic...

circuits-research / CLT-Forge

A Mechanistic Interpretability Toolkit for Cross-Layer Transcoder Training and Attribution-Graph Visualization

transcoder visual-interface mechanistic-interpretability ai-interpretability attribution-graphs auto-interpretability cross-layer-transcoder transformer-circuits

Updated Mar 18, 2026
Python

peppinob-ol / attribution-graph-probing

Star

Automates attribution-graph analysis via probe prompting: circuit-trace a prompt, auto-generate concept probes, profile feature activations, cluster supernodes.

graph-analysis sparse-autoencoders mechanistic-interpretability llm-interpretability research-tooling circuit-tracing attribution-graphs probe-prompting prompt-probing neuronpedia feature-activation supernodes cross-layer-transcoder

Updated Mar 19, 2026
Python

Improve this page

Add a description, image, and links to the attribution-graphs topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the attribution-graphs topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly