Inspectus allows you to create interactive visualizations of attention matrices with just a few lines of Python code. It’s designed to run smoothly in Jupyter notebooks through an easy-to-use Python API. Inspectus provides multiple views to help you understand language model behaviors. If you have any questions, feel free to ask!
https://neuralblog.github.io/llama3-neurons/neuron_viewer.ht...
Golden Gate Claude - https://news.ycombinator.com/item?id=40459543 - (60 comments, 16 days ago)
Extracting Concepts from GPT-4 - https://news.ycombinator.com/item?id=40599749 (144 comments, 2 days ago)
Inspectus, on the other hand is a general tool to visualize how transformer models pay attention to different parts of the data they process.
For an example if you're working on a Q&A model, you can check which tokens in the prompt contributed to the output. It's possible to detect issues like output not paying attention to any important part of the prompt.
Deleted Comment