TransformerLens v3 by shayansadeghieh · Pull Request #153 · hijohnnylin/neuronpedia

shayansadeghieh · 2025-08-23T15:21:49Z

Problem

Brief description of problem being resolved. If there are multiple or subpoints, use bullet points.

Fix

Brief description of fix being applied. If there are multiple or subpoints, use bullet points.

Testing

Brief description of testing that is done or added. If there are multiple or subpoints, use bullet points.

codecov · 2025-08-23T21:50:37Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 5.18%. Comparing base (374a059) to head (6cddecd).

Additional details and impacted files

@@           Coverage Diff            @@
##            main    #153      +/-   ##
========================================
- Coverage   7.93%   5.18%   -2.75%     
========================================
  Files        124     104      -20     
  Lines      17502   16035    -1467     
  Branches     382     169     -213     
========================================
- Hits        1388     832     -556     
+ Misses     16103   15203     -900     
+ Partials      11       0      -11

Flag	Coverage Δ
autointerp	`?`
inference	`?`
webapp	`5.18% <ø> (ø)`

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

shayansadeghieh · 2025-08-29T15:35:17Z

apps/inference/neuronpedia_inference/server.py

-        def add_hook_in_to_mlp(mlp):  # type: ignore
-            mlp.hook_in = HookPoint()
-            original_forward = mlp.forward
-            mlp.forward = lambda x: original_forward(mlp.hook_in(x))
-
-        for block in model.blocks:
-            add_hook_in_to_mlp(block.mlp)


@hijohnnylin I don't think we need this anymore with tlens >= 3, please correct me if I'm wrong. It is causing infinite loops with model_bridge.

Try running this in a notebook and you'll see the new version of tlens automatically has mlp_block.hook_in attribute so we shouldn't need to add it:

!pip install git+https://github.com/shayansadeghieh/[email protected] from transformer_lens.model_bridge import TransformerBridge from transformer_lens import HookedTransformer print("\n=== NEW: TransformerBridge ===") new_model = TransformerBridge.boot_transformers("gpt2-small") new_model.enable_compatibility_mode(disable_warnings=True) # Test if hook_in exists mlp_block = new_model.blocks[0].mlp print(f"MLP has hook_in: {hasattr(mlp_block, 'hook_in')}") print(f"MLP has hook_out: {hasattr(mlp_block, 'hook_out')}") # Try to access it try: hook = mlp_block.hook_in print(f"hook_in type: {type(hook)}") print(f"hook_in name: {hook.name}") except AttributeError as e: print(f"ERROR: {e}") print("=== OLD: HookedTransformer ===") old_model = HookedTransformer.from_pretrained("gpt2-small") # Test if hook_in exists mlp_block = old_model.blocks[0].mlp print(f"MLP has hook_in: {hasattr(mlp_block, 'hook_in')}") print(f"MLP has hook_out: {hasattr(mlp_block, 'hook_out')}") # Try to access it try: hook = mlp_block.hook_in print(f"hook_in type: {type(hook)}") except AttributeError as e: print(f"ERROR: {e}")

Override compatibility issues for tlens/saelens

8c46f54

shayansadeghieh changed the title ~~Transformer Lens v3~~ TransformerLens v3 Aug 23, 2025

shayansadeghieh added 5 commits August 23, 2025 17:08

Use tlens 3.0 in ci/cd temporarily

b4d2050

Merge remote-tracking branch 'upstream/main' into tlens-v3

2863c17

Skip type checks for now in ci/cd

3308616

Update poetry lock

feda555

Use saelens fork

6cddecd

shayansadeghieh added 16 commits August 23, 2025 17:50

specify commit hash for saelens

53bb5cf

Working tests

04c5c7b

Skip typechecking for now

0afa1c3

Initialization working

4041368

made it pass gpt2 forward pass for single endpoint

b417631

Set develop mode to false for tlens + saelens

f1ea0ce

Passing test again using compatibility mode

86b0264

Both single integration tests passing

cd715d5

remove linting too for now

e1c5966

topk_by_token passing

15198c8

Steering integration tests failing

1542712

Revert device to cpu for CI

c9e8e9f

Utilize generate_stream not v2

ddf8593

Update topk_by_token with correct batch logic

1db1671

Remove more references to hookedtransformer

b02377c

Remove manual hook creation

52ddda0

shayansadeghieh commented Aug 29, 2025

View reviewed changes

temp for debugging kv cache

a617514

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TransformerLens v3 #153

TransformerLens v3 #153
shayansadeghieh wants to merge 23 commits intohijohnnylin:mainfrom
shayansadeghieh:tlens-v3

shayansadeghieh commented Aug 23, 2025

Uh oh!

codecov bot commented Aug 23, 2025

Uh oh!

shayansadeghieh Aug 29, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

shayansadeghieh commented Aug 23, 2025

Problem

Fix

Testing

Uh oh!

codecov bot commented Aug 23, 2025

Codecov Report

Uh oh!

shayansadeghieh Aug 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

shayansadeghieh Aug 29, 2025 •

edited

Loading