Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

"Algebraic value editing" raises exception #82

Open
alexander-turner opened this issue Jul 12, 2023 · 1 comment
Open

"Algebraic value editing" raises exception #82

alexander-turner opened this issue Jul 12, 2023 · 1 comment

Comments

@alexander-turner
Copy link

On default settings, I selected the AVE analysis, and got the following:

AssertionError: This app has encountered an error. The original error message is redacted to prevent data leaks. Full error details have been recorded in the logs (if you're on Streamlit Cloud, click on 'Manage app' in the lower right of your app).
Traceback:

File "/home/appuser/venv/lib/python3.9/site-packages/streamlit/runtime/scriptrunner/script_runner.py", line 565, in _run_script
    exec(code, module.__dict__)
File "/app/decisiontransformerinterpretability/app.py", line 193, in <module>
    show_algebraic_value_editing(dt, logit_dir=logit_dir, original_cache=cache)
File "/app/decisiontransformerinterpretability/src/streamlit_app/causal_analysis_components.py", line 780, in show_algebraic_value_editing
    corrupted_tokens = get_corrupted_tokens(dt, key="avec")
File "/app/decisiontransformerinterpretability/src/streamlit_app/causal_analysis_components.py", line 410, in get_corrupted_tokens
    corrupted_tokens = get_modified_tokens_from_app_state(
File "/app/decisiontransformerinterpretability/src/streamlit_app/environment.py", line 147, in get_modified_tokens_from_app_state
    assert not torch.all(
@jbloomAus
Copy link
Owner

Ahh, this is the issue where I default to a value that is useful half the time and breaks the other half (setting the instruction to key, but if it's already a key, then corrupt and clean are the same). You can't see the error because streamlit hides errors. I'll update default to something else.

Haven't iterated on AVE stuff recently (it's still only residual stream level). Was hoping to compare to more well established techniques. Have some thoughts on this but it's ongoing.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants