Skip to content

docs: fix typo in calculator function description#10

Closed
dsmedia wants to merge 1 commit into
anthropics:masterfrom
dsmedia:patch-2
Closed

docs: fix typo in calculator function description#10
dsmedia wants to merge 1 commit into
anthropics:masterfrom
dsmedia:patch-2

Conversation

@dsmedia
Copy link
Copy Markdown

@dsmedia dsmedia commented Jun 13, 2024

Corrected a typo in the documentation. Changed "indepent" to "independent" in the description of defining the calculator function.

Before:
"The first step is to define the actual calculator function and make sure it works, indepent of Claude. We'll write a VERY simple function that expects three arguments:\n"

After:
"The first step is to define the actual calculator function and make sure it works, independent of Claude. We'll write a VERY simple function that expects three arguments:\n"

Corrected a typo in the documentation. Changed "indepent" to "independent" in the description of defining the calculator function.

Before: 
"The first step is to define the actual calculator function and make sure it works, indepent of Claude.  We'll write a VERY simple function that expects three arguments:\n"

After: 
"The first step is to define the actual calculator function and make sure it works, independent of Claude.  We'll write a VERY simple function that expects three arguments:\n"
@dsmedia dsmedia closed this by deleting the head repository Feb 2, 2025
ilonae added a commit to ilonae/courses that referenced this pull request Apr 28, 2026
Adds a new notebook covering practical techniques for evaluating whether LLM explanation traces are faithful to the model's actual
reasoning. Directly inspired by Anthropic's 2025 mechanistic interpretability research (circuit tracing / attribution graphs).

Techniques covered:
- Counterfactual probing (remove claimed features, measure output change)
- Motivated reasoning detection (misleading hint experiment)
- SHAP attribution comparison (local DistilBERT as reference model)
- Model-graded reasoning evaluation (second-Claude-as-judge)
- Combined faithfulness scorecard

Handles anthropics#155
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant