docs: fix typo in calculator function description by dsmedia · Pull Request #10 · anthropics/courses

dsmedia · 2024-06-13T01:47:33Z

Corrected a typo in the documentation. Changed "indepent" to "independent" in the description of defining the calculator function.

Before:
"The first step is to define the actual calculator function and make sure it works, indepent of Claude. We'll write a VERY simple function that expects three arguments:\n"

After:
"The first step is to define the actual calculator function and make sure it works, independent of Claude. We'll write a VERY simple function that expects three arguments:\n"

Corrected a typo in the documentation. Changed "indepent" to "independent" in the description of defining the calculator function. Before: "The first step is to define the actual calculator function and make sure it works, indepent of Claude. We'll write a VERY simple function that expects three arguments:\n" After: "The first step is to define the actual calculator function and make sure it works, independent of Claude. We'll write a VERY simple function that expects three arguments:\n"

Adds a new notebook covering practical techniques for evaluating whether LLM explanation traces are faithful to the model's actual reasoning. Directly inspired by Anthropic's 2025 mechanistic interpretability research (circuit tracing / attribution graphs). Techniques covered: - Counterfactual probing (remove claimed features, measure output change) - Motivated reasoning detection (misleading hint experiment) - SHAP attribution comparison (local DistilBERT as reference model) - Model-graded reasoning evaluation (second-Claude-as-judge) - Combined faithfulness scorecard Handles anthropics#155

dsmedia closed this by deleting the head repository Feb 2, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs: fix typo in calculator function description#10

docs: fix typo in calculator function description#10
dsmedia wants to merge 1 commit into
anthropics:masterfrom
dsmedia:patch-2

dsmedia commented Jun 13, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

dsmedia commented Jun 13, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant