Skip to content

For Manual Testers

github-actions[bot] edited this page Jun 1, 2026 · 4 revisions

Canary for Manual Testers

You've spent years finding bugs, writing test cases, and knowing exactly what "this should do X but it does Y" looks like in practice. That skill set is exactly what Canary needs from you.

Canary handles the coding. You handle the thinking.

The Mental Model

In manual testing, you write test cases: a description of the steps to take and the expected outcome. In Canary, you do the same thing — you just describe it in plain English, and Canary writes the code.

Manual Testing With Canary
Write a test case in a spreadsheet Describe the test in a prompt
Execute the test by hand Canary generates and can run it
Note "Expected: 200, Actual: 500" Canary asserts the expected outcome
File a bug if it fails Canary flags the failure for you

You don't need to know Python, JavaScript, or any testing framework to use Canary. If you can describe what you're testing clearly, Canary can write the test.

Your First Test — Step by Step

Let's say you want to test that a user can log in successfully. Here's how you'd think through it:

  1. What are you testing? The login feature.
  2. What's the action? Submitting valid credentials.
  3. What's the expected outcome? The user is redirected to the dashboard.

Turn that into a prompt:

Test that submitting valid credentials on the login page
redirects the user to the dashboard

Run it — in Claude Code:

/canary-write-test  Test that submitting valid credentials on the
login page redirects the user to the dashboard

Canary classifies this as an end-to-end UI test and generates a Playwright test for you.

Describing Tests You Know Well

You've probably tested dozens of scenarios by hand. Pick one you know inside and out — that familiarity makes you the best person to prompt Canary on it, because you already know:

  • What the expected behavior is
  • What a failure looks like
  • What edge cases exist

For example, a manual tester who has tested a search feature hundreds of times can prompt:

Test that searching for an empty string shows a "no results"
message rather than an error

That's a much stronger prompt than a developer who's never clicked through the search feature would write.

Types of Tests Canary Can Generate

You don't need to know what these mean technically — just pick the one that sounds closest to what you're testing:

  • End-to-end (E2E): Testing a user flow in a browser. "A user goes to X, clicks Y, sees Z."
  • API: Testing what a server returns when called. "When we send this request, we expect this response."
  • Unit: Testing a single function or calculation. "Given this input, the output should be X."
  • Performance/Load: Testing how the system handles traffic. "Under N users, the page should load in under X seconds."

If you're not sure which type applies, just describe what you want to test. Canary will classify it — and if it guesses wrong, you can tell it which type you meant and try again.

What to Do with the Generated Test

Canary writes the test; you review it. Here's a simple checklist:

  1. Does it test what I asked? Read the assertions — do they match the expected behavior you described?
  2. Are there any placeholder values? Look for things like "YOUR_URL_HERE" or "TODO" — fill those in.
  3. Does it pass? Run it. If it fails, check whether the failure is in the test or in the feature being tested.
  4. Is it worth keeping? If yes, promote it to the committed suite. If not, discard it and regenerate with a better prompt.

You don't need to understand every line of the generated code to evaluate whether the test is correct. Read the assertions (the parts that say "expect X to equal Y") — those are the parts that matter.

Tips from Manual Testing That Apply Here

  • Be specific. "Test the login page" is too vague. "Test that logging in with an expired password shows an error message" is much better.
  • Test one thing at a time. Manual test cases that cover one scenario are better than tests that cover five. The same is true for Canary prompts.
  • Include the expected outcome. "The user sees the dashboard" or "the API returns 201 with the order ID" — Canary uses this to write the assertion.
  • Edge cases matter. If you've found a bug in a weird edge case before, that's exactly the kind of test worth generating. Those tests are the most valuable to have automated.

When Canary Gets It Wrong

Canary sometimes misclassifies a test type or generates a test that's close but not quite right. This is normal — it's a first draft, not a final product.

If the type is wrong, tell it explicitly:

/canary-write-test  Test that the cart persists after page refresh
-- this is an end-to-end browser test

If the generated test is almost right but misses something, either adjust your prompt and regenerate, or (for small fixes) edit the file directly before promoting it.

See Writing Good Prompts for detailed guidance on getting it right the first time.

Using Canary Outside a Project Repo

You don't need to be inside a test repo to use Canary's skills. Some skills — like login helpers or environment guides — are useful anywhere: in the Claude web extension, in a scratch directory, or during an exploratory testing session.

Installing a skill globally

Ask your team lead or SDET for the skill directory, then install it once:

mkdir -p ~/.canary/skills/my-skill
cp /path/to/SKILL.md ~/.canary/skills/my-skill/

After this, Claude can reference the skill from anywhere — including the Claude web extension — without you needing to open a terminal or navigate to a repo.

Example: login helper

If your team has a login-helper skill installed globally, you can ask Claude from any context:

"Help me log in as a test user on stage"

Claude will walk you through the HealthSafe ID flow step by step, including which OTP code to enter and what to do if the T&C or passkey pages appear. No extension, no spreadsheet, no guessing.

You're Not Replacing Yourself

Generating a test with Canary still requires you to:

  • Know what the correct behavior is
  • Review whether the test actually checks that behavior
  • Decide if the test is worth keeping

Canary speeds up the work of writing the code. It doesn't replace the judgment of someone who understands the product. That's you.