Improve Speaker Identification docs: clearer overview, decision guide, and page split by devin-ai-integration[bot] · Pull Request #778 · AssemblyAI/assemblyai-api-spec

devin-ai-integration · 2026-03-17T19:36:09Z

Summary

Documentation-only changes to the Speaker Identification page (fern/pages/speech-understanding/speech-understanding.mdx) to improve clarity and scannability, plus a structural refactor to reduce code duplication by splitting Method 1 and Method 2 into separate pages.

Clarity improvements (commit 1)

Overview rewrite: Leads with the value proposition ("Replace generic labels with real names or roles, no voice enrollment needed") instead of a feature description.
Prerequisite callout upgrade: Changed the Speaker Diarization dependency from a <Note> to a <Warning> with an explicit speaker_labels: true instruction, making it harder to miss.
known_values vs speakers surfaced earlier: Added a new "Choosing how to identify speakers" subsection in the Overview that explains the two approaches upfront, rather than burying the distinction in the Advanced Usage section.
Decision guide: Added a quick "name vs role" and "when to use speakers with descriptions" guide so users can pick the right approach at a glance.

Page split (commit 2)

Main page now shows only Method 1 (transcribe + identify in one request). All Method 2 code examples, curl snippets, and dual-method Advanced Usage snippets have been removed from the main page.
New page created: speaker-identification-existing-transcript.mdx — "Using Speaker Identification on an existing transcript" — contains the Method 2 full examples (Python + JS), its own Advanced Usage section, and the Method 2 API reference (curl).
Cross-links added: The main page links to the new page via a <Note> callout in the "How to use" section. The new page links back to the main page for the shared request parameters and response reference tables.
docs.yml updated: Speaker Identification changed from a page to a section with the new page as a child, registered at slug speaker-identification-existing-transcript.

Content refinements (commits 3–5)

"Output format details" section removed from the main page — it was redundant with the before/after already shown in the Overview.
"Identify by name" and "Identify by role" subsections added under "How to use" on both pages — each has a brief description and full Python + JS tabbed examples. Previously the code examples had no name/role subsection headings, and role-based identification was only covered in Advanced Usage.
"Advanced usage" renamed to "Adding speaker metadata" on both pages — promoted to a ## heading, with the former ### sub-heading ("Adding speaker metadata with speakers") removed. The #### Simple usage and #### Advanced usage sub-headings within are also flattened into flowing prose, making the section more scannable.
Common role combinations list added after the "Identify by role" examples on both pages.
"Choosing how to identify speakers" simplified — removed the known_values vs speakers intro paragraph and bullet points. The section now leads directly with the name/role decision bullets, each with a "Click here to learn more." anchor link to the corresponding #identify-by-name / #identify-by-role / #adding-speaker-metadata section.
API reference callout removed — the <Note> linking to the existing-transcript page from the API reference section was removed (the cross-link in the "How to use" section remains).

Polish pass (commits 6–7)

Warning/Note split on main page: The <Warning> box previously combined the hard prerequisite (speaker_labels: true required) with an audio quality tip. These are now separated — <Warning> for the requirement, <Note> for the quality guidance — so the critical prerequisite stands out.
"Python for brevity" note added to the "Adding speaker metadata" section on both pages, since those examples are Python-only while the rest of the page has Python + JS tabs.
Role-based custom properties code block replaced with a sentence on both pages: "You can use the same custom properties with role-based identification by replacing name with role in each speaker object." — eliminates a low-value code snippet that only demonstrated a narrow point.
"How to use" intro simplified on main page: "to transcribe and identify speakers in a single step" → "to identify speakers" — the "single step" comparison was a leftover from when both methods lived on the same page.
Response JSON truncated in the API reference on the main page: reduced from two full utterances (~56 lines) to one utterance with truncated text and a // ... more utterances comment. The response fields table still documents every field.
"Key differences from standard transcription" table replaced with a sentence: "With Speaker Identification, the speaker field in utterances and words contains the identified name or role instead of generic labels like "A", "B", "C"." The two-row table was redundant given how little information it conveyed.

Final polish (commit 8)

Request parameters table shortened on main page: removed the three container-object rows (speech_understanding, .request, .speaker_identification) and dropped the speaker_identification. prefix from the remaining keys. An introductory sentence ("The following parameters are nested under speech_understanding.request.speaker_identification:") replaces them. This gives the Description column substantially more horizontal space.
Redundant speakers vs known_values <Note> removed from both pages — this distinction was already covered in the "Choosing" decision guide, the "Adding speaker metadata" intro paragraph, and the request parameters table descriptions.
"Choosing how to identify speakers" section added to the sub page — previously only on the main page. Users landing directly on the sub page now get the same name/role/metadata decision guide with anchor links.
Sub page Overview tightened: "This is useful for more complex workflows…" → "This is especially useful when you want to re-identify speakers with different parameters, or when your workflow separates transcription from post-processing." Leads with the stronger use case.
Role-based Before/After example added to the Overview on the main page — a second "After (by role)" block showing Speaker A → Interviewer, reinforcing that roles are a first-class option.
"default approach" → "most common approach" in "Identify by name" description on both pages — avoids implying speaker_type: "name" is a default parameter value (the parameter is required).

Review & Testing Checklist for Human

Verify anchor links on both pages: The "Click here to learn more." links point to #identify-by-name, #identify-by-role, and #adding-speaker-metadata. These anchors now exist on both the main page and the sub page. Confirm Fern generates matching slugs from the headings on each page — if Fern slugifies differently, these links will be broken.
Verify the shortened Request parameters table renders correctly: The table now uses short keys like speaker_type, known_values, speakers[].<custom>. Confirm the table renders with better proportions (wider Description column) and that the introductory sentence about the nesting path is visible.
Verify the section change in docs.yml: Speaker Identification changed from page to section. Confirm sidebar navigation, URL routing (/docs/speech-understanding/speaker-identification), and TOC still work as expected.
Check cross-page links resolve correctly: Main page → /docs/speech-understanding/speaker-identification-existing-transcript; sub page → /docs/speech-understanding/speaker-identification#request-parameters and #response.
Verify the role-based Before/After renders cleanly: The Overview now has three code blocks in sequence (Before, After by name, After by role). Confirm they don't visually blend together or look cluttered.

Recommended test plan: Open the deploy preview → navigate to Speaker Identification → verify the Before/After shows both name and role examples → verify <Warning> and <Note> render as separate boxes → click all three "Click here to learn more." anchor links → scroll to the Request parameters table and confirm the shorter keys render with a wider Description column → scroll to the Response JSON and confirm the truncated example renders cleanly → click the "existing transcript" link → on the sub page, verify the "Choosing how to identify speakers" section exists with working anchor links → verify "most common approach" wording in "Identify by name" → verify the speakers vs known_values <Note> is gone from both pages.

Notes

Both pages have hidden: true in frontmatter, consistent with the existing pattern.
The new page's API reference defers to the main page for the full request parameters table and response fields table to avoid duplication.
The old #advanced-usage anchor is replaced by #adding-speaker-metadata — this is a breaking change for any existing external links targeting that anchor.
Pre-existing lint warnings are unrelated (unused OpenAPI components).

Link to Devin session: https://app.devin.ai/sessions/73e913af2ee5457797441017325f14d7
Requested by: @LeeVaughn

…allout, decision guide - Rewrite overview to lead with the value prop (replace generic labels with real names/roles, no voice enrollment needed) - Upgrade Speaker Diarization prerequisite from Note to Warning with explicit speaker_labels: true instruction - Add 'Choosing how to identify speakers' section surfacing known_values vs speakers choice earlier - Add decision guide for name vs role identification and when to use speakers with descriptions Co-Authored-By: Lee Vaughn <dlvprogramming@gmail.com>

devin-ai-integration · 2026-03-17T19:36:12Z

🤖 Devin AI Engineer

I'll be helping with this pull request! Here's what you should know:

✅ I will automatically:

Address comments on this PR. Add '(aside)' to your comment to have me ignore it.
Look at CI failures and help fix them

Note: I can only respond to comments from users who have write access to this repository.

⚙️ Control Options:

Disable automatic comment and CI monitoring

github-actions · 2026-03-17T19:37:25Z

🌿 Preview your docs: https://assemblyai-preview-14ea94fa-fcac-4aae-8340-5d8d180301be.docs.buildwithfern.com/docs

… existing transcript page (Method 2) Co-Authored-By: Lee Vaughn <dlvprogramming@gmail.com>

github-actions · 2026-03-17T19:57:45Z

🌿 Preview your docs: https://assemblyai-preview-c7a8073d-368a-449a-a930-480d1732d19d.docs.buildwithfern.com/docs

…ls, slim Advanced Usage Co-Authored-By: Lee Vaughn <dlvprogramming@gmail.com>

github-actions · 2026-03-17T20:14:41Z

🌿 Preview your docs: https://assemblyai-preview-5d319d07-9bc0-4bed-a9b5-f414bd1e5343.docs.buildwithfern.com/docs

…aker metadata' Co-Authored-By: Lee Vaughn <dlvprogramming@gmail.com>

github-actions · 2026-03-17T20:31:43Z

🌿 Preview your docs: https://assemblyai-preview-b47b9161-2400-471a-bec9-92d032cceac1.docs.buildwithfern.com/docs

…lout Co-Authored-By: Lee Vaughn <dlvprogramming@gmail.com>

github-actions · 2026-03-17T20:44:34Z

🌿 Preview your docs: https://assemblyai-preview-8b8536b5-abd2-4c07-b2fd-a92091db1b01.docs.buildwithfern.com/docs

Co-Authored-By: Lee Vaughn <dlvprogramming@gmail.com>

github-actions · 2026-03-17T20:51:50Z

🌿 Preview your docs: https://assemblyai-preview-b446078c-b978-494c-8fda-74ae4397d0a9.docs.buildwithfern.com/docs

…lit Warning/Note, simplify intro Co-Authored-By: Lee Vaughn <dlvprogramming@gmail.com>

github-actions · 2026-03-17T21:05:12Z

🌿 Preview your docs: https://assemblyai-preview-f27da89b-80c7-4a37-82a8-7db9509c239c.docs.buildwithfern.com/docs

Co-Authored-By: Lee Vaughn <dlvprogramming@gmail.com>

github-actions · 2026-03-17T23:34:56Z

🌿 Preview your docs: https://assemblyai-preview-3f0d375f-5ad1-4af9-a33c-70182de7e30c.docs.buildwithfern.com/docs

…o sub page, remove redundant Note Co-Authored-By: Lee Vaughn <dlvprogramming@gmail.com>

github-actions · 2026-03-17T23:53:01Z

🌿 Preview your docs: https://assemblyai-preview-e86ff74b-2086-4277-a32b-fd696b36d808.docs.buildwithfern.com/docs

github-actions · 2026-03-17T23:57:06Z

🌿 Preview your docs: https://assemblyai-preview-2e95ad5c-104a-4af9-8bd4-caecac46312c.docs.buildwithfern.com/docs

github-actions · 2026-03-18T05:20:58Z

🌿 Preview your docs: https://assemblyai-preview-6d7097b0-8e23-4073-9792-7c4ccecee237.docs.buildwithfern.com/docs

devin-ai-integration

✅ Devin Review: No Issues Found

Devin Review analyzed this PR and found no potential bugs to report.

View in Devin Review to see 2 additional findings.

…tification slug segment)

github-actions · 2026-03-18T15:22:55Z

🌿 Preview your docs: https://assemblyai-preview-7d3d61ea-f8c0-48f3-9667-8a6373c3be34.docs.buildwithfern.com/docs

devin-ai-integration bot assigned LeeVaughn Mar 17, 2026

Split Speaker Identification into two pages: main page (Method 1) and…

b4fcd40

… existing transcript page (Method 2) Co-Authored-By: Lee Vaughn <dlvprogramming@gmail.com>

devin-ai-integration bot changed the title ~~Improve Speaker Identification docs: clearer overview, prerequisite callout, decision guide~~ Improve Speaker Identification docs: clearer overview, decision guide, and page split Mar 17, 2026

Refine Speaker ID docs: add role examples, remove output format detai…

ee8994b

…ls, slim Advanced Usage Co-Authored-By: Lee Vaughn <dlvprogramming@gmail.com>

Add 'Identify by name' sections, rename Advanced Usage to 'Adding spe…

a292497

…aker metadata' Co-Authored-By: Lee Vaughn <dlvprogramming@gmail.com>

Simplify Choosing section, add anchor links, remove API reference cal…

4afdcf0

…lout Co-Authored-By: Lee Vaughn <dlvprogramming@gmail.com>

Add 'Click here to learn more' link to 'Need better accuracy' bullet

9100d9c

Co-Authored-By: Lee Vaughn <dlvprogramming@gmail.com>

Refine docs: Python-only note, replace role snippet with sentence, sp…

94b5f20

…lit Warning/Note, simplify intro Co-Authored-By: Lee Vaughn <dlvprogramming@gmail.com>

Truncate Response JSON, replace Key differences table with sentence

cf12901

Co-Authored-By: Lee Vaughn <dlvprogramming@gmail.com>

Polish: shorten param table, add role example, add Choosing section t…

b09f56e

…o sub page, remove redundant Note Co-Authored-By: Lee Vaughn <dlvprogramming@gmail.com>

Merge branch 'main' into devin/1773776058-speaker-id-docs-improvements

d0067be

Merge branch 'main' into devin/1773776058-speaker-id-docs-improvements

ae1b325

devin-ai-integration bot commented Mar 18, 2026

View reviewed changes

dylan-duan-aai self-requested a review March 18, 2026 15:11

Fix broken link to existing-transcript sub-page (missing speaker-iden…

e7bec2f

…tification slug segment)

dylan-duan-aai approved these changes Mar 18, 2026

View reviewed changes

dylan-duan-aai merged commit 94ae60d into main Mar 18, 2026
4 of 5 checks passed

dylan-duan-aai deleted the devin/1773776058-speaker-id-docs-improvements branch March 18, 2026 15:24

Conversation

devin-ai-integration bot commented Mar 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Clarity improvements (commit 1)

Page split (commit 2)

Content refinements (commits 3–5)

Polish pass (commits 6–7)

Final polish (commit 8)

Review & Testing Checklist for Human

Notes

Uh oh!

devin-ai-integration bot commented Mar 17, 2026

🤖 Devin AI Engineer

Uh oh!

github-actions bot commented Mar 17, 2026

Uh oh!

github-actions bot commented Mar 17, 2026

Uh oh!

github-actions bot commented Mar 17, 2026

Uh oh!

github-actions bot commented Mar 17, 2026

Uh oh!

github-actions bot commented Mar 17, 2026

Uh oh!

github-actions bot commented Mar 17, 2026

Uh oh!

github-actions bot commented Mar 17, 2026

Uh oh!

github-actions bot commented Mar 17, 2026

Uh oh!

github-actions bot commented Mar 17, 2026

Uh oh!

github-actions bot commented Mar 17, 2026

Uh oh!

github-actions bot commented Mar 18, 2026

Uh oh!

devin-ai-integration bot left a comment

Choose a reason for hiding this comment

✅ Devin Review: No Issues Found

Uh oh!

github-actions bot commented Mar 18, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

devin-ai-integration bot commented Mar 17, 2026 •

edited

Loading