fix: IpynbConverter.accepts() returns False instead of raising on non-decodable content by wali-reheman · Pull Request #1910 · microsoft/markitdown

wali-reheman · 2026-05-23T13:06:24Z

Handles UnicodeDecodeError and ValueError gracefully in accepts() when a file with application/json mimetype can't be decoded as text — instead of crashing the full conversion pipeline, it just returns False so the converter skips the file.

Fixes #1894.

…-decodable content Wraps decode in try/except to catch UnicodeDecodeError and ValueError when a file with application/json mimetype contains non-notebook content (e.g. a binary or non-ASCII file misidentified as JSON). This prevents the converter from crashing the entire conversion pipeline when it encounters files it should simply skip. Fixes microsoft#1894

wali-reheman closed this May 23, 2026

serejaris mentioned this pull request May 27, 2026

fix: IpynbConverter.accepts() raises UnicodeDecodeError on non-ASCII files (e.g. French PDFs) #1895

Open

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: IpynbConverter.accepts() returns False instead of raising on non-decodable content#1910

fix: IpynbConverter.accepts() returns False instead of raising on non-decodable content#1910
wali-reheman wants to merge 1 commit into
microsoft:mainfrom
wali-reheman:fix/upstream-ipynb-accepts-unicode

wali-reheman commented May 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

wali-reheman commented May 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant