Parse header with email.parser #145113

m5k10 · 2025-05-17T14:47:54Z

Proposed change

Current implementation is using regular expression to extract filename from Content-Disposition. It truncates the filename if it contains space character.

Type of change

Dependency upgrade
Bugfix (non-breaking change which fixes an issue)
New integration (thank you!)
New feature (which adds functionality to an existing integration)
Deprecation (breaking change to happen in the future)
Breaking change (fix/feature causing existing functionality to break)
Code quality improvements to existing code or addition of tests

Additional information

This PR fixes or closes issue: fixes #
This PR is related to issue:
Link to documentation pull request:
Link to developer documentation pull request:
Link to frontend pull request:

Checklist

The code change is tested and works locally.
Local tests pass. Your PR cannot be merged unless tests pass
There is no commented out code in this PR.
I have followed the development checklist
I have followed the perfect PR recommendations
The code has been formatted using Ruff (ruff format homeassistant tests)
Tests have been added to verify that the new code works.

If user exposed functionality or configuration variables are added/changed:

Documentation added/updated for www.home-assistant.io

If the code communicates with devices, web services, or third-party tools:

The manifest file has all fields filled out correctly.
Updated and included derived files by running: python3 -m script.hassfest.
New or updated dependencies have been added to requirements_all.txt.
Updated by running python3 -m script.gen_requirements_all.
For the updated dependencies - a link to the changelog, or at minimum a diff between library versions is added to the PR description.

To help with the load of incoming pull requests:

I have reviewed two other open pull requests in this repository.

home-assistant

Hi @soldierkam

It seems you haven't yet signed a CLA. Please do so here.

Once you do that we will be able to review and accept this pull request.

Thanks!

home-assistant · 2025-05-17T14:47:58Z

Please take a look at the requested changes, and use the Ready for review button when you are done, thanks 👍

Learn more about our pull request process.

home-assistant · 2025-05-17T14:48:00Z

Hey there @erwindouna, mind taking a look at this pull request as it has been labeled with an integration (downloader) you are listed as a code owner for? Thanks!

Code owner commands

Code owners of downloader can trigger bot actions by commenting:

@home-assistant close Closes the pull request.
@home-assistant rename Awesome new title Renames the pull request.
@home-assistant reopen Reopen the pull request.
@home-assistant unassign downloader Removes the current integration label and assignees on the pull request, add the integration domain after the command.
@home-assistant add-label needs-more-information Add a label (needs-more-information, problem in dependency, problem in custom component) to the pull request.
@home-assistant remove-label needs-more-information Remove a label (needs-more-information, problem in dependency, problem in custom component) on the pull request.

erwindouna · 2025-05-17T19:23:22Z

homeassistant/components/downloader/__init__.py


 from __future__ import annotations

+from email.parser import HeaderParser


Why this library?

I was evaluating modules in the Python Standard Library for parsing headers. I identified two candidates: email and cgi. Since cgi is deprecated, email was the only viable option. I wanted to avoid introducing additional dependencies.

Is it not possible to use aiohttp library? https://github.com/aio-libs/aiohttp/blob/e5d1240babff6db6297e0d92f174446de19cf76d/aiohttp/client_reqrep.py#L892-L899

disposition_type, params_dct = multipart.parse_content_disposition(raw) params = MappingProxyType(params_dct) filename = multipart.content_disposition_filename(params) return ContentDisposition(disposition_type, params, filename)

It is a core dependency of Home Assistant so it wouldn't be considered "introducing additional dependencies"

Also (related but not dependant) it seems that "downloader" is still using "requests". Would it not make sense to migrate to aiohttp instead and gain all the benefits of async downloads?

async with session.get(url, stream=True, timeout=10) as req: if req.status != HTTPStatus.OK: ... return ... filename = req.content_disposition.filename

github-actions · 2025-07-18T08:08:30Z

There hasn't been any activity on this pull request recently. This pull request has been automatically marked as stale because of that and will be closed if no further activity occurs within 7 days.
If you are the author of this PR, please leave a comment if you want to keep it open. Also, please rebase your PR onto the latest dev branch to ensure that it's up to date with the latest changes.
Thank you for your contribution!

home-assistant bot added bugfix cla-needed integration: downloader small-pr PRs with less than 30 lines. labels May 17, 2025

home-assistant bot marked this pull request as draft May 17, 2025 14:47

home-assistant bot requested changes May 17, 2025

View reviewed changes

home-assistant bot added Quality Scale: internal cla-recheck cla-signed and removed cla-recheck cla-needed labels May 17, 2025

Parse header with email.parser

bf0d293

m5k10 force-pushed the downloader_header_parsing branch from 8d883b0 to bf0d293 Compare May 17, 2025 14:54

erwindouna suggested changes May 17, 2025

View reviewed changes

github-actions bot added the stale label Jul 18, 2025

github-actions bot closed this Jul 25, 2025

github-actions bot locked and limited conversation to collaborators Jul 26, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Parse header with email.parser #145113

Parse header with email.parser #145113

Uh oh!

m5k10 commented May 17, 2025

Uh oh!

home-assistant bot left a comment

Uh oh!

home-assistant bot commented May 17, 2025

Uh oh!

home-assistant bot commented May 17, 2025

Uh oh!

erwindouna May 17, 2025

Uh oh!

m5k10 May 18, 2025

Uh oh!

epenet May 19, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Jul 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants


		from __future__ import annotations

		from email.parser import HeaderParser

Uh oh!

Parse header with email.parser #145113

Parse header with email.parser #145113

Uh oh!

Conversation

m5k10 commented May 17, 2025

Proposed change

Type of change

Additional information

Checklist

Uh oh!

home-assistant bot left a comment

Choose a reason for hiding this comment

Uh oh!

home-assistant bot commented May 17, 2025

Uh oh!

home-assistant bot commented May 17, 2025

Uh oh!

erwindouna May 17, 2025

Choose a reason for hiding this comment

Uh oh!

m5k10 May 18, 2025

Choose a reason for hiding this comment

Uh oh!

epenet May 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Jul 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

epenet May 19, 2025 •

edited

Loading