Skip to content

NUTCH-2729 protocol-okhttp: fix marking of truncated content#462

Merged
sebastian-nagel merged 3 commits intoapache:masterfrom
sebastian-nagel:NUTCH-2729-protocol-okhttp-mark-truncated
Sep 1, 2019
Merged

NUTCH-2729 protocol-okhttp: fix marking of truncated content#462
sebastian-nagel merged 3 commits intoapache:masterfrom
sebastian-nagel:NUTCH-2729-protocol-okhttp-mark-truncated

Conversation

@sebastian-nagel
Copy link
Copy Markdown
Contributor

  • request one byte more than the configured content limit (http.content.limit) to detect truncations reliably
  • add unit tests for marking of truncations, also for gzip Content-Encoding and chunked Transfer-Encoding

@sebastian-nagel sebastian-nagel force-pushed the NUTCH-2729-protocol-okhttp-mark-truncated branch from 4782908 to 78da166 Compare August 26, 2019 15:27
@sebastian-nagel sebastian-nagel force-pushed the NUTCH-2729-protocol-okhttp-mark-truncated branch from 78da166 to 5c45172 Compare August 28, 2019 15:48
@sebastian-nagel
Copy link
Copy Markdown
Contributor Author

Rebased to master and merged with changes to protocol-okhttp related to NUTCH-2927/#460.

@jorgelbg
Copy link
Copy Markdown
Member

+1 LGTM

- log exception which caused a partial fetch and content is kept as truncated
  (marked as "disconnect")
- add unit test to verify that partial fetches are kept if
  http.partial.truncated == true
@sebastian-nagel sebastian-nagel merged commit 9a9f425 into apache:master Sep 1, 2019
@sebastian-nagel sebastian-nagel deleted the NUTCH-2729-protocol-okhttp-mark-truncated branch October 15, 2019 12:14
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants