change CrlClientOnline to ignore content-type #42

janflora · 2018-06-15T10:13:48Z

Using CrlClientOnline to fetch CRLs fails if the webserver publishing the CRL does not provide a content-type header in the response. Since the response is basically just converted to an input stream and is used in its raw format, the content type is not really needed. The case where I've seen this break is while fetching the CRL from Adobe CDS: http://crl.adobe.com/cds.crl

iText-CI · 2018-06-15T10:13:49Z

Can one of the admins verify this patch?

blagae · 2018-06-18T06:46:11Z

While I'm not an expert on this piece of code, it makes more sense to me to at least try and get the regular content type, and only fall back on failure. So either try { getContent() } catch (SomeException se) { getInputStream() } or by using a null check on the return value of getContent.

janflora · 2018-06-18T08:35:47Z

@blagae The original code would make use of getContent() which essentially tries to parse the content using the content-type of the response, only to cast this into an input stream. The way I see it is that the only benefit you get here is that the content type might be validated against the actual data being returned (I'm not sure if any validation takes place).

If you encapsulate the the getContent() call in a try-catch block and ultimately just get the input stream content if it fails, then you'd accept the returned content whether it matches the content-type or not. Isn't this essentially the same as just reading the response as an input stream to begin with?

blagae · 2018-06-18T08:52:29Z

I'm just looking at the code as I see it, and it seems to be not guaranteed that UrlConnection.getContent() and UrlConnection.getInputStream() will return the same object. In fact, it isn't even guaranteed that getContent() will return an InputStream, hence the cast (which may also need to be caught ?). From the source code in java.net, this is the implementation of getContent():

    public Object getContent() throws IOException {
        getInputStream(); // this is not the returned object !
        return getContentHandler().getContent(this);
    }

As far as I can see, we can't make inferences of the ContentHandler's behavior because the object's type depends on the mimetype of the content-type header.

Again, I'm not the expert here and I'll defer to one of my colleagues who have more knowledge of dig-sig, but I think if we're trying to improve behavior, we should take into account the uncertainties that are not solved by the java.net package.

avlemos · 2019-04-30T08:02:12Z

Hi @janflora,

Right now itext/itextpdf has been deprecated in favor of itext/itext7, although we will continue to incorporate security fixes in itext/itextpdf (iText5).

Could you check if the issue exists in itext/itext7 so we can incorporate your fix?

Thank you for contribution, it is appreciated.

avlemos closed this Apr 30, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

change CrlClientOnline to ignore content-type #42

change CrlClientOnline to ignore content-type #42

janflora commented Jun 15, 2018

iText-CI commented Jun 15, 2018

blagae commented Jun 18, 2018

janflora commented Jun 18, 2018

blagae commented Jun 18, 2018 •

edited

avlemos commented Apr 30, 2019

change CrlClientOnline to ignore content-type #42

change CrlClientOnline to ignore content-type #42

Conversation

janflora commented Jun 15, 2018

iText-CI commented Jun 15, 2018

blagae commented Jun 18, 2018

janflora commented Jun 18, 2018

blagae commented Jun 18, 2018 • edited

avlemos commented Apr 30, 2019

blagae commented Jun 18, 2018 •

edited