ArticleTextExtractor.getNodes() questions #3

tlvince · 2012-03-11T13:16:49Z

Not an issue as such, a few questions.

Why in ArticleTextExtractor.getNodes() do you:

Use a Map, generate a hashCode and then only return the map values? Wouldn't a Set do the same job?
Add the parent of each element?

The text was updated successfully, but these errors were encountered:

karussell · 2012-03-11T20:54:03Z

Regarding 1: yes, you are right. But it wouldn't be a difference in terms of CPU or memory. As HashSet uses even more memory than HashMap and calculating the hashCode would still be done under the hood from hashset ... but when I think about it then this could be improved using an IdentityHashMap. I'll see if I can get all tests passing

Regarding 2: Thanks! Really not necessary.

karussell · 2012-03-11T21:15:09Z

The linked hashmap cannot be replaced by an identity hashmap as the order of insertion is important.

karussell · 2012-03-11T21:18:27Z

Is this now better understandable?

38203f2#diff-0

tlvince · 2012-03-12T01:54:10Z

Yes, definitely clearer but I'm still not convinced a Map is suitable here; you are filling the values with null and returning a Set... This may be a matter of preference though (especially if HashMap has better performance than HashSet).

karussell · 2012-03-12T07:30:15Z

Yeah, ok. I'll see if it would have significant perf or memory differences. BTW: hashset is implemented via hashmap ...

karussell#3) Fixed ConcurrentModificationException in removeDisallowedAttributes.

tlvince closed this as completed Mar 12, 2012

arunkumar9t2 pushed a commit to arunkumar9t2/crux that referenced this issue Sep 16, 2018

Fixed a ConcurrentModificationException in removeDisallowedAttributes. (

d3c27c4

karussell#3) Fixed ConcurrentModificationException in removeDisallowedAttributes.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ArticleTextExtractor.getNodes() questions #3

ArticleTextExtractor.getNodes() questions #3

tlvince commented Mar 11, 2012

karussell commented Mar 11, 2012

karussell commented Mar 11, 2012

karussell commented Mar 11, 2012

tlvince commented Mar 12, 2012

karussell commented Mar 12, 2012

ArticleTextExtractor.getNodes() questions #3

ArticleTextExtractor.getNodes() questions #3

Comments

tlvince commented Mar 11, 2012

karussell commented Mar 11, 2012

karussell commented Mar 11, 2012

karussell commented Mar 11, 2012

tlvince commented Mar 12, 2012

karussell commented Mar 12, 2012