You can clone with
HTTPS or Subversion.
I know it's only a very minor thing, but I was looking at the implementation and it struck me.
If for example I feed the single character '\xEF', the code path that checks for BOMs will not find anything. If I then feed the 2 characters '\xBB\xBF' (completing the UTF-8 BOM), the BOM checking code path is skipped. If the detector is then closed, the UTF-8 BOM is detected as windows-1252 with 95% confidence...
Allows streaming of UTF w/ BOM strings [gh-6]
UTF strings with BOM were only being detected if the entire DOM was part of the first feed().
If a string were to be streamed and the BOM given in different feed() calls the detector would incorrectly detect a different encoding.