chardet dependency #171

Closed
pitman opened this Issue Jan 30, 2012 · 3 comments

Comments

Projects
None yet
2 participants

pitman commented Jan 30, 2012

I note /bibserver/parsers/BibTexParser.py
is importing chardet
I presume this is http://pypi.python.org/pypi/chardet

I note that the chardet homepage
http://chardet.feedparser.org/
is returning a 410 Gone response which is not a good sign. Seems like we should be curating our own version of chardet
if the original developer is no longer supporting it.

Contributor

epoz commented Jan 30, 2012

We are modifying the parsers to improve unicode handling. The character set detection is being moved in to the parser base class. Looking into using the detection code from http://www.crummy.com/software/BeautifulSoup/.

Contributor

epoz commented Jan 30, 2012

I suspect the disappearance of chardet is related to Mark Pilgrim's decision to remove his presence on the Internet (in so far as that is possible)

pitman commented Jan 30, 2012

Etienne Posthumus reply@reply.github.com wrote:

I suspect the disappearance of chardet is related to Mark Pilgrim's decision to remove his presence on the Internet (in so far as that is possible)

Fascinating. More about this at http://en.wikipedia.org/wiki/Mark_Pilgrim_(software_developer)
Looks like he has taken down a very substantial amount of free documentation, e.g.
"Dive into Greasemonkey" http://diveintogreasemonkey.org/ a free book on Greasemonkey.
Raises some serious issues about the meaning of "publication" of such material.

--Jim

@epoz epoz closed this Jan 31, 2012

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment