Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ImportError with html5lib 0.99999999 (2016-07-14) #332

Open
ivan opened this issue Jul 15, 2016 · 1 comment · May be fixed by #393
Open

ImportError with html5lib 0.99999999 (2016-07-14) #332

ivan opened this issue Jul 15, 2016 · 1 comment · May be fixed by #393
Labels
Projects

Comments

@ivan
Copy link
Contributor

ivan commented Jul 15, 2016

html5lib 0.99999999 renamed html5lib.tokenizer:

Move a whole load of stuff (inputstream, ihatexml, trie, tokenizer, utils) to be underscore prefixed to clarify their status as private.

https://pypi.python.org/pypi/html5lib

and thus wpull crashes with ImportError on startup:

  File "/home/travis/virtualenv/python3.4.2/lib/python3.4/site-packages/click/core.py", line 716, in __call__
    return self.main(*args, **kwargs)
  File "/home/travis/virtualenv/python3.4.2/lib/python3.4/site-packages/click/core.py", line 696, in main
    rv = self.invoke(ctx)
  File "/home/travis/virtualenv/python3.4.2/lib/python3.4/site-packages/click/core.py", line 889, in invoke
    return ctx.invoke(self.callback, **ctx.params)
  File "/home/travis/virtualenv/python3.4.2/lib/python3.4/site-packages/click/core.py", line 534, in invoke
    return callback(*args, **kwargs)
  File "/home/travis/virtualenv/python3.4.2/lib/python3.4/site-packages/libgrabsite/main.py", line 368, in main
    wpull.__main__.main()
  File "/home/travis/virtualenv/python3.4.2/lib/python3.4/site-packages/wpull/__main__.py", line 29, in main
    builder.build()
  File "/home/travis/virtualenv/python3.4.2/lib/python3.4/site-packages/wpull/builder.py", line 195, in build
    self._build_html_parser()
  File "/home/travis/virtualenv/python3.4.2/lib/python3.4/site-packages/wpull/builder.py", line 692, in _build_html_parser
    from wpull.document.htmlparse.html5lib_ import HTMLParser
  File "/home/travis/virtualenv/python3.4.2/lib/python3.4/site-packages/wpull/document/htmlparse/html5lib_.py", line 3, in <module>
    import html5lib.tokenizer
ImportError: No module named 'html5lib.tokenizer'

(That's a traceback from wpull 1.2.3, but it looks like wpull 2.0.1 has the same import.)

ivan added a commit to ArchiveTeam/grab-site that referenced this issue Jul 15, 2016
@chfoo chfoo added the bug label Aug 19, 2016
JustAnotherArchivist added a commit to JustAnotherArchivist/wpull that referenced this issue Oct 9, 2018
Version 0.99999999 (eight nines) and up do not provide the `html5lib.tokenizer` API used by wpull.
@anarcat
Copy link

anarcat commented Oct 31, 2018

workaround for this:

pip3 install 'html5lib<=0.9999999'

It would be great to see a release with this and #384 because right now, out of the box, wpull just fails to pip install itself...

anarcat pushed a commit to anarcat/wpull that referenced this issue Nov 2, 2018
Version 0.99999999 (eight nines) and up do not provide the `html5lib.tokenizer` API used by wpull.
@anarcat anarcat mentioned this issue Nov 2, 2018
5 tasks
JustAnotherArchivist added a commit to JustAnotherArchivist/wpull that referenced this issue Nov 3, 2018
Version 0.99999999 (eight nines) and up do not provide the `html5lib.tokenizer` API used by wpull.
JustAnotherArchivist added a commit to JustAnotherArchivist/wpull that referenced this issue Nov 3, 2018
Version 0.99999999 (eight nines) and up do not provide the `html5lib.tokenizer` API used by wpull.
JustAnotherArchivist added a commit to JustAnotherArchivist/wpull that referenced this issue Jan 16, 2023
Version 0.99999999 (eight nines) and up do not provide the `html5lib.tokenizer` API used by wpull.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
Development

Successfully merging a pull request may close this issue.

3 participants