Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

PDF-Link broken & Dependency Error? #2

Closed
Studentenfutter opened this issue Aug 16, 2020 · 1 comment
Closed

PDF-Link broken & Dependency Error? #2

Studentenfutter opened this issue Aug 16, 2020 · 1 comment

Comments

@Studentenfutter
Copy link

Leider scheint der Parser aufgrund einer geänderten URL auf Seiten des Bundestages nicht mehr zu funktionieren:

DEBUG=scraper node ./scraper.js dump
  scraper outdir is /home/user/Dokumente/scraper-lobbyliste/dump +0ms
(node:14036) [DEP0022] DeprecationWarning: os.tmpDir() is deprecated. Use os.tmpdir() instead.
Error: Unable to locate PDF link
    at /home/user/Dokumente/scraper-lobbyliste/scraper.js:224:80
    at /home/user/Dokumente/scraper-lobbyliste/node_modules/scrapyard/lib/scrapyard.js:110:22
    at Object.callback (/home/user/Dokumente/scraper-lobbyliste/node_modules/scrapyard/lib/scrapyard.js:123:4)
    at /home/user/Dokumente/scraper-lobbyliste/node_modules/async/dist/async.js:1311:26
    at /home/user/Dokumente/scraper-lobbyliste/node_modules/async/dist/async.js:321:20
    at /home/user/Dokumente/scraper-lobbyliste/node_modules/scrapyard/lib/scrapyard.js:55:4
    at /home/user/Dokumente/scraper-lobbyliste/node_modules/scrapyard/lib/scrapyard.js:160:4
    at Request._callback (/home/user/Dokumente/scraper-lobbyliste/node_modules/scrapyard/lib/scrapyard.js:193:7)
    at Request.self.callback (/home/user/Dokumente/scraper-lobbyliste/node_modules/request/request.js:185:22)

Wenn ich die BASEURL auf das aktuelle PDF ändere erhalte ich ebenfalls einen Error:

  scraper outdir is /home/user/Dokumente/scraper-lobbyliste/dump +0ms
(node:14735) [DEP0022] DeprecationWarning: os.tmpDir() is deprecated. Use os.tmpdir() instead.
/home/user/Dokumente/scraper-lobbyliste/node_modules/domutils/lib/querying.js:87
		if(test(elems[i])) result.push(elems[i]);
		   ^

RangeError: Maximum call stack size exceeded
    at findAll (/home/user/Dokumente/scraper-lobbyliste/node_modules/domutils/lib/querying.js:87:6)
    at findAll (/home/user/Dokumente/scraper-lobbyliste/node_modules/domutils/lib/querying.js:90:27)
    at findAll (/home/user/Dokumente/scraper-lobbyliste/node_modules/domutils/lib/querying.js:90:27)
    at findAll (/home/user/Dokumente/scraper-lobbyliste/node_modules/domutils/lib/querying.js:90:27)
    at findAll (/home/user/Dokumente/scraper-lobbyliste/node_modules/domutils/lib/querying.js:90:27)
    at findAll (/home/user/Dokumente/scraper-lobbyliste/node_modules/domutils/lib/querying.js:90:27)
    at findAll (/home/user/Dokumente/scraper-lobbyliste/node_modules/domutils/lib/querying.js:90:27)
    at findAll (/home/user/Dokumente/scraper-lobbyliste/node_modules/domutils/lib/querying.js:90:27)
    at findAll (/home/user/Dokumente/scraper-lobbyliste/node_modules/domutils/lib/querying.js:90:27)
    at findAll (/home/user/Dokumente/scraper-lobbyliste/node_modules/domutils/lib/querying.js:90:27)

Vielleicht liegt dies an #19? Leider habe ich keine JavaScript-Kenntnisse, könnte mir aber vorstellen, dass ein Problem bei den Dependencies vorliegt.

@yetzt
Copy link
Collaborator

yetzt commented Aug 18, 2020

ja, der code ist 5 jahre alt und wird nicht mehr gepflegt.

@yetzt yetzt closed this as completed Aug 18, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants