-
Notifications
You must be signed in to change notification settings - Fork 10
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Create clean/ca/los_angeles_pd.py #18
Comments
MAIN VISION FOR LA: LA has two systems for getting to the next request page:
In both cases, all URLs come structured (displayed directly on the HTML, no need for reconstruction) Need two distinct approaches to tackle this:
Brainstorming the second method:
Now that we understand the terms, let's move on to the plan of attack. PART 1: main index page scrape -> second index page scrape from there, two possible paths: PART 2A: child page scrape for current year To achieve that, here are some goals:
|
The text was updated successfully, but these errors were encountered: