Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Parse descriptions from HTML commented out #3

Open
elvispoz opened this issue May 18, 2020 · 14 comments
Open

Parse descriptions from HTML commented out #3

elvispoz opened this issue May 18, 2020 · 14 comments

Comments

@elvispoz
Copy link

Crawler does not download event descriptions...

@DaWe35
Copy link
Owner

DaWe35 commented May 20, 2020

You may only need one xpath correction in

event_description = tree.xpath('//div[@id="unit_id_886302548152152"]/div[2]/text()')

@elvispoz
Copy link
Author

I know, but what is correct ?

@elvispoz
Copy link
Author

Im testing with new selector but still does not effects...

event_description = tree.xpath('//div[@id="unit_id_886302548152152"]/section[1]/text()')

Have you got any idea?

@DaWe35
Copy link
Owner

DaWe35 commented May 22, 2020

section[1]

That's weird, section[1] need to work. Anyway, I think there is a Facebook release slipping - in my old facebook account <div> works, in my new account <section>, so we need to support all of them. Cool....

Can you try out event_description = tree.xpath('//div[@id="unit_id_886302548152152"]/section/text()') ? Anyway I don't understand why section[1] not works...

@elvispoz
Copy link
Author

Descriptions in mysql are still empty :(

@elvispoz
Copy link
Author

Have you got any idea and solution... ?

@DaWe35
Copy link
Owner

DaWe35 commented May 29, 2020

Have you got any idea and solution... ?

Sorry @elvispoz, today I have no time, I'll check it out later. Sometimes I need to work for money also :)

@DaWe35
Copy link
Owner

DaWe35 commented May 31, 2020

@elvispoz I just registered a new Facebook account, and everything works fine. Can you give me your Facebook account to have a try? If you followed my guide, you registered one only for FB crawler, so there is no personal data.
You can find me here: https://discord.gg/69SZC4v (I'm DaWe)

@elvispoz
Copy link
Author

elvispoz commented Jul 13, 2020

Hi,
Have you got any time for this? I see thet fb make some change in code and put description to:
<!-- xxx -->

@elvispoz
Copy link
Author

@DaWe35 DaWe35 mentioned this issue Jul 29, 2020
@DaWe35
Copy link
Owner

DaWe35 commented Jul 29, 2020

Is there anyone having the same issue?

@DaWe35 DaWe35 changed the title No descriptions Parse descriptions from HTML commented out Sep 26, 2020
@XinminHu
Copy link

XinminHu commented Jan 7, 2021

Is there anyone having the same issue?

Hello, I followed all your guidances, and the crawler worked smoothly.
But there are no records downloaded, the 'events' table is always empty...
I am not sure which part went wrong...

@oblab
Copy link

oblab commented Nov 12, 2021

Same here, the crawler is working, but 'events" table still empty...

@oblab
Copy link

oblab commented Nov 12, 2021

python3 Crawlr.py                                                                                                                                                         
Logging in to facebook...                                                                                                                                                                     
0 old row deleted                                                                                                                                                                             
Pages are already updated less than an hour ago, no new events queried  

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants