Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Cant crawl, it print "-Only few posts" always #34

Closed
s1011448 opened this issue Feb 13, 2018 · 10 comments
Closed

Cant crawl, it print "-Only few posts" always #34

s1011448 opened this issue Feb 13, 2018 · 10 comments
Labels

Comments

@s1011448
Copy link

User has more than 12 posts but after running the program, only print out -Only few posts

@timgrossmann
Copy link
Collaborator

@s1011448 Hi, I'm really sorry you're encountering this issue...
However, right now I unfortunately cannot put time into this project.
I'll try to fix all the persisting bugs as soon as I'm back in university and have more time to spare.

If you want to fix it yourself, I'm pretty sure myself and a lot of people would be interested in having this merged into this repository.

Thank you

@icebergyy
Copy link

icebergyy commented Feb 16, 2018

I am getting the same issue. Any idea what could be causing the problem? I will take a look at the code

looks like there is an error going on in
https://github.com/timgrossmann/instagram-profilecrawl/blob/0a4415e3b76538401b326b87a265c217c1013645/util/extractor.py#L94

@s1011448
Copy link
Author

@timgrossmann Sorry for bothering you. I am trying to fix it myself but still suffer problems, it seems that there's no load_button this element in the web page and some class name has changed. I would appreciate it very much if you have time to take a look at this project and give me some advice.

Thank you~

@deniskrafczyk
Copy link

same issue for me :/

@hadifar
Copy link

hadifar commented Mar 15, 2018

same issue here :(

@justdvl
Copy link
Contributor

justdvl commented Mar 15, 2018

Hey guys, I fixed the bug, I'll try to contribute to it now (it is my first time working with GitHub so let me figure out how to do that).
The prev_divs needs to be changed to: prev_divs = browser.find_elements_by_tag_name('main')
Also the loading of page needs to be changed.
Then it works.

@justdvl
Copy link
Contributor

justdvl commented Mar 15, 2018

What took me 2 hours to figure out is, that Instagram always only displays 30 pictures in it's code at the same time, that's the reason why I was only getting 30 links (and in InsaPy, the like by user feature also only do 30 likes).
I fixed it by getting links every time you move the page down, to get all of them line by line.

@justdvl
Copy link
Contributor

justdvl commented Mar 16, 2018

Everything is working flawlessly,
I managed to do a Pull Request
It si my first time ever so hope I did all correctly 0-)

@guifelix
Copy link

@justdvl since you managed to fix it, did you do a pull request just in this project or in instapy as well?
btw, tnx for the fix!

@justdvl
Copy link
Contributor

justdvl commented Mar 22, 2018

They already fixed it in Instapy, just pull requests take longer to get through as they want to make sure all is fine. Just be patient.
But honestly, this fix is super simple and I don't see what can go wrong there.
Meanwhile I also figured out even shorter way to write it..

@timmoh timmoh closed this as completed Jun 28, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

8 participants