New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Unable to retrive the full page of url #1142
Comments
Could you try the latest HTTPie version (2.5.0 at the time)? |
I created a simple script that uses httpie and uses grep next to parse the html file and retrieve only the links , probably is more easier for you guys make tests , i am sending it as an attachment in zip file . |
You have 50 links of wav files in that drop box , httpie only get 31 , your output was the same as i got in 2.5.0 . |
Ohh, let me recheck and try to fix it. |
@peterpt hey, I tried to extract the URLs manually by downloading the source HTML page locally and running the script on that HTML file and I found out that it still shows 31 links and not 48 which is the total number of files so I think its a thing from dropbox side and not httpie 😅 |
The reason there are only 30-something in the HTTPie usage and 50-something on the browser is that once the page is fully loaded, the browser starts executing javascript which does light pagination to load more entries. If you want to test this out by yourself, simply block the I am afraid this is something httpie does not support but it can be achieved by other means (e.g using a headless browser via selenium to fully load the page and then do the extraction). |
I am using httpie 0.9.8 and when i try to download the webpage of dropbox i dont get the page fully downloaded , only partially .
Using this dropbox link :
https://www.dropbox.com/sh/erv1tycztizfvyd/AADeXwemV9sK37MSHqxmYz_5a?dl=0
and using the command :
http https://www.dropbox.com/sh/erv1tycztizfvyd/AADeXwemV9sK37MSHqxmYz_5a?dl=0 -o page.html
i get the links from 30 files where in reality exists 50 files inside that folder in dropbox if we opened with a normal web browser like firefox or chrome .
Any idea ?
Provide any additional information, screenshots, or code examples below:
The text was updated successfully, but these errors were encountered: