New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
getValidSnapshots does not find date from html code #258
Comments
The best solution here is probably "don't mess with the standard layout" |
Dear @Hong-Revo, thank you for your timely, friendly comment. Could you please specify what you mean by "standard layout"? I guess the problem is that our local Apache server has a different (but standard, I assure you!) configuration from whatever web server the default mran is using. Therefore, when I point my browser at the right snapshot-adress on our local server, the page looks different (and has a different html-structure) than when I point my browser at https://mran.microsoft.com/snapshot/. I don't know what is a good robust way to query subfolders from a webpage. Just reading the directory itself may lead to unexpected results. |
Standard as in whatever mran.microsoft.com does. The code for listing snapshots could be made smarter, but I'm not going to go down the rabbit hole of parsing HTML with regex. The good news is that the next version of checkpoint won't try to get the list of snapshots every time you run |
Fair enough. Is there any documentation of "whatever mran.microsoft.com" does? |
One could also say that getting a list of subdirectories via HTTP is flawed right from the beginning. The question is how to easily work around that. The obvious solution would be to use FTP instead of HTTP. Another way would be to add an additional file that contains index information about available checkpoints. Of course, that file would have to be maintained, adding an overhead. |
With a custom snapshot server, in the call to
getValidSnapshots
,text[idx]
becomes:Then
gsub
statement fails to isolate the checkpoint dates, because the lines do not start with "<a href...", but they start with "...".The text was updated successfully, but these errors were encountered: