Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Mtlarchive.com parser #1312

Closed
TheFirstProgenitor opened this issue May 12, 2024 · 11 comments
Closed

Mtlarchive.com parser #1312

TheFirstProgenitor opened this issue May 12, 2024 · 11 comments

Comments

@TheFirstProgenitor
Copy link

Can you please create a parser for mtlarchive.com

@Kiradien
Copy link
Collaborator

Kiradien commented May 12, 2024

Looks like this page is also under cf and has the same 403 issues creating a parser as Fanfiction.net currently has.
image

Generic parameters look like they would have worked fine if w2e could open the page.

The fix for #1306 might need to be a bit more far-reaching than originally intended.

@dteviot
Copy link
Owner

dteviot commented May 12, 2024

@Kiradien

Actually, looking at the site, it would need a custom parser. The content for the chapters is not in the HTML pages, instead it's JSON that is obtained by a REST call. (So I don't know why they're bothering with CF.)

@dteviot
Copy link
Owner

dteviot commented May 12, 2024

Also, duplicate #1310

@dteviot
Copy link
Owner

dteviot commented May 25, 2024

@TheFirstProgenitor @Kiradien

Test versions for Firefox and Chrome have been uploaded to https://drive.google.com/drive/folders/1B_X2WcsaI_eg9yA-5bHJb8VeTZGKExl8?usp=sharing. Pick the one suitable for you, follow the "How to install from Source (for people who are not developers)" instructions at https://github.com/dteviot/WebToEpub/tree/ExperimentalTabMode#user-content-how-to-install-from-source-for-people-who-are-not-developers and let me know how it goes.
Tested with:

Note, did not try to fetch above chapter 100, that required a subscription.

For my notes: 171 minutes work
Some time lost dealing with bug with Clouldflare cookies when site doesn't have cookies.
But mostly due to difficulty in REST calls to get Chapters.
Having to search array of objects for novel ID was ugly.

@1k-7
Copy link

1k-7 commented May 25, 2024

Screenshot_20240525_104409_Firefox Nightly

@1k-7
Copy link

1k-7 commented May 25, 2024

Already logged in

@1k-7
Copy link

1k-7 commented May 25, 2024

@dteviot it doesn't work?

@dteviot
Copy link
Owner

dteviot commented May 25, 2024

@1k-7
You're using firefox? You might need to open a chapter normally to trigger the Cloudflare "are you a human" page.
I just tested desktop version. Got chatpers 1 to 3. (Which is all I tried for)

@dteviot
Copy link
Owner

dteviot commented May 25, 2024

Did have to make a fix for https://mtlarchive.com/novel/i-can-copy-talent-novel. Novel ID was in position 23 of the array. Updated test versions on Google Drive.

@1k-7
Copy link

1k-7 commented May 25, 2024

@1k-7 You're using firefox? You might need to open a chapter normally to trigger the Cloudflare "are you a human" page. I just tested desktop version. Got chatpers 1 to 3. (Which is all I tried for)

It works...

@dteviot
Copy link
Owner

dteviot commented Jun 16, 2024

@TheFirstProgenitor

Updated version (0.0.0.160) has been submitted to Firefox and Chrome stores.
Firefox version is available now.
Chrome might be available in a few hours to 21 days.

@dteviot dteviot closed this as completed Jun 16, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

4 participants