Add more comic strips #86

ArtskydJ · 2018-08-12T03:46:35Z

Before you write a scraper for comicsrss, please know that I don't want comicsrss to have some types of comic strips.

I don't want comicsrss to have sexually-suggestive comics. For example, I've considered killing the rss feed for 9 Chickweek Lane, and I still might kill it someday. I'm going to avoid adding anything to comicsrss that's more suggestive than that.

I might kill off political comics. I haven't yet, but I've been strongly considering it for a while now. Internet politics discussions tend to be tribal and echo-chambery, but political comics step that up a few notches.

List of comic strips/websites that folks have requested, and who requested them

Planned:

Dilbert http://dilbert.com/
Arcamax https://www.arcamax.com/comics
Comics Kingdom https://www.comicskingdom.com/
Creators.com https://www.creators.com/categories/comics
- Mike P (email) - Spectickles
The Far Side https://www.thefarside.com
- Brian W (email)
- Coleman (email)
Ctrl+Alt+Delete https://cad-comic.com/feed/
- Milan A (email)

ghost · 2018-11-15T09:09:07Z

I'd love to see some Comics Kingdom strips added, if possible. (For me, personally, mainly Bizarro, Rhymes with Orange, and Darrin Bell.)

ArtskydJ · 2018-11-24T20:12:26Z

Arcamax has Bizarro, Dilbert, and Rhymes with Orange, and Darrin Bell.

Both Comics Kingdom, and Arcamax look like they will be much more difficult to scrape than gocomics.

ArtskydJ · 2018-11-24T20:13:07Z

Added Dilbert today.

ArtskydJ · 2019-01-04T15:41:25Z

I don't remember why I thought Arcamax would be particularly difficult. It doesn't look like it will be that hard...

<a class="prev" href="/thefunnies/brilliantmindofedisonlee/s-2160999" title="Brilliant Mind of Edison Lee 1/3/2019"><span class="entypo-left-open"></span></a>
  <span class="cur">January  4</span>
<a class="next-off" href="#"><span class="entypo-right-open"></span></a>

<!-- ... -->

<figure class="comic">
  <img id="comic-zoom" data-zoom-image="/newspics/168/16885/1688589.gif" src="/newspics/168/16885/1688589.gif"  data-width="600" data-height="187" alt="" class="img-responsive the-comic" title="click or tap to zoom" />
  <cite class="comic-copyright">(c) 2019 John Hambrock.  Dist. by King Features Syndicate, Inc.</cite>
</figure>

Hopefully I'll get around to it within a few weeks.

infinitytec · 2019-06-12T13:31:49Z

Could I request Sherman's Lagoon and Freefall (the latter is a webcomic found at freefall.purrsia.com)?

ArtskydJ · 2019-06-12T14:35:34Z

Sherman's lagoon is on Comics Kingdom. If/when I add comics Kingdom, I can @ you in this thread.

I doubt I'll add Freefall unless it is part of a larger site like Comics Kingdom or Arcamax. If there's enough demand for it, I might add it.

Or you could look into adding it similar to dilbert was added:
https://github.com/ArtskydJ/comicsrss.com/blob/gh-pages/_generator/scraper-dilbert/index.js
There isn't really an API for making a scraper... ☹️

This is what I did for dilbert (and the process would be similar on freefall):

Grab a page that shows multiple comics, including the latest comic
a. For dilbert it was https://dilbert.com
b. For freefall it might be http://freefall.purrsia.com/lastthree.htm
Parse the HTML to turn it into an array like this:

[
    {
        "titleAuthorDate": "Freefall by Tugrik for Wednesday 6/12/2019",
        "url": "http://freefall.purrsia.com/ff3300/fc03290.htm",
        "date": "2019-06-12",
        "comicImageUrl": "http://freefall.purrsia.com/ff3300/fc03290.png"
    },
    ...
]

Open the cached version of that array, and merge them together. (If I don't have the latest comic in the cached array, then I need to push it onto the array.)
Write the cached file to disk.
Integrate it with the rest of the system. (If you do everything else I would be more than happy to integrate your scraper.)

infinitytec · 2019-06-12T15:57:40Z

Thanks for the information! I'll look into it and see what I can do!

ArtskydJ · 2019-06-24T13:28:23Z

I made an API and published it in the README.

jgbishop · 2020-01-01T14:42:56Z

Any progress on this? I've looked into scraping Comics Kingdom in the past year myself, and it's pretty difficult. Lots of the page gets loaded dynamically when first visited in a web browser. The publishers are clearly trying their best to prevent scraping, but my scraping knowledge is fairly limited when it comes to dynamic data. Maybe the arcamax website would be easier?

ArtskydJ · 2020-01-08T05:54:15Z

@jgbishop Very little progress. You can see in _generator/site-scrapers/ that there are 2 Work In Progress folders. I haven't done anything since then.

Getting a functional scraper is probably around 2-10 hours of work. (Depending on how smoothly it goes, and if you run into any issues, like rate-limiting.) The reason that I haven't made another site scraper is not because of a technical issue blocking the way. It's just I haven't made it a priority.

And I personally don't have a ton of incentive to expand comicsrss since it does all that I need. I still want to scrape more sites.

If you have a specific comic strip that you're wanting, you could try making a scraper just for it, instead of the entire arcamax/comics kingdom site. And that might be a nice starting point for me to expand it to the whole site.

One more thing to note is that if/when arcamax or comics kingdom is added, the site generator will have to avoid making two entries when a comic is in both gocomics.com and the added site.

ArtskydJ · 2020-06-23T14:50:20Z

@jgbishop I finally added Arcamax comics.

jgbishop · 2020-06-23T15:06:24Z

Woo-hoo! Thanks! 👏 🍰

ghost · 2020-07-05T12:50:17Z

Beetle Bailey and Hagar the Horrible, at last!

infinitytec · 2021-10-14T12:01:10Z

Well, I may have figured out something for Comics Kingdom: https://jsfiddle.net/p0tojns1/1/

Not a full scraper, and only for Sherman's Lagoon, but it may help.

ArtskydJ · 2021-10-14T16:57:18Z

Interesting...

Earlier, I'd decided not to write a scraper for Comics Kingdom, because I remembered Comics Kingdom being very dynamic. But it looks quite do-able to scrape that site now?

So I'm now planning to write a scraper for Comics Kingdom. I'm not promising anything. 😁 Difficulties might come up where I change my mind again, and abandon Comics Kingdom again. But I hope to get it working!

jalberto · 2021-11-15T12:40:23Z

I would like to suggest https://workchronicles.com

ArtskydJ · 2021-11-16T15:31:14Z

I would like to suggest workchronicles.com

They already have an RSS feed: https://workchronicles.com/feed/

jalberto · 2021-11-16T15:35:13Z

Totally missed it, thanks

…

On Tue, 16 Nov 2021 at 16:31, Joseph Dykstra ***@***.***> wrote: I would like to suggest workchronicles.com They already have an RSS feed: https://workchronicles.com/feed/ — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#86 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAAYMV33XI4D6YYNC73DR3LUMJ2MZANCNFSM4FPFSS4Q> . Triage notifications on the go with GitHub Mobile for iOS <https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675> or Android <https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>.

twizzayy · 2022-04-07T20:58:20Z

Cant wait for The Far side to be added. Thanks for this awesome resource. :)

infinitytec · 2022-07-02T13:45:33Z

Hey, looks like Sherman's Lagoon is now on GoComics so it's being scraped!

ArtskydJ · 2022-08-09T18:29:43Z

I added Comics Kingdom strips to https://www.comicsrss.com/

@infinitytec

tylerbenson · 2023-05-25T16:11:12Z

Would it be difficult to add support for https://tinyview.com/ and https://www.webtoons.com/ hosted comics?

Webtoons has an RSS feed, but usually only shows the first pane of the comic.

Thanks!

tylerbenson · 2023-10-27T03:34:46Z

I tried to add additional details for tinyview: #141.

ArtskydJ · 2023-10-27T15:21:24Z

I just updated the original post.

Webtoons has some "mature"-rated comics, which I don't want on comicsrss. The "young adult"-rated comics varied a lot in their suggestiveness. Webtoons, by nature of its user-generated content, is difficult to categorize. If someone wrote a scraper for webtoons, even with the "mature"-rated comics filtered out, I'm not sure if I'd merge it into comicsrss.

I'd probably merge a scraper for tinyview. Most seemed fine. Maybe I'd filter out "Eggs n' Ben", IDK.

tylerbenson · 2023-10-27T20:28:14Z

Makes sense... For the record, I was interested in some of the family friendly cartoons for each, and I totally respect your desire to keep things clean. (I've sent my teen son to your site to find comics to read.)

ArtskydJ added the high-priority label Mar 29, 2019

ArtskydJ mentioned this issue May 22, 2019

Some gocomics strips are not showing up #107

Closed

Repository owner deleted a comment May 22, 2019

ArtskydJ mentioned this issue Jun 12, 2019

Make a scraper API #109

Closed

ArtskydJ pinned this issue Jul 17, 2019

ArtskydJ changed the title ~~Add other comic strips~~ Add more comic strips Jul 17, 2019

ArtskydJ removed the high-priority label Jun 23, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add more comic strips #86

Add more comic strips #86

ArtskydJ commented Aug 12, 2018 •

edited

ghost commented Nov 15, 2018

ArtskydJ commented Nov 24, 2018

ArtskydJ commented Nov 24, 2018

ArtskydJ commented Jan 4, 2019

infinitytec commented Jun 12, 2019

ArtskydJ commented Jun 12, 2019

infinitytec commented Jun 12, 2019

ArtskydJ commented Jun 24, 2019

jgbishop commented Jan 1, 2020

ArtskydJ commented Jan 8, 2020

ArtskydJ commented Jun 23, 2020

jgbishop commented Jun 23, 2020

ghost commented Jul 5, 2020

infinitytec commented Oct 14, 2021

ArtskydJ commented Oct 14, 2021

jalberto commented Nov 15, 2021

ArtskydJ commented Nov 16, 2021

jalberto commented Nov 16, 2021 via email

twizzayy commented Apr 7, 2022

infinitytec commented Jul 2, 2022

ArtskydJ commented Aug 9, 2022

tylerbenson commented May 25, 2023

tylerbenson commented Oct 27, 2023

ArtskydJ commented Oct 27, 2023

tylerbenson commented Oct 27, 2023 •

edited

Add more comic strips #86

Add more comic strips #86

Comments

ArtskydJ commented Aug 12, 2018 • edited

ghost commented Nov 15, 2018

ArtskydJ commented Nov 24, 2018

ArtskydJ commented Nov 24, 2018

ArtskydJ commented Jan 4, 2019

infinitytec commented Jun 12, 2019

ArtskydJ commented Jun 12, 2019

infinitytec commented Jun 12, 2019

ArtskydJ commented Jun 24, 2019

jgbishop commented Jan 1, 2020

ArtskydJ commented Jan 8, 2020

ArtskydJ commented Jun 23, 2020

jgbishop commented Jun 23, 2020

ghost commented Jul 5, 2020

infinitytec commented Oct 14, 2021

ArtskydJ commented Oct 14, 2021

jalberto commented Nov 15, 2021

ArtskydJ commented Nov 16, 2021

jalberto commented Nov 16, 2021 via email

twizzayy commented Apr 7, 2022

infinitytec commented Jul 2, 2022

ArtskydJ commented Aug 9, 2022

tylerbenson commented May 25, 2023

tylerbenson commented Oct 27, 2023

ArtskydJ commented Oct 27, 2023

tylerbenson commented Oct 27, 2023 • edited

ArtskydJ commented Aug 12, 2018 •

edited

tylerbenson commented Oct 27, 2023 •

edited