You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I think #117 should be able to handle this, but do you have a few sample lines from your log and how these should be parsed and what output you're expecting? Thanks!
Some entries from my logs, I di not validate all of them, but at least from the name they look like rss readers:
# N changes based on the amount of subscribers
"NewsBlur Feed Fetcher - N subscriber - https://www.newsblur.com/site/8128934/website (\"Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/605.1.15 (KHTML, like Gecko) Version/14.0.1 Safari/605.1.15\")"
"Mozilla/5.0 (compatible; BazQux/2.4; +https://bazqux.com/fetcher; N subscribers)"
"Mozilla/5.0 (compatible; inoreader.com; N subscribers)"
"FreshRSS/1.21.0 (Linux; https://freshrss.org)"
"Newsboat/2.34.0 (Macintosh; Intel Mac OS X)"
"Newsboat/2.34.0 (Linux x86_64)"
"Newsboat/2.30.1 (FreeBSD amd64)"
"Tiny Tiny RSS/1.15.3 (http://tt-rss.org/)"
"SpaceCowboys Android RSS Reader / 2.6.0(284)"
"Mozilla/5.0 (compatible; WireReaderBot/1.0; +https://wirereader.app)"
"KOReader/2022.03.1 (https://koreader.rocks/) LuaSocket/3.0-rc1"
"NextCloud-News/1.0"
"Aggregator/2.22.000 (Android/8.0.0; universal8890)"
"Feedly/1.0 (+http://www.feedly.com/fetcher.html; 1 subscribers; like FeedFetcher-Google)"
"Vienna/8268 (Macintosh; Intel macOS 10_15_6)"
"Microsoft Office/16.0 (Windows NT 10.0; Microsoft Outlook 16.0.17029; Pro)"
"Microsoft Office Outlook 2014 (16.0.13127) Windows NT 10.0"
# not sure about this one, could be outlook too?
"Mozilla/4.0 (compatible; ms-office; MSOffice 16)"
# ???
"Rome Client (http://tinyurl.com/64t5n) Ver: UNKNOWN"
There are the options
--crawlers-only
and--ignore-crawlers
, but I'm actually interested in one particular category of "bots": rss readers.Contrary from crawlers, they represent users, but have a different behavior from browsers.
Has it been considered to add an option for filtering only rss readers; something like
--rss-only
?Would it make sense to add it to
goacces
?The text was updated successfully, but these errors were encountered: