Skip to content

Commit

Permalink
Added new sites (#480)
Browse files Browse the repository at this point in the history
  • Loading branch information
soxoj committed May 14, 2022
1 parent e88d71d commit 246c770
Show file tree
Hide file tree
Showing 2 changed files with 70 additions and 17 deletions.
56 changes: 53 additions & 3 deletions maigret/resources/data.json
Original file line number Diff line number Diff line change
Expand Up @@ -11490,7 +11490,8 @@
"url": "https://psnprofiles.com/{username}",
"errorUrl": "https://psnprofiles.com/?psnId={username}",
"usernameClaimed": "blue",
"usernameUnclaimed": "noonewouldeverusethis"
"usernameUnclaimed": "noonewouldeverusethis",
"disabled": true
},
"Packagist": {
"tags": [
Expand Down Expand Up @@ -29145,6 +29146,16 @@
"forum"
]
},
"mindmachine.ru": {
"urlSubpath": "/forum",
"urlMain": "https://mindmachine.ru/",
"engine": "phpBB/Search",
"usernameClaimed": "\u0410\u043b\u0435\u043a\u0441\u0430\u043d\u0434\u0440_",
"usernameUnclaimed": "noonewouldeverusethis7",
"tags": [
"forum"
]
},
"holodforum.ru": {
"urlMain": "https://holodforum.ru",
"engine": "vBulletin",
Expand Down Expand Up @@ -31286,11 +31297,14 @@
"usernameUnclaimed": "noonewouldeverusethis7",
"url": "https://encyc.org/wiki/User:{username}"
},
"es.pixilart.com": {
"Pixilart": {
"tags": [
"art"
],
"checkType": "status_code",
"usernameClaimed": "adam",
"usernameUnclaimed": "noonewouldeverusethis7",
"url": "https://es.pixilart.com/{username}"
"url": "https://pixilart.com/{username}"
},
"eve.community": {
"checkType": "status_code",
Expand Down Expand Up @@ -32067,6 +32081,42 @@
"tags": [
"coding"
]
},
"dolap": {
"absenceStrs": [
" role="
],
"presenseStrs": [
"setEmail"
],
"url": "https://dolap.com/profil/{username}",
"urlMain": "https://dolap.com",
"usernameClaimed": "burcakmeric",
"usernameUnclaimed": "noonewouldeverusethis7",
"checkType": "message",
"alexaRank": 46439,
"tags": [
"shopping",
"tr"
]
},
"Gardrops": {
"absenceStrs": [
"> <title> Gardrops"
],
"presenseStrs": [
"/reviews"
],
"url": "https://www.gardrops.com/{username}",
"urlMain": "https://www.gardrops.com",
"usernameClaimed": "blue",
"usernameUnclaimed": "noonewouldeverusethis7",
"checkType": "message",
"alexaRank": 114405,
"tags": [
"shopping",
"tr"
]
}
},
"engines": {
Expand Down
31 changes: 17 additions & 14 deletions sites.md
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@

## List of supported sites (search methods): total 2966
## List of supported sites (search methods): total 2969

Rank data fetched from Alexa by domains.

Expand Down Expand Up @@ -335,7 +335,7 @@ Rank data fetched from Alexa by domains.
1. ![](https://www.google.com/s2/favicons?domain=https://pedsovet.su/) [Pedsovet (https://pedsovet.su/)](https://pedsovet.su/)*: top 10K, ru*, search is disabled
1. ![](https://www.google.com/s2/favicons?domain=https://www.kwork.ru/) [kwork (https://www.kwork.ru/)](https://www.kwork.ru/)*: top 10K, ru*
1. ![](https://www.google.com/s2/favicons?domain=https://about.me/) [About.me (https://about.me/)](https://about.me/)*: top 10K, blog, in*
1. ![](https://www.google.com/s2/favicons?domain=https://www.fark.com/) [Fark (https://www.fark.com/)](https://www.fark.com/)*: top 10K, forums, news*
1. ![](https://www.google.com/s2/favicons?domain=https://www.fark.com/) [Fark (https://www.fark.com/)](https://www.fark.com/)*: top 10K, forum, news*
1. ![](https://www.google.com/s2/favicons?domain=https://www.reverbnation.com/) [ReverbNation (https://www.reverbnation.com/)](https://www.reverbnation.com/)*: top 10K, us*
1. ![](https://www.google.com/s2/favicons?domain=https://www.glavbukh.ru) [Scorcher (https://www.glavbukh.ru)](https://www.glavbukh.ru)*: top 10K, ru*
1. ![](https://www.google.com/s2/favicons?domain=https://www.trakt.tv/) [Trakt (https://www.trakt.tv/)](https://www.trakt.tv/)*: top 10K, de, fr*
Expand Down Expand Up @@ -468,7 +468,7 @@ Rank data fetched from Alexa by domains.
1. ![](https://www.google.com/s2/favicons?domain=https://www.trueachievements.com) [TrueAchievements (https://www.trueachievements.com)](https://www.trueachievements.com)*: top 100K, us*
1. ![](https://www.google.com/s2/favicons?domain=https://jimdosite.com/) [Jimdo (https://jimdosite.com/)](https://jimdosite.com/)*: top 100K, jp*
1. ![](https://www.google.com/s2/favicons?domain=https://club.cnews.ru/) [club.cnews.ru (https://club.cnews.ru/)](https://club.cnews.ru/)*: top 100K, blog, ru*
1. ![](https://www.google.com/s2/favicons?domain=https://psnprofiles.com/) [PSNProfiles.com (https://psnprofiles.com/)](https://psnprofiles.com/)*: top 100K, gaming*
1. ![](https://www.google.com/s2/favicons?domain=https://psnprofiles.com/) [PSNProfiles.com (https://psnprofiles.com/)](https://psnprofiles.com/)*: top 100K, gaming*, search is disabled
1. ![](https://www.google.com/s2/favicons?domain=https://donorbox.org) [donorbox (https://donorbox.org)](https://donorbox.org)*: top 100K, finance*
1. ![](https://www.google.com/s2/favicons?domain=https://www.sbazar.cz/) [Sbazar.cz (https://www.sbazar.cz/)](https://www.sbazar.cz/)*: top 100K, cz, shopping*
1. ![](https://www.google.com/s2/favicons?domain=https://www.euro-football.ru) [EuroFootball (https://www.euro-football.ru)](https://www.euro-football.ru)*: top 100K, ru*
Expand Down Expand Up @@ -621,6 +621,7 @@ Rank data fetched from Alexa by domains.
1. ![](https://www.google.com/s2/favicons?domain=https://forum.dwg.ru/) [dwg (https://forum.dwg.ru/)](https://forum.dwg.ru/)*: top 100K, forum, ru*
1. ![](https://www.google.com/s2/favicons?domain=https://stereo.ru/) [Stereo (https://stereo.ru/)](https://stereo.ru/)*: top 100K, nl, ru*
1. ![](https://www.google.com/s2/favicons?domain=http://lurkmore.to) [Lurkmore (http://lurkmore.to)](http://lurkmore.to)*: top 100K, ru*
1. ![](https://www.google.com/s2/favicons?domain=https://dolap.com) [dolap (https://dolap.com)](https://dolap.com)*: top 100K, shopping, tr*
1. ![](https://www.google.com/s2/favicons?domain=https://www.webnode.cz/) [WebNode (https://www.webnode.cz/)](https://www.webnode.cz/)*: top 100K, cz, us*
1. ![](https://www.google.com/s2/favicons?domain=http://promodj.com/) [PromoDJ (http://promodj.com/)](http://promodj.com/)*: top 100K, ru*
1. ![](https://www.google.com/s2/favicons?domain=https://www.sythe.org) [Sythe (https://www.sythe.org)](https://www.sythe.org)*: top 100K, ca, forum, us*
Expand Down Expand Up @@ -782,6 +783,7 @@ Rank data fetched from Alexa by domains.
1. ![](https://www.google.com/s2/favicons?domain=https://php.ru/forum/) [php.ru (https://php.ru/forum/)](https://php.ru/forum/)*: top 10M, forum, ru*
1. ![](https://www.google.com/s2/favicons?domain=https://noblogs.org/) [Noblogs (https://noblogs.org/)](https://noblogs.org/)*: top 10M, blog*
1. ![](https://www.google.com/s2/favicons?domain=https://forums.eagle.ru) [forums.eagle.ru (https://forums.eagle.ru)](https://forums.eagle.ru)*: top 10M, ca, forum, gaming, gb, in, us*, search is disabled
1. ![](https://www.google.com/s2/favicons?domain=https://www.gardrops.com) [Gardrops (https://www.gardrops.com)](https://www.gardrops.com)*: top 10M, shopping, tr*
1. ![](https://www.google.com/s2/favicons?domain=https://schlock.ru/) [Schlock (https://schlock.ru/)](https://schlock.ru/)*: top 10M, ru*
1. ![](https://www.google.com/s2/favicons?domain=https://monitoringminecraft.ru) [monitoringminecraft.ru (https://monitoringminecraft.ru)](https://monitoringminecraft.ru)*: top 10M, gaming*
1. ![](https://www.google.com/s2/favicons?domain=https://www.stratege.ru) [Stratege (https://www.stratege.ru)](https://www.stratege.ru)*: top 10M, forum, gaming, news, ru*
Expand Down Expand Up @@ -2688,6 +2690,7 @@ Rank data fetched from Alexa by domains.
1. ![](https://www.google.com/s2/favicons?domain=https://wigle.net) [Wigle (https://wigle.net)](https://wigle.net)*: top 100M, forum*
1. ![](https://www.google.com/s2/favicons?domain=https://camas.github.io/reddit-search/) [Reddit Search (Pushshift) (https://camas.github.io/reddit-search/)](https://camas.github.io/reddit-search/)*: top 100M, discussion, news*
1. ![](https://www.google.com/s2/favicons?domain=https://pornsavant.com) [pornsavant.com (https://pornsavant.com)](https://pornsavant.com)*: top 100M, forum*
1. ![](https://www.google.com/s2/favicons?domain=https://mindmachine.ru/) [mindmachine.ru (https://mindmachine.ru/)](https://mindmachine.ru/)*: top 100M, forum*
1. ![](https://www.google.com/s2/favicons?domain=https://holodforum.ru) [holodforum.ru (https://holodforum.ru)](https://holodforum.ru)*: top 100M, forum*
1. ![](https://www.google.com/s2/favicons?domain=https://weibo.com) [Weibo (https://weibo.com)](https://weibo.com)*: top 100M, cn, networking*
1. ![](https://www.google.com/s2/favicons?domain=http://profile.hatena.com) [Hatena (http://profile.hatena.com)](http://profile.hatena.com)*: top 100M, bookmarks, jp*
Expand Down Expand Up @@ -2846,7 +2849,7 @@ Rank data fetched from Alexa by domains.
1. ![](https://www.google.com/s2/favicons?domain=) [en.uncyclopedia.co ()]()*: top 100M*
1. ![](https://www.google.com/s2/favicons?domain=) [en.wikifur.com ()]()*: top 100M*
1. ![](https://www.google.com/s2/favicons?domain=) [encyc.org ()]()*: top 100M*
1. ![](https://www.google.com/s2/favicons?domain=) [es.pixilart.com ()]()*: top 100M*
1. ![](https://www.google.com/s2/favicons?domain=) [Pixilart ()]()*: top 100M, art*
1. ![](https://www.google.com/s2/favicons?domain=) [eve.community ()]()*: top 100M*
1. ![](https://www.google.com/s2/favicons?domain=) [exploretalent.com ()]()*: top 100M*
1. ![](https://www.google.com/s2/favicons?domain=) [fandalism.com ()]()*: top 100M*
Expand Down Expand Up @@ -2970,20 +2973,20 @@ Rank data fetched from Alexa by domains.
1. ![](https://www.google.com/s2/favicons?domain=) [zeldadungeon.net ()]()*: top 100M*
1. ![](https://www.google.com/s2/favicons?domain=) [zoig.com ()]()*: top 100M*

The list was updated at (2022-05-14 07:36:46.496379 UTC)
The list was updated at (2022-05-14 08:48:39.154815 UTC)
## Statistics

Enabled/total sites: 2795/2966 = 94.23%
Enabled/total sites: 2797/2969 = 94.21%

Incomplete message checks: 511/2795 = 18.28% (false positive risks)
Incomplete message checks: 511/2797 = 18.27% (false positive risks)

Status code checks: 733/2795 = 26.23% (false positive risks)
Status code checks: 733/2797 = 26.21% (false positive risks)

False positive risk (total): 44.510000000000005%
False positive risk (total): 44.480000000000004%

Top 20 profile URLs:
- (796) `{urlMain}/index/8-0-{username} (uCoz)`
- (289) `/{username}`
- (290) `/{username}`
- (221) `{urlMain}{urlSubpath}/members/?username={username} (XenForo)`
- (160) `/user/{username}`
- (134) `{urlMain}{urlSubpath}/member.php?username={username} (vBulletin)`
Expand All @@ -2993,7 +2996,7 @@ Top 20 profile URLs:
- (87) `{urlMain}/u/{username}/summary (Discourse)`
- (54) `/wiki/User:{username}`
- (48) `/@{username}`
- (44) `{urlMain}{urlSubpath}/search.php?author={username} (phpBB/Search)`
- (45) `{urlMain}{urlSubpath}/search.php?author={username} (phpBB/Search)`
- (42) `SUBDOMAIN`
- (41) `/members/?username={username}`
- (29) `/members/{username}`
Expand All @@ -3004,8 +3007,8 @@ Top 20 profile URLs:
- (17) `/search.php?keywords=&terms=all&author={username}`

Top 20 tags:
- (293) `NO_TAGS` (non-standard)
- (271) `forum`
- (292) `NO_TAGS` (non-standard)
- (272) `forum`
- (50) `gaming`
- (24) `photo`
- (24) `coding`
Expand All @@ -3016,9 +3019,9 @@ Top 20 tags:
- (13) `freelance`
- (12) `sharing`
- (12) `finance`
- (11) `art`
- (11) `shopping`
- (10) `dating`
- (10) `art`
- (9) `hobby`
- (9) `movies`
- (7) `sport`
Expand Down

0 comments on commit 246c770

Please sign in to comment.