mirror of
https://github.com/soxoj/maigret.git
synced 2026-05-06 14:08:59 +00:00
Fixed/Disabled sites. Update requirements.txt (#1517)
* Fixed/Disabled sites. Update requirements.txt fixed_sites: AllRecipes, Linktree, CreativeMarket, ImgInn, Shutterstock, Contently disabled_sites: Forums.ea.com. CrunchyRoll, Windy, MetaCritic, InfosecInstitute, Armchairgm.fandom.com, Bleach.fandom.com Update requirements to prevent dependency conflicts. * Update requirements.txt Update requirements.txt to prevent dependency conflicts * Update requirements.txt * Update sites.md * fixed_sites: Armchairgm.fandom.com, Bleach.fandom.com, Battleraprus. disabled_sites: MicrosoftTechNet, club.cnews.ru, Scorcher * fixed_sites: Armchairgm.fandom.com, Bleach.fandom.com, Battleraprus. disabled_sites: MicrosoftTechNet, club.cnews.ru, Scorcher
This commit is contained in:
@@ -14,7 +14,7 @@ Rank data fetched from Alexa by domains.
|
||||
1.  [Wikipedia (https://www.wikipedia.org/)](https://www.wikipedia.org/)*: top 50, wiki*
|
||||
1.  [Reddit (https://www.reddit.com/)](https://www.reddit.com/)*: top 50, discussion, news*
|
||||
1.  [social.msdn.microsoft.com (https://social.msdn.microsoft.com)](https://social.msdn.microsoft.com)*: top 50, us*
|
||||
1.  [MicrosoftTechNet (https://social.technet.microsoft.com)](https://social.technet.microsoft.com)*: top 50, us*
|
||||
1.  [MicrosoftTechNet (https://social.technet.microsoft.com)](https://social.technet.microsoft.com)*: top 50, us*, search is disabled
|
||||
1.  [Weibo (https://weibo.com)](https://weibo.com)*: top 50, cn, networking*
|
||||
1.  [GitHubGist (https://gist.github.com)](https://gist.github.com)*: top 50, coding, sharing*
|
||||
1.  [VK (https://vk.com/)](https://vk.com/)*: top 50, ru*
|
||||
@@ -127,7 +127,7 @@ Rank data fetched from Alexa by domains.
|
||||
1.  [TripAdvisor (https://tripadvisor.com/)](https://tripadvisor.com/)*: top 500, travel*
|
||||
1.  [Academia.edu (https://www.academia.edu/)](https://www.academia.edu/)*: top 500, id*
|
||||
1.  [mercadolivre (https://www.mercadolivre.com.br)](https://www.mercadolivre.com.br)*: top 500, br*
|
||||
1.  [Crunchyroll (https://www.crunchyroll.com/)](https://www.crunchyroll.com/)*: top 500, forum, movies, us*
|
||||
1.  [Crunchyroll (https://www.crunchyroll.com/)](https://www.crunchyroll.com/)*: top 500, forum, movies, us*, search is disabled
|
||||
1.  [WordPressOrg (https://wordpress.org/)](https://wordpress.org/)*: top 500, in*
|
||||
1.  [Ameblo (https://ameblo.jp)](https://ameblo.jp)*: top 500, blog, jp*
|
||||
1.  [Unsplash (https://unsplash.com/)](https://unsplash.com/)*: top 500, art, photo*
|
||||
@@ -242,7 +242,7 @@ Rank data fetched from Alexa by domains.
|
||||
1.  [Pastebin (https://pastebin.com/)](https://pastebin.com/)*: top 5K, sharing*
|
||||
1.  [gfycat (https://gfycat.com/)](https://gfycat.com/)*: top 5K, photo, sharing*
|
||||
1.  [last.fm (https://last.fm/)](https://last.fm/)*: top 5K, music*
|
||||
1.  [Windy (https://windy.com/)](https://windy.com/)*: top 5K, in, jp, kr, pl, us*
|
||||
1.  [Windy (https://windy.com/)](https://windy.com/)*: top 5K, in, jp, kr, pl, us*, search is disabled
|
||||
1.  [profile.hatena.ne.jp (https://profile.hatena.ne.jp)](https://profile.hatena.ne.jp)*: top 5K, jp*
|
||||
1.  [BodyBuilding (https://bodyspace.bodybuilding.com/)](https://bodyspace.bodybuilding.com/)*: top 5K, us*
|
||||
1.  [community.icons8.com (https://community.icons8.com)](https://community.icons8.com)*: top 5K, forum, in*
|
||||
@@ -258,7 +258,7 @@ Rank data fetched from Alexa by domains.
|
||||
1.  [jsfiddle.net (https://jsfiddle.net)](https://jsfiddle.net)*: top 5K, coding, sharing*
|
||||
1.  [Pathofexile (https://ru.pathofexile.com)](https://ru.pathofexile.com)*: top 5K, ru, us*
|
||||
1.  [VC.ru (https://vc.ru)](https://vc.ru)*: top 5K, ru*
|
||||
1.  [metacritic (https://www.metacritic.com/)](https://www.metacritic.com/)*: top 5K, us*
|
||||
1.  [metacritic (https://www.metacritic.com/)](https://www.metacritic.com/)*: top 5K, us*, search is disabled
|
||||
1.  [DigitalOcean (https://www.digitalocean.com/)](https://www.digitalocean.com/)*: top 5K, forum, in, tech*
|
||||
1.  [jeuxvideo (http://www.jeuxvideo.com)](http://www.jeuxvideo.com)*: top 5K, fr, gaming*
|
||||
1.  [ShiftDelete (https://forum.shiftdelete.net)](https://forum.shiftdelete.net)*: top 5K, forum, tr*, search is disabled
|
||||
@@ -337,7 +337,7 @@ Rank data fetched from Alexa by domains.
|
||||
1.  [BuyMeACoffee (https://www.buymeacoffee.com/)](https://www.buymeacoffee.com/)*: top 5K, in*
|
||||
1.  [Muckrack (https://muckrack.com)](https://muckrack.com)*: top 5K, us*
|
||||
1.  [fixya (https://www.fixya.com)](https://www.fixya.com)*: top 5K, us*
|
||||
1.  [Lolchess (https://lolchess.gg/)](https://lolchess.gg/)*: top 5K, kr*
|
||||
1.  [Lolchess (https://lolchess.gg/)](https://lolchess.gg/)*: top 5K, kr*, search is disabled
|
||||
1.  [IFTTT (https://www.ifttt.com/)](https://www.ifttt.com/)*: top 5K, tech*
|
||||
1.  [www.minds.com (https://www.minds.com)](https://www.minds.com)*: top 5K, in*
|
||||
1.  [forums.imore.com (https://forums.imore.com)](https://forums.imore.com)*: top 5K, forum, us*, search is disabled
|
||||
@@ -396,7 +396,7 @@ Rank data fetched from Alexa by domains.
|
||||
1.  [About.me (https://about.me/)](https://about.me/)*: top 10K, blog, in*
|
||||
1.  [Fark (https://www.fark.com/)](https://www.fark.com/)*: top 10K, forum, news*
|
||||
1.  [ReverbNation (https://www.reverbnation.com/)](https://www.reverbnation.com/)*: top 10K, us*
|
||||
1.  [Scorcher (https://www.glavbukh.ru)](https://www.glavbukh.ru)*: top 10K, ru*
|
||||
1.  [Scorcher (https://www.glavbukh.ru)](https://www.glavbukh.ru)*: top 10K, ru*, search is disabled
|
||||
1.  [Trakt (https://www.trakt.tv/)](https://www.trakt.tv/)*: top 10K, de, fr*
|
||||
1.  [Hotcopper (https://hotcopper.com.au)](https://hotcopper.com.au)*: top 10K, au*
|
||||
1.  [Pandia (https://pandia.ru)](https://pandia.ru)*: top 10K, news, ru*
|
||||
@@ -515,7 +515,7 @@ Rank data fetched from Alexa by domains.
|
||||
1.  [forums.indiegala.com (https://forums.indiegala.com)](https://forums.indiegala.com)*: top 100K, forum, us*
|
||||
1.  [Picarto (https://ptvintern.picarto.tv)](https://ptvintern.picarto.tv)*: top 100K, art, streaming*
|
||||
1.  [Neoseeker (https://www.neoseeker.com)](https://www.neoseeker.com)*: top 100K, us*
|
||||
1.  [InfosecInstitute (https://community.infosecinstitute.com)](https://community.infosecinstitute.com)*: top 100K, us*
|
||||
1.  [InfosecInstitute (https://community.infosecinstitute.com)](https://community.infosecinstitute.com)*: top 100K, us*, search is disabled
|
||||
1.  [Armorgames (https://armorgames.com)](https://armorgames.com)*: top 100K, gaming, us*
|
||||
1.  [giters.com (https://giters.com)](https://giters.com)*: top 100K, coding*
|
||||
1.  [teamtreehouse.com (https://teamtreehouse.com)](https://teamtreehouse.com)*: top 100K, us*
|
||||
@@ -556,7 +556,7 @@ Rank data fetched from Alexa by domains.
|
||||
1.  [DonationsAlerts (https://www.donationalerts.com/)](https://www.donationalerts.com/)*: top 100K, finance, ru*
|
||||
1.  [TrueAchievements (https://www.trueachievements.com)](https://www.trueachievements.com)*: top 100K, us*
|
||||
1.  [Jimdo (https://jimdosite.com/)](https://jimdosite.com/)*: top 100K, jp*
|
||||
1.  [club.cnews.ru (https://club.cnews.ru/)](https://club.cnews.ru/)*: top 100K, blog, ru*
|
||||
1.  [club.cnews.ru (https://club.cnews.ru/)](https://club.cnews.ru/)*: top 100K, blog, ru*, search is disabled
|
||||
1.  [PSNProfiles.com (https://psnprofiles.com/)](https://psnprofiles.com/)*: top 100K, gaming*, search is disabled
|
||||
1.  [donorbox (https://donorbox.org)](https://donorbox.org)*: top 100K, finance*
|
||||
1.  [Sbazar.cz (https://www.sbazar.cz/)](https://www.sbazar.cz/)*: top 100K, cz, shopping*
|
||||
@@ -3100,20 +3100,20 @@ Rank data fetched from Alexa by domains.
|
||||
1.  [ngl.link (https://ngl.link)](https://ngl.link)*: top 100M, q&a*
|
||||
1.  [bitpapa.com (https://bitpapa.com)](https://bitpapa.com)*: top 100M, crypto*
|
||||
|
||||
The list was updated at (2023-10-27 19:46:13.899883 UTC)
|
||||
The list was updated at (2024-05-13 20:09:33.626841+00:00 UTC)
|
||||
## Statistics
|
||||
|
||||
Enabled/total sites: 2802/3096 = 90.5%
|
||||
Enabled/total sites: 2794/3096 = 90.25%
|
||||
|
||||
Incomplete message checks: 447/2802 = 15.95% (false positive risks)
|
||||
Incomplete message checks: 438/2794 = 15.68% (false positive risks)
|
||||
|
||||
Status code checks: 720/2802 = 25.7% (false positive risks)
|
||||
Status code checks: 722/2794 = 25.84% (false positive risks)
|
||||
|
||||
False positive risk (total): 41.65%
|
||||
False positive risk (total): 41.52%
|
||||
|
||||
Top 20 profile URLs:
|
||||
- (796) `{urlMain}/index/8-0-{username} (uCoz)`
|
||||
- (294) `/{username}`
|
||||
- (295) `/{username}`
|
||||
- (221) `{urlMain}{urlSubpath}/members/?username={username} (XenForo)`
|
||||
- (158) `/user/{username}`
|
||||
- (133) `{urlMain}{urlSubpath}/member.php?username={username} (vBulletin)`
|
||||
@@ -3138,16 +3138,16 @@ Top 20 tags:
|
||||
- (279) `forum`
|
||||
- (49) `gaming`
|
||||
- (25) `coding`
|
||||
- (22) `photo`
|
||||
- (21) `photo`
|
||||
- (19) `news`
|
||||
- (18) `blog`
|
||||
- (16) `music`
|
||||
- (15) `music`
|
||||
- (14) `tech`
|
||||
- (13) `freelance`
|
||||
- (12) `freelance`
|
||||
- (11) `sharing`
|
||||
- (11) `art`
|
||||
- (11) `finance`
|
||||
- (10) `dating`
|
||||
- (10) `art`
|
||||
- (10) `shopping`
|
||||
- (9) `movies`
|
||||
- (8) `hobby`
|
||||
|
||||
Reference in New Issue
Block a user