Fixed 3 sites, disabed 3, added (#1539)

* Fixed/Disabled sites. Update requirements.txt

fixed_sites: AllRecipes, Linktree, CreativeMarket, ImgInn, Shutterstock, Contently

disabled_sites: Forums.ea.com. CrunchyRoll, Windy, MetaCritic, InfosecInstitute, Armchairgm.fandom.com, Bleach.fandom.com

Update requirements to prevent dependency conflicts.

* Update requirements.txt

Update requirements.txt to prevent dependency conflicts

* Update requirements.txt

* Update sites.md

* fixed_sites: Armchairgm.fandom.com, Bleach.fandom.com, Battleraprus. disabled_sites: MicrosoftTechNet, club.cnews.ru, Scorcher

* fixed_sites: Armchairgm.fandom.com, Bleach.fandom.com, Battleraprus. disabled_sites: MicrosoftTechNet, club.cnews.ru, Scorcher

* fixed 2 sites, disabled 22 sites, and added 1 site

* fixed 3 sites, disabled 28, added 4 sites

* update sites.md

* Added 2 more sites

* fixed 3 sites, disabled 3 sites, added 1 site

* fix Twitch. Update snapcraft.yaml. Add pyproject.toml. Remove setup.py, requirements.txt, test-requirements.txt, as they are already specified in pyproject.toml

* fix Twitch. Update snapcraft.yaml. Add pyproject.toml. Remove setup.py, requirements.txt, test-requirements.txt, as they are already specified in pyproject.toml

* fix Twitch. Update snapcraft.yaml. Add pyproject.toml. Remove setup.py, requirements.txt, test-requirements.txt, as they are already specified in pyproject.toml

* fix Twitch. Update snapcraft.yaml. Add pyproject.toml. Remove setup.py, requirements.txt, test-requirements.txt, as they are already specified in pyproject.toml

* Update sites.md

* fix Twitch. Update snapcraft.yaml. Add pyproject.toml. Remove setup.py, requirements.txt, test-requirements.txt, as they are already specified in pyproject.toml

* Update sites.md

* fix forums.drom.ru

* Add EduGeek

* Add EduGeek

* Update python-package.yml

Fix dependency installation

* Update python-package.yml

* Update python-package.yml
This commit is contained in:
Richard Mwewa
2024-05-24 14:51:27 +02:00
committed by GitHub
parent 9399737ee6
commit 034153791b
15 changed files with 2628 additions and 220 deletions
+17 -16
View File
@@ -1,5 +1,5 @@
## List of supported sites (search methods): total 3102
## List of supported sites (search methods): total 3104
Rank data fetched from Alexa by domains.
@@ -21,7 +21,7 @@ Rank data fetched from Alexa by domains.
1. ![](https://www.google.com/s2/favicons?domain=https://vk.com/) [VK (by id) (https://vk.com/)](https://vk.com/)*: top 50, ru*
1. ![](https://www.google.com/s2/favicons?domain=https://pt.bongacams.com) [BongaCams (https://pt.bongacams.com)](https://pt.bongacams.com)*: top 50, cz, webcam*
1. ![](https://www.google.com/s2/favicons?domain=https://www.instagram.com/) [Instagram (https://www.instagram.com/)](https://www.instagram.com/)*: top 50, photo*, search is disabled
1. ![](https://www.google.com/s2/favicons?domain=https://www.twitch.tv/) [Twitch (https://www.twitch.tv/)](https://www.twitch.tv/)*: top 50, streaming*
1. ![](https://www.google.com/s2/favicons?domain=https://www.twitch.tv/) [Twitch (https://www.twitch.tv/)](https://www.twitch.tv/)*: top 50, streaming, us*
1. ![](https://www.google.com/s2/favicons?domain=https://yandex.ru/collections/) [YandexCollections API (https://yandex.ru/collections/)](https://yandex.ru/collections/)*: top 50, ru, sharing*
1. ![](https://www.google.com/s2/favicons?domain=https://stackoverflow.com) [StackOverflow (https://stackoverflow.com)](https://stackoverflow.com)*: top 50, coding*
1. ![](https://www.google.com/s2/favicons?domain=https://www.ebay.com/) [Ebay (https://www.ebay.com/)](https://www.ebay.com/)*: top 50, shopping, us*
@@ -155,7 +155,7 @@ Rank data fetched from Alexa by domains.
1. ![](https://www.google.com/s2/favicons?domain=https://duolingo.com/) [Duolingo (https://duolingo.com/)](https://duolingo.com/)*: top 1K, us*
1. ![](https://www.google.com/s2/favicons?domain=https://www.rottentomatoes.com) [Rottentomatoes (https://www.rottentomatoes.com)](https://www.rottentomatoes.com)*: top 1K, movies, us*
1. ![](https://www.google.com/s2/favicons?domain=https://www.kickstarter.com) [Kickstarter (https://www.kickstarter.com)](https://www.kickstarter.com)*: top 1K, finance, us*
1. ![](https://www.google.com/s2/favicons?domain=https://forums.ea.com) [forums.ea.com (https://forums.ea.com)](https://forums.ea.com)*: top 1K, forum, gaming, us*
1. ![](https://www.google.com/s2/favicons?domain=https://forums.ea.com) [forums.ea.com (https://forums.ea.com)](https://forums.ea.com)*: top 1K, forum, gaming, us*, search is disabled
1. ![](https://www.google.com/s2/favicons?domain=https://forums.envato.com) [Envato (https://forums.envato.com)](https://forums.envato.com)*: top 1K, au, forum, in*
1. ![](https://www.google.com/s2/favicons?domain=https://ultimate-guitar.com/) [Ultimate-Guitar (https://ultimate-guitar.com/)](https://ultimate-guitar.com/)*: top 1K, us*
1. ![](https://www.google.com/s2/favicons?domain=https://www.freelancer.com/) [Freelancer.com (https://www.freelancer.com/)](https://www.freelancer.com/)*: top 1K, freelance, us*
@@ -199,7 +199,7 @@ Rank data fetched from Alexa by domains.
1. ![](https://www.google.com/s2/favicons?domain=https://freelance.habr.com/) [Freelance.habr (https://freelance.habr.com/)](https://freelance.habr.com/)*: top 5K, freelance, ru*
1. ![](https://www.google.com/s2/favicons?domain=https://qna.habr.com/) [Toster (https://qna.habr.com/)](https://qna.habr.com/)*: top 5K, coding, ru*
1. ![](https://www.google.com/s2/favicons?domain=https://habr.com/) [Habr (https://habr.com/)](https://habr.com/)*: top 5K, blog, discussion, ru*
1. ![](https://www.google.com/s2/favicons?domain=https://forums.drom.ru/) [forums.drom.ru (https://forums.drom.ru/)](https://forums.drom.ru/)*: top 5K, forum, ru*
1. ![](https://www.google.com/s2/favicons?domain=https://www.forumsdrom.ru/) [forums.drom.ru (https://www.forumsdrom.ru/)](https://www.forumsdrom.ru/)*: top 5K, forum, ru*
1. ![](https://www.google.com/s2/favicons?domain=https://users.software.informer.com) [SoftwareInformer (https://users.software.informer.com)](https://users.software.informer.com)*: top 5K, in*
1. ![](https://www.google.com/s2/favicons?domain=https://www.freecodecamp.org/forum/) [Freecodecamp (https://www.freecodecamp.org/forum/)](https://www.freecodecamp.org/forum/)*: top 5K, coding, education, forum*
1. ![](https://www.google.com/s2/favicons?domain=https://www.zomato.com/) [Zomato (https://www.zomato.com/)](https://www.zomato.com/)*: top 5K, geosocial, in*
@@ -298,7 +298,7 @@ Rank data fetched from Alexa by domains.
1. ![](https://www.google.com/s2/favicons?domain=https://community.getpostman.com) [community.getpostman.com (https://community.getpostman.com)](https://community.getpostman.com)*: top 5K, forum, in, tech*
1. ![](https://www.google.com/s2/favicons?domain=https://www.templatemonster.com) [TemplateMonster (https://www.templatemonster.com)](https://www.templatemonster.com)*: top 5K, coding*
1. ![](https://www.google.com/s2/favicons?domain=https://www.weforum.org) [Weforum (https://www.weforum.org)](https://www.weforum.org)*: top 5K, forum, us*
1. ![](https://www.google.com/s2/favicons?domain=https://banki.ru) [banki.ru (https://banki.ru)](https://banki.ru)*: top 5K, ru*
1. ![](https://www.google.com/s2/favicons?domain=https://banki.ru) [banki.ru (https://banki.ru)](https://banki.ru)*: top 5K, ru*, search is disabled
1. ![](https://www.google.com/s2/favicons?domain=https://angel.co) [angel.co (https://angel.co)](https://angel.co)*: top 5K, business*
1. ![](https://www.google.com/s2/favicons?domain=https://fotostrana.ru) [fotostrana.ru (https://fotostrana.ru)](https://fotostrana.ru)*: top 5K, ru*
1. ![](https://www.google.com/s2/favicons?domain=http://www.techspot.com/community/) [techspot.com (http://www.techspot.com/community/)](http://www.techspot.com/community/)*: top 5K, forum, us*
@@ -585,7 +585,7 @@ Rank data fetched from Alexa by domains.
1. ![](https://www.google.com/s2/favicons?domain=https://www.nn.ru/) [NN.RU (https://www.nn.ru/)](https://www.nn.ru/)*: top 100K, ru*
1. ![](https://www.google.com/s2/favicons?domain=https://cent.co/) [Cent (https://cent.co/)](https://cent.co/)*: top 100K, art, us, writing*
1. ![](https://www.google.com/s2/favicons?domain=https://www.forumophilia.com) [Forumophilia (https://www.forumophilia.com)](https://www.forumophilia.com)*: top 100K, forum, porn*
1. ![](https://www.google.com/s2/favicons?domain=https://robertsspaceindustries.com/) [Star Citizen (https://robertsspaceindustries.com/)](https://robertsspaceindustries.com/)*: top 100K, de, us*
1. ![](https://www.google.com/s2/favicons?domain=https://robertsspaceindustries.com/) [Star Citizen (https://robertsspaceindustries.com/)](https://robertsspaceindustries.com/)*: top 100K, de, us*, search is disabled
1. ![](https://www.google.com/s2/favicons?domain=https://eva.ru/) [Eva (https://eva.ru/)](https://eva.ru/)*: top 100K, ru*, search is disabled
1. ![](https://www.google.com/s2/favicons?domain=https://e621.net) [E621 (https://e621.net)](https://e621.net)*: top 100K, us*
1. ![](https://www.google.com/s2/favicons?domain=https://www.metacafe.com/) [Metacafe (https://www.metacafe.com/)](https://www.metacafe.com/)*: top 100K, in, us*, search is disabled
@@ -705,7 +705,7 @@ Rank data fetched from Alexa by domains.
1. ![](https://www.google.com/s2/favicons?domain=https://tripster.ru) [Tripster (https://tripster.ru)](https://tripster.ru)*: top 100K, de, ru*
1. ![](https://www.google.com/s2/favicons?domain=https://forum.ss-iptv.com) [forum.ss-iptv.com (https://forum.ss-iptv.com)](https://forum.ss-iptv.com)*: top 100K, forum, tr*
1. ![](https://www.google.com/s2/favicons?domain=http://gribnikikybani.mybb.ru) [Gribnikikybani (http://gribnikikybani.mybb.ru)](http://gribnikikybani.mybb.ru)*: top 100K, ru*
1. ![](https://www.google.com/s2/favicons?domain=https://waytothelight.mybb.ru/) [Waytothelight (https://waytothelight.mybb.ru/)](https://waytothelight.mybb.ru/)*: top 100K, ru*
1. ![](https://www.google.com/s2/favicons?domain=https://waytothelight.mybb.ru/) [Waytothelight (https://waytothelight.mybb.ru/)](https://waytothelight.mybb.ru/)*: top 100K, ru*, search is disabled
1. ![](https://www.google.com/s2/favicons?domain=https://www.thefastlaneforum.com) [TheFastlaneForum (https://www.thefastlaneforum.com)](https://www.thefastlaneforum.com)*: top 100K, forum, us*, search is disabled
1. ![](https://www.google.com/s2/favicons?domain=https://shor.by) [shor.by (https://shor.by)](https://shor.by)*: top 100K, links*
1. ![](https://www.google.com/s2/favicons?domain=https://www.liveexpert.ru) [Liveexpert (https://www.liveexpert.ru)](https://www.liveexpert.ru)*: top 100K, ru*
@@ -828,7 +828,7 @@ Rank data fetched from Alexa by domains.
1. ![](https://www.google.com/s2/favicons?domain=https://forum.gorod.dp.ua/) [Gorod.dp.ua (https://forum.gorod.dp.ua/)](https://forum.gorod.dp.ua/)*: top 100K, de, forum, ua*
1. ![](https://www.google.com/s2/favicons?domain=https://www.homebrewtalk.com) [homebrewtalk.com (https://www.homebrewtalk.com)](https://www.homebrewtalk.com)*: top 100K*
1. ![](https://www.google.com/s2/favicons?domain=https://www.rappad.co) [Rappad (https://www.rappad.co)](https://www.rappad.co)*: top 100K, music*
1. ![](https://www.google.com/s2/favicons?domain=https://maxpark.com) [Maxpark (https://maxpark.com)](https://maxpark.com)*: top 100K, news, ru*
1. ![](https://www.google.com/s2/favicons?domain=https://maxpark.com) [Maxpark (https://maxpark.com)](https://maxpark.com)*: top 100K, news, ru*, search is disabled
1. ![](https://www.google.com/s2/favicons?domain=https://www.fishingsib.ru/) [Fishingsib (https://www.fishingsib.ru/)](https://www.fishingsib.ru/)*: top 100K, ru*
1. ![](https://www.google.com/s2/favicons?domain=https://miped.ru) [Miped (https://miped.ru)](https://miped.ru)*: top 100K, ru*
1. ![](https://www.google.com/s2/favicons?domain=https://site.gipsyteam.ru/) [GipsysTeam (https://site.gipsyteam.ru/)](https://site.gipsyteam.ru/)*: top 100K, ru*
@@ -1893,7 +1893,7 @@ Rank data fetched from Alexa by domains.
1. ![](https://www.google.com/s2/favicons?domain=http://igra-online.ucoz.com) [igra-online.ucoz.com (http://igra-online.ucoz.com)](http://igra-online.ucoz.com)*: top 10M*
1. ![](https://www.google.com/s2/favicons?domain=https://joby.su) [Joby (https://joby.su)](https://joby.su)*: top 10M, freelance, ru*, search is disabled
1. ![](https://www.google.com/s2/favicons?domain=http://forum.scuba-divers.ru/) [scuba (http://forum.scuba-divers.ru/)](http://forum.scuba-divers.ru/)*: top 10M, forum, ru*
1. ![](https://www.google.com/s2/favicons?domain=https://boominfo.ru) [BoomInfo (https://boominfo.ru)](https://boominfo.ru)*: top 10M, ru, ua*
1. ![](https://www.google.com/s2/favicons?domain=https://boominfo.ru) [BoomInfo (https://boominfo.ru)](https://boominfo.ru)*: top 10M, ru, ua*, search is disabled
1. ![](https://www.google.com/s2/favicons?domain=http://musicbunker.ru) [musicbunker.ru (http://musicbunker.ru)](http://musicbunker.ru)*: top 10M*
1. ![](https://www.google.com/s2/favicons?domain=http://chastysc.ucoz.ru) [chastysc.ucoz.ru (http://chastysc.ucoz.ru)](http://chastysc.ucoz.ru)*: top 10M, ru*
1. ![](https://www.google.com/s2/favicons?domain=http://rottweiler.ucoz.ru) [rottweiler.ucoz.ru (http://rottweiler.ucoz.ru)](http://rottweiler.ucoz.ru)*: top 10M*
@@ -2128,6 +2128,7 @@ Rank data fetched from Alexa by domains.
1. ![](https://www.google.com/s2/favicons?domain=https://forum.djangoproject.co) [Djangoproject.co (https://forum.djangoproject.co)](https://forum.djangoproject.co)*: top 100M, coding, forum*
1. ![](https://www.google.com/s2/favicons?domain=https://www.domestika.org) [domestika.org (https://www.domestika.org)](https://www.domestika.org)*: top 100M, education*
1. ![](https://www.google.com/s2/favicons?domain=https://www.dublikat.shop) [Dublikat (https://www.dublikat.shop)](https://www.dublikat.shop)*: top 100M, ru*, search is disabled
1. ![](https://www.google.com/s2/favicons?domain=https://www.edugeek.net) [EduGeek (https://www.edugeek.net)](https://www.edugeek.net)*: top 100M, education, us*
1. ![](https://www.google.com/s2/favicons?domain=) [Elftown ()]()*: top 100M*
1. ![](https://www.google.com/s2/favicons?domain=) [Engadget ()]()*: top 100M*
1. ![](https://www.google.com/s2/favicons?domain=) [8tracks.com ()]()*: top 100M*
@@ -2255,6 +2256,7 @@ Rank data fetched from Alexa by domains.
1. ![](https://www.google.com/s2/favicons?domain=https://community.spiceworks.co) [SpiceWorks (https://community.spiceworks.co)](https://community.spiceworks.co)*: top 100M, forum, tech*
1. ![](https://www.google.com/s2/favicons?domain=) [Splice ()]()*: top 100M*
1. ![](https://www.google.com/s2/favicons?domain=) [Sportlerfrage ()]()*: top 100M*
1. ![](https://www.google.com/s2/favicons?domain=https://robertsspaceindustries.com/) [Star Citizens Community (https://robertsspaceindustries.com/)](https://robertsspaceindustries.com/)*: top 100M, de, us*
1. ![](https://www.google.com/s2/favicons?domain=http://statistika.ru) [Statistika (http://statistika.ru)](http://statistika.ru)*: top 100M, forum, ru*, search is disabled
1. ![](https://www.google.com/s2/favicons?domain=) [Suzuri.jp ()]()*: top 100M*
1. ![](https://www.google.com/s2/favicons?domain=) [Swapd ()]()*: top 100M*
@@ -2892,7 +2894,7 @@ Rank data fetched from Alexa by domains.
1. ![](https://www.google.com/s2/favicons?domain=) [chiefdelphi.com ()]()*: top 100M*
1. ![](https://www.google.com/s2/favicons?domain=) [choice.community ()]()*: top 100M*
1. ![](https://www.google.com/s2/favicons?domain=) [cloudromance.com ()]()*: top 100M*, search is disabled
1. ![](https://www.google.com/s2/favicons?domain=) [club.myce.com ()]()*: top 100M*
1. ![](https://www.google.com/s2/favicons?domain=) [club.myce.com ()]()*: top 100M*, search is disabled
1. ![](https://www.google.com/s2/favicons?domain=) [cnblogs.com ()]()*: top 100M*
1. ![](https://www.google.com/s2/favicons?domain=) [commons.commondreams.org ()]()*: top 100M*
1. ![](https://www.google.com/s2/favicons?domain=) [community.cartalk.com ()]()*: top 100M*
@@ -3106,17 +3108,16 @@ Rank data fetched from Alexa by domains.
1. ![](https://www.google.com/s2/favicons?domain=https://ngl.link) [ngl.link (https://ngl.link)](https://ngl.link)*: top 100M, q&a*
1. ![](https://www.google.com/s2/favicons?domain=https://bitpapa.com) [bitpapa.com (https://bitpapa.com)](https://bitpapa.com)*: top 100M, crypto*
The list was updated at (2024-05-17 19:44:47.837150+00:00 UTC)
The list was updated at (2024-05-23 19:05:28.745384+00:00 UTC)
## Statistics
Enabled/total sites: 2775/3102 = 89.46%
Enabled/total sites: 2770/3104 = 89.24%
Incomplete message checks: 424/2775 = 15.28% (false positive risks)
Incomplete message checks: 424/2770 = 15.31% (false positive risks)
Status code checks: 722/2775 = 26.02% (false positive risks)
Status code checks: 721/2770 = 26.03% (false positive risks)
False positive risk (total): 41.30%
False positive risk (total): 41.34%
Top 20 profile URLs:
- (796) `{urlMain}/index/8-0-{username} (uCoz)`