Compare commits

...

337 Commits

Author SHA1 Message Date
Soxoj f7c7809d8d Bump to 0.4.4 (#621) 2022-09-03 14:30:24 +03:00
dependabot[bot] 80bd7f21eb Bump tqdm from 4.64.0 to 4.64.1 (#618)
Bumps [tqdm](https://github.com/tqdm/tqdm) from 4.64.0 to 4.64.1.
- [Release notes](https://github.com/tqdm/tqdm/releases)
- [Commits](https://github.com/tqdm/tqdm/compare/v4.64.0...v4.64.1)

---
updated-dependencies:
- dependency-name: tqdm
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-09-03 14:22:56 +03:00
fen0s 994d79244e add ProtonMail, disable 3 broken sites (#619)
* fixed false positives
2022-09-03 14:22:42 +03:00
dependabot[bot] 4b2d2c07bd Bump pycountry from 22.1.10 to 22.3.5 (#607)
Bumps [pycountry](https://github.com/flyingcircusio/pycountry) from 22.1.10 to 22.3.5.
- [Release notes](https://github.com/flyingcircusio/pycountry/releases)
- [Changelog](https://github.com/flyingcircusio/pycountry/blob/master/HISTORY.txt)
- [Commits](https://github.com/flyingcircusio/pycountry/compare/22.1.10...22.3.5)

---
updated-dependencies:
- dependency-name: pycountry
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-09-03 14:16:12 +03:00
dependabot[bot] 938d05f812 Bump cloudscraper from 1.2.63 to 1.2.64 (#614)
Bumps [cloudscraper](https://github.com/venomous/cloudscraper) from 1.2.63 to 1.2.64.
- [Release notes](https://github.com/venomous/cloudscraper/releases)
- [Commits](https://github.com/venomous/cloudscraper/commits)

---
updated-dependencies:
- dependency-name: cloudscraper
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-09-03 13:35:38 +03:00
OSINT Tactical 487c4e0dbf Update sites.md -Gitmemory.com suppression (#610)
* Update sites.md

* Add files via upload
2022-09-03 13:35:29 +03:00
dependabot[bot] 09dce2046a Bump pytest from 7.1.2 to 7.1.3 (#613)
Bumps [pytest](https://github.com/pytest-dev/pytest) from 7.1.2 to 7.1.3.
- [Release notes](https://github.com/pytest-dev/pytest/releases)
- [Changelog](https://github.com/pytest-dev/pytest/blob/main/CHANGELOG.rst)
- [Commits](https://github.com/pytest-dev/pytest/compare/7.1.2...7.1.3)

---
updated-dependencies:
- dependency-name: pytest
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-09-03 13:35:18 +03:00
dependabot[bot] 65963e5647 Bump pypdf2 from 2.5.0 to 2.10.4 (#606)
Bumps [pypdf2](https://github.com/py-pdf/PyPDF2) from 2.5.0 to 2.10.4.
- [Release notes](https://github.com/py-pdf/PyPDF2/releases)
- [Changelog](https://github.com/py-pdf/PyPDF2/blob/main/CHANGELOG.md)
- [Commits](https://github.com/py-pdf/PyPDF2/compare/2.5.0...2.10.4)

---
updated-dependencies:
- dependency-name: pypdf2
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-09-03 13:34:53 +03:00
dependabot[bot] 69f220a7e4 Bump pytest-asyncio from 0.18.2 to 0.19.0 (#601)
Bumps [pytest-asyncio](https://github.com/pytest-dev/pytest-asyncio) from 0.18.2 to 0.19.0.
- [Release notes](https://github.com/pytest-dev/pytest-asyncio/releases)
- [Changelog](https://github.com/pytest-dev/pytest-asyncio/blob/master/CHANGELOG.rst)
- [Commits](https://github.com/pytest-dev/pytest-asyncio/compare/v0.18.2...v0.19.0)

---
updated-dependencies:
- dependency-name: pytest-asyncio
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-09-03 13:26:13 +03:00
dependabot[bot] 722d3039dc Bump attrs from 21.4.0 to 22.1.0 (#597)
Bumps [attrs](https://github.com/python-attrs/attrs) from 21.4.0 to 22.1.0.
- [Release notes](https://github.com/python-attrs/attrs/releases)
- [Changelog](https://github.com/python-attrs/attrs/blob/main/CHANGELOG.rst)
- [Commits](https://github.com/python-attrs/attrs/compare/21.4.0...22.1.0)

---
updated-dependencies:
- dependency-name: attrs
  dependency-type: direct:production
  update-type: version-update:semver-major
...

Signed-off-by: dependabot[bot] <support@github.com>

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-09-03 13:26:01 +03:00
dependabot[bot] 420c29610d Bump flake8 from 4.0.1 to 5.0.4 (#598)
Bumps [flake8](https://github.com/pycqa/flake8) from 4.0.1 to 5.0.4.
- [Release notes](https://github.com/pycqa/flake8/releases)
- [Commits](https://github.com/pycqa/flake8/compare/4.0.1...5.0.4)

---
updated-dependencies:
- dependency-name: flake8
  dependency-type: direct:production
  update-type: version-update:semver-major
...

Signed-off-by: dependabot[bot] <support@github.com>

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-09-02 21:18:35 +03:00
dependabot[bot] 6b53fac424 Bump cloudscraper from 1.2.60 to 1.2.63 (#600)
Bumps [cloudscraper](https://github.com/venomous/cloudscraper) from 1.2.60 to 1.2.63.
- [Release notes](https://github.com/venomous/cloudscraper/releases)
- [Commits](https://github.com/venomous/cloudscraper/commits)

---
updated-dependencies:
- dependency-name: cloudscraper
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-08-27 17:11:12 +03:00
dependabot[bot] 37c54735f1 Bump chardet from 4.0.0 to 5.0.0 (#550)
Bumps [chardet](https://github.com/chardet/chardet) from 4.0.0 to 5.0.0.
- [Release notes](https://github.com/chardet/chardet/releases)
- [Commits](https://github.com/chardet/chardet/compare/4.0.0...5.0.0)

---
updated-dependencies:
- dependency-name: chardet
  dependency-type: direct:production
  update-type: version-update:semver-major
...

Signed-off-by: dependabot[bot] <support@github.com>

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-08-27 17:07:29 +03:00
dependabot[bot] 2f0a0b49f3 Bump colorama from 0.4.4 to 0.4.5 (#548)
Bumps [colorama](https://github.com/tartley/colorama) from 0.4.4 to 0.4.5.
- [Release notes](https://github.com/tartley/colorama/releases)
- [Changelog](https://github.com/tartley/colorama/blob/master/CHANGELOG.rst)
- [Commits](https://github.com/tartley/colorama/compare/0.4.4...0.4.5)

---
updated-dependencies:
- dependency-name: colorama
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-08-27 17:03:30 +03:00
dependabot[bot] 1a8b06385a Bump typing-extensions from 4.2.0 to 4.3.0 (#549)
Bumps [typing-extensions](https://github.com/python/typing_extensions) from 4.2.0 to 4.3.0.
- [Release notes](https://github.com/python/typing_extensions/releases)
- [Changelog](https://github.com/python/typing_extensions/blob/main/CHANGELOG.md)
- [Commits](https://github.com/python/typing_extensions/compare/4.2.0...4.3.0)

---
updated-dependencies:
- dependency-name: typing-extensions
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-08-27 17:03:19 +03:00
dependabot[bot] 22d7c204f8 Bump pytest-httpserver from 1.0.4 to 1.0.5 (#583)
Bumps [pytest-httpserver](https://github.com/csernazs/pytest-httpserver) from 1.0.4 to 1.0.5.
- [Release notes](https://github.com/csernazs/pytest-httpserver/releases)
- [Changelog](https://github.com/csernazs/pytest-httpserver/blob/master/CHANGES.rst)
- [Commits](https://github.com/csernazs/pytest-httpserver/compare/1.0.4...1.0.5)

---
updated-dependencies:
- dependency-name: pytest-httpserver
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-08-27 17:03:09 +03:00
fen0s a6ae0723f9 False positives fixes (#591) 2022-08-24 18:26:01 +03:00
dependabot[bot] aa4f94ac01 Bump certifi from 2022.5.18.1 to 2022.6.15 (#551)
Bumps [certifi](https://github.com/certifi/python-certifi) from 2022.5.18.1 to 2022.6.15.
- [Release notes](https://github.com/certifi/python-certifi/releases)
- [Commits](https://github.com/certifi/python-certifi/compare/2022.05.18.1...2022.06.15)

---
updated-dependencies:
- dependency-name: certifi
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-08-20 12:13:32 +03:00
fen0s 1153a9bf01 disable Instagram, fix two false positives (#578)
* Update data.json

* Update data.json

* Update data.json
2022-08-15 15:45:53 +03:00
fen0s 3d878131b9 fix false positives (#577) 2022-08-13 13:12:22 +03:00
fen0s 20746a0fc3 disable yandex music + set utf8 encoding (#562)
* Update report.py

* Update data.json

* Update data.json
2022-07-26 02:37:26 +03:00
dependabot[bot] ce062d915e Bump lxml from 4.9.0 to 4.9.1 (#538)
Bumps [lxml](https://github.com/lxml/lxml) from 4.9.0 to 4.9.1.
- [Release notes](https://github.com/lxml/lxml/releases)
- [Changelog](https://github.com/lxml/lxml/blob/master/CHANGES.txt)
- [Commits](https://github.com/lxml/lxml/compare/lxml-4.9.0...lxml-4.9.1)

---
updated-dependencies:
- dependency-name: lxml
  dependency-type: direct:production
...

Signed-off-by: dependabot[bot] <support@github.com>

Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-07-17 12:45:37 +03:00
dependabot[bot] c057c5c478 Bump xhtml2pdf from 0.2.7 to 0.2.8 (#522)
Bumps [xhtml2pdf](https://github.com/xhtml2pdf/xhtml2pdf) from 0.2.7 to 0.2.8.
- [Release notes](https://github.com/xhtml2pdf/xhtml2pdf/releases)
- [Commits](https://github.com/xhtml2pdf/xhtml2pdf/compare/v0.2.7...v0.2.8)

---
updated-dependencies:
- dependency-name: xhtml2pdf
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>

Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-07-17 12:45:29 +03:00
dependabot[bot] eab0ec48da Bump pypdf2 from 2.0.0 to 2.5.0 (#542)
Bumps [pypdf2](https://github.com/py-pdf/PyPDF2) from 2.0.0 to 2.5.0.
- [Release notes](https://github.com/py-pdf/PyPDF2/releases)
- [Changelog](https://github.com/py-pdf/PyPDF2/blob/main/CHANGELOG.md)
- [Commits](https://github.com/py-pdf/PyPDF2/compare/2.0.0...2.5.0)

---
updated-dependencies:
- dependency-name: pypdf2
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>

Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-07-17 12:45:24 +03:00
dependabot[bot] 5b40eac230 Bump requests from 2.27.1 to 2.28.1 (#530)
Bumps [requests](https://github.com/psf/requests) from 2.27.1 to 2.28.1.
- [Release notes](https://github.com/psf/requests/releases)
- [Changelog](https://github.com/psf/requests/blob/main/HISTORY.md)
- [Commits](https://github.com/psf/requests/compare/v2.27.1...v2.28.1)

---
updated-dependencies:
- dependency-name: requests
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>

Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-07-17 12:45:15 +03:00
dependabot[bot] 2d782379ab Bump reportlab from 3.6.9 to 3.6.11 (#543)
Bumps [reportlab](http://www.reportlab.com/) from 3.6.9 to 3.6.11.

---
updated-dependencies:
- dependency-name: reportlab
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>

Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-07-17 12:45:05 +03:00
fen0s 042981d8bb Update data.json (#540)
* Update data.json

* disable false positives

* Update data.json

* Update data.json
2022-07-12 14:31:22 +03:00
fen0s 2c2017c7db Update data.json (#539) 2022-07-10 12:49:03 +03:00
fen0s 4aeba4d648 Fixes july third (#535)
* fix falsepositives on megafon

* token spotify
2022-07-06 23:57:10 +03:00
fen0s de34e29188 yazbel, aboutcar, zhihu (#531)
* fix some sites and delete abandoned

* disable aboutcar, fix zhihu, add yazbel

* yazbel quickfix

* Squashed commit of the following:

commit 932152edac2765391e0203d6e75f6bffda73d643
Author: fen0s <37670363+fen0s@users.noreply.github.com>
Date:   Fri Jul 1 17:36:58 2022 +0300

    Update data.json

* fix forumsmotri,  teamtreehouse, sourceforge, tomshardware, disable codeby

* 2 sites disasbled, 6 fixed

disabled echomsk (dead), disabled chipmaker (weird search-based username detection while usernames are searched not by exact match), fixed rutracker, kloomba, mobypicture, gamefaqs, eporner, 1337x, tried to fix myfitnesspal but didn't work
2022-07-03 11:49:48 +03:00
fen0s 0c127a97d5 Fixesjulyfirst (#533)
* fix some sites and delete abandoned

* disable aboutcar, fix zhihu, add yazbel

* yazbel quickfix

* Squashed commit of the following:

commit 932152edac2765391e0203d6e75f6bffda73d643
Author: fen0s <37670363+fen0s@users.noreply.github.com>
Date:   Fri Jul 1 17:36:58 2022 +0300

    Update data.json

* fix forumsmotri,  teamtreehouse, sourceforge, tomshardware, disable codeby

* 2 sites disasbled, 6 fixed

disabled echomsk (dead), disabled chipmaker (weird search-based username detection while usernames are searched not by exact match), fixed rutracker, kloomba, mobypicture, gamefaqs, eporner, 1337x, tried to fix myfitnesspal but didn't work
2022-07-02 18:11:05 +03:00
fen0s 11f047b1ae fix some sites and delete abandoned (#526) 2022-06-23 13:18:02 +03:00
Soxoj 43f8adef66 Downgrade pycountry due to problems with wheels
Python 3.10 related problem
2022-06-18 20:07:31 +03:00
Sergey Mamadjanov 2ffb77823d feat: add *.log & *.bak files to gitignore (#511) 2022-06-08 01:45:50 +03:00
Soxoj 7ba8af0247 Compatibility with Python 10 (#509) 2022-06-05 01:12:54 +03:00
dependabot[bot] 814544e1a0 Bump lxml from 4.8.0 to 4.9.0
Bumps [lxml](https://github.com/lxml/lxml) from 4.8.0 to 4.9.0.
- [Release notes](https://github.com/lxml/lxml/releases)
- [Changelog](https://github.com/lxml/lxml/blob/master/CHANGES.txt)
- [Commits](https://github.com/lxml/lxml/compare/lxml-4.8.0...lxml-4.9.0)

---
updated-dependencies:
- dependency-name: lxml
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>
2022-06-04 17:38:36 -04:00
Soxoj 477e62a5c5 Updated sites list, added disabled Anilist 2022-06-04 17:38:28 -04:00
dependabot[bot] 0a629614c2 Bump pefile from 2021.9.3 to 2022.5.30
Bumps [pefile](https://github.com/erocarrera/pefile) from 2021.9.3 to 2022.5.30.
- [Release notes](https://github.com/erocarrera/pefile/releases)
- [Commits](https://github.com/erocarrera/pefile/compare/v2021.9.3...v2022.5.30)

---
updated-dependencies:
- dependency-name: pefile
  dependency-type: direct:production
  update-type: version-update:semver-major
...

Signed-off-by: dependabot[bot] <support@github.com>
2022-06-04 17:38:17 -04:00
dependabot[bot] e2d623f0d7 Bump pypdf2 from 1.28.2 to 2.0.0
Bumps [pypdf2](https://github.com/py-pdf/PyPDF2) from 1.28.2 to 2.0.0.
- [Release notes](https://github.com/py-pdf/PyPDF2/releases)
- [Changelog](https://github.com/py-pdf/PyPDF2/blob/main/CHANGELOG)
- [Commits](https://github.com/py-pdf/PyPDF2/compare/1.28.2...2.0.0)

---
updated-dependencies:
- dependency-name: pypdf2
  dependency-type: direct:production
  update-type: version-update:semver-major
...

Signed-off-by: dependabot[bot] <support@github.com>
2022-06-04 17:38:01 -04:00
kustermariocoding 5145bfe820 added regexchecks for realmeye and realmeye-graveyard to prevent false positives. 2022-06-01 02:12:02 +03:00
kustermariocoding 58f66f5c3c added Bezuzyteczna and Znanylekarz.pl 2022-06-01 02:12:02 +03:00
kustermariocoding 746b74238b added forum.dangerousthings.com 2022-06-01 02:12:02 +03:00
kustermariocoding ae56a927cf added Wiki.vg 2022-06-01 02:12:02 +03:00
kustermariocoding 40ed0a7535 added Watchmemore.com 2022-06-01 02:12:02 +03:00
kustermariocoding beb4d740c7 removed Anilist because it's not working properly 2022-06-01 02:12:02 +03:00
dependabot[bot] a47b6a705e Bump pypdf2 from 1.28.1 to 1.28.2 (#493)
Bumps [pypdf2](https://github.com/py-pdf/PyPDF2) from 1.28.1 to 1.28.2.
- [Release notes](https://github.com/py-pdf/PyPDF2/releases)
- [Changelog](https://github.com/py-pdf/PyPDF2/blob/1.28.2/CHANGELOG)
- [Commits](https://github.com/py-pdf/PyPDF2/compare/1.28.1...1.28.2)

---
updated-dependencies:
- dependency-name: pypdf2
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>

Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-05-26 00:59:55 +03:00
dependabot[bot] 3bfb2db6df Bump pypdf2 from 1.27.12 to 1.28.1 (#491)
Bumps [pypdf2](https://github.com/py-pdf/PyPDF2) from 1.27.12 to 1.28.1.
- [Release notes](https://github.com/py-pdf/PyPDF2/releases)
- [Changelog](https://github.com/py-pdf/PyPDF2/blob/main/CHANGELOG)
- [Commits](https://github.com/py-pdf/PyPDF2/compare/1.27.12...1.28.1)

---
updated-dependencies:
- dependency-name: pypdf2
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>

Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-05-23 17:16:11 +03:00
dependabot[bot] d30ef15a79 Bump psutil from 5.9.0 to 5.9.1 (#490)
Bumps [psutil](https://github.com/giampaolo/psutil) from 5.9.0 to 5.9.1.
- [Release notes](https://github.com/giampaolo/psutil/releases)
- [Changelog](https://github.com/giampaolo/psutil/blob/master/HISTORY.rst)
- [Commits](https://github.com/giampaolo/psutil/compare/release-5.9.0...release-5.9.1)

---
updated-dependencies:
- dependency-name: psutil
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>

Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-05-23 17:16:03 +03:00
dependabot[bot] 1ebf0ca5cf Bump certifi from 2021.10.8 to 2022.5.18.1 (#488)
Bumps [certifi](https://github.com/certifi/python-certifi) from 2021.10.8 to 2022.5.18.1.
- [Release notes](https://github.com/certifi/python-certifi/releases)
- [Commits](https://github.com/certifi/python-certifi/compare/2021.10.08...2022.05.18.1)

---
updated-dependencies:
- dependency-name: certifi
  dependency-type: direct:production
  update-type: version-update:semver-major
...

Signed-off-by: dependabot[bot] <support@github.com>

Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-05-22 23:17:45 +03:00
Soxoj eaa545a2c4 Disabled sites with false positives results (#482) 2022-05-14 20:13:31 +03:00
Soxoj cbe1f09536 Added new forums, updated ranks, some utils improvements (#481)
* Added new forums, updated ranks, some utils improvements

* Updated requirements
2022-05-14 13:29:48 +03:00
Soxoj 246c770d5c Added new sites (#480) 2022-05-14 11:51:15 +03:00
Soxoj e88d71d792 New sites added, some tags/rank update (#477) 2022-05-14 10:58:27 +03:00
Soxoj 929366cc81 Improved usability of external progressbar func (#476) 2022-05-14 02:06:33 +03:00
Soxoj bb6ed59e44 Updated logic of false positive risk estimating (#475) 2022-05-10 14:54:09 +03:00
fen0s 6400d83a46 Social analyzer websites, also fixing presense strs (#471)
* add a lot of new sites from social analyzer, fix presenceStr

* add social-analyzer sites

* fix username claimed

* update site list

* Update data.json
2022-05-10 12:37:23 +03:00
dependabot[bot] 507d0dac3a Bump pyvis from 0.2.0 to 0.2.1 (#472)
Bumps [pyvis](https://github.com/WestHealth/pyvis) from 0.2.0 to 0.2.1.
- [Release notes](https://github.com/WestHealth/pyvis/releases)
- [Commits](https://github.com/WestHealth/pyvis/compare/v0.2.0...v0.2.1)

---
updated-dependencies:
- dependency-name: pyvis
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>

Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-05-10 11:06:26 +03:00
Soxoj f058ee0daf Fixed new false positives, updated sites list (#469) 2022-05-05 02:16:29 +03:00
dependabot[bot] a66c25452a Bump pypdf2 from 1.27.10 to 1.27.12 (#466)
Bumps [pypdf2](https://github.com/py-pdf/PyPDF2) from 1.27.10 to 1.27.12.
- [Release notes](https://github.com/py-pdf/PyPDF2/releases)
- [Changelog](https://github.com/py-pdf/PyPDF2/blob/main/CHANGELOG)
- [Commits](https://github.com/py-pdf/PyPDF2/compare/1.27.10...1.27.12)

---
updated-dependencies:
- dependency-name: pypdf2
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>

Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-05-03 13:29:43 +03:00
dependabot[bot] bfc682f758 Bump pypdf2 from 1.27.9 to 1.27.10 (#465)
Bumps [pypdf2](https://github.com/py-pdf/PyPDF2) from 1.27.9 to 1.27.10.
- [Release notes](https://github.com/py-pdf/PyPDF2/releases)
- [Changelog](https://github.com/py-pdf/PyPDF2/blob/main/CHANGELOG)
- [Commits](https://github.com/py-pdf/PyPDF2/compare/1.27.9...1.27.10)

---
updated-dependencies:
- dependency-name: pypdf2
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>

Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-05-02 19:10:17 +03:00
fen0s aedbe927cb fix Figma username definition, add a bunch of sites (#464)
* Add files via upload

Co-authored-by: fen0s <fen0s@example.com>
2022-05-01 19:52:20 +03:00
fen0s 340d8b45fe Add BYOND, Figma, BeatStars (#462)
* Add files via upload

* fix forums

* Add BYOND, Figma, BeatStars

Co-authored-by: fen0s <fen0s@example.com>
2022-05-01 00:45:57 +03:00
fen0s c95f0fdfbb Ubisoft forums addition (#461)
* Add files via upload

* fix forums

Co-authored-by: fen0s <fen0s@example.com>
2022-04-30 16:34:27 +03:00
dependabot[bot] a5b73d1108 Bump jinja2 from 3.1.1 to 3.1.2 (#460)
Bumps [jinja2](https://github.com/pallets/jinja) from 3.1.1 to 3.1.2.
- [Release notes](https://github.com/pallets/jinja/releases)
- [Changelog](https://github.com/pallets/jinja/blob/main/CHANGES.rst)
- [Commits](https://github.com/pallets/jinja/compare/3.1.1...3.1.2)

---
updated-dependencies:
- dependency-name: jinja2
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>

Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-04-29 15:27:02 +03:00
dependabot[bot] 6157c5ff3d Bump pytest from 7.0.1 to 7.1.2 (#457)
Bumps [pytest](https://github.com/pytest-dev/pytest) from 7.0.1 to 7.1.2.
- [Release notes](https://github.com/pytest-dev/pytest/releases)
- [Changelog](https://github.com/pytest-dev/pytest/blob/main/CHANGELOG.rst)
- [Commits](https://github.com/pytest-dev/pytest/compare/7.0.1...7.1.2)

---
updated-dependencies:
- dependency-name: pytest
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>

Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-04-27 01:15:36 +03:00
dependabot[bot] e0f0dd5d4d Bump pypdf2 from 1.27.8 to 1.27.9 (#456)
Bumps [pypdf2](https://github.com/py-pdf/PyPDF2) from 1.27.8 to 1.27.9.
- [Release notes](https://github.com/py-pdf/PyPDF2/releases)
- [Changelog](https://github.com/py-pdf/PyPDF2/blob/main/CHANGELOG)
- [Commits](https://github.com/py-pdf/PyPDF2/compare/1.27.8...1.27.9)

---
updated-dependencies:
- dependency-name: pypdf2
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>

Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-04-27 01:15:28 +03:00
Soxoj 059c8198a1 False positive fixes 24.04.22 (#455)
* Fixed some false positives
2022-04-24 17:14:07 +03:00
Soxoj 34073d12f4 XMind 8 report warning and some docs update (#452) 2022-04-23 01:28:31 +03:00
dependabot[bot] d24d80ab43 Bump pypdf2 from 1.27.7 to 1.27.8 (#450)
Bumps [pypdf2](https://github.com/py-pdf/PyPDF2) from 1.27.7 to 1.27.8.
- [Release notes](https://github.com/py-pdf/PyPDF2/releases)
- [Changelog](https://github.com/py-pdf/PyPDF2/blob/main/CHANGELOG)
- [Commits](https://github.com/py-pdf/PyPDF2/compare/1.27.7...1.27.8)

---
updated-dependencies:
- dependency-name: pypdf2
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>

Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-04-23 01:01:19 +03:00
Soxoj 123ec35569 Update bug.md 2022-04-21 10:34:57 +03:00
dependabot[bot] 73aa8b649b Bump pypdf2 from 1.27.6 to 1.27.7 (#449)
Bumps [pypdf2](https://github.com/py-pdf/PyPDF2) from 1.27.6 to 1.27.7.
- [Release notes](https://github.com/py-pdf/PyPDF2/releases)
- [Changelog](https://github.com/py-pdf/PyPDF2/blob/main/CHANGELOG)
- [Commits](https://github.com/py-pdf/PyPDF2/compare/1.27.6...1.27.7)

---
updated-dependencies:
- dependency-name: pypdf2
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>

Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-04-21 01:15:39 +03:00
dependabot[bot] 28aa74d83a Bump soupsieve from 2.3.2 to 2.3.2.post1 (#444)
Bumps [soupsieve](https://github.com/facelessuser/soupsieve) from 2.3.2 to 2.3.2.post1.
- [Release notes](https://github.com/facelessuser/soupsieve/releases)
- [Commits](https://github.com/facelessuser/soupsieve/compare/2.3.2...2.3.2.post1)

---
updated-dependencies:
- dependency-name: soupsieve
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>

Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-04-19 23:40:35 +03:00
dependabot[bot] d4780d2840 Bump typing-extensions from 4.1.1 to 4.2.0 (#447)
Bumps [typing-extensions](https://github.com/python/typing) from 4.1.1 to 4.2.0.
- [Release notes](https://github.com/python/typing/releases)
- [Changelog](https://github.com/python/typing/blob/master/typing_extensions/CHANGELOG)
- [Commits](https://github.com/python/typing/compare/4.1.1...4.2.0)

---
updated-dependencies:
- dependency-name: typing-extensions
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>

Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-04-19 23:40:24 +03:00
dependabot[bot] 4c7b6d82cf Bump pypdf2 from 1.27.4 to 1.27.6 (#448)
Bumps [pypdf2](https://github.com/py-pdf/PyPDF2) from 1.27.4 to 1.27.6.
- [Release notes](https://github.com/py-pdf/PyPDF2/releases)
- [Changelog](https://github.com/py-pdf/PyPDF2/blob/main/CHANGELOG)
- [Commits](https://github.com/py-pdf/PyPDF2/compare/1.27.4...1.27.6)

---
updated-dependencies:
- dependency-name: pypdf2
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>

Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-04-19 23:40:15 +03:00
dependabot[bot] 37d6b9a949 Bump pyvis from 0.1.9 to 0.2.0 (#443)
Bumps [pyvis](https://github.com/WestHealth/pyvis) from 0.1.9 to 0.2.0.
- [Release notes](https://github.com/WestHealth/pyvis/releases)
- [Commits](https://github.com/WestHealth/pyvis/commits/v0.2.0)

---
updated-dependencies:
- dependency-name: pyvis
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>

Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-04-19 23:39:55 +03:00
dependabot[bot] 2664094f65 Bump pypdf2 from 1.26.0 to 1.27.4 (#442)
Bumps [pypdf2](https://github.com/py-pdf/PyPDF2) from 1.26.0 to 1.27.4.
- [Release notes](https://github.com/py-pdf/PyPDF2/releases)
- [Changelog](https://github.com/py-pdf/PyPDF2/blob/main/CHANGELOG)
- [Commits](https://github.com/py-pdf/PyPDF2/compare/1.26.0...1.27.4)

---
updated-dependencies:
- dependency-name: pypdf2
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>

Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-04-14 01:51:34 +03:00
dependabot[bot] d884fea00b Bump soupsieve from 2.3.1 to 2.3.2 (#436)
Bumps [soupsieve](https://github.com/facelessuser/soupsieve) from 2.3.1 to 2.3.2.
- [Release notes](https://github.com/facelessuser/soupsieve/releases)
- [Commits](https://github.com/facelessuser/soupsieve/compare/2.3.1...2.3.2)

---
updated-dependencies:
- dependency-name: soupsieve
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>

Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-04-14 01:50:28 +03:00
dependabot[bot] 4a4fa69e93 Bump jinja2 from 3.0.3 to 3.1.1 (#441)
Bumps [jinja2](https://github.com/pallets/jinja) from 3.0.3 to 3.1.1.
- [Release notes](https://github.com/pallets/jinja/releases)
- [Changelog](https://github.com/pallets/jinja/blob/main/CHANGES.rst)
- [Commits](https://github.com/pallets/jinja/compare/3.0.3...3.1.1)

---
updated-dependencies:
- dependency-name: jinja2
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>

Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-04-14 01:50:14 +03:00
dependabot[bot] 801bc388e4 Bump tqdm from 4.63.0 to 4.64.0 (#440)
Bumps [tqdm](https://github.com/tqdm/tqdm) from 4.63.0 to 4.64.0.
- [Release notes](https://github.com/tqdm/tqdm/releases)
- [Commits](https://github.com/tqdm/tqdm/compare/v4.63.0...v4.64.0)

---
updated-dependencies:
- dependency-name: tqdm
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>

Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-04-14 01:48:13 +03:00
Soxoj 48fcfcb89b Update GH actions (#439) 2022-04-14 01:46:50 +03:00
dependabot[bot] 07db3ce463 Bump pypdf2 from 1.26.0 to 1.27.4 (#438)
Bumps [pypdf2](https://github.com/py-pdf/PyPDF2) from 1.26.0 to 1.27.4.
- [Release notes](https://github.com/py-pdf/PyPDF2/releases)
- [Changelog](https://github.com/py-pdf/PyPDF2/blob/main/CHANGELOG)
- [Commits](https://github.com/py-pdf/PyPDF2/compare/1.26.0...1.27.4)

---
updated-dependencies:
- dependency-name: pypdf2
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>

Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: Soxoj <31013580+soxoj@users.noreply.github.com>
2022-04-14 01:46:03 +03:00
dependabot[bot] f9f4449079 Bump pycountry from 22.1.10 to 22.3.5 (#384)
Bumps [pycountry](https://github.com/flyingcircusio/pycountry) from 22.1.10 to 22.3.5.
- [Release notes](https://github.com/flyingcircusio/pycountry/releases)
- [Changelog](https://github.com/flyingcircusio/pycountry/blob/master/HISTORY.txt)
- [Commits](https://github.com/flyingcircusio/pycountry/compare/22.1.10...22.3.5)

---
updated-dependencies:
- dependency-name: pycountry
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>

Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-04-14 01:36:30 +03:00
dependabot[bot] 0d4236e2d4 Bump markupsafe from 2.0.1 to 2.1.1 (#389)
Bumps [markupsafe](https://github.com/pallets/markupsafe) from 2.0.1 to 2.1.1.
- [Release notes](https://github.com/pallets/markupsafe/releases)
- [Changelog](https://github.com/pallets/markupsafe/blob/main/CHANGES.rst)
- [Commits](https://github.com/pallets/markupsafe/compare/2.0.1...2.1.1)

---
updated-dependencies:
- dependency-name: markupsafe
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>

Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-04-14 01:36:22 +03:00
dependabot[bot] b2db783620 Bump reportlab from 3.6.6 to 3.6.9 (#403)
Bumps [reportlab](http://www.reportlab.com/) from 3.6.6 to 3.6.9.

---
updated-dependencies:
- dependency-name: reportlab
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>

Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-04-14 01:36:09 +03:00
dependabot[bot] b27c53b5b6 Bump xhtml2pdf from 0.2.5 to 0.2.7 (#409)
Bumps [xhtml2pdf](https://github.com/xhtml2pdf/xhtml2pdf) from 0.2.5 to 0.2.7.
- [Release notes](https://github.com/xhtml2pdf/xhtml2pdf/releases)
- [Commits](https://github.com/xhtml2pdf/xhtml2pdf/compare/0.2.5...v0.2.7)

---
updated-dependencies:
- dependency-name: xhtml2pdf
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>

Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-04-14 01:35:59 +03:00
Soxoj 6691b26674 Drop Python 3.6 support (#434) 2022-04-14 01:32:30 +03:00
Soxoj 131b96ddb3 Fixed some false positives (#433)
* Fixed some false positives

* Updated site list and statistics
2022-04-14 01:11:42 +03:00
Soxoj 0803d8ebaa Merge pull request #432 from soxoj/0.4.3
Bump to 0.4.3
2022-04-13 22:58:45 +03:00
Soxoj 19956f74ca Bump to 0.4.3 2022-04-13 22:58:21 +03:00
Soxoj dd57019c80 Merge pull request #431 from soxoj/fix-false-positives-13-04-22
Fixed actual false positives
2022-04-13 22:51:15 +03:00
Maigret autoupdate 9fb265ea85 Updated site list and statistics 2022-04-13 19:50:19 +00:00
Soxoj 0f9fdfc639 Fixed actual false positives 2022-04-13 22:47:02 +03:00
Soxoj 0de087d751 Merge pull request #424 from soxoj/false-positive-fixes-08-04-22
Fixed last false positives
2022-04-08 00:19:04 +03:00
Maigret autoupdate 600e58f8ef Updated site list and statistics 2022-04-07 21:18:14 +00:00
Soxoj 16131c58f9 Fixed last false positives 2022-04-08 00:17:05 +03:00
Soxoj 5106d32342 Merge pull request #422 from soxoj/houzz
Disabled houzz.com, updated sites statistics
2022-04-08 00:07:09 +03:00
Maigret autoupdate 1456ff6bc1 Updated site list and statistics 2022-04-07 21:04:32 +00:00
Soxoj b94fb65809 Disabled houzz.com, updated sites statistics 2022-04-08 00:03:28 +03:00
Soxoj e283d8b561 Merge pull request #413 from kustermariocoding/main
Added new Websites to data.json
2022-04-08 00:02:19 +03:00
kustermariocoding 7cd727bbff updated sites.md 2022-04-05 13:19:13 +02:00
kustermariocoding 5532c00b04 Merge branch 'main' into site_adds 2022-04-05 11:28:56 +02:00
kustermariocoding 8846b8b225 added Traktrain 2022-04-05 11:26:54 +02:00
kustermariocoding 7307c98029 added Sportlerfrage 2022-04-05 11:24:13 +02:00
kustermariocoding 4d129c2c6b added Splice 2022-04-05 11:22:17 +02:00
kustermariocoding 1e772b7dd4 added Swapd 2022-04-05 11:19:58 +02:00
kustermariocoding 81bb0a01b2 added Reisefrage 2022-04-05 11:17:16 +02:00
kustermariocoding 7ae8b58e1a added RcloneForum 2022-04-05 11:14:09 +02:00
kustermariocoding dde8bf8af0 added Polymart 2022-04-05 11:10:26 +02:00
kustermariocoding dc4addd985 added Needrom 2022-04-05 11:01:39 +02:00
kustermariocoding 803f62f7b7 added Motorradfrage 2022-04-05 10:58:12 +02:00
kustermariocoding 91596b31ec added Mapify.travel 2022-04-05 10:55:31 +02:00
kustermariocoding a27fea4ba4 added Lottiefiles 2022-04-05 10:43:23 +02:00
kustermariocoding ba9a94debc added Listed.to 2022-04-05 10:40:46 +02:00
kustermariocoding ac80d26cab added Lesswrong 2022-04-05 10:34:06 +02:00
kustermariocoding e4aea719fa added Keakr 2022-04-05 10:30:48 +02:00
kustermariocoding 4b18ecbd4b added JoplinApp 2022-04-05 10:28:44 +02:00
kustermariocoding c2a4c64640 added IonicFrameWorks 2022-04-05 10:24:22 +02:00
kustermariocoding 47045dd653 added Grailed 2022-04-05 10:04:24 +02:00
kustermariocoding b65a85368b added Gitbook 2022-04-05 10:02:25 +02:00
kustermariocoding daf483b097 added Gesundheitsfrage 2022-04-05 09:59:17 +02:00
kustermariocoding 838a0c5e0c added GeniusArtists 2022-04-05 09:55:26 +02:00
kustermariocoding 0ccaccfcde added G2g.com 2022-04-05 09:51:35 +02:00
kustermariocoding d1e7f5c113 added Finanzfrage 2022-04-05 09:46:42 +02:00
kustermariocoding bfb5b85c41 added Fameswap 2022-04-05 09:44:35 +02:00
kustermariocoding effd753512 added Cryptomator Forum 2022-04-05 09:42:05 +02:00
kustermariocoding cfc777d45d added Bikemap 2022-04-05 09:39:45 +02:00
kustermariocoding 422f65afbe added Autofrage 2022-04-05 09:37:33 +02:00
kustermariocoding 135b554030 added Airbit 2022-04-05 09:35:06 +02:00
kustermariocoding 47edb4427a added Buzznet 2022-04-04 11:27:11 +02:00
kustermariocoding bda6c7c390 added Patronite 2022-04-04 11:17:16 +02:00
kustermariocoding f0f7334f31 added Archive.org Parler Posts 2022-04-04 11:11:38 +02:00
kustermariocoding 669f92c34b added Archiver.org Parler Profiles 2022-04-04 11:09:19 +02:00
kustermariocoding b657c1323d added Ow.ly 2022-04-04 11:02:21 +02:00
Soxoj 692f401043 Merge pull request #406 from soxoj/update-stats
Updated statistics
2022-03-31 00:08:50 +03:00
Soxoj 27f91ddbe3 Updated statistics 2022-03-31 00:07:45 +03:00
Soxoj 72fccb2868 Merge pull request #404 from kustermariocoding/main
Added new Websites to data.json
2022-03-31 00:00:48 +03:00
kustermariocoding a959243282 added Iconfinder 2022-03-30 14:24:19 +02:00
kustermariocoding 42895e81a8 Merge branch 'main' into site_adds
to add new websites to data.json
2022-03-29 10:19:50 +02:00
Soxoj fb9663599e Merge pull request #401 from kustermariocoding/main
Added new Websites to data.json
2022-03-29 00:10:09 +03:00
kustermariocoding 005685e69a added zmarsa.com 2022-03-28 14:30:23 +02:00
kustermariocoding eb70f91db9 added zbiornik.com 2022-03-28 14:18:44 +02:00
kustermariocoding a3eaf6130e added zatrybi.pl 2022-03-28 14:10:25 +02:00
kustermariocoding 2ce65ca45a added xvideos models 2022-03-28 13:58:17 +02:00
kustermariocoding 46a14631ea added xanga 2022-03-28 13:48:58 +02:00
kustermariocoding 2699cd221f added Wordpress Support 2022-03-28 13:45:10 +02:00
kustermariocoding 2a7851c814 added Wordnik 2022-03-28 12:03:17 +02:00
kustermariocoding 1356cc8e3a added WolniSlowianie 2022-03-28 12:00:16 +02:00
kustermariocoding 523966eaf2 added Wimkin Public Profiles 2022-03-28 11:56:14 +02:00
kustermariocoding 21f5db5661 added wicgforum 2022-03-28 11:30:19 +02:00
kustermariocoding 6b52c41b97 added wego.social 2022-03-28 11:17:53 +02:00
kustermariocoding 8c898bd356 added Voice123 2022-03-28 11:01:20 +02:00
kustermariocoding e725a73c8f added vizjer.pl 2022-03-28 10:53:24 +02:00
kustermariocoding 645abfe72c added Vine 2022-03-28 10:45:05 +02:00
kustermariocoding 17886bb9fa added Viddler 2022-03-28 10:42:29 +02:00
kustermariocoding 5b6cf4f15a added usa.life 2022-03-28 10:02:23 +02:00
kustermariocoding ca1d5e3a76 added ulub.pl 2022-03-28 09:57:15 +02:00
kustermariocoding 52789abda7 added ultrasdiary.pl 2022-03-28 09:48:55 +02:00
kustermariocoding 54f1f1feaa added twpro.jp 2022-03-28 09:37:38 +02:00
kustermariocoding ea33f4150f added Archive.org Twitter Tweets 2022-03-28 09:32:07 +02:00
kustermariocoding 7ff52e60a2 added Archive.org TwitterProfiles 2022-03-28 09:29:20 +02:00
kustermariocoding e5420e4639 added Twitcasting 2022-03-28 09:21:06 +02:00
kustermariocoding 393469ddfd added tunefind 2022-03-28 08:58:39 +02:00
kustermariocoding 0b03a7ab00 added tldrlegal.com 2022-03-25 14:15:14 +01:00
kustermariocoding dd13010bb5 added thetattooforum 2022-03-25 14:06:15 +01:00
kustermariocoding e3bd89c9e4 added thegatewaypundit 2022-03-25 14:02:46 +01:00
kustermariocoding 00865db0f6 added tfl.net.pl 2022-03-25 13:59:54 +01:00
kustermariocoding 8635abe79f added tf2items.com 2022-03-25 13:56:57 +01:00
kustermariocoding 8fbe6b42de added tetr.io 2022-03-25 13:42:41 +01:00
kustermariocoding db12e7b563 added tenor.com 2022-03-25 13:34:43 +01:00
kustermariocoding 77c9bda3e5 added teknik.io 2022-03-25 13:24:00 +01:00
kustermariocoding 54547c797a added taskrabbit 2022-03-25 11:43:38 +01:00
kustermariocoding 7e0b20e8fb added tanuki.pl 2022-03-25 11:33:58 +01:00
kustermariocoding 85288dccb5 added szmer.info 2022-03-25 09:34:22 +01:00
kustermariocoding d973831dc1 added szerokikadr.pl 2022-03-25 09:30:10 +01:00
kustermariocoding 12502c020c added suzuri.jp 2022-03-25 09:25:24 +01:00
kustermariocoding ce48c317b2 fixed headers for vimeo -merge conflict 2022-03-22 14:49:57 +01:00
kustermariocoding 41a277237c added Spankpay 2022-03-22 14:32:34 +01:00
kustermariocoding 721ff2874f added Solikick 2022-03-22 14:20:08 +01:00
kustermariocoding 3cdca22b9d added Citizen4 2022-03-22 14:17:07 +01:00
kustermariocoding 346611c5da added slant.co and fixed usernameClaimed for skeb.jp 2022-03-22 14:13:17 +01:00
kustermariocoding a8e538ad29 added Skeb.jp 2022-03-22 14:10:06 +01:00
kustermariocoding 95ff061cf6 added Shanii Writes 2022-03-22 14:05:17 +01:00
kustermariocoding 5bb5e29ffb added Sfd.pl 2022-03-22 14:02:17 +01:00
kustermariocoding ac3e0b16e4 added Seneporno 2022-03-22 13:57:13 +01:00
kustermariocoding 970b75b88d added regexcheck für Hackerrank 2022-03-18 15:05:04 +01:00
kustermariocoding 8f6b40c8d0 added a regexcheck for gumroad 2022-03-18 15:03:08 +01:00
kustermariocoding ccebd677e3 updated data.json 2022-03-18 08:24:24 +01:00
Soxoj 75625f72f8 Merge pull request #397 from soxoj/skip-broken-tests
Skipped broken tests
2022-03-18 01:54:59 +03:00
Maigret autoupdate f6dbe1a6bd Updated site list and statistics 2022-03-17 22:52:22 +00:00
Soxoj a914283a15 Skipped broken tests 2022-03-18 01:51:14 +03:00
Soxoj 2a4f4d47e2 Merge pull request #390 from kustermariocoding/main
added new Websites to data.json
2022-03-18 01:18:20 +03:00
kustermariocoding 50350972a5 fixed url and absense/presence strings for friendfinder-x.com 2022-03-17 11:52:14 +01:00
kustermariocoding cdb69f99a1 added Scoutwiki 2022-03-16 15:04:26 +01:00
kustermariocoding 4786822e6d added Saracartershow 2022-03-16 12:07:12 +01:00
kustermariocoding 9c56f29267 added Salon24.pl 2022-03-16 11:51:03 +01:00
kustermariocoding 1ee4f4c93b added runescape 2022-03-16 11:42:00 +01:00
kustermariocoding 9e302542ed Merge branch 'main' into site_adds 2022-03-16 11:14:55 +01:00
kustermariocoding 3409f8a726 added RumbleUser 2022-03-16 11:14:20 +01:00
kustermariocoding 94bfa4233d added Rumblechannel 2022-03-16 11:11:28 +01:00
kustermariocoding 9c08c34007 added Ourfreedombook 2022-03-16 11:05:13 +01:00
kustermariocoding 880ffb4bf1 added lowcygier.pl 2022-03-16 11:02:01 +01:00
kustermariocoding d987c681b7 added line.me 2022-03-16 10:55:58 +01:00
kustermariocoding 2ef141a5c5 added d3.ru 2022-03-16 10:43:42 +01:00
kustermariocoding 809b97d4f9 changed usernameClaimed for Bugcrowd to a working one 2022-03-16 10:29:40 +01:00
kustermariocoding 4a1342b654 added Justforfans 2022-03-16 10:25:45 +01:00
kustermariocoding fb200875d3 added engadget 2022-03-16 10:15:02 +01:00
kustermariocoding 53bc79938c added elftown 2022-03-16 10:11:51 +01:00
kustermariocoding 3866c1be9e added chamsko.pl 2022-03-16 09:47:46 +01:00
kustermariocoding ca65ffe864 added cda.pl and changed usernameClaimed of cdaction.pl 2022-03-16 09:44:14 +01:00
kustermariocoding c9638f704f added cd-action 2022-03-16 09:34:17 +01:00
kustermariocoding 39c57e7925 added Cash.app 2022-03-16 09:31:30 +01:00
kustermariocoding 1b5c39dc1b added carrd.co 2022-03-16 08:57:09 +01:00
kustermariocoding 379fca8602 added Americanthinker 2022-03-16 08:47:50 +01:00
kustermariocoding 9716f40140 added anonup 2022-03-16 08:31:49 +01:00
kustermariocoding 61d346dd0a added ApexLegends 2022-03-16 08:30:21 +01:00
kustermariocoding 5edfc00b2d added ruby.dating 2022-03-14 14:20:17 +01:00
kustermariocoding 5905dcf384 added rigcz.club 2022-03-14 14:09:33 +01:00
kustermariocoding 67046273c7 added quizlet.com 2022-03-14 14:01:34 +01:00
kustermariocoding b4fd2fe40f added quitter.pl 2022-03-14 13:47:33 +01:00
kustermariocoding 7113824c59 added prv.pl 2022-03-14 13:32:39 +01:00
Soxoj a2e782d07c Merge pull request #386 from kustermariocoding/main
Added Sites to data.json
2022-03-14 01:56:14 +03:00
kustermariocoding 4b2d030d7a added poshmark 2022-03-11 10:50:09 +01:00
kustermariocoding e98c97dbb1 added Pornhub Pornstars 2022-03-11 10:43:44 +01:00
kustermariocoding fd4d570b59 added Polleverywhere 2022-03-11 10:13:20 +01:00
kustermariocoding 9892532aae added policja2009 2022-03-10 12:01:56 +01:00
kustermariocoding 66422332c4 added Polczat.pl 2022-03-10 11:36:41 +01:00
kustermariocoding 8b1eb15939 added pol.social 2022-03-10 11:32:44 +01:00
kustermariocoding 06df4661bc added Piekielni 2022-03-10 11:26:21 +01:00
kustermariocoding eaa126906f added pewex.pl 2022-03-10 11:19:29 +01:00
kustermariocoding 1c7cbbc27d added olx.pl 2022-03-10 10:30:59 +01:00
kustermariocoding 0eed5ced7d added oglaszamy24h 2022-03-10 10:18:53 +01:00
kustermariocoding 30f3ac4889 added nyaa.si 2022-03-10 10:02:08 +01:00
kustermariocoding 0212796696 Merge remote-tracking branch 'origin' into site_adds 2022-03-09 14:21:14 +01:00
kustermariocoding 6c723f8329 added ninjakiwi 2022-03-09 14:18:32 +01:00
kustermariocoding b1bfbbc371 added Naturalnews.com 2022-03-09 14:11:39 +01:00
kustermariocoding ee8eabc5ed added mym.fans 2022-03-09 14:05:33 +01:00
kustermariocoding cf6bb0bd7a added Motokiller.pl 2022-03-09 13:37:44 +01:00
kustermariocoding 93b542dad2 added Mistrzowie 2022-03-09 12:22:57 +01:00
kustermariocoding ec6324473a added Minecraftlist 2022-03-09 12:16:48 +01:00
kustermariocoding 263afb8990 added megamodels.pl 2022-03-09 11:55:37 +01:00
kustermariocoding 7016161206 added medyczka.pl 2022-03-09 11:34:14 +01:00
Soxoj 470ef5721f Merge pull request #385 from soxoj/v0.4.2
Bump to 0.4.2
2022-03-07 20:12:59 +03:00
Maigret autoupdate fd2c8afd33 Updated site list and statistics 2022-03-07 16:44:19 +00:00
cyb3rk0tik 8c007219f5 Bump to 0.4.2 2022-03-07 21:42:34 +05:00
Soxoj a425e5ceff Merge pull request #380 from soxoj/dependabot/pip/pytest-asyncio-0.18.2
Bump pytest-asyncio from 0.18.1 to 0.18.2
2022-03-07 15:36:39 +03:00
Soxoj d0fd3533b5 Merge pull request #374 from soxoj/dependabot/pip/tqdm-4.63.0
Bump tqdm from 4.62.3 to 4.63.0
2022-03-07 15:36:25 +03:00
kustermariocoding 7d225750ac added Mcuuid(Minecraft) 2022-03-07 11:02:06 +01:00
kustermariocoding 286319b6ec added MassageAnywhere 2022-03-07 10:28:44 +01:00
kustermariocoding fef323ab7d added martech 2022-03-07 10:09:47 +01:00
kustermariocoding 05c29c8c77 added marshmallow 2022-03-07 09:58:08 +01:00
kustermariocoding d18d5c96d9 added MapMyTracks 2022-03-07 09:49:02 +01:00
kustermariocoding 1da4345a50 added magabook 2022-03-07 09:40:27 +01:00
kustermariocoding c5b9f4e0fa added maga-chat 2022-03-07 09:28:04 +01:00
Soxoj 5bf361a1ac Merge pull request #382 from soxoj/fix-alexa-rank
Fixed issue with str alexaRank
2022-03-06 16:23:19 +03:00
Maigret autoupdate e07d3b60ba Updated site list and statistics 2022-03-06 13:20:31 +00:00
Soxoj 1e2d5cf742 Fixed issue with str alexaRank 2022-03-06 16:19:25 +03:00
Soxoj 694e024ba1 Merge pull request #375 from kustermariocoding/main
Added new sites to data.json
2022-03-06 16:17:50 +03:00
dependabot[bot] 6862425215 Bump pytest-asyncio from 0.18.1 to 0.18.2
Bumps [pytest-asyncio](https://github.com/pytest-dev/pytest-asyncio) from 0.18.1 to 0.18.2.
- [Release notes](https://github.com/pytest-dev/pytest-asyncio/releases)
- [Commits](https://github.com/pytest-dev/pytest-asyncio/compare/v0.18.1...v0.18.2)

---
updated-dependencies:
- dependency-name: pytest-asyncio
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>
2022-03-04 06:05:57 +00:00
kustermariocoding 54c8074e51 tried to fix merge conflicts 2022-02-28 11:41:38 +01:00
dependabot[bot] 71e1fb6dcf Bump tqdm from 4.62.3 to 4.63.0
Bumps [tqdm](https://github.com/tqdm/tqdm) from 4.62.3 to 4.63.0.
- [Release notes](https://github.com/tqdm/tqdm/releases)
- [Commits](https://github.com/tqdm/tqdm/compare/v4.62.3...v4.63.0)

---
updated-dependencies:
- dependency-name: tqdm
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>
2022-02-28 06:07:15 +00:00
Soxoj 364187861d Fix false positive and CI (#372)
* Fix false positive and CI
* Updated site list and statistics
2022-02-27 04:44:15 +03:00
Soxoj 8a53a38543 Fixed the rest of false positives for now (#371)
* Fixed the rest of false positives for now

* Fixed tag

* Updated site list and statistics
2022-02-26 16:43:40 +03:00
Soxoj bc787cdf51 Fix false positives (#370)
* Fixed several false positives, improved statistics info

* Disabled some sites, fixed fp percent count method

* Updated site list and statistics
2022-02-26 16:01:22 +03:00
Soxoj dcf5181e28 Fixed several false positives, improved statistics info (#368)
* Fixed several false positives, improved statistics info

* Updated site list and statistics
2022-02-26 15:31:15 +03:00
Soxoj 61452d56d3 Disabled Netvibes and LeetCode (#366)
* Disabled Netvibes and LeetCode

* Specified types of PR for tests in CI

* Updated site list and statistics
2022-02-26 14:49:43 +03:00
Soxoj be204ff119 Wikipedia fix (#365)
* Fixed op.gg sites

* Added testing docs, fixed some error

* Fixed Wikipedia
2022-02-26 14:27:08 +03:00
Soxoj 8a865a1ce6 Op.gg fixes (#363)
* Fixed op.gg sites

* Added testing docs, fixed some error

* Updated site list and statistics
2022-02-26 14:16:13 +03:00
Soxoj a29c3c6abe CI autoupdate (#359)
* CI autoupdate

* Updated site list and statistics
2022-02-26 13:38:15 +03:00
kustermariocoding ea6fd30a30 added kotburger.pl 2022-02-24 11:51:00 +01:00
kustermariocoding 8dbe9a415c added karab.in 2022-02-24 11:45:23 +01:00
kustermariocoding 222398154e added joemonster 2022-02-24 11:25:18 +01:00
kustermariocoding 3030025ea3 added jellyfin weblate 2022-02-24 11:11:40 +01:00
kustermariocoding 40233e66cb added jeja.pl 2022-02-24 10:57:13 +01:00
kustermariocoding 2ea75f7f76 added jbzd 2022-02-24 10:50:33 +01:00
kustermariocoding dbd393da58 added ipolska.pl 2022-02-24 10:34:03 +01:00
kustermariocoding b9f72151ea added Inkbunny 2022-02-24 10:08:59 +01:00
kustermariocoding dc2989a47d added hexrpg 2022-02-24 09:53:56 +01:00
kustermariocoding c86e558a57 added hackerrank 2022-02-24 09:41:14 +01:00
kustermariocoding 3c8c1d1f5a Merge branch 'main' of https://github.com/soxoj/maigret into site_adds 2022-02-24 09:39:58 +01:00
Soxoj 1683e5b744 Added DB statistics autoupdate and write to sites.md (#357) 2022-02-23 18:01:42 +03:00
Soxoj 31fc656721 Added package publishing instruction (#356) 2022-02-23 16:46:58 +03:00
Soxoj 79f872c77c Added some scripts (#355) 2022-02-23 14:33:37 +03:00
kustermariocoding 22f158e749 added gradle 2022-02-22 11:42:39 +01:00
kustermariocoding ff1eac0b20 added gnome vcs 2022-02-22 11:23:16 +01:00
kustermariocoding f2d3fed9c7 added Furaffinity 2022-02-22 10:26:58 +01:00
kustermariocoding cbbdc5a820 added friendfinder-x 2022-02-22 10:15:56 +01:00
kustermariocoding 8a614001fd added friendfinder 2022-02-22 09:49:37 +01:00
kustermariocoding 7a50f2922a Merge branch 'main' of https://github.com/soxoj/maigret into site_adds 2022-02-22 09:15:39 +01:00
kustermariocoding da0f4ae7cf added fotka 2022-02-22 09:15:11 +01:00
kustermariocoding d12310bb53 added fosstodon 2022-02-22 08:59:12 +01:00
cyberkotik 211b8ccfd0 Merge pull request #352 from soxoj/cyb3rk0tik-patch-1
Fix reportlab not only for testing
2022-02-21 23:52:58 +05:00
cyberkotik f352f9f58b Fix reportlab not only for testing 2022-02-21 23:42:49 +05:00
kustermariocoding 0d70ee1abc added forumprawne.org 2022-02-21 14:43:08 +01:00
kustermariocoding 032ca8141a added fedi.lewactwo.pl 2022-02-21 14:28:48 +01:00
kustermariocoding 3acf6e5180 added fansly 2022-02-21 14:20:54 +01:00
kustermariocoding 14f2b0c756 added fancentro.com 2022-02-21 12:50:41 +01:00
cyberkotik e0a4775205 Merge pull request #351 from soxoj/cyb3rk0tik-patch-1
Pin reportlab version
2022-02-21 16:47:25 +05:00
cyberkotik d056eb545f Pin reportlab version 2022-02-21 16:39:56 +05:00
kustermariocoding 10f8e1f597 added faktopedia.pl 2022-02-21 12:12:27 +01:00
kustermariocoding 6cc789d800 added fabswingers 2022-02-21 11:59:34 +01:00
kustermariocoding c214f38841 Merge branch 'main' of https://github.com/soxoj/maigret into site_adds 2022-02-21 11:56:35 +01:00
cyberkotik 392b83c230 Merge pull request #350 from soxoj/dependabot/pip/lxml-4.8.0
Bump lxml from 4.7.1 to 4.8.0
2022-02-21 15:23:31 +05:00
cyberkotik 96bebd49d3 Merge pull request #346 from soxoj/dependabot/pip/typing-extensions-4.1.1
Bump typing-extensions from 4.0.1 to 4.1.1
2022-02-21 15:23:19 +05:00
cyberkotik 92950f1b88 Merge pull request #345 from soxoj/dependabot/pip/pytest-7.0.1
Bump pytest from 7.0.0 to 7.0.1
2022-02-21 15:21:53 +05:00
cyberkotik 07b5874802 Merge pull request #343 from soxoj/dependabot/pip/pytest-asyncio-0.18.1
Bump pytest-asyncio from 0.18.0 to 0.18.1
2022-02-21 15:21:42 +05:00
kustermariocoding 6a62586a59 added dojoverse 2022-02-18 15:00:36 +01:00
kustermariocoding 883abe7877 added demotywatory.pl 2022-02-18 13:53:34 +01:00
kustermariocoding fc58046a34 added cytoid.io 2022-02-18 11:50:54 +01:00
kustermariocoding b6a1eb26e7 added Cults3d 2022-02-18 11:37:16 +01:00
kustermariocoding 42169397fe added chomukij.pl and crowdin.com 2022-02-18 11:18:57 +01:00
kustermariocoding 870d68ec1c added site castingcallclub 2022-02-18 09:22:18 +01:00
kustermariocoding 12ef7f62c2 added site caringbridge 2022-02-18 09:05:16 +01:00
dependabot[bot] 8b7ea67edc Bump lxml from 4.7.1 to 4.8.0
Bumps [lxml](https://github.com/lxml/lxml) from 4.7.1 to 4.8.0.
- [Release notes](https://github.com/lxml/lxml/releases)
- [Changelog](https://github.com/lxml/lxml/blob/master/CHANGES.txt)
- [Commits](https://github.com/lxml/lxml/compare/lxml-4.7.1...lxml-4.8.0)

---
updated-dependencies:
- dependency-name: lxml
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>
2022-02-18 06:07:51 +00:00
dependabot[bot] 182a493b6a Bump typing-extensions from 4.0.1 to 4.1.1
Bumps [typing-extensions](https://github.com/python/typing) from 4.0.1 to 4.1.1.
- [Release notes](https://github.com/python/typing/releases)
- [Changelog](https://github.com/python/typing/blob/master/typing_extensions/CHANGELOG)
- [Commits](https://github.com/python/typing/compare/4.0.1...4.1.1)

---
updated-dependencies:
- dependency-name: typing-extensions
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>
2022-02-14 06:09:08 +00:00
dependabot[bot] 4f7781b7a2 Bump pytest from 7.0.0 to 7.0.1
Bumps [pytest](https://github.com/pytest-dev/pytest) from 7.0.0 to 7.0.1.
- [Release notes](https://github.com/pytest-dev/pytest/releases)
- [Changelog](https://github.com/pytest-dev/pytest/blob/main/CHANGELOG.rst)
- [Commits](https://github.com/pytest-dev/pytest/compare/7.0.0...7.0.1)

---
updated-dependencies:
- dependency-name: pytest
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>
2022-02-14 06:09:03 +00:00
dependabot[bot] 3579f2fd09 Bump pytest-asyncio from 0.18.0 to 0.18.1
Bumps [pytest-asyncio](https://github.com/pytest-dev/pytest-asyncio) from 0.18.0 to 0.18.1.
- [Release notes](https://github.com/pytest-dev/pytest-asyncio/releases)
- [Commits](https://github.com/pytest-dev/pytest-asyncio/compare/v0.18.0...v0.18.1)

---
updated-dependencies:
- dependency-name: pytest-asyncio
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>
2022-02-11 06:14:37 +00:00
kustermariocoding 34b8d938f7 added site blogi.pl 2022-02-10 14:32:47 +01:00
kustermariocoding ea963af29b added Bitwarden Forum 2022-02-10 14:17:20 +01:00
kustermariocoding 5ea5f6337d added site Biggerpockets 2022-02-10 14:00:05 +01:00
kustermariocoding 292d0a2665 added site Bentbox 2022-02-10 13:42:33 +01:00
kustermariocoding 057bdce751 added site Bandlab 2022-02-10 13:24:29 +01:00
kustermariocoding f051cc768e added AvidCommunity Site 2022-02-10 12:06:56 +01:00
kustermariocoding 985f4075f4 added site Artistsnclients 2022-02-10 11:29:14 +01:00
kustermariocoding d88abc6271 added site arduino.cc 2022-02-10 11:14:27 +01:00
kustermariocoding 63b99338d7 added new site appian 2022-02-10 10:35:50 +01:00
kustermariocoding bd3503f3c8 added 101010.pl website to data.json 2022-02-08 14:51:59 +01:00
dependabot[bot] d7f94076bf Bump pytest-asyncio from 0.17.2 to 0.18.0 (#340)
Bumps [pytest-asyncio](https://github.com/pytest-dev/pytest-asyncio) from 0.17.2 to 0.18.0.
- [Release notes](https://github.com/pytest-dev/pytest-asyncio/releases)
- [Commits](https://github.com/pytest-dev/pytest-asyncio/compare/v0.17.2...v0.18.0)

---
updated-dependencies:
- dependency-name: pytest-asyncio
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>

Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-02-08 12:28:58 +03:00
dependabot[bot] 10879c8bf3 Bump pytest from 6.2.5 to 7.0.0 (#339)
Bumps [pytest](https://github.com/pytest-dev/pytest) from 6.2.5 to 7.0.0.
- [Release notes](https://github.com/pytest-dev/pytest/releases)
- [Changelog](https://github.com/pytest-dev/pytest/blob/main/CHANGELOG.rst)
- [Commits](https://github.com/pytest-dev/pytest/compare/6.2.5...7.0.0)

---
updated-dependencies:
- dependency-name: pytest
  dependency-type: direct:production
  update-type: version-update:semver-major
...

Signed-off-by: dependabot[bot] <support@github.com>

Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-02-07 12:42:00 +03:00
dependabot[bot] b48d126118 Bump pytest-httpserver from 1.0.3 to 1.0.4 (#334)
Bumps [pytest-httpserver](https://github.com/csernazs/pytest-httpserver) from 1.0.3 to 1.0.4.
- [Release notes](https://github.com/csernazs/pytest-httpserver/releases)
- [Changelog](https://github.com/csernazs/pytest-httpserver/blob/master/CHANGES.rst)
- [Commits](https://github.com/csernazs/pytest-httpserver/compare/1.0.3...1.0.4)

---
updated-dependencies:
- dependency-name: pytest-httpserver
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>

Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-01-29 00:10:17 +03:00
dependabot[bot] c2c2707fb6 Bump multidict from 6.0.1 to 6.0.2 (#333)
Bumps [multidict](https://github.com/aio-libs/multidict) from 6.0.1 to 6.0.2.
- [Release notes](https://github.com/aio-libs/multidict/releases)
- [Changelog](https://github.com/aio-libs/multidict/blob/master/CHANGES.rst)
- [Commits](https://github.com/aio-libs/multidict/compare/v6.0.1...v6.0.2)

---
updated-dependencies:
- dependency-name: multidict
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>

Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-01-25 23:05:58 +03:00
dependabot[bot] 5e16edc003 Bump multidict from 5.2.0 to 6.0.1 (#332)
* Bump multidict from 5.2.0 to 6.0.1

Bumps [multidict](https://github.com/aio-libs/multidict) from 5.2.0 to 6.0.1.
- [Release notes](https://github.com/aio-libs/multidict/releases)
- [Changelog](https://github.com/aio-libs/multidict/blob/master/CHANGES.rst)
- [Commits](https://github.com/aio-libs/multidict/compare/v5.2.0...v6.0.1)

---
updated-dependencies:
- dependency-name: multidict
  dependency-type: direct:production
  update-type: version-update:semver-major
...

Signed-off-by: dependabot[bot] <support@github.com>

* Fixed Python 3.6 compatibility

Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: Soxoj <soxoj@protonmail.com>
2022-01-25 00:52:11 +03:00
Soxoj e84b5e3d5d Disable kinooh, sites list update workflow added (#329)
* Disable kinooh, sites list update workflow added

* Workflow update
2022-01-22 00:37:49 +03:00
Soxoj 4d65d03074 Disabled Ruboard (#327) 2022-01-21 02:11:08 +03:00
Soxoj 222e8d3d09 Update logo 2022-01-18 23:36:02 +03:00
dependabot[bot] 92c7e41439 Bump pytest-asyncio from 0.17.1 to 0.17.2 (#323)
Bumps [pytest-asyncio](https://github.com/pytest-dev/pytest-asyncio) from 0.17.1 to 0.17.2.
- [Release notes](https://github.com/pytest-dev/pytest-asyncio/releases)
- [Commits](https://github.com/pytest-dev/pytest-asyncio/compare/v0.17.1...v0.17.2)

---
updated-dependencies:
- dependency-name: pytest-asyncio
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>

Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-01-18 17:34:08 +03:00
dependabot[bot] 55f941cf18 Bump pytest-asyncio from 0.17.0 to 0.17.1 (#321)
Bumps [pytest-asyncio](https://github.com/pytest-dev/pytest-asyncio) from 0.17.0 to 0.17.1.
- [Release notes](https://github.com/pytest-dev/pytest-asyncio/releases)
- [Commits](https://github.com/pytest-dev/pytest-asyncio/compare/v0.17.0...v0.17.1)

---
updated-dependencies:
- dependency-name: pytest-asyncio
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>

Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2022-01-17 19:29:03 +03:00
imgbot[bot] fa6bb1ee17 [ImgBot] Optimize images (#319)
*Total -- 1,424.29kb -> 846.97kb (40.53%)

/static/report_alexaimephotography_xmind_screenshot.png -- 772.80kb -> 351.39kb (54.53%)
/static/report_alexaimephotography_html_screenshot.png -- 606.94kb -> 451.06kb (25.68%)
/static/recursive_search.svg -- 44.55kb -> 44.52kb (0.07%)

Signed-off-by: ImgBotApp <ImgBotHelp@gmail.com>

Co-authored-by: ImgBotApp <ImgBotHelp@gmail.com>
2022-01-15 15:33:42 +03:00
39 changed files with 7431 additions and 1270 deletions
+5 -1
View File
@@ -15,10 +15,14 @@ assignees: soxoj
## Description ## Description
Info about Maigret version you are running and environment (`--version`, operation system, ISP provuder): Info about Maigret version you are running and environment (`--version`, operation system, ISP provider):
<INSERT VERSION INFO HERE> <INSERT VERSION INFO HERE>
How to reproduce this bug (commandline options / conditions): How to reproduce this bug (commandline options / conditions):
<INSERT EXAMPLE OF CLI COMMAND HERE> <INSERT EXAMPLE OF CLI COMMAND HERE>
<DESCRIPTION> <DESCRIPTION>
<PASTE SCREENSHOT>
<ATTACH LOG FILE>
-2
View File
@@ -3,8 +3,6 @@ name: Package exe with PyInstaller - Windows
on: on:
push: push:
branches: [ main ] branches: [ main ]
pull_request:
branches: [ main ]
jobs: jobs:
build: build:
+3 -5
View File
@@ -1,13 +1,11 @@
# This workflow will install Python dependencies, run tests and lint with a variety of Python versions name: Linting and testing
# For more information see: https://help.github.com/actions/language-and-framework-guides/using-python-with-github-actions
name: Python package
on: on:
push: push:
branches: [ main ] branches: [ main ]
pull_request: pull_request:
branches: [ main ] branches: [ main ]
types: [opened, synchronize, reopened]
jobs: jobs:
build: build:
@@ -15,7 +13,7 @@ jobs:
runs-on: ubuntu-latest runs-on: ubuntu-latest
strategy: strategy:
matrix: matrix:
python-version: [3.6.9, 3.7, 3.8, 3.9] python-version: [3.7, 3.8, 3.9]
steps: steps:
- uses: actions/checkout@v2 - uses: actions/checkout@v2
-3
View File
@@ -1,6 +1,3 @@
# This workflow will upload a Python Package using Twine when a release is created
# For more information see: https://help.github.com/en/actions/language-and-framework-guides/using-python-with-github-actions#publishing-to-package-registries
name: Upload Python Package name: Upload Python Package
on: on:
+34
View File
@@ -0,0 +1,34 @@
name: Update sites rating and statistics
on:
pull_request:
branches: [ dev ]
types: [opened, synchronize]
jobs:
build:
runs-on: ubuntu-latest
steps:
- name: Checkout repository
uses: actions/checkout@v2.3.2
with:
ref: ${{ github.event.pull_request.head.sha }}
fetch-depth: 0 # otherwise, there would be errors pushing refs to the destination repository.
- name: build application
run: |
pip3 install .
python3 ./utils/update_site_data.py --empty-only
- name: Commit and push changes
run: |
git config --global user.name "Maigret autoupdate"
git config --global user.email "soxoj@protonmail.com"
echo `git name-rev ${{ github.event.pull_request.head.sha }} --name-only`
export BRANCH=`git name-rev ${{ github.event.pull_request.head.sha }} --name-only | sed 's/remotes\/origin\///'`
echo $BRANCH
git remote -v
git checkout $BRANCH
git add sites.md
git commit -m "Updated site list and statistics"
git push origin $BRANCH
+4
View File
@@ -15,6 +15,10 @@ src/
.ipynb_checkpoints .ipynb_checkpoints
*.ipynb *.ipynb
# Logs and backups
*.log
*.bak
# Output files, except requirements.txt # Output files, except requirements.txt
*.txt *.txt
!requirements.txt !requirements.txt
+134
View File
@@ -2,6 +2,140 @@
## [Unreleased] ## [Unreleased]
## [0.4.4] - 2022-09-03
* Fixed some false positives by @soxoj in https://github.com/soxoj/maigret/pull/433
* Drop Python 3.6 support by @soxoj in https://github.com/soxoj/maigret/pull/434
* Bump xhtml2pdf from 0.2.5 to 0.2.7 by @dependabot in https://github.com/soxoj/maigret/pull/409
* Bump reportlab from 3.6.6 to 3.6.9 by @dependabot in https://github.com/soxoj/maigret/pull/403
* Bump markupsafe from 2.0.1 to 2.1.1 by @dependabot in https://github.com/soxoj/maigret/pull/389
* Bump pycountry from 22.1.10 to 22.3.5 by @dependabot in https://github.com/soxoj/maigret/pull/384
* Bump pypdf2 from 1.26.0 to 1.27.4 by @dependabot in https://github.com/soxoj/maigret/pull/438
* Update GH actions by @soxoj in https://github.com/soxoj/maigret/pull/439
* Bump tqdm from 4.63.0 to 4.64.0 by @dependabot in https://github.com/soxoj/maigret/pull/440
* Bump jinja2 from 3.0.3 to 3.1.1 by @dependabot in https://github.com/soxoj/maigret/pull/441
* Bump soupsieve from 2.3.1 to 2.3.2 by @dependabot in https://github.com/soxoj/maigret/pull/436
* Bump pypdf2 from 1.26.0 to 1.27.4 by @dependabot in https://github.com/soxoj/maigret/pull/442
* Bump pyvis from 0.1.9 to 0.2.0 by @dependabot in https://github.com/soxoj/maigret/pull/443
* Bump pypdf2 from 1.27.4 to 1.27.6 by @dependabot in https://github.com/soxoj/maigret/pull/448
* Bump typing-extensions from 4.1.1 to 4.2.0 by @dependabot in https://github.com/soxoj/maigret/pull/447
* Bump soupsieve from 2.3.2 to 2.3.2.post1 by @dependabot in https://github.com/soxoj/maigret/pull/444
* Bump pypdf2 from 1.27.6 to 1.27.7 by @dependabot in https://github.com/soxoj/maigret/pull/449
* Bump pypdf2 from 1.27.7 to 1.27.8 by @dependabot in https://github.com/soxoj/maigret/pull/450
* XMind 8 report warning and some docs update by @soxoj in https://github.com/soxoj/maigret/pull/452
* False positive fixes 24.04.22 by @soxoj in https://github.com/soxoj/maigret/pull/455
* Bump pypdf2 from 1.27.8 to 1.27.9 by @dependabot in https://github.com/soxoj/maigret/pull/456
* Bump pytest from 7.0.1 to 7.1.2 by @dependabot in https://github.com/soxoj/maigret/pull/457
* Bump jinja2 from 3.1.1 to 3.1.2 by @dependabot in https://github.com/soxoj/maigret/pull/460
* Ubisoft forums addition by @fen0s in https://github.com/soxoj/maigret/pull/461
* Add BYOND, Figma, BeatStars by @fen0s in https://github.com/soxoj/maigret/pull/462
* fix Figma username definition, add a bunch of sites by @fen0s in https://github.com/soxoj/maigret/pull/464
* Bump pypdf2 from 1.27.9 to 1.27.10 by @dependabot in https://github.com/soxoj/maigret/pull/465
* Bump pypdf2 from 1.27.10 to 1.27.12 by @dependabot in https://github.com/soxoj/maigret/pull/466
* Sites fixes 05 05 22 by @soxoj in https://github.com/soxoj/maigret/pull/469
* Bump pyvis from 0.2.0 to 0.2.1 by @dependabot in https://github.com/soxoj/maigret/pull/472
* Social analyzer websites, also fixing presense strs by @fen0s in https://github.com/soxoj/maigret/pull/471
* Updated logic of false positive risk estimating by @soxoj in https://github.com/soxoj/maigret/pull/475
* Improved usability of external progressbar func by @soxoj in https://github.com/soxoj/maigret/pull/476
* New sites added, some tags/rank update by @soxoj in https://github.com/soxoj/maigret/pull/477
* Added new sites by @soxoj in https://github.com/soxoj/maigret/pull/480
* Added new forums, updated ranks, some utils improvements by @soxoj in https://github.com/soxoj/maigret/pull/481
* Disabled sites with false positives results by @soxoj in https://github.com/soxoj/maigret/pull/482
* Bump certifi from 2021.10.8 to 2022.5.18.1 by @dependabot in https://github.com/soxoj/maigret/pull/488
* Bump psutil from 5.9.0 to 5.9.1 by @dependabot in https://github.com/soxoj/maigret/pull/490
* Bump pypdf2 from 1.27.12 to 1.28.1 by @dependabot in https://github.com/soxoj/maigret/pull/491
* Bump pypdf2 from 1.28.1 to 1.28.2 by @dependabot in https://github.com/soxoj/maigret/pull/493
* added and fixed some websites in data.json by @kustermariocoding in https://github.com/soxoj/maigret/pull/494
* Bump pypdf2 from 1.28.2 to 2.0.0 by @dependabot in https://github.com/soxoj/maigret/pull/504
* Bump pefile from 2021.9.3 to 2022.5.30 by @dependabot in https://github.com/soxoj/maigret/pull/499
* Updated sites list, added disabled Anilist by @soxoj in https://github.com/soxoj/maigret/pull/502
* Bump lxml from 4.8.0 to 4.9.0 by @dependabot in https://github.com/soxoj/maigret/pull/503
* Compatibility with Python 10 by @soxoj in https://github.com/soxoj/maigret/pull/509
* feat: add .log & .bak files to gitignore in https://github.com/soxoj/maigret/pull/511
* fix some sites and delete abandoned by @fen0s in https://github.com/soxoj/maigret/pull/526
* Fixesjulyfirst by @fen0s in https://github.com/soxoj/maigret/pull/533
* yazbel, aboutcar, zhihu by @fen0s in https://github.com/soxoj/maigret/pull/531
* Fixes july third by @fen0s in https://github.com/soxoj/maigret/pull/535
* Update data.json by @fen0s in https://github.com/soxoj/maigret/pull/539
* Update data.json by @fen0s in https://github.com/soxoj/maigret/pull/540
* Bump reportlab from 3.6.9 to 3.6.11 by @dependabot in https://github.com/soxoj/maigret/pull/543
* Bump requests from 2.27.1 to 2.28.1 by @dependabot in https://github.com/soxoj/maigret/pull/530
* Bump pypdf2 from 2.0.0 to 2.5.0 by @dependabot in https://github.com/soxoj/maigret/pull/542
* Bump xhtml2pdf from 0.2.7 to 0.2.8 by @dependabot in https://github.com/soxoj/maigret/pull/522
* Bump lxml from 4.9.0 to 4.9.1 by @dependabot in https://github.com/soxoj/maigret/pull/538
* disable yandex music + set utf8 encoding by @fen0s in https://github.com/soxoj/maigret/pull/562
* fix false positives by @fen0s in https://github.com/soxoj/maigret/pull/577
* disable Instagram, fix two false positives by @fen0s in https://github.com/soxoj/maigret/pull/578
* Bump certifi from 2022.5.18.1 to 2022.6.15 by @dependabot in https://github.com/soxoj/maigret/pull/551
* August15 by @fen0s in https://github.com/soxoj/maigret/pull/591
* Bump pytest-httpserver from 1.0.4 to 1.0.5 by @dependabot in https://github.com/soxoj/maigret/pull/583
* Bump typing-extensions from 4.2.0 to 4.3.0 by @dependabot in https://github.com/soxoj/maigret/pull/549
* Bump colorama from 0.4.4 to 0.4.5 by @dependabot in https://github.com/soxoj/maigret/pull/548
* Bump chardet from 4.0.0 to 5.0.0 by @dependabot in https://github.com/soxoj/maigret/pull/550
* Bump cloudscraper from 1.2.60 to 1.2.63 by @dependabot in https://github.com/soxoj/maigret/pull/600
* Bump flake8 from 4.0.1 to 5.0.4 by @dependabot in https://github.com/soxoj/maigret/pull/598
* Bump attrs from 21.4.0 to 22.1.0 by @dependabot in https://github.com/soxoj/maigret/pull/597
* Bump pytest-asyncio from 0.18.2 to 0.19.0 by @dependabot in https://github.com/soxoj/maigret/pull/601
* Bump pypdf2 from 2.5.0 to 2.10.4 by @dependabot in https://github.com/soxoj/maigret/pull/606
* Bump pytest from 7.1.2 to 7.1.3 by @dependabot in https://github.com/soxoj/maigret/pull/613
* Update sites.md -Gitmemory.com suppression by @C3n7ral051nt4g3ncy in https://github.com/soxoj/maigret/pull/610
* Bump cloudscraper from 1.2.63 to 1.2.64 by @dependabot in https://github.com/soxoj/maigret/pull/614
* Bump pycountry from 22.1.10 to 22.3.5 by @dependabot in https://github.com/soxoj/maigret/pull/607
* add ProtonMail, disable 3 broken sites by @fen0s in https://github.com/soxoj/maigret/pull/619
* Bump tqdm from 4.64.0 to 4.64.1 by @dependabot in https://github.com/soxoj/maigret/pull/618
**Full Changelog**: https://github.com/soxoj/maigret/compare/v0.4.3...v0.4.4
## [0.4.3] - 2022-04-13
* Added Sites to data.json by @kustermariocoding in https://github.com/soxoj/maigret/pull/386
* added new Websites to data.json by @kustermariocoding in https://github.com/soxoj/maigret/pull/390
* Skipped broken tests by @soxoj in https://github.com/soxoj/maigret/pull/397
* Added new Websites to data.json by @kustermariocoding in https://github.com/soxoj/maigret/pull/401
* Added new Websites to data.json by @kustermariocoding in https://github.com/soxoj/maigret/pull/404
* Updated statistics by @soxoj in https://github.com/soxoj/maigret/pull/406
* Added new Websites to data.json by @kustermariocoding in https://github.com/soxoj/maigret/pull/413
* Disabled houzz.com, updated sites statistics by @soxoj in https://github.com/soxoj/maigret/pull/422
* Fixed last false positives by @soxoj in https://github.com/soxoj/maigret/pull/424
* Fixed actual false positives by @soxoj in https://github.com/soxoj/maigret/pull/431
**Full Changelog**: https://github.com/soxoj/maigret/compare/v0.4.2...v0.4.3
## [0.4.2] - 2022-03-07
* [ImgBot] Optimize images by @imgbot in https://github.com/soxoj/maigret/pull/319
* Bump pytest-asyncio from 0.17.0 to 0.17.1 by @dependabot in https://github.com/soxoj/maigret/pull/321
* Bump pytest-asyncio from 0.17.1 to 0.17.2 by @dependabot in https://github.com/soxoj/maigret/pull/323
* Disabled Ruboard by @soxoj in https://github.com/soxoj/maigret/pull/327
* Disable kinooh, sites list update workflow added by @soxoj in https://github.com/soxoj/maigret/pull/329
* Bump multidict from 5.2.0 to 6.0.1 by @dependabot in https://github.com/soxoj/maigret/pull/332
* Bump multidict from 6.0.1 to 6.0.2 by @dependabot in https://github.com/soxoj/maigret/pull/333
* Bump pytest-httpserver from 1.0.3 to 1.0.4 by @dependabot in https://github.com/soxoj/maigret/pull/334
* Bump pytest from 6.2.5 to 7.0.0 by @dependabot in https://github.com/soxoj/maigret/pull/339
* Bump pytest-asyncio from 0.17.2 to 0.18.0 by @dependabot in https://github.com/soxoj/maigret/pull/340
* Bump pytest-asyncio from 0.18.0 to 0.18.1 by @dependabot in https://github.com/soxoj/maigret/pull/343
* Bump pytest from 7.0.0 to 7.0.1 by @dependabot in https://github.com/soxoj/maigret/pull/345
* Bump typing-extensions from 4.0.1 to 4.1.1 by @dependabot in https://github.com/soxoj/maigret/pull/346
* Bump lxml from 4.7.1 to 4.8.0 by @dependabot in https://github.com/soxoj/maigret/pull/350
* Pin reportlab version by @cyb3rk0tik in https://github.com/soxoj/maigret/pull/351
* Fix reportlab not only for testing by @cyb3rk0tik in https://github.com/soxoj/maigret/pull/352
* Added some scripts by @soxoj in https://github.com/soxoj/maigret/pull/355
* Added package publishing instruction by @soxoj in https://github.com/soxoj/maigret/pull/356
* Added DB statistics autoupdate and write to sites.md by @soxoj in https://github.com/soxoj/maigret/pull/357
* CI autoupdate by @soxoj in https://github.com/soxoj/maigret/pull/359
* Op.gg fixes by @soxoj in https://github.com/soxoj/maigret/pull/363
* Wikipedia fix by @soxoj in https://github.com/soxoj/maigret/pull/365
* Disabled Netvibes and LeetCode by @soxoj in https://github.com/soxoj/maigret/pull/366
* Fixed several false positives, improved statistics info by @soxoj in https://github.com/soxoj/maigret/pull/368
* Fix false positives by @soxoj in https://github.com/soxoj/maigret/pull/370
* Fixed the rest of false positives for now by @soxoj in https://github.com/soxoj/maigret/pull/371
* Fix false positive and CI by @soxoj in https://github.com/soxoj/maigret/pull/372
* Added new sites to data.json by @kustermariocoding in https://github.com/soxoj/maigret/pull/375
* Fixed issue with str alexaRank by @soxoj in https://github.com/soxoj/maigret/pull/382
* Bump tqdm from 4.62.3 to 4.63.0 by @dependabot in https://github.com/soxoj/maigret/pull/374
* Bump pytest-asyncio from 0.18.1 to 0.18.2 by @dependabot in https://github.com/soxoj/maigret/pull/380
* @imgbot made their first contribution in https://github.com/soxoj/maigret/pull/319
* @kustermariocoding made their first contribution in https://github.com/soxoj/maigret/pull/375
**Full Changelog**: https://github.com/soxoj/maigret/compare/v0.4.1...v0.4.2
## [0.4.1] - 2022-01-15 ## [0.4.1] - 2022-01-15
* Added dozen of sites, improved submit mode by @soxoj in https://github.com/soxoj/maigret/pull/288 * Added dozen of sites, improved submit mode by @soxoj in https://github.com/soxoj/maigret/pull/288
* Bump requests from 2.26.0 to 2.27.0 by @dependabot in https://github.com/soxoj/maigret/pull/292 * Bump requests from 2.26.0 to 2.27.0 by @dependabot in https://github.com/soxoj/maigret/pull/292
+3 -3
View File
@@ -51,7 +51,7 @@ Also you can run Maigret using cloud shells and Jupyter notebooks (see buttons b
### Package installing ### Package installing
**NOTE**: Python 3.6 or higher and pip is required, **Python 3.8 is recommended.** **NOTE**: Python 3.7 or higher and pip is required, **Python 3.8 is recommended.**
```bash ```bash
# install from pypi # install from pypi
@@ -101,7 +101,7 @@ maigret user --tags photo,dating
maigret user1 user2 user3 -a maigret user1 user2 user3 -a
``` ```
Use `maigret --help` to get full options description. Also options are documented in [the Maigret Wiki](https://github.com/soxoj/maigret/wiki/Command-line-options). Use `maigret --help` to get full options description. Also options [are documented](https://maigret.readthedocs.io/en/latest/command-line-options.html).
## Demo with page parsing and recursive username search ## Demo with page parsing and recursive username search
@@ -112,7 +112,7 @@ Use `maigret --help` to get full options description. Also options are documente
![HTML report screenshot](https://raw.githubusercontent.com/soxoj/maigret/main/static/report_alexaimephotography_html_screenshot.png) ![HTML report screenshot](https://raw.githubusercontent.com/soxoj/maigret/main/static/report_alexaimephotography_html_screenshot.png)
![XMind report screenshot](https://raw.githubusercontent.com/soxoj/maigret/main/static/report_alexaimephotography_xmind_screenshot.png) ![XMind 8 report screenshot](https://raw.githubusercontent.com/soxoj/maigret/main/static/report_alexaimephotography_xmind_screenshot.png)
[Full console output](https://raw.githubusercontent.com/soxoj/maigret/main/static/recursive_search.md) [Full console output](https://raw.githubusercontent.com/soxoj/maigret/main/static/recursive_search.md)
+2 -2
View File
@@ -6,8 +6,8 @@ project = 'Maigret'
copyright = '2021, soxoj' copyright = '2021, soxoj'
author = 'soxoj' author = 'soxoj'
release = '0.4.1' release = '0.4.4'
version = '0.4.1' version = '0.4.4'
# -- General configuration # -- General configuration
+101
View File
@@ -0,0 +1,101 @@
.. _development:
Development
==============
Testing
-------
It is recommended use Python 3.7/3.8 for test due to some conflicts in 3.9.
Install test requirements:
.. code-block:: console
pip install -r test-requirements.txt
Use the following commands to check Maigret:
.. code-block:: console
# run linter and typing checks
# order of checks%
# - critical syntax errors or undefined names
# - flake checks
# - mypy checks
make lint
# run testing with coverage html report
# current test coverage is 60%
make text
# open html report
open htmlcov/index.html
How to publish new version of Maigret
-------------------------------------
**Collaborats rights are requires, write Soxoj to get them**.
For new version publishing you must create a new branch in repository
with a bumped version number and actual changelog first. After it you
must create a release, and GitHub action automatically create a new
PyPi package.
- New branch example: https://github.com/soxoj/maigret/commit/e520418f6a25d7edacde2d73b41a8ae7c80ddf39
- Release example: https://github.com/soxoj/maigret/releases/tag/v0.4.1
1. Make a new branch locally with a new version name. Check the current version number here: https://pypi.org/project/maigret/.
**Increase only patch version (third number)** if there are no breaking changes.
.. code-block:: console
git checkout -b 0.4.0
2. Update Maigret version in three files manually:
- setup.py
- maigret/__version__.py
- docs/source/conf.py
3. Create a new empty text section in the beginning of the file `CHANGELOG.md` with a current date:
.. code-block:: console
## [0.4.0] - 2022-01-03
4. Get auto-generate release notes:
- Open https://github.com/soxoj/maigret/releases/new
- Click `Choose a tag`, enter `v0.4.0` (your version)
- Click `Create new tag`
- Press `+ Auto-generate release notes`
- Copy all the text from description text field below
- Paste it to empty text section in `CHANGELOG.txt`
- Remove redundant lines `## What's Changed` and `## New Contributors` section if it exists
- *Close the new release page*
5. Commit all the changes, push, make pull request
.. code-block:: console
git add -p
git commit -m 'Bump to YOUR VERSION'
git push origin head
6. Merge pull request
7. Create new release
- Open https://github.com/soxoj/maigret/releases/new again
- Click `Choose a tag`
- Enter actual version in format `v0.4.0`
- Also enter actual version in the field `Release title`
- Click `Create new tag`
- Press `+ Auto-generate release notes`
- **Press "Publish release" button**
8. That's all, now you can simply wait push to PyPi. You can monitor it in Action page: https://github.com/soxoj/maigret/actions/workflows/python-publish.yml
+3 -1
View File
@@ -24,7 +24,7 @@ Enabled by default, can be disabled with ``--no-recursion``.
Reports Reports
------- -------
Maigret currently supports HTML, PDF, TXT, XMind mindmap, and JSON reports. Maigret currently supports HTML, PDF, TXT, XMind 8 mindmap, and JSON reports.
HTML/PDF reports contain: HTML/PDF reports contain:
@@ -34,6 +34,8 @@ HTML/PDF reports contain:
Also, there is a short text report in the CLI output after the end of a searching phase. Also, there is a short text report in the CLI output after the end of a searching phase.
**Warning**: XMind 8 mindmaps are incompatible with XMind 2022!
Tags Tags
---- ----
+1
View File
@@ -28,3 +28,4 @@ You may be interested in:
tags tags
usage-examples usage-examples
settings settings
development
+12 -1
View File
@@ -3,4 +3,15 @@
Philosophy Philosophy
========== ==========
Username => Dossier TL;DR: Username => Dossier
Maigret is designed to gather all the available information about person by his usernname.
What kind of information is this? First, links to person accounts. Secondly, all the machine-extractable
pieces of info, such as: other usernames, full name, URLs to people's images, birthday, location (country,
city, etc.), gender.
All this infromations forms some dossier, but it also useful for other tools and analytical purposes.
Each collected piece of data has a label of a certain format (for exa,ple, ``follower_count`` for the number
of subscribers or ``created_at`` for account creation time) so that it can be parsed and analyzed by various
systems and stored in databases.
+5 -2
View File
@@ -7,8 +7,11 @@ from maigret.maigret import main
def run(): def run():
try: try:
loop = asyncio.get_event_loop() if sys.version_info.minor >= 10:
loop.run_until_complete(main()) asyncio.run(main())
else:
loop = asyncio.get_event_loop()
loop.run_until_complete(main())
except KeyboardInterrupt: except KeyboardInterrupt:
print('Maigret is interrupted.') print('Maigret is interrupted.')
sys.exit(1) sys.exit(1)
+1 -1
View File
@@ -1,3 +1,3 @@
"""Maigret version file""" """Maigret version file"""
__version__ = '0.4.1' __version__ = '0.4.4'
+5 -2
View File
@@ -132,7 +132,7 @@ class SimpleAiohttpChecker(CheckerBase):
error = CheckError("Unexpected", str(e)) error = CheckError("Unexpected", str(e))
if error == "Invalid proxy response": if error == "Invalid proxy response":
self.logger.debug(e, exc_info=True) self.logger.debug(error, exc_info=True)
return str(html_text), status_code, error return str(html_text), status_code, error
@@ -580,6 +580,8 @@ async def maigret(
cookies=None, cookies=None,
retries=0, retries=0,
check_domains=False, check_domains=False,
*args,
**kwargs,
) -> QueryResultWrapper: ) -> QueryResultWrapper:
"""Main search func """Main search func
@@ -660,7 +662,8 @@ async def maigret(
executor = AsyncioSimpleExecutor(logger=logger) executor = AsyncioSimpleExecutor(logger=logger)
else: else:
executor = AsyncioProgressbarQueueExecutor( executor = AsyncioProgressbarQueueExecutor(
logger=logger, in_parallel=max_connections, timeout=timeout + 0.5 logger=logger, in_parallel=max_connections, timeout=timeout + 0.5,
*args, **kwargs
) )
# make options objects for all the requests # make options objects for all the requests
+22 -2
View File
@@ -81,6 +81,22 @@ class AsyncioProgressbarQueueExecutor(AsyncExecutor):
self.queue = asyncio.Queue(self.workers_count) self.queue = asyncio.Queue(self.workers_count)
self.timeout = kwargs.get('timeout') self.timeout = kwargs.get('timeout')
async def increment_progress(self, count):
update_func = self.progress.update
if asyncio.iscoroutinefunction(update_func):
await update_func(count)
else:
update_func(count)
await asyncio.sleep(0)
async def stop_progress(self):
stop_func = self.progress.close
if asyncio.iscoroutinefunction(stop_func):
await stop_func()
else:
stop_func()
await asyncio.sleep(0)
async def worker(self): async def worker(self):
while True: while True:
try: try:
@@ -96,7 +112,7 @@ class AsyncioProgressbarQueueExecutor(AsyncExecutor):
result = kwargs.get('default') result = kwargs.get('default')
self.results.append(result) self.results.append(result)
self.progress.update(1) await self.increment_progress(1)
self.queue.task_done() self.queue.task_done()
async def _run(self, queries: Iterable[QueryDraft]): async def _run(self, queries: Iterable[QueryDraft]):
@@ -109,10 +125,14 @@ class AsyncioProgressbarQueueExecutor(AsyncExecutor):
workers = [create_task_func()(self.worker()) for _ in range(min_workers)] workers = [create_task_func()(self.worker()) for _ in range(min_workers)]
self.progress = self.progress_func(total=len(queries_list)) self.progress = self.progress_func(total=len(queries_list))
for t in queries_list: for t in queries_list:
await self.queue.put(t) await self.queue.put(t)
await self.queue.join() await self.queue.join()
for w in workers: for w in workers:
w.cancel() w.cancel()
self.progress.close()
await self.stop_progress()
return self.results return self.results
+7 -4
View File
@@ -48,7 +48,7 @@ def notify_about_errors(search_results: QueryResultWrapper, query_notify):
for e in errs: for e in errs:
if not errors.is_important(e): if not errors.is_important(e):
continue continue
text = f'Too many errors of type "{e["err"]}" ({e["perc"]}%)' text = f'Too many errors of type "{e["err"]}" ({round(e["perc"],2)}%)'
solution = errors.solution_of(e['err']) solution = errors.solution_of(e['err'])
if solution: if solution:
text = '. '.join([text, solution.capitalize()]) text = '. '.join([text, solution.capitalize()])
@@ -566,7 +566,7 @@ async def main():
# Database statistics # Database statistics
if args.stats: if args.stats:
print(db.get_db_stats(db.sites_dict)) print(db.get_db_stats())
report_dir = path.join(os.getcwd(), args.folderoutput) report_dir = path.join(os.getcwd(), args.folderoutput)
@@ -719,8 +719,11 @@ async def main():
def run(): def run():
try: try:
loop = asyncio.get_event_loop() if sys.version_info.minor >= 10:
loop.run_until_complete(main()) asyncio.run(main())
else:
loop = asyncio.get_event_loop()
loop.run_until_complete(main())
except KeyboardInterrupt: except KeyboardInterrupt:
print('Maigret is interrupted.') print('Maigret is interrupted.')
sys.exit(1) sys.exit(1)
+1 -1
View File
@@ -67,7 +67,7 @@ def save_txt_report(filename: str, username: str, results: dict):
def save_html_report(filename: str, context: dict): def save_html_report(filename: str, context: dict):
template, _ = generate_report_template(is_pdf=False) template, _ = generate_report_template(is_pdf=False)
filled_template = template.render(**context) filled_template = template.render(**context)
with open(filename, "w") as f: with open(filename, "w", encoding="utf-8") as f:
f.write(filled_template) f.write(filled_template)
+5455 -663
View File
File diff suppressed because it is too large Load Diff
+36 -10
View File
@@ -419,9 +419,8 @@ class MaigretDatabase:
results[_id] = _type results[_id] = _type
return results return results
def get_db_stats(self, sites_dict): def get_db_stats(self, is_markdown=False):
if not sites_dict: sites_dict = self.sites_dict
sites_dict = self.sites_dict()
urls = {} urls = {}
tags = {} tags = {}
@@ -429,6 +428,11 @@ class MaigretDatabase:
disabled_count = 0 disabled_count = 0
total_count = len(sites_dict) total_count = len(sites_dict)
message_checks = 0
message_checks_one_factor = 0
status_checks = 0
for _, site in sites_dict.items(): for _, site in sites_dict.items():
if site.disabled: if site.disabled:
disabled_count += 1 disabled_count += 1
@@ -436,24 +440,46 @@ class MaigretDatabase:
url_type = site.get_url_template() url_type = site.get_url_template()
urls[url_type] = urls.get(url_type, 0) + 1 urls[url_type] = urls.get(url_type, 0) + 1
if site.check_type == 'message' and not site.disabled:
message_checks += 1
if site.absence_strs and site.presense_strs:
continue
message_checks_one_factor += 1
if site.check_type == 'status_code':
status_checks += 1
if not site.tags: if not site.tags:
tags["NO_TAGS"] = tags.get("NO_TAGS", 0) + 1 tags["NO_TAGS"] = tags.get("NO_TAGS", 0) + 1
for tag in filter(lambda x: not is_country_tag(x), site.tags): for tag in filter(lambda x: not is_country_tag(x), site.tags):
tags[tag] = tags.get(tag, 0) + 1 tags[tag] = tags.get(tag, 0) + 1
output += f"Enabled/total sites: {total_count - disabled_count}/{total_count}\n" enabled_count = total_count-disabled_count
output += "Top profile URLs:\n" enabled_perc = round(100*enabled_count/total_count, 2)
for url, count in sorted(urls.items(), key=lambda x: x[1], reverse=True)[:20]: output += f"Enabled/total sites: {enabled_count}/{total_count} = {enabled_perc}%\n\n"
checks_perc = round(100*message_checks_one_factor/enabled_count, 2)
output += f"Incomplete message checks: {message_checks_one_factor}/{enabled_count} = {checks_perc}% (false positive risks)\n\n"
status_checks_perc = round(100*status_checks/enabled_count, 2)
output += f"Status code checks: {status_checks}/{enabled_count} = {status_checks_perc}% (false positive risks)\n\n"
output += f"False positive risk (total): {checks_perc+status_checks_perc}%\n\n"
top_urls_count = 20
output += f"Top {top_urls_count} profile URLs:\n"
for url, count in sorted(urls.items(), key=lambda x: x[1], reverse=True)[:top_urls_count]:
if count == 1: if count == 1:
break break
output += f"{count}\t{url}\n" output += f"- ({count})\t`{url}`\n" if is_markdown else f"{count}\t{url}\n"
output += "Top tags:\n" top_tags_count = 20
for tag, count in sorted(tags.items(), key=lambda x: x[1], reverse=True)[:200]: output += f"\nTop {top_tags_count} tags:\n"
for tag, count in sorted(tags.items(), key=lambda x: x[1], reverse=True)[:top_tags_count]:
mark = "" mark = ""
if tag not in self._tags: if tag not in self._tags:
mark = " (non-standard)" mark = " (non-standard)"
output += f"{count}\t{tag}{mark}\n" output += f"- ({count})\t`{tag}`{mark}\n" if is_markdown else f"{count}\t{tag}{mark}\n"
return output return output
+36 -7
View File
@@ -1,10 +1,11 @@
import asyncio import asyncio
import json import json
import re import re
from typing import List from typing import List, Tuple
import xml.etree.ElementTree as ET import xml.etree.ElementTree as ET
from aiohttp import TCPConnector, ClientSession from aiohttp import TCPConnector, ClientSession
import requests import requests
import cloudscraper
from .activation import import_aiohttp_cookies from .activation import import_aiohttp_cookies
from .checking import maigret from .checking import maigret
@@ -14,6 +15,27 @@ from .sites import MaigretDatabase, MaigretSite, MaigretEngine
from .utils import get_random_user_agent, get_match_ratio from .utils import get_random_user_agent, get_match_ratio
class CloudflareSession:
def __init__(self):
self.scraper = cloudscraper.create_scraper()
async def get(self, *args, **kwargs):
await asyncio.sleep(0)
res = self.scraper.get(*args, **kwargs)
self.last_text = res.text
self.status = res.status_code
return self
def status_code(self):
return self.status
async def text(self):
await asyncio.sleep(0)
return self.last_text
async def close(self):
pass
class Submitter: class Submitter:
HEADERS = { HEADERS = {
"User-Agent": get_random_user_agent(), "User-Agent": get_random_user_agent(),
@@ -141,16 +163,18 @@ class Submitter:
fields['urlSubpath'] = f'/{subpath}' fields['urlSubpath'] = f'/{subpath}'
return fields return fields
async def detect_known_engine(self, url_exists, url_mainpage) -> List[MaigretSite]: async def detect_known_engine(self, url_exists, url_mainpage) -> [List[MaigretSite], str]:
resp_text = '' resp_text = ''
try: try:
r = await self.session.get(url_mainpage) r = await self.session.get(url_mainpage)
resp_text = await r.text() content = await r.content.read()
charset = r.charset or "utf-8"
resp_text = content.decode(charset, "ignore")
self.logger.debug(resp_text) self.logger.debug(resp_text)
except Exception as e: except Exception as e:
self.logger.warning(e) self.logger.warning(e)
print("Some error while checking main page") print("Some error while checking main page")
return [] return [], resp_text
for engine in self.db.engines: for engine in self.db.engines:
strs_to_check = engine.__dict__.get("presenseStrs") strs_to_check = engine.__dict__.get("presenseStrs")
@@ -193,9 +217,9 @@ class Submitter:
) )
sites.append(maigret_site) sites.append(maigret_site)
return sites return sites, resp_text
return [] return [], resp_text
def extract_username_dialog(self, url): def extract_username_dialog(self, url):
url_parts = url.rstrip("/").split("/") url_parts = url.rstrip("/").split("/")
@@ -338,10 +362,15 @@ class Submitter:
print('Detecting site engine, please wait...') print('Detecting site engine, please wait...')
sites = [] sites = []
try: try:
sites = await self.detect_known_engine(url_exists, url_mainpage) sites, text = await self.detect_known_engine(url_exists, url_exists)
except KeyboardInterrupt: except KeyboardInterrupt:
print('Engine detect process is interrupted.') print('Engine detect process is interrupted.')
if 'cloudflare' in text.lower():
print('Cloudflare protection detected. I will use cloudscraper for futher work')
# self.session = CloudflareSession()
if not sites: if not sites:
print("Unable to detect site engine, lets generate checking features") print("Unable to detect site engine, lets generate checking features")
+2 -2
View File
@@ -1,5 +1,5 @@
maigret @ https://github.com/soxoj/maigret/archive/refs/heads/main.zip maigret @ https://github.com/soxoj/maigret/archive/refs/heads/main.zip
pefile==2021.9.3 pefile==2022.5.30
psutil==5.9.0 psutil==5.9.1
pyinstaller @ https://github.com/pyinstaller/pyinstaller/archive/develop.zip pyinstaller @ https://github.com/pyinstaller/pyinstaller/archive/develop.zip
pywin32-ctypes==0.2.0 pywin32-ctypes==0.2.0
+18 -16
View File
@@ -3,35 +3,37 @@ aiohttp==3.8.1
aiohttp-socks==0.7.1 aiohttp-socks==0.7.1
arabic-reshaper==2.1.3 arabic-reshaper==2.1.3
async-timeout==4.0.2 async-timeout==4.0.2
attrs==21.4.0 attrs==22.1.0
certifi==2021.10.8 certifi==2022.6.15
chardet==4.0.0 chardet==5.0.0
colorama==0.4.4 colorama==0.4.5
future==0.18.2 future==0.18.2
future-annotations==1.0.0 future-annotations==1.0.0
html5lib==1.1 html5lib==1.1
idna==3.3 idna==3.3
Jinja2==3.0.3 Jinja2==3.1.2
lxml==4.7.1 lxml==4.9.1
MarkupSafe==2.0.1 MarkupSafe==2.1.1
mock==4.0.3 mock==4.0.3
multidict==5.2.0 multidict==6.0.2
pycountry==22.1.10 pycountry==22.3.5
PyPDF2==1.26.0 PyPDF2==2.10.4
PySocks==1.7.1 PySocks==1.7.1
python-bidi==0.4.2 python-bidi==0.4.2
requests==2.27.1 requests==2.28.1
requests-futures==1.0.0 requests-futures==1.0.0
six==1.16.0 six==1.16.0
socid-extractor>=0.0.21 socid-extractor>=0.0.21
soupsieve==2.3.1 soupsieve==2.3.2.post1
stem==1.8.0 stem==1.8.0
torrequest==0.1.0 torrequest==0.1.0
tqdm==4.62.3 tqdm==4.64.1
typing-extensions==4.0.1 typing-extensions==4.3.0
webencodings==0.5.1 webencodings==0.5.1
xhtml2pdf==0.2.5 xhtml2pdf==0.2.8
XMind==1.2.0 XMind==1.2.0
yarl==1.7.2 yarl==1.7.2
networkx==2.5.1 networkx==2.5.1
pyvis==0.1.9 pyvis==0.2.1
reportlab==3.6.11
cloudscraper==1.2.64
+1 -1
View File
@@ -11,7 +11,7 @@ with open('requirements.txt') as rf:
requires = rf.read().splitlines() requires = rf.read().splitlines()
setup(name='maigret', setup(name='maigret',
version='0.4.1', version='0.4.4',
description='Collect a dossier on a person by username from a huge number of sites', description='Collect a dossier on a person by username from a huge number of sites',
long_description=long_description, long_description=long_description,
long_description_content_type="text/markdown", long_description_content_type="text/markdown",
+1044 -529
View File
File diff suppressed because it is too large Load Diff
Binary file not shown.

Before

Width:  |  Height:  |  Size: 15 KiB

After

Width:  |  Height:  |  Size: 9.0 KiB

File diff suppressed because one or more lines are too long

Before

Width:  |  Height:  |  Size: 44 KiB

After

Width:  |  Height:  |  Size: 44 KiB

Binary file not shown.

Before

Width:  |  Height:  |  Size: 607 KiB

After

Width:  |  Height:  |  Size: 451 KiB

Binary file not shown.

Before

Width:  |  Height:  |  Size: 773 KiB

After

Width:  |  Height:  |  Size: 351 KiB

+5 -4
View File
@@ -1,7 +1,8 @@
flake8==4.0.1 reportlab==3.6.11
pytest==6.2.5 flake8==5.0.4
pytest==7.1.3
pytest-asyncio==0.16.0;python_version<"3.7" pytest-asyncio==0.16.0;python_version<"3.7"
pytest-asyncio==0.17.0;python_version>="3.7" pytest-asyncio==0.19.0;python_version>="3.7"
pytest-cov==3.0.0 pytest-cov==3.0.0
pytest-httpserver==1.0.3 pytest-httpserver==1.0.5
pytest-rerunfailures==10.2 pytest-rerunfailures==10.2
+3
View File
@@ -48,6 +48,7 @@ def test_self_check_db_positive_disable(test_db):
@pytest.mark.slow @pytest.mark.slow
@pytest.mark.skip(reason="broken, fixme")
def test_self_check_db_positive_enable(test_db): def test_self_check_db_positive_enable(test_db):
logger = Mock() logger = Mock()
@@ -78,6 +79,7 @@ def test_self_check_db_negative_disabled(test_db):
assert test_db.sites[0].disabled is True assert test_db.sites[0].disabled is True
@pytest.mark.skip(reason='broken, fixme')
@pytest.mark.slow @pytest.mark.slow
def test_self_check_db_negative_enabled(test_db): def test_self_check_db_negative_enabled(test_db):
logger = Mock() logger = Mock()
@@ -95,6 +97,7 @@ def test_self_check_db_negative_enabled(test_db):
@pytest.mark.slow @pytest.mark.slow
@pytest.mark.skip(reason="broken, fixme")
def test_maigret_results(test_db): def test_maigret_results(test_db):
logger = Mock() logger = Mock()
+2
View File
@@ -2,6 +2,7 @@
import copy import copy
import json import json
import os import os
import pytest
from io import StringIO from io import StringIO
import xmind import xmind
@@ -424,6 +425,7 @@ def test_html_report_broken():
assert SUPPOSED_BROKEN_INTERESTS in report_text assert SUPPOSED_BROKEN_INTERESTS in report_text
@pytest.mark.skip(reason='connection reset, fixme')
def test_pdf_report(): def test_pdf_report():
report_name = 'report_test.pdf' report_name = 'report_test.pdf'
context = generate_report_context(TEST) context = generate_report_context(TEST)
View File
+11 -5
View File
@@ -3,7 +3,7 @@ import random
from argparse import ArgumentParser, RawDescriptionHelpFormatter from argparse import ArgumentParser, RawDescriptionHelpFormatter
from maigret.maigret import MaigretDatabase from maigret.maigret import MaigretDatabase
from maigret.submit import get_alexa_rank from maigret.submit import Submitter
def update_tags(site): def update_tags(site):
@@ -22,7 +22,7 @@ def update_tags(site):
site.disabled = True site.disabled = True
print(f'Old alexa rank: {site.alexa_rank}') print(f'Old alexa rank: {site.alexa_rank}')
rank = get_alexa_rank(site.url_main) rank = Submitter.get_alexa_rank(site.url_main)
if rank: if rank:
print(f'New alexa rank: {rank}') print(f'New alexa rank: {rank}')
site.alexa_rank = rank site.alexa_rank = rank
@@ -36,6 +36,7 @@ if __name__ == '__main__':
parser.add_argument("--base","-b", metavar="BASE_FILE", parser.add_argument("--base","-b", metavar="BASE_FILE",
dest="base_file", default="maigret/resources/data.json", dest="base_file", default="maigret/resources/data.json",
help="JSON file with sites data to update.") help="JSON file with sites data to update.")
parser.add_argument("--name", help="Name of site to check")
pool = list() pool = list()
@@ -45,12 +46,17 @@ if __name__ == '__main__':
db.load_from_file(args.base_file).sites db.load_from_file(args.base_file).sites
while True: while True:
site = random.choice(db.sites) if args.name:
sites = list(db.ranked_sites_dict(names=[args.name]).values())
site = random.choice(sites)
else:
site = random.choice(db.sites)
if site.engine == 'uCoz': if site.engine == 'uCoz':
continue continue
if not 'in' in site.tags: # if not 'in' in site.tags:
continue # continue
update_tags(site) update_tags(site)
+152
View File
@@ -0,0 +1,152 @@
#!/usr/bin/env python3
"""Maigret: Supported Site Listing with Alexa ranking and country tags
This module generates the listing of supported sites in file `SITES.md`
and pretty prints file with sites data.
"""
import aiohttp
import asyncio
import json
import sys
import requests
import logging
import threading
import xml.etree.ElementTree as ET
from datetime import datetime
from argparse import ArgumentParser, RawDescriptionHelpFormatter
import tqdm.asyncio
from maigret.maigret import get_response, site_self_check
from maigret.sites import MaigretSite, MaigretDatabase, MaigretEngine
from maigret.utils import CaseConverter
async def check_engine_of_site(site_name, sites_with_engines, future, engine_name, semaphore, logger):
async with semaphore:
response = await get_response(request_future=future,
site_name=site_name,
logger=logger)
html_text, status_code, error_text, expection_text = response
if html_text and engine_name in html_text:
sites_with_engines.append(site_name)
return True
return False
if __name__ == '__main__':
parser = ArgumentParser(formatter_class=RawDescriptionHelpFormatter
)
parser.add_argument("--base","-b", metavar="BASE_FILE",
dest="base_file", default="maigret/resources/data.json",
help="JSON file with sites data to update.")
parser.add_argument('--engine', '-e', help='check only selected engine', type=str)
args = parser.parse_args()
log_level = logging.INFO
logging.basicConfig(
format='[%(filename)s:%(lineno)d] %(levelname)-3s %(asctime)s %(message)s',
datefmt='%H:%M:%S',
level=log_level
)
logger = logging.getLogger('engines-check')
logger.setLevel(log_level)
db = MaigretDatabase()
sites_subset = db.load_from_file(args.base_file).sites
sites = {site.name: site for site in sites_subset}
with open(args.base_file, "r", encoding="utf-8") as data_file:
sites_info = json.load(data_file)
engines = sites_info['engines']
for engine_name, engine_data in engines.items():
if args.engine and args.engine != engine_name:
continue
if not 'presenseStrs' in engine_data:
print(f'No features to automatically detect sites on engine {engine_name}')
continue
engine_obj = MaigretEngine(engine_name, engine_data)
# setup connections for checking both engine and usernames
connector = aiohttp.TCPConnector(ssl=False)
connector.verify_ssl=False
session = aiohttp.ClientSession(connector=connector)
sem = asyncio.Semaphore(100)
loop = asyncio.get_event_loop()
tasks = []
# check sites without engine if they look like sites on this engine
new_engine_sites = []
for site_name, site_data in sites.items():
if site_data.engine:
continue
future = session.get(url=site_data.url_main,
allow_redirects=True,
timeout=10,
)
check_engine_coro = check_engine_of_site(site_name, new_engine_sites, future, engine_name, sem, logger)
future = asyncio.ensure_future(check_engine_coro)
tasks.append(future)
# progress bar
for f in tqdm.asyncio.tqdm.as_completed(tasks):
loop.run_until_complete(f)
print(f'Total detected {len(new_engine_sites)} sites on engine {engine_name}')
# dict with new found engine sites
new_sites = {site_name: sites[site_name] for site_name in new_engine_sites}
# update sites obj from engine
for site_name, site in new_sites.items():
site.request_future = None
site.engine = engine_name
site.update_from_engine(engine_obj)
async def update_site_data(site_name, site_data, all_sites, logger, no_progressbar):
updates = await site_self_check(site_name, site_data, logger, no_progressbar)
all_sites[site_name].update(updates)
tasks = []
# for new_site_name, new_site_data in new_sites.items():
# coro = update_site_data(new_site_name, new_site_data, new_sites, logger)
# future = asyncio.ensure_future(coro)
# tasks.append(future)
# asyncio.gather(*tasks)
for new_site_name, new_site_data in new_sites.items():
coro = update_site_data(new_site_name, new_site_data, new_sites, logger, no_progressbar=True)
loop.run_until_complete(coro)
updated_sites_count = 0
for s in new_sites:
site = new_sites[s]
site.request_future = None
if site.disabled:
print(f'{site.name} failed username checking of engine {engine_name}')
continue
site = site.strip_engine_data()
db.update_site(site)
updated_sites_count += 1
db.save_to_file(args.base_file)
print(f'Site "{s}": ' + json.dumps(site.json, indent=4))
print(f'Updated total {updated_sites_count} sites!')
print(f'Checking all sites on engine {engine_name}')
loop.run_until_complete(session.close())
print("\nFinished updating supported site listing!")
+280
View File
@@ -0,0 +1,280 @@
#!/usr/bin/env python3
import json
import random
import re
import tqdm.asyncio
from mock import Mock
import requests
from maigret.maigret import *
from maigret.result import QueryStatus
from maigret.sites import MaigretSite
URL_RE = re.compile(r"https?://(www\.)?")
TIMEOUT = 200
async def maigret_check(site, site_data, username, status, logger):
query_notify = Mock()
logger.debug(f'Checking {site}...')
for username, status in [(username, status)]:
results = await maigret(
username,
{site: site_data},
logger,
query_notify,
timeout=TIMEOUT,
forced=True,
no_progressbar=True,
)
if results[site]['status'].status != status:
if results[site]['status'].status == QueryStatus.UNKNOWN:
msg = site_data.absence_strs
etype = site_data.check_type
context = results[site]['status'].context
logger.debug(f'Error while searching {username} in {site}, must be claimed. Context: {context}')
# if site_data.get('errors'):
# continue
return False
if status == QueryStatus.CLAIMED:
logger.debug(f'Not found {username} in {site}, must be claimed')
logger.debug(results[site])
pass
else:
logger.debug(f'Found {username} in {site}, must be available')
logger.debug(results[site])
pass
return False
return site_data
async def check_and_add_maigret_site(site_data, semaphore, logger, ok_usernames, bad_usernames):
async with semaphore:
sitename = site_data.name
positive = False
negative = False
for ok_username in ok_usernames:
site_data.username_claimed = ok_username
status = QueryStatus.CLAIMED
if await maigret_check(sitename, site_data, ok_username, status, logger):
# print(f'{sitename} positive case is okay')
positive = True
break
for bad_username in bad_usernames:
site_data.username_unclaimed = bad_username
status = QueryStatus.AVAILABLE
if await maigret_check(sitename, site_data, bad_username, status, logger):
# print(f'{sitename} negative case is okay')
negative = True
break
if positive and negative:
site_data = site_data.strip_engine_data()
db.update_site(site_data)
print(site_data.json)
try:
db.save_to_file(args.base_file)
except Exception as e:
logging.error(e, exc_info=True)
print(f'Saved new site {sitename}...')
ok_sites.append(site_data)
if __name__ == '__main__':
parser = ArgumentParser(formatter_class=RawDescriptionHelpFormatter
)
parser.add_argument("--base", "-b", metavar="BASE_FILE",
dest="base_file", default="maigret/resources/data.json",
help="JSON file with sites data to update.")
parser.add_argument("--add-engine", dest="add_engine", help="Additional engine to check")
parser.add_argument("--only-engine", dest="only_engine", help="Use only this engine from detected to check")
parser.add_argument('--check', help='only check sites in database', action='store_true')
parser.add_argument('--random', help='shuffle list of urls', action='store_true', default=False)
parser.add_argument('--top', help='top count of records in file', type=int, default=10000)
parser.add_argument('--filter', help='substring to filter input urls', type=str, default='')
parser.add_argument('--username', help='preferable username to check with', type=str)
parser.add_argument(
"--info",
"-vv",
action="store_true",
dest="info",
default=False,
help="Display service information.",
)
parser.add_argument(
"--verbose",
"-v",
action="store_true",
dest="verbose",
default=False,
help="Display extra information and metrics.",
)
parser.add_argument(
"-d",
"--debug",
"-vvv",
action="store_true",
dest="debug",
default=False,
help="Saving debugging information and sites responses in debug.txt.",
)
parser.add_argument("urls_file",
metavar='URLS_FILE',
action="store",
help="File with base site URLs"
)
args = parser.parse_args()
log_level = logging.ERROR
if args.debug:
log_level = logging.DEBUG
elif args.info:
log_level = logging.INFO
elif args.verbose:
log_level = logging.WARNING
logging.basicConfig(
format='[%(filename)s:%(lineno)d] %(levelname)-3s %(asctime)s %(message)s',
datefmt='%H:%M:%S',
level=log_level
)
logger = logging.getLogger('engines-check')
logger.setLevel(log_level)
db = MaigretDatabase()
sites_subset = db.load_from_file(args.base_file).sites
sites = {site.name: site for site in sites_subset}
engines = db.engines
# TODO: usernames extractors
ok_usernames = ['alex', 'god', 'admin', 'red', 'blue', 'john']
if args.username:
ok_usernames = [args.username] + ok_usernames
bad_usernames = ['noonewouldeverusethis7']
with open(args.urls_file, 'r') as urls_file:
urls = urls_file.read().splitlines()
if args.random:
random.shuffle(urls)
urls = urls[:args.top]
raw_maigret_data = json.dumps({site.name: site.json for site in sites_subset})
new_sites = []
for site in tqdm.asyncio.tqdm(urls):
site_lowercase = site.lower()
domain_raw = URL_RE.sub('', site_lowercase).strip().strip('/')
domain_raw = domain_raw.split('/')[0]
if args.filter and args.filter not in domain_raw:
logger.debug('Site %s skipped due to filtering by "%s"', domain_raw, args.filter)
continue
if domain_raw in raw_maigret_data:
logger.debug(f'Site {domain_raw} already exists in the Maigret database!')
continue
if '"' in domain_raw:
logger.debug(f'Invalid site {domain_raw}')
continue
main_page_url = '/'.join(site.split('/', 3)[:3])
site_data = {
'url': site,
'urlMain': main_page_url,
'name': domain_raw,
}
try:
r = requests.get(main_page_url, timeout=5)
except:
r = None
pass
detected_engines = []
for e in engines:
strs_to_check = e.__dict__.get('presenseStrs')
if strs_to_check and r and r.text:
all_strs_in_response = True
for s in strs_to_check:
if not s in r.text:
all_strs_in_response = False
if all_strs_in_response:
engine_name = e.__dict__.get('name')
detected_engines.append(engine_name)
logger.info(f'Detected engine {engine_name} for site {main_page_url}')
if args.only_engine and args.only_engine in detected_engines:
detected_engines = [args.only_engine]
elif not detected_engines and args.add_engine:
logging.debug('Could not detect any engine, applying default engine %s...', args.add_engine)
detected_engines = [args.add_engine]
def create_site_from_engine(sitename, data, e):
site = MaigretSite(sitename, data)
site.update_from_engine(db.engines_dict[e])
site.engine = e
return site
for engine_name in detected_engines:
site = create_site_from_engine(domain_raw, site_data, engine_name)
new_sites.append(site)
logger.debug(site.json)
# if engine_name == "phpBB":
# site_data_with_subpath = dict(site_data)
# site_data_with_subpath["urlSubpath"] = "/forum"
# site = create_site_from_engine(domain_raw, site_data_with_subpath, engine_name)
# new_sites.append(site)
# except Exception as e:
# print(f'Error: {str(e)}')
# pass
print(f'Found {len(new_sites)}/{len(urls)} new sites')
if args.check:
for s in new_sites:
print(s.url_main)
sys.exit(0)
sem = asyncio.Semaphore(20)
loop = asyncio.get_event_loop()
ok_sites = []
tasks = []
for site in new_sites:
check_coro = check_and_add_maigret_site(site, sem, logger, ok_usernames, bad_usernames)
future = asyncio.ensure_future(check_coro)
tasks.append(future)
for f in tqdm.asyncio.tqdm.as_completed(tasks, timeout=TIMEOUT):
try:
loop.run_until_complete(f)
except asyncio.exceptions.TimeoutError:
pass
print(f'Found and saved {len(ok_sites)} sites!')
+36
View File
@@ -0,0 +1,36 @@
import sys
import difflib
import requests
a = requests.get(sys.argv[1]).text
b = requests.get(sys.argv[2]).text
tokens_a = set(a.split('"'))
tokens_b = set(b.split('"'))
a_minus_b = tokens_a.difference(tokens_b)
b_minus_a = tokens_b.difference(tokens_a)
print(a_minus_b)
print(b_minus_a)
print(len(a_minus_b))
print(len(b_minus_a))
desired_strings = ["username", "not found", "пользователь", "profile", "lastname", "firstname", "biography",
"birthday", "репутация", "информация", "e-mail"]
def get_match_ratio(x):
return round(max([
difflib.SequenceMatcher(a=x.lower(), b=y).ratio()
for y in desired_strings
]), 2)
RATIO = 0.6
print(sorted(a_minus_b, key=get_match_ratio, reverse=True)[:10])
print(sorted(b_minus_a, key=get_match_ratio, reverse=True)[:10])
+6 -2
View File
@@ -114,7 +114,7 @@ Rank data fetched from Alexa by domains.
sys.stdout.flush() sys.stdout.flush()
index = index + 1 index = index + 1
sites_full_list = [(s, s.alexa_rank) for s in sites_subset] sites_full_list = [(s, int(s.alexa_rank)) for s in sites_subset]
sites_full_list.sort(reverse=False, key=lambda x: x[1]) sites_full_list.sort(reverse=False, key=lambda x: x[1])
@@ -137,7 +137,11 @@ Rank data fetched from Alexa by domains.
site_file.write(f'1. {favicon} [{site}]({url_main})*: top {valid_rank}{tags}*{note}\n') site_file.write(f'1. {favicon} [{site}]({url_main})*: top {valid_rank}{tags}*{note}\n')
db.update_site(site) db.update_site(site)
site_file.write(f'\nAlexa.com rank data fetched at ({datetime.utcnow()} UTC)\n') site_file.write(f'\nThe list was updated at ({datetime.utcnow()} UTC)\n')
db.save_to_file(args.base_file) db.save_to_file(args.base_file)
statistics_text = db.get_db_stats(is_markdown=True)
site_file.write('## Statistics\n\n')
site_file.write(statistics_text)
print("\nFinished updating supported site listing!") print("\nFinished updating supported site listing!")