GovWayback is a simple tool to quickly access archived versions of government websites from before January 20, 2025 - just add "wayback.com" after ".gov" in any government URL. GovWayback automatically redirects you to that page's archived version from the Internet Archive.
GovWayback only redirects to Internet Archive's archives. Please consider donating to them as they're doing the hard work of preserving websites.
This independent project is not affiliated with the Internet Archive or Wayback Machine, it only redirects to it. And to that end: we strongly recommend donating to the Internet Archive. GovWayback is an independent tool that helps people access the Internet Archive's public records of government websites.
This is a Chrome and Firefox browser extension that strips Google Analytics (i.e. Urchin Tracking Monitor) parameters, and various other click tracking tokens, from URL query strings. This is done before the web request is made and results in both more private browsing as well as more aesthetically pleasing URLs.
Aware of Facebook, Google, Hubspot, Instagram, Mailchimp, Marketo, Microsoft, Olytics, Wicked Reports, Yahoo, Yandex, and a few generic URIs.
This privacy add-on protects your identity by changing tracking parameters.
Addresses containing Google Analytics parameters will be altered to protect your privacy: no buttons to press and no configuration required.
It will scan the page for links and correct them. If a redirected URL contains the tracking parameters, they will be edited. Not functionality is lost.
This add-on is provided as part of Projekt ONI.
Webmasters can use this tool to prevent their site from appearing in the server logs of referred pages as referrer. The operators of the referred pages cannot see where their visitors come from any more.
A self-hosted service that pings webhooks or other URLs on a user-defined schedule. Works a little bit like cron. Can even do things every X minutes or hours, like cron.
Written in PHP, uses SQLite.
If you don't want to set it up yourself: https://hookless.co/
Lightweight asynchronous task execution service. Tries to replace RabbitMQ and Celery for lightweight (Python specifically, for some reason). Randomizes when retriesare done. POST base64 encoded URL to an endpoint and when you want it to run, and it'll hit that URL at the right time.
A python module (Python3, specifically - Python2 support was obsoleted) that tries to be the Requests of HTML scraping. Designed with news sites in mind. Picks out names of authors, publication dates, text, URLs to images, any embedded media. keyword analysis. NLP Picks articles out of websites. URL extraction. Picks out categories. i18n support.
Documentation here: https://newspaper.readthedocs.io/en/latest/