A commandline utility to search text in PDF files. Tries to be compatible with GNU Grep, where it makes sense. Many of your favorite grep options are supported (such as -r, -i, -n or -c).
Git: https://gitlab.com/pdfgrep/pdfgrep
I wonder if I can plug this into SearxNG.
Recoll is a desktop full-text search tool. Finds documents based on their contents as well as their file names. Can search most document formats, even if they're compressed (even Maildir/ and mailboxes). You may need external applications for text extraction. Based on Xapian. Primarily desktop but it could be run server-side. Indices are backwards-compatible.
Source code: https://framagit.org/medoc92/recoll
Flies on solid state storage!
Can be plugged into Searx: https://searx.github.io/searx/admin/engines/recoll.html
Seaborn is a Python visualization library based on matplotlib. It provides a high-level interface for drawing attractive statistical graphics.
The docs include a tutorial, example gallery, API reference, and other useful information.