Fess is a remarkably nifty self-hosted search engine. It's designed for enterprise use (serious enterprise use) but works pretty well for personal use. Written in Java. Uses Elastic Search as its back-end unfortunately, but like reasonably good Java applications everything it needs is bundled with what you want, and is started with a single shell script. It has a nicely modern user interface, too. Takes a while to run but it might just be how much stuff I have. Stores its logs in Elastic Search, too. Offers a JSON API.
Requires Java 11 or later.
MeiliSearch is a powerful, fast, open-source, easy to use, and deploy search engine. The search and indexation are fully customizable and handles features like typo-tolerance, filters, and synonyms. For more details about those features, go to our documentation. Has its own web search interface as well as an API. Searches its indices as you type. Smart enough to figure out typos and synonyms. Customizable. Create an index, then upload documents to it.
A fast, multi-threaded application that takes apart files, indexes them, and shoves them into Elastic Search. Tries to be portable. Relies upon Elastic Search, unfortunately. Indices can be transported elsewhere (say you've indexed offline storage media) and loaded into the engine.
retrocomputing retro commodore computers c64 c128 magazine index links articles reviews
A community-built and maintained database of science fiction, fantasy, and horror that includes bibliographic data, community reviews, ISBN numbers of as many editions as people can find (of use to amateur librarians such as myself), and links to anthologies.
A search engine for Torrents which only uses Magnet links.
python module for extracting text from different documents. Can also be used as a CLI utility. Can work with text-based formats like CSV, JSON, and HTML. Can work with binary formats like MS Word, MP3, and PDF. The list is fairly extensive.
Whoosh is a Python library which implements full text searching and indexing of arbitrary text. Can be used to build custom search engines. Pythonic, fully extensible. Supposedly pretty fast, too. Multithreaded to take advantage of execution across multiple processor cores.
3712 links, including 188 private