Free and Open Source Machine Translation API, entirely self-hosted. Unlike other APIs, it doesn't rely on proprietary providers such as Google or Azure to perform translations. Instead, its translation engine is powered by the open source Argos Translate library.
Supports per-user limit quotas, e.g. you can issue API keys to users so that they can enjoy higher requests limits per minute (if you also set --req-limit). By default all users are rate-limited based on --req-limit, but passing an optional api_key parameter to the REST endpoints allows a user to enjoy higher request limits. To use API keys simply start LibreTranslate with the --api-keys option.
There are also F/OSS mobile clients for Android and browser plugins.
Recoll WebUI is a Python-based web interface for Recoll text search tool for Unix/Linux.
This is an updated clone of Koniu's original version on GitHub, which has not been updated lately, and is now slightly obsolete.
As compared to the original, this version has an additional dependancy when running the webui-standalone.py server (i.e. not with apache): it uses the python3-waitress module which must be installed. This makes webui-standalone quite suitable for moderate multiaccess loads with no authentication needs.
Required to plug Recoll into Searx.
Recoll is a desktop full-text search tool. Finds documents based on their contents as well as their file names. Can search most document formats, even if they're compressed (even Maildir/ and mailboxes). You may need external applications for text extraction. Based on Xapian. Primarily desktop but it could be run server-side. Indices are backwards-compatible.
Source code: https://framagit.org/medoc92/recoll
Flies on solid state storage!
Can be plugged into Searx: https://searx.github.io/searx/admin/engines/recoll.html
Seaborn is a Python visualization library based on matplotlib. It provides a high-level interface for drawing attractive statistical graphics.
The docs include a tutorial, example gallery, API reference, and other useful information.
Migrate YaCy's indexer to an external Solr instance?
jinabox.js is a lightweight, customizable set of components for querying and displaying results from Jina flows. You can use it for searching text, images, videos, audio or any kind of data with a Jina backend.
Jina is geared towards building search systems for any kind of data, including text, images, audio, video and many more. With the modular design & multi-layer abstraction, you can leverage the efficient patterns to build the system by parts, or chaining them into a Flow for an end-to-end experience. Large-scale indexing and querying of unstructured data: video, image, long/short text, music, source code, etc. Decentralized architecture from day one. Scalable & cloud-native by design: enjoy containerizing, distributing, sharding, async, REST/gRPC/WebSocket.
Self hostable?
InvenTree is an open-source Inventory Management System which provides powerful low-level stock control and part tracking. The core of the InvenTree system is a Python/Django database backend which provides an admin interface (web-based) and a JSON API for interaction with external interfaces and applications.
InvenTree is designed to be lightweight and easy to use for SME or hobbyist applications, where many existing stock management solutions are bloated and cumbersome to use. Updating stock is a single-action process and does not require a complex system of work orders or stock transactions.
Has a mobile app.
Look in https://github.com/Zeigren/inventree-docker/blob/master/inventree_vhost.conf to see how to proxy it with Nginx.
Free and Open Source Machine Translation API. 100% self-hosted, no limits, no ties to proprietary services. Run your own API server in just a few minutes. Playing with it a little, it seems like it might be interesting to experiment with. Supports a couple of languages right now, but at least they're useful ones.
API docs: https://libretranslate.com/docs/
Github: https://github.com/uav4geo/LibreTranslate
The back end appears to be written in Python. Examples of use are in Javascript, but that's not necessarily the way it has to be done.
Language models are kept in a different repo: https://github.com/uav4geo/LibreTranslate-Models
Paperless-ngx is a document management system that transforms your physical documents into a searchable online archive so you can keep, well, less paper. Paperless-ngx forked from paperless-ng to continue the great work and distribute responsibility of supporting and advancing the project among a team of people.
Paperless-ngx is a webapp that indexes your scanned documents and allows you to easily search for documents and store metadata alongside your documents. Paperless-ngx does not control your scanner, it only helps you deal with what your scanner produces.
Store archived documents with an embedded OCR text layer, while keeping originals available.
EleutherAI is a grassroots AI research group aimed at democratizing and open sourcing AI research. Multiple projects and usable training corpora. F/OSS model called GPT-Neo.
Several spinoff projects to investigate.
filebrowser provides a file managing interface within a specified directory and it can be used to upload, delete, preview, rename and edit your files. It allows the creation of multiple users and each user can have its own directory. It can be used as a standalone app or as a middleware. Install it on a server, direct it to a path and then access your files through a nice web interface. Login-mediated. Built-in file editor. Upload and download. Custom command execution on the server. Customizable.
Rhasspy (pronounced RAH-SPEE) is an offline, multilingual voice assistant toolkit inspired by Jasper that works well with Home Assistant, Hass.io, and Node-RED. Designed so that you don't have to use any not-self-hosted software under the hood, from speech recognition to TTS. Emits JSON events. Vocabulary can be expanded with the automated assistance feature. Will run on something as simple as a RasPi but doesn't treat x86(-64) like a second-class citizen. Commands/intents are specified in a fairly easy templating language.
Greyprints for 3d printing a modular hard drive rack. Modular. Mount a bunch of standard PC hard drives vertically. Bolt some fans onto the side to blow air across them and keep 'em cool. Seems like this would be an ideal setup to use with some hot-swappable drive connectors.
Downloaded to Windbringer. Fab, rivet into Leandra's chassis to replace the awkward steel drive backet that requires pulling out all the cards.
Self hosted recipe management webapp. Create, store, share, document recipes. Uses Django as its framework. Designed with a small group of people (like a family) in mind. Automatically generates shopping lists. Quickly print recipes.
Has docs explicitly for running without Docker: https://github.com/open-eats/OpenEats/blob/master/docs/Running_the_App_Without_Docker.md
Has a REST API.
A PHP script which not only prettifies file directories shown by your web server, but adds context. Also seems to work around hardcoded limitations (like those in Nginx, which you can only fix by hacking the source and recompiling). Seems designed to use as many versions of PHP as possible for compatibility. Can be customized to make it easier to find things.
Filestash lets you easily manage your data regardless of where it is stored: FTP / SFTP / Git / S3 / Minio / Dropbox / Google Drive
Graphical file manager. Lets you create mountable shared drives on a server for collaboration. Plugin architecture so it can be customized. Has its own text editor. Tries to run on as many platforms as possible, from desktop to mobile. Tries to be lightweight.
A tool which interfaces with Signal on behalf of one of your devices to send text messages. Designed for use in scripts. Written in Java, in the AUR, I probably already have it installed on Leandra.
Free Software for your own Search Engine, Explorer for Discovery of large document collections, Media Monitoring, Text Analytics, Document Analysis & Text Mining platform based on Apache Solr or Elasticsearch open-source enterprise-search and Open Standards for Linked Data, Semantic Web & Linked Open Data integration.
Usage tutorial here: https://www.opensemanticsearch.org/doc/tutorial
Github: https://github.com/opensemanticsearch
Of course it has an API: https://www.opensemanticsearch.org/doc/admin/rest-api
A personal terminal-based dashboard utility, designed for displaying infrequently-needed, but very important, daily data. Updated fairly regularly with new services. Uses YAML files for configuration, so you don't (literally) have to write your custom dashboard from scratch.
Website: https://wtfutil.com/