Created in response to the environs of apathy concerning the use of hypertext search and discovery. In Lieu, the internet is not what is made searchable, but instead one's own neighbourhood. Put differently, Lieu is a neighbourhood search engine, a way for personal webrings to increase serendipitous connexions.
Lieu's crawl & precrawl commands output to standard output, for easy inspection of the data. You typically want to redirect their output to the files Lieu reads from, as defined in the config file. See below for a typical workflow.
Wavelog is a self-hosted PHP application that allows you to log your amateur radio contacts anywhere. All you need is a web browser and active internet connection. Wavelog itself is an enhanced fork of Cloudlog by 2M0SQL.
DrawDB is a robust and user-friendly database entity relationship (DBER) editor right in your browser. Build diagrams with a few clicks, export sql scripts, customize your editor, and more without creating an account.
Today, I'm taking this request one step further and publishing the entire list of self-hosted software projects I follow and track in a browsable format. The list of software covers a wide range of self-hosted projects (both open- and closed-source) and includes filters for tags and popular hosted alternatives as well as sort options based on the number of repository stars, date of last development activity, and alphabetical order.
Cross-platform, open-source voice assistant and framework to build fully-featured, offline machines you can talk to. Self-hosted. Desktop and mobile clients. Repos of note:
Dicio is a free and open source voice assistant running on Android. It supports many different skills and input/output methods, and it provides both speech and graphical feedback to a question. It uses Vosk for speech to text. It has multilanguage support, and is currently available in these languages: English, French, German, Greek, Italian, Russian, Slovenian and Spanish.
Available on F-Droid, Google Play, and as an .apk file from Github.
A Shaarli browser extension using the API for both Firefox and Chrome based browsers. It features add/edit and search of bookmarks for your Shaarli instance.
Firefox: https://addons.mozilla.org/firefox/addon/shaanti/
Chrome: https://chromewebstore.google.com/detail/shaanti/bfecpppjnokkpdegijfgbldholankami
This is a community-sourced collection of scripts that extend Paperless-ngx in various ways.
Embeddings databases are a union of vector indexes (sparse and dense), graph networks and relational databases. This enables vector search with SQL, topic modeling, retrieval augmented generation and more. Embeddings databases can stand on their own and/or serve as a powerful knowledge source for large language model (LLM) prompts.
Features
Your AI second brain. A copilot to search and chat (using RAG) with your knowledge base (pdf, markdown, org). Use powerful, online (e.g gpt4) or private, offline (e.g mistral) LLMs. Self-host locally or have it always accessible on the cloud. Access from Obsidian, Emacs, Desktop app, Web or Whatsapp
Khoj is an AI application to search and chat with your notes and documents. It is open-source, self-hostable and accessible on Desktop, Emacs, Obsidian, Web and Whatsapp. It works with pdf, markdown, org-mode, notion files and github repositories. It can paint, search the internet and understand speech.
Weaviate is an open source vector database that stores both objects and vectors, allowing for combining vector search with structured filtering with the fault-tolerance and scalability of a cloud-native database, all accessible through GraphQL, REST, and various language clients.
With Weaviate, you can turn your text, images and more into a searchable vector database using state-of-the-art ML models. Weaviate typically performs a 10-NN neighbor search out of millions of objects in single-digit milliseconds. You can use Weaviate to conveniently vectorize your data at import time, or alternatively you can upload your own vectors (say, if you download a model from OpenAI or HuggingFace). Weaviate powers lightning-fast vector searches, but it is capable of much more. Some of its other superpowers include recommendation, summarization, and integrations with neural search frameworks.
Milvus is an open-source vector database built to power embedding similarity search and AI applications. Milvus makes unstructured data search more accessible, and provides a consistent user experience regardless of the deployment environment.
Millisecond search on trillion vector datasets. Rich APIs designed for data science workflows. Consistent user experience across laptop, local cluster, and cloud. Embed real-time search and analytics into virtually any application. Component-level scalability makes it possible to scale up and down on demand. Milvus can autoscale at a component level according to the load type, making resource scheduling much more efficient.
A privacy-first, self-hosted, fully open source personal knowledge management software, written in typescript and golang. Supports fine-grained block-level reference and Markdown WYSIWYG. The data is saved in the workspace data folder. Data synchronization through third-party synchronization disks is not supported, otherwise data may be corrupted. Although it does not support third-party sync disks, it supports connect with third-party cloud storage.
Johnny.Decimal is a system to organise your digital life. It’s designed to help you find things quickly, with more confidence, and less stress. In real life, if you stored your stuff in piles of badly-labelled boxes you’d never find anything again. If you put those boxes in boxes, in boxes, you’d never know which box to open to find the next box. It would be chaos. But I just described how you save your computer files.
Imagine your computer as a physical storage space. We can’t put everything on the floor, so we buy some shelves. If we had a limitless number of shelves, we wouldn’t know which one to look on when we wanted to find something. So we get ten shelves. We decide to dedicate each shelf to an area of our life.
Collie is a minimal RSS feed reader application. With Collie, you can:
All you need is a local machine and the Internet. No virtual machine, no cloud infrastructures, no always-on database, and no account registration with privacy information required.
Filter for uBlock origin to remove spam-website results from DuckDuckGo and Google that just blatantly copy and paste content from well known websites. This list is just a personal collection of annoying, unhelpful websites.
I think the combined-list.txt file could be used to build a crapsite filtering database.
A database that tries to make it easy to build an LLM-like search database. Super-simple API for loading data and querying it.
You can do everything in your code or run it as a server (chroma run --path /path/to/datastore/on/disk
) and use an HTTP client to interact with it.
In today's workplace, essential information is often scattered across the cloud in the form of links. We understand the frustration of endlessly searching through emails, messages, and websites just to find the right link. Links are notorious for being unwieldy, complex, and easily lost in the shuffle. Remembering and sharing them can be a challenge.
That's why we developed Slash, a solution that transforms these links into easily accessible, discoverable, and shareable shortcuts(e.g., s/shortcut). Say goodbye to link chaos and welcome the organizational ease of Slash into your daily online workflow.
Customizable short link generator. Visibility restrictions (logged in or not? team or not?) Has browser extensions. Looks like it uses SQLite as its back end.
Take apart the Dockerfile to figure out how to build the webshit. At least the compilation process is straightforward: CGO_ENABLED=0 go build -o slash ./bin/slash/main.go
Most APIs are undocumented and hidden in plain sight. Being able to find these APIs can provide a rich, reliable, and scalable method of building your own datasets. Learn how to find them in the wild, and how they’ve been used in past investigations.
Ghostfolio is an open source wealth management software built with web technology. The application empowers busy people to keep track of stocks, ETFs or cryptocurrencies and make solid, data-driven investment decisions. The software is designed for personal use in continuous operation.
In theory you can self-host it. It looks like another case of picking apart the Dockerfile.