Bookmarks
Tag cloud
Picture wall
Daily
RSS Feed
  • RSS Feed
  • Daily Feed
  • Weekly Feed
  • Monthly Feed
Filters

Links per page

  • 20 links
  • 50 links
  • 100 links

Filters

Untagged links
10 results tagged vectors  ✕   ✕
perone/vectorvfs https://github.com/perone/vectorvfs
Tue 26 Aug 2025 02:32:36 PM PDT archive.org

VectorVFS is a lightweight Python package that transforms your Linux filesystem into a vector database by leveraging the native VFS (Virtual File System) extended attributes. Rather than maintaining a separate index or external database, VectorVFS stores vector embeddings directly alongside each file—turning your existing directory structure into an efficient and semantically searchable embedding store.

linux python filesystems vectors metadata
doganarif/vectordb https://github.com/doganarif/vectordb
Wed 13 Aug 2025 01:56:40 AM PDT archive.org

In-memory vector database with pluggable indexing algorithms, metadata filtering, and a FastAPI-based REST API.

Multiple Indexing Algorithms: Linear, KD-Tree, and LSH (Locality Sensitive Hashing). Flexible Similarity Metrics: Cosine similarity and Euclidean distance. Metadata Filtering: Filter search results by custom metadata. Persistence: Snapshot and restore functionality for data durability. Thread-Safe: Custom reader-writer locks for concurrent operations. In-Memory: Fast access with in-memory storage. RESTful API: Full CRUD operations via FastAPI. Python SDK: Native client library for seamless integration. Embeddings API: Integrated Cohere support for text embeddings.

python servers vectors databases rest api webapps exocortex
unum-cloud/usearch https://github.com/unum-cloud/usearch
Tue 01 Apr 2025 01:35:24 PM PDT archive.org

A fast open source source vector search and clustering engine. API bindings for multiple languages. Tries to be simple to use and extensible; if you're using it with C++ you only need to import one header file. Tries to be hardware agnostic; supports half-precision and quarter-precision with 16-bit floats and 8-bit integers, respectively. Can scan very large indices without loading the entire file into memory; implicitly supports serializing indices to disk. Heterogeneous lookups, renaming/relabeling, and on-the-fly deletions. Supports semantic search. Supports exact and approximate search.

cpp python search vectors api library indexing
Krita https://krita.org/
Tue 24 Sep 2024 02:08:23 PM PDT archive.org

Krita is a professional FREE and open source painting program. It is made by artists that want to see affordable art tools for everyone. Has an intuitive user interface that stays out of your way. The docks and panels can be moved and customized for your specific workflow. Once you have your setup, you can save it as your own workspace. You can also create your own shortcuts for commonly used tools. Over 100 professionally made brushes, stabilizers for them in case your hand isn't the steadiest, built-in vector drawing tools, customizable and constructable brushes, wrap-around mode for seamless textures, and a resource manager to import and export tools and packs from other users.

Supports 2d animation. Multiple layers and audio support, thousands of frames on the video timeline, onion skinning for tweening, drag-and-drop of frames, shortcuts, and performance tweaking.

Source code: https://invent.kde.org/graphics/krita

art images vectors svg editors animation crossplatform videos kde qt
UKPLab/sentence-transformers https://github.com/UKPLab/sentence-transformers
Tue 09 Jul 2024 01:42:17 PM PDT archive.org

This framework provides an easy method to compute dense vector representations for sentences, paragraphs, and images. The models are based on transformer networks like BERT / RoBERTa / XLM-RoBERTa etc. and achieve state-of-the-art performance in various tasks. Text is embedded in vector space such that similar text are closer and can efficiently be found using cosine similarity. We provide an increasing number of state-of-the-art pretrained models for more than 100 languages, fine-tuned for various use-cases. Further, this framework allows an easy fine-tuning of custom embeddings models, to achieve maximal performance on your specific task. CUDA enabled.

Seems to lend itself to research coding. The real winner here is that you can generate embeddings and vectors for arbitrary text, which would make it ideal for writing a utility that could do only this without a lot of heavy lifting.

Comes with pre-trained models for over 100 languages. Has documentation and examples for building your own models.

python modules vectors nlp images models
objectbox/objectbox-python https://github.com/objectbox/objectbox-python
Wed 29 May 2024 12:58:15 PM PDT archive.org

ObjectBox Python is a lightweight yet powerful on-device database & vector database. Store Python objects and vectors directly with an easy-to-use CRUD API while enjoying exceptional speed and efficiency. And because it's an embedded database, there's no setup required. Its advanced vector search empowers AI applications including RAG, generative AI, and similarity searches. Designed for high performance, the ObjectBox database runs locally on-device. As an offline-first solution, ObjectBox makes sure your app reliably works offline as well as online.

ACID compliant. Claims to be scalable and sustainable (i.e., it doesn't need a lot of horsepower to run). Queries can be filtered, even across relations.

python modules embedded databases vectors exocortex
neuml/txtai https://github.com/neuml/txtai
Mon 12 Feb 2024 11:09:06 AM PST archive.org

Embeddings databases are a union of vector indexes (sparse and dense), graph networks and relational databases. This enables vector search with SQL, topic modeling, retrieval augmented generation and more. Embeddings databases can stand on their own and/or serve as a powerful knowledge source for large language model (LLM) prompts.

Features

  • Vector search with SQL, object storage, topic modeling, graph analysis and multimodal indexing
  • Create embeddings for text, documents, audio, images and video
  • Pipelines powered by language models that run LLM prompts, question-answering, labeling, transcription, translation, summarization and more
  • Workflows to join pipelines together and aggregate business logic. txtai processes can be simple microservices or multi-model workflows.
  • Build with Python or YAML. API bindings available for JavaScript, Java, Rust and Go.
  • Run local or scale out with container orchestration
python ai ml vectors databases search exocortex
weaviate/weaviate https://github.com/weaviate/weaviate
Tue 30 Jan 2024 01:45:01 PM PST archive.org

Weaviate is an open source vector database that stores both objects and vectors, allowing for combining vector search with structured filtering with the fault-tolerance and scalability of a cloud-native database, all accessible through GraphQL, REST, and various language clients.

With Weaviate, you can turn your text, images and more into a searchable vector database using state-of-the-art ML models. Weaviate typically performs a 10-NN neighbor search out of millions of objects in single-digit milliseconds. You can use Weaviate to conveniently vectorize your data at import time, or alternatively you can upload your own vectors (say, if you download a model from OpenAI or HuggingFace). Weaviate powers lightning-fast vector searches, but it is capable of much more. Some of its other superpowers include recommendation, summarization, and integrations with neural search frameworks.

golang vectors ai ml search exocortex databases
milvus-io/milvus https://github.com/milvus-io/milvus
Tue 30 Jan 2024 01:42:30 PM PST archive.org

Milvus is an open-source vector database built to power embedding similarity search and AI applications. Milvus makes unstructured data search more accessible, and provides a consistent user experience regardless of the deployment environment.

Millisecond search on trillion vector datasets. Rich APIs designed for data science workflows. Consistent user experience across laptop, local cluster, and cloud. Embed real-time search and analytics into virtually any application. Component-level scalability makes it possible to scale up and down on demand. Milvus can autoscale at a component level according to the load type, making resource scheduling much more efficient.

golang search vectors ai ml exocortex
Chroma: The AI-native open-source embedding database https://github.com/chroma-core/chroma
Fri 08 Dec 2023 12:35:53 PM PST archive.org

A database that tries to make it easy to build an LLM-like search database. Super-simple API for loading data and querying it.

You can do everything in your code or run it as a server (chroma run --path /path/to/datastore/on/disk) and use an HTTP client to interact with it.

python modules ai ml databases search integrations llm vectors exocortex
6963 links, including 441 private
Shaarli - The personal, minimalist, super-fast, database free, bookmarking service by the Shaarli community - Theme by kalvn