This repository takes a clear, hands-on approach to Retrieval-Augmented Generation (RAG), breaking down advanced techniques into straightforward, understandable implementations. Instead of relying on frameworks like LangChain or FAISS, everything here is built using familiar Python libraries openai, numpy, matplotlib, and a few others.
The goal is simple: provide code that is readable, modifiable, and educational. By focusing on the fundamentals, this project helps demystify RAG and makes it easier to understand how it really works.
Golem is a demonstration of how to distribute content over ActivityPub securely over peer to peer networks.
The problems this demo is trying to address are:
By encrypting the file and splitting it into chunks distributed through the network and only sharing the decryption key with the intended recipient, and by using a URI scheme that captures the appropriate information, we can accomplish all the above. Golem uses the magenc extension of the magnet URI scheme to accomplish the above.
A standard for transferring Files between airgapped units over a simplex transmission agnostic to the medium, transfer speed and distribution network. Most existing Protocols don't handle simplex transmissions at all or very poorly.
The data to be tansmitted is to be collected in a container [this may be a folder or tar archive]. The container is then optionally compressed and encrypted. The container is hashed (SHA3-512 is recommended). A header is used to communicate the essential parameters, including compression algorithm, encryption used, checksum, container size and number of fragments, timestamp, and checksum of the header.
Documentation aside, proof of concept of each feature is part of the repo in the form of shell scripts.
A brilliant proof of concept CAPTCHA system that requires the user to make arbitrary obscene gestures at the camera.
A short PHP script that demonstrates how to do caller ID spoofing in Asterisk. Reading through the code should explain how to do it in other languages.
A utility that, when given a CVE searches Github for a PoC of the vulnerability.
Self-made C++ file archiver and archive extractor programs based on Huffman's lossless compression algorithm.
My first test of RTL-SDR to read signals from a water meter transmitter using the rtl_433 utility.
This is supposed to be a follow-up to my Reddit post in r/RTLSDR sub.
Errol is a file sender that rely on inotify. It can be used to watch a directory and automatically transfers the new files (or modified ones) with XMPP. Requires inotify support (which is a given - Linux). Uses the slixmpp Python module.
Requires XEPs 0198 (stream management), 0060 (pubsub), 0045 (MUC) on the server. Pubsub nodes need to be set to "open." Has links to a couple of PoC tests to make sure the server's set up correctly.
I think I can use this to figure out how to write a File Transfer Bot.
XEP-0363 CLI upload tool. Written in Python, uses SleekXMPP. Give it login credentials to an XMPP server and a filename, and it'll upload the file to the server.