lakeFS is an open-source tool that transforms your object storage into a Git-like repository. It enables you to manage your data lake the way you manage your code. With lakeFS you can build repeatable, atomic, and versioned data lake operations - from complex ETL jobs to data science and analytics. lakeFS supports AWS S3, Azure Blob Storage, and Google Cloud Storage as its underlying storage service. It is API compatible with S3 and works seamlessly with all modern data frameworks such as Spark, Hive, AWS Athena, DuckDB, and Presto.
Doesn't force you to use Docker.
Dolt is a SQL database that you can fork, clone, branch, merge, push and pull just like a git repository. Connect to Dolt just like any MySQL database to run queries or update the data using SQL commands. Use the command line interface to import CSV files, commit your changes, push them to a remote, or merge your teammate's changes.
All the commands you know for Git work exactly the same for Dolt. Git versions files, Dolt versions tables. It's like Git and MySQL had a baby.
Dolt ships with a MySQL compatible database server built in.
A distributed version control system, similar to Git or Mercurial, only it also includes a bugtracker, a wiki, and a blog for collaboration in the same package. Includes a web interface for everything. Can automatically sync with other instances. Self contained.
A short and sweet page about how easy it is to fuck up using git, giving specific examples, and how to get out of those situations.