datahoarder
Who are we?
We are digital librarians. Among us are represented the various reasons to keep data -- legal requirements, competitive requirements, uncertainty of permanence of cloud services, distaste for transmitting your data externally (e.g. government or corporate espionage), cultural and familial archivists, internet collapse preppers, and people who do it themselves so they're sure it's done right. Everyone has their reasons for curating the data they have decided to keep (either forever or For A Damn Long Time). Along the way we have sought out like-minded individuals to exchange strategies, war stories, and cautionary tales of failures.
We are one. We are legion. And we're trying really hard not to forget.
-- 5-4-3-2-1-bang from this thread
view the rest of the comments
Not sure if it's quite what you're looking for but the first thing that comes to mind for me is Ceph. It's not exactly a service in and of itself but it is self hostable/open source. I currently have a cluster set up across 3 machines with 87 TiB total space and a Ceph Filesystem and thought it was relatively easy to set up (I'd recommend first doing so in a virtual environment to get your bearings if possible, though). Said filesystem is set to make 3 replicas (1 on each machine) of any data written to it and I use Unison to sync files between local storage and the cluster (such that the whole setup is analogous to Windows/Onedrive or MacOS/iCloud). I also plan on setting up a node at my parents' house and making a new replication rule for that. As they live in a different state than me, this would amount to having hot offsite backups (for both me and them). Finally, while I haven't seen it done in practice, in theory, multiple Ceph admins may be able to configure a multi-site setup where they could trade some space on their own clusters with each other for a sort of community based storage pool/hot offsite backups (like a community ran version of Google Drive or similar where the buy-in could be some of your own storage space or money). However, it's important to note that while communications are encrypted and the storage drives can be encrypted as part of setup and operation, any data written to a cluster is not automatically encrypted and if one wants privacy, said data would need to be encrypted separately before writing it to a community cluster.
Thanks for sharing your situation/ideas!