I always wondered by never bothered to implement: What if I would implement an S...

DaiPlusPlus · on July 6, 2023

What you're describing already exists in Azure Blob Storage (Azure's S3 compete), known as Page Blobs, which is also how you host VM HDDs (VHDs) in Azure, as it allows for page-level (i.e. disk-block-level) reads/writes.

I'm not clued-in to AWS's equivalent, but a quick google suggests it's AWS EBS (Elastic Block Storage), though it doesn't seem to coexist side-by-side with normal S3 storage, and their API for reading/writing blocks looks considerably more complicated than Azure's: https://docs.aws.amazon.com/AWSEC2/latest/UserGuide/ebs-acce...

bob1029 · on July 6, 2023

Azure blob storage took me by surprise when I had to load a custom VHD. My experience with AWS/EC2/S3 led me to think it would take half a day or longer.

Turns out, there's no transformation layers or anything. You upload the image using a special tool, it saturates your uplink for a few minutes, and before you can brew some more coffee the VM will be booting up.

yread · on July 6, 2023

Their fuse-based filesystem driver even supported CACHED block access ie you would download, cache and read the blocks you needed locally instead of sending every read over the wire. Close to local fs performance for little space

DaiPlusPlus · on July 6, 2023

How does that handle concurrent writes to the same block? How is the cache invalidated?

mappu · on July 6, 2023

The single FUSE process can coordinate all that with an in-memory mutex.

This thread is in the context of filesystems like ext4, which are completely unsafe for concurrent mounts - if you connect from multiple machines then it will just corrupt horrifically. If your filesystem was glusterfs, moose/lizard, or lustre, then they have ways to handle this (with on-disk locks/semaphores in some other sector).

electroly · on July 6, 2023

Behind the scenes, EBS Snapshots are stored in S3. When you first create an EBS volume from a snapshot, it pulls blocks from S3 on-demand.

damagednoob · on July 6, 2023

Not sure if it's a usable concept but it sounds similar to what someone else did[1]:

> ...I implemented a virtual file system that fetches chunks of the database with HTTP Range requests when SQLite tries to read from the filesystem...

[1]: https://phiresky.github.io/blog/2021/hosting-sqlite-database...

candiodari · on July 6, 2023

https://github.com/s3fs-fuse/s3fs-fuse/wiki/Fuse-Over-Amazon

Seems like a usable concept yes.

kirubakaran · on July 6, 2023

https://github.com/s3fs-fuse/s3fs-fuse

vbezhenar · on July 6, 2023

It uses S3 as a filesystem. I'm talking about using S3 as a block device. Slightly different approach.

franky47 · on July 6, 2023

Sounds like a very expensive way to build a filesystem.

cbluth · on July 6, 2023

Yes, its called s3backer

vbezhenar · on July 6, 2023

Thanks, that's the thing.