Kubernetes storage backends

InnerScientist@lemmy.world · edit-2 15 days ago

Kubernetes storage backends

Getting6409@lemmy.dbzer0.com · 13 days ago

I’ve been using backblaze b2 (via s3fs-fuse container + bidirectional mount propagation to a host path) and a little bit of google drive (via rclone mount + the same mounting business) within kubernetes. I only use this for tubearchivist which I consider to be disposable. No way I’m using these “devices” for anything I really care about. I haven’t tried gauging the performance of either of these, but I can say, anecdotally, that both are fine for tubearchivist to write to in a reasonable amount of time (the bottleneck is yt-dlp ingesting from youtube) and playback seems to be on par with local storage with the embedded tubearchivist player and jellyfin. I’ve had no issues with this, been using it about a year now, and overall I feel it’s a decent solution if you need a lot of cheap-ish storage that you are okay with not trusting.

ChaosMonkey@lemmy.dbzer0.com · 15 days ago

Longhorn is pretty easy to use. Garage works well too. Ceph is harder to use but provides both block and object storage (s3).

InnerScientist@lemmy.world · 15 days ago

Ceph (and longhorn) want “10 Gbps network bandwidth between nodes” while I’ll have around 1gbit between nodes, or even lower.

What’s your experience with Garage?

Possibly linux@lemmy.zip · edit-2 15 days ago

That isn’t how you would normally do it

You don’t want to try and span locations on a Container/hypervisor level. The problem is that there is likely to much latency between the sites which will screw with things. Instead, set up replicated data types where it is necessary.

What are you trying to accomplish from this?

InnerScientist@lemmy.world · 15 days ago

The problem is that I want failover to work if a site goes offline, this happens quite a bit with private ISP where I live and instead of waiting for the connection to be restored my idea was that kubernetes would see the failed node and replace it.

Most data will be transfered locally (with node affinity) and only on failure would the pods spread out. The problem that remained in this was storage which is why I’m here looking for options.

Possibly linux@lemmy.zip · 14 days ago

That isn’t going to work unfortunately

You need very low latency (something like 10ms or preferably less)

notfromhere@lemmy.ml · edit-2 15 days ago

I know Ceph would work for this use case, but it’s not a lighthearted choice, kind of an investment and a steep learning curve (at least it was, and still is, for me).

karlhungus@lemmy.ca · 15 days ago

My gut says go multi cluster (or not) at that pointbut treat the remote as a service, have a local container be a proxy