Learning about object storage #58

consideRatio · 2021-06-10T19:17:46Z

Background

We learned that the NFS storage using the Amazon EFS service is very costly for just a few TB of storage, and that we want to make use of S3 Object Storage instead that is ~10-20 times cheaper.

Object storage learning goals

There may be plenty of things to learn and document about these, but for now I've just created a single issue listing some learning goals about working with object storage in general and object storage on AWS.

Understand how to access our s3 object storage bucket that we reference as the "scratch bucket"
You can use the aws s3 command, for example aws s3 cp <source> target> and you will copy something from one location to another, where a location can be a path on the local file system or it can be a location in object storage such as s3://jmte-scratch/consideratio which is what my SCRATCH_BUCKET environment variable evaluates to, while yours will be s3://jmte-scratch/<your-username>.
Understand sensible practices for bucket to bucket transfers
Understand possibilities of mounting buckets to the file system
Understand costs of S3 object storage
Some technical details: https://aws.amazon.com/s3/pricing/
What kind of s3 bucket storage are we currently allocating in our s3 scratch bucket?
Understand how we can use ephemeral storage in /tmp
I think we can't download more than ~80 GB per node for now, but that we can increase this by updating our machine configuration.

The text was updated successfully, but these errors were encountered:

fperez · 2021-06-10T19:46:07Z

This recent article titled Cloud-Native Repositories for Big Scientific Data may be a useful reference...

consideRatio · 2021-06-14T15:55:38Z

We've received amazing feedback from the Pangeo Cloud ops working group as can be seen in the notes from the meeting 14th June.

https://docs.google.com/document/d/1I-2VNNHoAjjeYvlCezQhFLmiu2OevqGDS5nUAP-6Hfw/edit#

whyjz added the 🏷️ JupyterHub Something related to JupyterHub label Jun 11, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Learning about object storage #58

Learning about object storage #58

consideRatio commented Jun 10, 2021 •

edited by whyjz

Loading

fperez commented Jun 10, 2021

consideRatio commented Jun 14, 2021

Learning about object storage #58

Learning about object storage #58

Comments

consideRatio commented Jun 10, 2021 • edited by whyjz Loading

Background

Object storage learning goals

fperez commented Jun 10, 2021

consideRatio commented Jun 14, 2021

consideRatio commented Jun 10, 2021 •

edited by whyjz

Loading