Polycloud: S3-compatible object storage

onionjake · on March 25, 2021

Hi, CTO of CrowdStorage here. Happy to answer any questions. I'm doing my best to respond to comments.

Our website is right now being redesigned and your comments here are helpful to help us know where we need to improve!

spaniard89277 · on March 25, 2021

Your site says it's distributed, but honestly it has no details of how it works. Is it built on top fo SIA, STORJ or similar? How does it work? How does one "become a node", etc.

tw04 · on March 25, 2021

The fact they seem to refuse to answer WHERE the storage is coming from is honestly pretty concerning. It's been asked multiple times in this thread and they don't seem interested in addressing it.

For all we know the storage is coming compromised hosts that don't even know they're participating.

neverartful · on March 25, 2021

I just found this poking around on their website:

https://crowdstorage.com/faq/#where-is-my-data-stored

Where is my data stored?

Your data is intelligently encrypted on your My Cloud Home and sliced into tiny pieces that only you can put back together. The encrypted data pieces are scattered across our community of users, using small portions of our member’s unused hard drive space. Spreading your data out in this way makes it more safe and secure than it would be if it was all stored in one location.

spaniard89277 · on March 25, 2021

Yeah, I wouldn't trust my data to a service I have no idea how it works.

onionjake · on March 25, 2021

Not trying to obscure anything, just busy this morning :)

I answered the parent.

onionjake · on March 25, 2021

We are not built on top of Sia or Storj. Our technology has been developed for the past decade and is independent.

Storage comes from services that we offer to end consumers. In exchange for our services they agree to let us use the unused storage on device, i.e. (https://crowdstorage.com/products/device-backup/). We plan on expanding to many more devices.

Storage also comes from being able to transparently leverage more traditional storage that we purchase from various providers.

mrlinx · on March 25, 2021

This is still not a clear answer. Where is the data? who are those people holding the data? how do they enroll in this?

onionjake · on March 25, 2021

It is a service you can sign up for https://crowdstorage.com/products/device-backup-my-cloud-hom.... Stay tuned for support on more devices in the future (i.e. synology, qnap, etc.).

We also purchase storage in datacenters.

spaniard89277 · on March 25, 2021

So, you're selling unused space in your own hardware? I'm sorry but It's not really clear.

onionjake · on March 25, 2021

I answered the sibling comment: https://news.ycombinator.com/item?id=26581857

johnx123-up · on March 26, 2021

What is your opinion about https://min.io/ (It also has S3 compatibility AFAIK)

igetspam · on March 25, 2021

Do you have plans for non US regions? (Or do you have in US regions that are not currently obvious?)

onionjake · on March 25, 2021

Yes! Our storage footprint globally is growing so we will definitely add more regions.

What region in particular interests you?

igetspam · on March 25, 2021

We span many right now. Primarily APAC (serving customers in Jakarta by way of Singapore and soon Thailand), as well as South America.

How do you handle the problem of low interest? Do you preseed a region with storage servers so you have enough shards to store all the data until you have enough in-market storage endpoints?

I agree that the egress pricing seems a little disingenuous but model doesn't typically egress much more than 100%. Will take a closer look, when there's time.

onionjake · on March 25, 2021

> How do you handle the problem of low interest?

We can seamlessly leverage more traditional storage as well as distributed endpoints. So in a way yes we can 'seed' it. We will likely wait until we have enough storage in a region before launching there.

> the egress pricing seems a little disingenuous

I have made small edits to the website to make it clearer it is 100% of data stored is free egress. This is definitely something we will focus on in the redesign.

quyse · on March 25, 2021

It seems the catch is the same as with Wasabi. It says everywhere on the site that egress is free and there's no hidden fees. Except for this FAQ entry [1]:

> Pricing for Polycloud is only $0.004 per GB per month. There are no other fees as long as you do not egress more than 100% of your data during the month. If you do egress more than 100% of the amount of data you have stored then you will only be charged $0.01 per GB for any overages.

Looks like a hidden fee to me. Note that even the Pricing page [2] does not mention it, and the calculator does not allow to enter egress volume for calculation.

[1] https://crowdstorage.com/faq/#what-is-the-pricing-for-polycl... [2] https://crowdstorage.com/products/polycloud/pricing/

19h · on March 25, 2021

That's pretty expensive.

Storing a 250GB backup = $1.

Sending to 2 machines 500GB - 250GB = $2.5.

Sending to 10 machines 2500GB - 250GB = $22.5.

How is storing this data so much cheaper than sending it? Especially given that it's stored redundantly?

I can't wrap my head around the fact that electrical signals are priced higher than equivalent HDDs.

Example, HGST 4TB MegaScale -- $60, or $13.46 per TB. Storing 4TB at 0.004 = $16. Transferring it 10 times would be $400.

Total $416, or $356 more than an actual physial hard disk.

Imagine getting a 4TB HDD, transferring the data to 10 computers and then throwing it away.

Being able to pay for excess egress is better at least than risking getting your contract killed by Wasabi if your egress exceeds your traffic.

vitus · on March 25, 2021

One thing that stands out to me is an unintended consequence of the pricing structure:

Storing a 250GB backup and sending to 10 machines: $1 for storage, $22.5 for egress.

Storing a 250 GB backup, 2250GB of /dev/zero, and sending your backup to 10 machines: $10 for storage, $0 for egress.

xyzzy123 · on March 25, 2021

That's an interesting observation! Definitely creates weird incentives.

It seems incentives also depend on the nitty gritty details around how you are billed which are not defined very clearly. Granularity and timing, whether you pay for part months of storage etc.

However all games aside it does seem like if your own egress were free (or much cheaper than 0.01) it would be most efficient to send your data directly to where it needed to go, yourself.

Then you'd only be paying $1 for the backup.

_v7gu · on March 25, 2021

Looks like someone forgot to put incentive compatibility constraints to their pricing optimization problem

onionjake · on March 25, 2021

Interesting thought thread on our pricing structure... It saves money if you intend to send the backup to 10x places every month. If you only intend to do it once though you don't come out ahead $10 * 3=$30 versus $1 * 3+$22.50 = $25.50.

vitus · on March 25, 2021

Good point -- the minimum 3-month storage duration eliminates this edge case for one-offs (since 3 * $0.4 > $1).

That said, it almost breaks even after the first month and pays off for itself a week into the second month. To eliminate this incentive, I'd suggest setting the cost of egress at (or below) the cost of storage.

It also would make more sense to me as a prospective customer that I'm not paying crazy overage fees, and I'm paying as if I reuploaded the same data and downloaded it an extra time.

chrisshroba · on March 25, 2021

> I can't wrap my head around the fact that electrical signals are priced higher than equivalent HDDs.

I think this is standard with cloud storage. Check out this comparison [1], showing 2-6x higher costs for downloads versus monthly storage in B2, AWS, GCP, and Azure.

[1]: https://www.backblaze.com/b2/cloud-storage-pricing.html

ddorian43 · on March 25, 2021

> Sending that backup to 10 machines 2500GB - 250GB = $22.5.

They aren't doing 20x replication. They do 20+20? erasure coding, so the overhead is 2x or less.

> Being able to pay for excess egress is better at least than risking getting your contract killed by Wasabi if your egress exceeds your traffic.

You can upload dumb storage as needed in Wasabi. I assume you can do the same here.

19h · on March 25, 2021

> They aren't doing 20x replication. They do 20+20? erasure coding, so the overhead is 2x or less.

With "sending that backup to 10 machines" I meant "egress 10x of 250GB" = 2500GB. With a pricing of $0.01 per GB after egress tops 100% of stored data, or 250GB, that's 0.01 * 2250 = $22.5. I'm not talking about replication overhead on their side.

onionjake · on March 25, 2021

This is great feedback. We are redesigning the website right now and will make sure it is more prominent.

gccs · on March 25, 2021

Change it to: exgress is free as long as you don't use it

nine_k · on March 25, 2021

Egress is free as long as you use it for individual backup, or for other data you want to keep just in case and mostly discard, like old logs.

igetspam · on March 25, 2021

If I do the math but I store a lot of data that gets served by a CDN. It rarely changes and is requested enough that it will stay in the cache. I'm already paying for the CDN, so I can't save there but if can save on storage, it could be useful.

I'm not disagreeing with the point that that pricing is clear or not "hidden" but more that there's a use case here that can make sense.

gpm · on March 25, 2021

Wow, that's just outright false advertising (assuming they are actually charging for egress)... Here is everything the pricing page, signup page, and TOS (which reference the first) have to say about paying for egress... I especially like the part where they are calling this hot storage, which certainly implies that it might be accessed more than once a month.

Pricing page [1]

> We have no hidden fees and the monthly calculator to prove it.

> Egress $0.00

> No charge for egress, ops, or retrieval.

> With Polycloud from CrowdStorage, you only pay for the storage you need. And we never penalize you with fees for accessing your data.

> No egress charges

> We keep it simple—access your data when you want it, without being nickel-and-dimed with hidden fees.

> Hot storage for cold storage prices.

> Don’t overpay to get the speed you need. Polycloud delivers quick access you data whenever you need it.

> Egress $0.00

Signup page [2]

> Everything you need to store your data for only $4 per TB/mo and no hidden fees.

Terms of service [3]

> 6.1 Device Backup. In the event that you subscribe to a paid version of the Device Backup Services, we will put you on a recurring payment plan that charges you for the fees set forth at https://app.crowdstorage.com/pricing in advance for each billing cycle. We will charge the payment method you specify at the time of purchase and, if you do not cancel the Services prior to the end of the current billing cycle in accordance with Section 7, you will automatically be charged the then-current fee for the Services at the start of the following billing cycle.

> 6.2 Polycloud. With the Polycloud Services, you pay only for what you use. There are no set-up fees or commitments to begin using the Polycloud Services. At the end of each month, you will be charged for that month’s usage of the Polycloud Services as further set forth at https://polycloud.crowdstorage.com/pricing.

[1] Pricing: https://archive.is/K3Dyf [2] Signup: https://archive.is/c3fAf [3] Tos: https://archive.is/xBKJL

onionjake · on March 25, 2021

I'm adding some quick edits to the page to make it clearer that it is free egress up to 100% of stored content.

We are currently redesigning our website and this will definitely be something we focus on making more upfront and clearer.

KaiserPro · on March 25, 2021

I'm assuming the catch with this is that you are letting your data be stored on some randomer's set of disks. which means availability is directly tied to the interest of nerds who are loaning out their hdd space for a pittance.

https://crowdstorage.com/faq/#where-is-my-data-stored

aequitas · on March 25, 2021

I also cannot find any link to join the community and become a storage provider. Maybe it operates under a different name?

onionjake · on March 25, 2021

Answered in another thread https://news.ycombinator.com/item?id=26581857

ClumsyPilot · on March 25, 2021

The avaliability of coffee is directly tied to the interest of random farmers who are growing it for a pittance.

How is it different than any other market? To bitcoin nodes? To Torrents, Tor network? Wikipedia depends on donations of random nerds.

Like this is such a self-depreciating perspective, that nothing can be done or relied upon unless its done by a megacorp.

tw04 · on March 25, 2021

But "random farmers" have almost no bearing on the ACTUAL coffee supply chain. Starbucks doesn't use the local farmers market for their supply chain because it's not reliable. They have contracts in place that are fulfilled through obligation.

If anyone on this site has built a business that relies on wikipedia having accurate information at all times I'd call them crazy too.

In the same way I wouldn't call this "enterprise" as they have plastered all over the site. Using spare capacity on a bunch of random usb drives that users happen to have online gives me no guarantee of uptime. With a 20+20 they're betting that 21 users won't experience an outage at the same time, and that if a large portion experiences outages that they can rebuild faster than users fail.

Without knowing anything about where the users are coming from, or what kind of contract they've agreed to, you're just giving a company your data that has told you: it'll be secure, TRUST ME!

KaiserPro · on March 25, 2021

Thanks for that, I hadn't truly grasped the concept of supply and demand until you so clearly summed up macro economics.

To just fill you in, Coffee growers grow coffee because the cost of growing coffee is less than the local wholesale price (most of the time.) It is possible to subsist on the profits of growing coffee(depending on where you are).

The price per TB being charged to the consumer is $48 a year. which is significantly less than the initial setup cost to become a data host. (pi + sd card + hdd) That's before we get to the opex of paying for an ISP, (and any bandwidth overages) plus power and general maintenance.

Thats assuming that the company is selling at the price it pays the "hosts". I'm assuming they have some ambition for profitability.

which leads me back to the original statement: you're reliant on a bunch of randomers Who didn't really think about the economics subsidising this company.

ClumsyPilot · on March 25, 2021

Erm, you've got it all backwards, coffee farming is less profitable than data hosting.

The money is made on egress, that's the case with Storj and most others. For domestic setup, the data connection is a fixed cost you would have anyway.

Stop assuming everyone is an idiot, people do the math for profitability of these platforms, and switch between them.

KaiserPro · on March 26, 2021

They don't charge for egress, its explicit in the front page.

> coffee farming is less profitable than data hosting.

yes, the money is in roasting. you're missing the essential point, there is a reason why coffee growers don't roast their own raw material, because they cannot get access to capital to get the equipment, let along access to the customers.

but coffee isn't the point.

Unless there is a sustained incentive to store the data, there will not be anyone willing to host it. the amount they charge is not enough to make it sustainable for people to invest in equipment to host the data, so you'll be reliant on best effort, or worse still, short term incentives that are unsustainable, leading to wide spread evaporation of storage.

You pay for gaurenteed storage, which is why its expensive, its a slice of the capex for the equipment, opex for the running and maintenance, plus profit. At this current price, it doesn't cover any of them.

chrisshroba · on March 25, 2021

I guess this argument really depends on how much redundancy Polycloud implements. Your data is replicated on 2 random people's computers? Uh oh. Replicated across 10? Probably safe.

There are thousands (millions?) of coffee farmers, so they have no problem.

dubcanada · on March 25, 2021

What happens if say 3 shards of a specific file you need are not online that day? Or never come back online again? I understand the files are copied to multiple locations but we are not talking about a 24/7 datacenter but random computers. People can decide to uninstall the program or never power back on or anything.

Unless it’s copied to basically every computer I don’t see how eventually someone would have a corrupted file just because they are unable to piece it back together with available computers.

PurpleFoxy · on March 25, 2021

The solution for this is to require hosts to enter some kind of deposit in to the system. If they go offline too much, the deposit is forfeit.

ohashi · on March 25, 2021

Basically what Storj does. I am kind of wondering if they are backed by it based on descriptions

rglullis · on March 25, 2021

I confess that I haven't gone deep into Storj, but I am running a node for some weeks now and I didn't have to pay anything (besides the operational costs of keeping the server online, of course) to get into it.

ClumsyPilot · on March 25, 2021

You are earning money every month, but they keep a colateral and payment to you is delayed. If your nodes combusts one day, that colateral dissapears broadly speaking.

Also there are like 32 shards for each data, so 32 conpuetrs don't go offline in one day.

wctwct · on March 25, 2021

We now manage a network of over 250,000 NAS devices that are part of our network. So we know how to handle devices going offline without losing data. This approach has been used at scale since 2014.

onionjake · on March 25, 2021

We are planning on releasing more information about this in the future. Using erasure encoding we can copy it to lots of devices, say 60, then only require 25 of those 60 to retrieve the object.

If the devices storing data go offline we constantly monitor and refresh the pieces to maintain integrity.

elcomet · on March 25, 2021

The important point in the FAQ:

> The encrypted data pieces are scattered across our community of users, using small portions of our member’s unused hard drive space

So it's not really comparable to S3 or B2. It's more like storj or sia.

lcabral · on March 25, 2021

I think they meant int the sense of a API compatible > Polycloud is 100% AWS S3 bit-compatible. If you’re used to using an S3 API, you can access Polycloud using the same API.

zokier · on March 25, 2021

Do you also provide same consistency guarantees as S3?

onionjake · on March 25, 2021

Yes. We are strongly consistent.

ahachete · on March 25, 2021

S3 only became strongly consistent recently. It is a major feat to do so.

I can imagine doing it without controlling the storage layer, network topology and specially network latency is only harder.

Could you please elaborate a bit more on how do you achieve it?

onionjake · on March 27, 2021

S3 has been around for a long time and they had a lot of objects to transition when they upgraded, so I imagine that is why it took awhile.

There are other object storage systems that have strong consistency guarantees that came out after S3.

It greatly simplifies things that an object written to S3 it is immutable.

On a high level, all writes to your storage use some UUID. All reads use a consistent metadata storage (pick a modern database). After your write is complete and you are sure it is persisted, do the metadata update and return success. Everyone gets a consistent view of the operation.

viraptor · on March 25, 2021

The first thing I check on services which advertise as S3-compatible is permissions. As usual - no permissions, no ACLs. So practically it looks like anyone with an access key can wipe out everything you store with them.

onionjake · on March 25, 2021

We are working on it! We plan to have support for ACL, object locking, versioned buckets, etc.

robjan · on March 25, 2021

Where is the other side, i.e how are people with hard drives recruited? It didn't seem to be on this site but it's important to know otherwise how do we know it's not, for instance, a botnet?

reacharavindh · on March 25, 2021

What is the catch here? How can it be cheaper than b2 while essentially offering the same service to the end-user?(geographically distributing data does not directly result in any reduction of costs). It’s okay to say that “we have lower profit margin” than B2, what is it technically that enables this lower cost? , and what are the trade offs made to get there?..

toolslive · on March 25, 2021

there are several things you can do.

  - First, like you indicated is: you can take less margin (and believe me, there's still considerable margin).
  - You can play with erasure coding policy. (n+10)/n makes that you can tolerate 10 failures. a higher n makes your storage overhead less. (which means more margin)
  - you can use even fancier storage schemes (fe online codes will make your storage overhead something like 3%)
  - you can use cheaper hardware.
  - ...

wctwct · on March 25, 2021

Managing a distributed network has a much lower cost structure - no capital investment for hard drives, no electrical cost for running the hard drives, no cooling costs, etc.

dodyg · on March 25, 2021

If you want people to upload data to your system, it helps to give them more information about your organization.

szszrk · on March 25, 2021

It may not be clear at first sight, but they provide info about the company [0] and that data is actually stored on drives provided by community [1].

It could have be laid out more clearly, though.

[0] https://crowdstorage.com/privacy/

[1] https://crowdstorage.com/faq/#where-is-my-data-stored

headmelted · on March 25, 2021

We’ve seen these types of systems before and I always wonder how well thought out this is - either on the part of the hosts, the company or the client.

A couple of questions I have right off the bat are:

How do we know how secure this solution is?

Even if it were incredibly well secured - what are the laws around this setup?

If someone stores illegal materials on this system who is responsible? (The person who stored it? The company? The unsuspecting host?)

What happens if the hosts lose interest or it fails commercially? Does the data get lost without warning when hosts start uninstalling the software?

The problems with these systems is almost never the technology - it’s finding a way to negotiate the millions of different implications storing information on other people’s behalf brings.

onionjake · on March 25, 2021

We've been storing data like this for several years and have kept hundreds of PB and 10's of billions of objects without data loss despite having many nodes in areas affected by large regional outages. One recent example is when Texas was affected by blackouts.

We are working on getting as much content up on the website as we can and telling more about us!

headmelted · on March 25, 2021

I’d genuinely be interested in seeing that, it would certainly give me more confidence.

Have you looked into the legal aspects of your setup as regards liabilities? It probably doesn’t matter as much from the customer’s perspective as it’s their data so they should know what it contains, but would be worth knowing where this leaves you as a company and/or the network of nodes if some other customer pushed data onto the system that was objectionable.

shay_ker · on March 25, 2021

Is there a technical reason that cloud stores charge egress fees, or is it just purely "we want to disincentivize people from using competitors"?

EricE · on March 25, 2021

Well data transfer isn't free - so they have to have some model to help cover that overhead.

The roach motel model would be considered making lemonaide by marketing types - not sure as a customer I'd readily agree :/

jnwatson · on March 25, 2021

I think it is to encourage write-once, read-rarely use cases like logs and backups, as opposed to serving web assets.

onionjake · on March 25, 2021

Precisely. Although we also believe you shouldn't have to worry about going bankrupt if you ever need to restore all your data or want to migrate somewhere else, which is where the 100% egress of what you have stored comes from.

killingtime74 · on March 25, 2021

Profit. Its the Biggest part of the margin

Mortiffer · on March 25, 2021

Would love to have lower cost S3. Last time I tried one I got burned when they changed their prices and then made it really hard to migrate with super slow outbound bandwidth.

Anyway my advice is giving people something to de risk the cost of migration. IDK maybe an abstraction layer that automatically keeps a backup copy inside s3 glacier . Or something else ...

mike-cardwell · on March 25, 2021

$4/month for 1TB of storage and 1TB of bandwidth? Just get a 1TB Storage VPS (+8TB BW) from https://www.time4vps.com/storage-vps/ for 3 EURO a month instead and stick Minio on it ;) (Just a happy customer)

Dylan16807 · on March 25, 2021

> this discount is valid only for the first invoice and it is not reoccurring

So it'll double in price after that. And that's on top of having to round up to the next terabyte or two. But if you want more bandwidth then it looks like it's worth considering.

Side note: It's interesting that above 2TB the deal gets steadily worse.

mkl · on March 25, 2021

Where did you see the non-discount price?

I think maybe the deal looks like it gets worse because the RAM provided increases significantly.

gpm · on March 25, 2021

Top left above the price in black on white text is the non discounted price. The text he is quoting is the first entry in the "faq" below.

onionjake · on March 25, 2021

We are pay-as-you-go and charge per GB, so you can store 1-10GB and pay nothing, 13 GB and pay $0.01/month, etc.

It scales up and handles PB of data as well.

deallocator · on March 25, 2021

"We designed this service to reliably hold a huge amount of data. This setup will serve you best if it’s used to store compressed data archives or backups." "Do you offer backups with this service?

Unfortunately, no. Users have to regularly back up the data themselves. " That sounds a little contradicting

gpm · on March 25, 2021

If you have a lot of storage, you can rent servers for 2.86/tb (+ tax) from OVH.

[1] (Switch to 6*12 tb configuration) https://us.ovhcloud.com/bare-metal/advance/adv-stor-2/

hikarudo · on March 25, 2021

Even better:

https://www.hetzner.com/dedicated-rootserver/matrix-sx

js4ever · on March 25, 2021

I tried them in the past, speed is limited to 50mb/s (6.25MB/s)

0xbkt · on March 25, 2021

You are missing the durability part here, leave aside the overall operational burden. I am just saying in case the drive storing your files catches fire or... simply fails.

JCM9 · on March 25, 2021

The catch is always in the fine print. The storage is cheap but the savings go away if you actually do anything with the data.

Storage is a foundational service in the cloud. There are huge advantages to having that storage sit adjacent to everything else. Such storage as a service somewhere else doesn’t make a lot of sense in many/most use cases. Now you’re having to pay to move data across comparatively slow networks links to where it’s actually needed. It’s a catchy headline but when putting this to the test in real world scenarios those numbers don’t pan out.

bad_username · on March 25, 2021

> storage is cheap but the savings go away if you actually do anything with the data.

This is the "data back up" use case, where you store heaps of data, and hope to never need to access it. For this use case the conditions seem excellent.

e12e · on March 25, 2021

They should at a minimum include 2x stored as transfer - untested backups aren't.

onionjake · on March 25, 2021

This is great feedback, we'll take this into consideration. You can always reach out to talk about specifics about your use case and how we can better meet it (there is a Contact Us button near the bottom of the page).

JCM9 · on March 25, 2021

Yes, but then you wouldn’t use S3 for that either so the price comparisons are misleading.

the_duke · on March 25, 2021

> but then you wouldn’t use S3 for that either

Why do you have that impression?

S3 and similar are used a lot for backups, especially Glacier.

onionjake · on March 25, 2021

Another thread I posted a little bit about how we are compared to glacier: https://news.ycombinator.com/item?id=26582429

Glacier can get pricey when you store/retrieve your data because of ops, retrieval fees, and egress (if going to the internet). We feel like immediate availability is a compelling advantage of our product.

wctwct · on March 25, 2021

Best for archive and backup. You can also see how Vivint uses them to store video clips and stream them to customers when needed: https://crowdstorage.com/solutions/

ddorian43 · on March 25, 2021

How much free space do you guys have ? If a big customers comes and stores 50PB, what do you do since you don't have servers ?

onionjake · on March 25, 2021

We have plenty of capacity to handle 50 PB :) We also leverage our own more traditional cloud storage transparently if/when needed.

ddorian43 · on March 25, 2021

Can you do efficient range-reads ? Reading 5MB in the middle of 500MB file, how does that work out ?

v02zfl8pg9 · on March 25, 2021

> Yes, Polycloud is GDPR compliant. [...] processed and stored on servers located across the United States [...] certified with the EU-U.S. Privacy Shield Framework.

I don't think this means Polycloud is GDRP compliant. It's my understanding the Privacy Shield Framework was struck down by the EU courts. Might be a bit careful here if you are a EU business.

ing33k · on March 25, 2021

also check out https://tardigrade.io/

wctwct · on March 25, 2021

Very similar to tardigrade, just much lower cost, $0.004/GB for Polycloud vs. $0.010/GB for Tardigrade.

bennyp101 · on March 25, 2021

Says it is GDPR compliant - but I don't see it explained how?

You say you would use "Standard Data Protection Clauses" if the data goes outside of the US ... but I don't want my data in the US in the first place, so is this really a US only service? How would those clauses be implemented? Does every node have a contractual agreement with you? How would you know if someone took their computer with them on holiday to another country?

Also, how is the data encrypted? "State of the art encryption" is just marketing fluff :)

onionjake · on March 25, 2021

> so is this really a US only service

We plan on launching regions outside of the U.S. that only store data outside of the U.S. soon.

> Also, how is the data encrypted? "State of the art encryption" is just marketing fluff :)

AES256 and/or xsalsa20poly1305.

You can also use SSE-C to provide our servers with a key to encrypt the data (which our servers then promptly forget). https://docs.aws.amazon.com/AmazonS3/latest/userguide/Server...

bennyp101 · on March 26, 2021

Thanks.

How do you handle the data leaving the country? With someone just taking their machine on holiday etc

mtlynch · on March 25, 2021

>This means that with our 20/40 encoding scheme, a malicious attacker would need to physically access 20 different nodes within our network of almost 300,000 devices.

Which community is providing 300k devices? Is Polycloud building on top of IPFS or Sia?

onionjake · on March 25, 2021

Discussed in another thread: https://news.ycombinator.com/item?id=26581399

There is a 'Contact Us' button at the bottom of the page you can always reach out.

chaz6 · on March 26, 2021

One killer feature could be some form of ransomware protection. If a sudden entropy change is detected, snapshot the data, then provide the customer with the option to revert changes from that time (up to, say, a week).

clarkevans · on March 25, 2021

Are there S3 compatible services where they charge for storage/ingress/egress straight-up, without clever billing?

treesknees · on March 25, 2021

If you scroll a bit on the Backblaze B2 pricing page, they have an easy calculator that will tell you exactly what the costs will be.

$0.005/GB/Month for storage

$0.01/GB for egress

The only "hidden" cost I'd say they have is they have a limit on some API calls. For example their b2_download_file_by_name is limited to 2500 calls per day and then $0.004 per 10k calls after that.

https://www.backblaze.com/b2/cloud-storage-pricing.html

https://www.backblaze.com/b2/b2-transactions-price.html

drip-egg-jack · on March 28, 2021

https://tebi.io/ charge for the storage, you can define how many copies and where you want to store your data and a flat egress fee

acejam · on March 25, 2021

Filebase is S3 compatible and has pretty straight forward pricing [1]

[1] https://filebase.com

e12e · on March 25, 2021

Digital Ocean spaces has pretty straightforward pricing?

Hetzner storagebboxes/shares are great value - but no s3 api.

TruthWillHurt · on March 25, 2021

You do realize storing your data away from where processing takes place is going to incur network charges and latency, right?

wctwct · on March 25, 2021

It isn't the best solution for applications that require a lot of compute. Better for pure archive or backup applications.

teknopurge · on March 25, 2021

In this case it seems twice as expensive as AWS glacier.

I like the idea of this but do not yet see the competitive advantage.

Also your FAQ mentions your community of users. Is there a forum/discord where the community can discuss/learn?

onionjake · on March 25, 2021

How do you arrive at twice as expensive as AWS glacier?

AWS S3 Glacier is the same $0.004/GB/month and requires the same minimum of 90 days, but retrieval takes "from 1 minute to 12 hours". They also have large retrieval costs (on top of the regular egress to the internet) and API call charges.

AWS S3 Deep glacier is only $0.00099/GB/month, but requires it to be stored 180 days and also have operation/retrieval fees. "For long-term data archiving that is accessed once or twice in a year and can be restored within 12 hours"

https://aws.amazon.com/s3/pricing/

We feel like the immediate availability of our offering is compelling and a competitive advantage over glacier.

> Is there a forum/discord where the community can discuss/learn?

Not yet, we will have one created soon.

teknopurge · on March 25, 2021

from https://aws.amazon.com/glacier/pricing/

Select Bulk $0.001 per GB (Storage)

Select Bulk $0.0025 per GB (Transfer)

jjt-yn_t · on March 25, 2021

Some/many of these solutions have an unstated low speed UPLOAD bottleneck. Anyone with with definitive answer(s) to that?

drip-egg-jack · on March 28, 2021

https://tebi.io/ has a solution - you always upload to the closest server in your region that is much faster then upload to a central location.