More

deltaci · on May 3, 2023

It's a benchmark of GitHub Actions(Azure) vs a really old Macbook Pro 15, not exactly what you are looking for, but it tells the vibe already.

https://buildjet.com/for-github-actions/blog/a-performance-r...

015a · on May 3, 2023

This is a big, general problem with CI providers I don't hear talked about enough: because they charge per-minute, they are actively incentivized to run on old hardware, slowing builds and milking more from customers in the process. Doubly-so when your CI is hosted by a major cloud provider who would otherwise have to scrap these old machines.

I wish this were only a theoretical concern, a theoretical incentive, but its not. Github Actions is slow, and Gitlab suffers from a similar problem; their hosted SaaS runners are on GCP n1-standard-1 machines. The oldest machine type in GCP's fleet, the n1-standard-1 is powered by a variety of dusty, old CPUs Google Cloud has no other use for, from Sandy Bridge to Skylake. That's a 12 year old CPU.

AnthonyMouse · on May 3, 2023

There are workloads where newer CPUs are dramatically faster (e.g. AVX-512), but in general the difference isn't huge. Most of what the newer CPUs get you is more cores and higher power efficiency, which you don't care about when you're paying per-vCPU. Which vCPU is faster, a ten year old Xeon E5-2643 v2 at 3.5GHz or a two year old Xeon Platinum 8352V at 2.1GHz? It depends on the workload. Which has more memory bandwidth per core?

But the cloud provider prefers the latter because it has 500% more cores for 50% more power. Which is why the latter still goes for >$2000 and the former is <$15.

015a · on May 3, 2023

> Which vCPU is faster, a ten year old Xeon E5-2643 v2 at 3.5GHz or a two year old Xeon Platinum 8352V at 2.1GHz? It depends on the workload.

It really does not depend on the workload, when those workloads we're talking about are by-and-large bounded to 1vCPU or less (CI jobs, serverless functions, etc). Ice Lake cores are substantially faster than Ivy Bridge; the 8352V will be faster in practically any workload we're talking about.

However, I do agree with this take, if we're talking about, say, lambda functions. The reason being that the vast majority of workloads built on lambda functions are bounded by IO, not compute; so newer core designs won't result in a meaningful improvement in function execution. Put another way: Is a function executing in 75ms instead of 80ms worth paying 30% more? (I made these numbers up, but its the illustration that matters).

CI is a different story. CI runs are only bound by IO for the smallest of projects; downloading that 800mb node:18 base docker image takes some time, but it can very easily and quickly be dwarfed by all the things that happen afterward. This is not an uncontroversial opinion; "the CI is slow" is such a meme of a problem at engineering companies nowadays that you'd think more people would have the sense to look at the common denominator (the CI hosts suck) and not blame themselves (though, often there's blame to go around). We've got a project that can build locally, M2 Pro, docker pull and push included, in something like 40 seconds; the CI takes 4 minutes. Its the crusty CPUs; its slow networking; its the "step 1 is finished, wait 10 seconds for the orchestrator to realize it and start step 2".

And I think we, the community, need to be more vocal about this when speaking on platforms that charge by the minute. They are clearly incentivized to leave it shitty. It should even surface in discussions about, for example, the markup of lambda versus EC2. A 4096mb lambda function would cost $172/mo if ran 24/7, back-to-back. A comparable c6i-large: $62/mo; a third the price. That's bad enough on the surface, and we need to be cognizant that its even worse than it initially appears because Amazon runs Lambda on whatever they have collecting dust in the closet, and people still report getting Ivy Bridge and Haswell cores sometimes, in 2023; and the better comparison is probably a t2-medium @ $33/mo; a 5-6x markup.

This isn't new information; lambda is crazy expensive; blah blah blah; but I don't hear that dimension brought up enough. Calling back to my previous point: Is a function executing in 75ms instead of 80ms worth paying 30% more? Well, we're already paying 550% more; the fact that it doesn't execute in 75ms by default is abhorrent. Put another way: if Lambda, and other serverless systems like it such as hosted CI runners, enables cloud providers to keep old hardware around far longer than performance improvements say it should be; the markup should not be 500%. We're doing Amazon a favor by using Lambda.

AnthonyMouse · on May 4, 2023

> It really does not depend on the workload, when those workloads we're talking about are by-and-large bounded to 1vCPU or less (CI jobs, serverless functions, etc). Ice Lake cores are substantially faster than Ivy Bridge; the 8352V will be faster in practically any workload we're talking about.

If you were comparing e.g. the E5-2667v2 to the Xeon Gold 6334 you would be right, because they have the same number of cores and the 6334 has a higher rather than lower clock speed.

But the newer CPUs support more cores per socket. The E5-2643v2 has 6, the Xeon Platinum 8352V has 36.

To make that fit in the power budget, it has a lower base clock, which eats a huge chunk out of Ice Lake's IPC advantage. Then the newer CPU has around twice as much L3 cache, 54MB vs. 25MB, but that's for six times as many cores. You get 1.5MB/core instead of >4MB/core. It has just over three times the memory bandwidth (8xDDR4-2933 vs. 4xDDR3-1866), but again six times as many cores, so around half as much per core. It can easily be slower despite being newer, even when you're compute bound.

> We've got a project that can build locally, M2 Pro, docker pull and push included, in something like 40 seconds; the CI takes 4 minutes. Its the crusty CPUs; its slow networking; its the "step 1 is finished, wait 10 seconds for the orchestrator to realize it and start step 2".

Inefficient code and slow hardware are two different things. You can have the fastest machine in the world that finishes step 1 in 4ms and still be waiting 10 full seconds if the system is using a timer.

But they're operating in a competitive market. If you want a faster system, patronize a company that provides one. Just don't be surprised if it costs more.

icedchai · on May 4, 2023

Lambda is good for bursty, typically low activity applications, where it just wouldn't make sense to have EC2 instances running 24x7. There about some line-of-business app that gets a couple of requests every minute or so. Maybe once a quarter there will be a spike in usage. Lambda scales up and just handles it. If requests execute in 50ms (unlikely!) or 500ms, it just doesn't matter.

throwaway2990 · on May 4, 2023

Lambda is not crazy expensive. It’s expensive if you’re running something 24/7 in place of a physical server or VM.

spockz · on May 4, 2023

It would be nice if I could still use all lambda functionality and tooling but instead have it running in my own vms with long time commitment.

throwaway2990 · on May 4, 2023

Not quite sure I follow. But I built an asp.net api and deployed it into lambda and it cost $2/m and when it started to get more traffic and the cost got to $20/m I moved it to a t4g instance.

When I moved it, I didn’t need to make any code changes :) I just made a systemd file and deployed it.

nwmcsween · on May 5, 2023

For this to be true IPC would have to have stagnated for 10 years which is not the case. Look at Agner's instruction tables for different uarchs and compare.

willcipriano · on May 3, 2023

Sometimes when I run a lot of builds in a short period of time I feel like I get demoted to the slower boxes.

deltaci · on March 29, 2023

this is already the third time github actions is down this week at wednesday morning

mdaniel · on March 29, 2023

maybe they have a "no deploy on Friday" rule :-D

capableweb · on March 29, 2023

I sure hope not, GitHub is supposed to be matured infrastructure at this point, where most if not all changes going into production should be very well tested and nothing that multiple people haven't verified as being correct should end up being deployed and released.

Besides, Microsoft surely has 24/7 watch of their infrastructure, even on weekends, it's a huge company.

outworlder · on March 29, 2023

> Besides, Microsoft surely has 24/7 watch of their infrastructure, even on weekends

"watching" with a dedicated team vs "waking up everyone in engg because things are on fire" are two very different things.

Besides, size doesn't work that way. The larger the organization and the more complex the product is, the higher the chance some unexpected interaction will occur. There are processes and automation that can mitigate this, but one can never be completely certain.

Not even the aviation industry has mastered that.

andrewxdiamond · on March 29, 2023

Bugs are a function of change, not a function of maturity.

Just because they can have people come in on weekends to fix things doesn’t mean they like doing that.

I know many “mature” software platforms that do not deploy on Fridays or off hours at all

bastardoperator · on March 29, 2023

A true global company doesn't have off hours.

andrewxdiamond · on March 29, 2023

As someone who runs one of the most global APIs in the world, I promise you, I do in fact sleep

zamnos · on March 29, 2023

What's your pager rotation like? I want to say you have follows-the-sun, and so your on-call shifts are 12-hours long and you swap with a team on the other side of the world from you so you can get said sleep, but I don't want to just assume that.

andrewxdiamond · on March 29, 2023

Dayshifts are 9-5, night shifts are 5-9. Same team rotates through both. I have done plenty of overnight oncalls.

Some teams do follow the sun type rotations, but my team is all in Seattle.

zamnos · on March 29, 2023

Fascinating, glad I asked!

bastardoperator · on March 29, 2023

That's why a global team is important, when you sleep they work, when they sleep you work. Work is constant when you're servicing the globe.

andrewxdiamond · on March 29, 2023

And work is constantly slow as well, since it’s impossible to get people in the room at the same time

bastardoperator · on March 29, 2023

Why does everyone need to be in the room? I have a groomed backlog and can talk to people async as needed. We also record meetings if you missed them and depending on the context of the meeting or importance, we'll hold timezone friendly meetings for everyone as required.

andrewxdiamond · on March 29, 2023

¯\_(ツ)_/¯

We have a different work philosophy. I do work with teams in India and England, and it’s painful to accomplish anything cross team

capableweb · on March 29, 2023

Everyone employed by your company, who work in the same industry stops and starts working at the same time, all around the world?

andrewxdiamond · on March 29, 2023

No, but my team owns our own APIs. We are all located in the same area and we go oncall for our service.

zamnos · on March 29, 2023

Less "on" hours then? Even Google has diurnal patterns when there's a lower amount of traffic simply due to the fact that humans are unevenly distributed across the Earth's surface. And Google does code freezes for the holidays where they don't deploy at all.

deltaci · on Nov 18, 2022

congratulations on the launch. it looks pretty much like a self-hosted version of https://buildjet.com/for-github-actions

alexellisuk · on Nov 18, 2022

Thanks for commenting.

It seems like buildjet is competing directly with GitHub on price (GitHub has bigger runners available now, pay per minute), and GitHub will always win because they own Azure, so I'm not sure what their USP is and worry they will get commoditised and then lose their market share.

Actuated is hybrid, not self-hosted. We run actuated as a managed service and scheduler, you provide your own compute and run our agent, then it's a very hands-off experience. This comes with support from our team, and extensive documentation.

Agents can even be cheap-ish VMs using nested virtualisation, you can learn a bit more here: https://docs.actuated.dev/add-agent/

deltaci · on June 26, 2022

It's amazing to see big company can throw so much engineering effort into it, while for majority of the CI users, just getting a 2x faster CI machine can achieve the same outcome with much less cost.

[0] https://buildjet.com/for-github-actions/blog/a-performance-r...

edited: wrong link

shawabawa3 · on June 26, 2022

Speaking from experience working on CI at a large company, I'm sure they've "just got a 2x CI machine" about 6 times. At some point you can't just burn more money and you need to optimise

tootie · on June 26, 2022

Computers are usually way cheaper than people. The difference would just be Cap Ex vs Op Ex. That being said, they must be burning zillions of cycles rebuilding code that hasn't changed by using a monorepo.

crummy · on June 26, 2022

Maybe I misread the article, but I thought they used tools to ensure they only rebuilt parts of the dependency graph, not wasting zillions of cycles?

craigching · on June 26, 2022

That’s how I understood it as well.

pocketarc · on June 26, 2022

I've been thinking about this for a while, and it seems to apply to a lot of things; serving a HTTP request on cheap EC2 instances won't come close to doing it on a dedicated server with great single-thread performance.

So even though you can more easily horizontally scale and handle infinite requests, the latency of each request will be much poorer than if you were just running on better hardware.

IshKebab · on June 26, 2022

Yeah in my experience "get a faster machine" is so easy it was always done years ago and is no longer a possible improvement.

idontknowifican · on June 26, 2022

fwiw there’s a cap to how much perf you can extract from an instance. we use R6 32cpu 64gb ram for our builders. we can’t really 2x that from a price point again lol

doktorhladnjak · on June 26, 2022

These companies often have thousands of machines in their CI fleets. It really can be cheaper to pay engineers to optimize rather than just buying more or bigger instances.

maccard · on June 26, 2022

what makes you think they havent already done so? If they're running Jenkins and/or buildkite, they're managing their own runners so they're not jumping from GitHub actions runners to 8/16 core machines.

deltaci · on June 4, 2022

Estonia is also a really good choice within EU. With their digital residency card, everything can be managed online with digital signatures only.

SkyAndSand · on June 4, 2022

I've heard lots of people talking about Estonia but can you name any international remote-first startup that became successful and raised money with an Estonian entity? I don't and this makes me somewhat suspicious :)

lukeqsee · on June 4, 2022

I don't know of remote-first companies, but I know of many Estonian companies that have raised a lot of money and been incredibly successful.

To name a few:

    - Bolt
    - Wise (TransferWise)
    - Pipedrive

I have a long history with e-Residency and know many e-Residents (e-Resident since 2015, 3 companies, member of EERICA.ee). Happy to chat about it. Email in bio.

mishaker · on June 4, 2022

I was e-resident too but had to become resident in Estonia. Cause if you are abroad and distribute yourself dividend (as a sole entrepreneur) your host country would consider your Estonian company local and thus it constitutes a permanent establishment and that becomes extremely complicated

lukeqsee · on June 4, 2022

Of course--but this is the case for almost all jurisdictions of record vs jurisdictions of taxation. Few countries want to allow a company to operate 100% in their borders without extracting some degree of taxation. (In fact, this is basic OECD taxation doctrine.)

My situation is complex, but, generally, the advantages of Estonian registration are found in drastically simplified and lower-cost business registration and processing (vs say a GmbH in Germany or Switzerland with high share capital and accounting costs) or ease of operation for digital nomads or fully remote companies. A OÜ isn't for everyone in every life situation, but when it fits, it tends to work really well.

A LLC or C Corp in the US could work just as well (or better), depending on the situation.

krn · on June 5, 2022

Just to clarify: since you are only an e-resident of Estonia, are all your three Estonian companies paying the corporate income tax (CIT) in your personal country of residence?

Since Switzerland is currently the only country in Europe without CFC rules[1], it seems that an Estonian company can only be managed from Estonia or Switzerland without being considered as local for tax purposes in another jurisdiction.

[1] https://taxfoundation.org/controlled-foreign-corporation-cfc...

lukeqsee · on June 5, 2022

From my last post:

> My situation is complex

This means my situation pretty much does not apply to anyone else, and the tax residency question is very complex in my case, as well. I won't bore you with details. :)

krn · on June 5, 2022

Sure. But since you have a lot of experience with the Estonian e-residency, maybe you know any real life examples where managing an Estonian company without personally being an Estonian tax resident made sense?

lukeqsee · on June 5, 2022

Yes, hundreds. :)

Here's a few examples:

In Germany, it reduces the overhead of owning a company dramatically (the overhead of a German GmbH is quite a bit more than just taxes, including mandatory registrations, etc.).

In Switzerland, it reduces necessary share capital from 25'000CHF to 2,500EUR (which can be deferred).

In the US, it probably doesn't make sense unless your situation is very special.

As a digital nomad, it gives you a clear business home while traveling the world (and running everything online is essential).

Estonian accounting and business management is all electronic, so you can essentially run your business 99% in Estonia, do yearly tax reports in your resident, all while reducing the day-to-day complexity of running your business significantly. (No more paper or faxing!)

For non-EU residents, an Estonian entity gives them a clear legal path to marketing and selling in the EU with proper VAT, etc. reporting.

Every one I know has had slightly different reasons while Estonia made sense for them. I'm also purposefully not addressing the intangibles of registering a business in a well-functioning, well-regulated, forward-looking jurisdiction, which is a large component for many of my friends. (We care about Estonia, like its ideals, and want to see it succeed on a global scale.)

mishaker · on June 4, 2022

I second that. There are growing number of startups and increasing capital pouring into Estonia (hence the largest inflation rate in Europe) Wise and Bolt are good examples.

digianarchist · on June 4, 2022

Wise is incorporated in the UK. Founders are Estonian.

dybber · on June 4, 2022

Skype, maybe not remote first, but I think Skype have sort of paved the way/accellerated these efforts in Estonia. But it’s all just a guess…

deltaci · on March 17, 2022

this is very naive reimplentation of the C# version. I managed to reduce the runtime of the same file from 5.7 seconds to just 800ms

    using var file = File.OpenRead("file.bin");
    var counter = 0;
    var sw = Stopwatch.StartNew();
    var buf = new byte[4096];
    while (file.Read(buf,0,buf.Length) > 0)
    {
        foreach (var t in buf)
        {
            if (t == '1')
            {
                counter++;
            }
        }
    }

    sw.Stop();
    Console.WriteLine($"Counted {counter:N0} 1s in {sw.Elapsed.TotalMilliseconds:N4} milliseconds");

cakoose · on March 17, 2022

This code has a bug that can cause the count to be overreported. The last `file.Read` may only partially fill the buffer, but this code will look for 1s in the entire buffer.

(This bug won't affect the performance comparison, but I was just reminded of how error prone these kinds APIs can be vs the PHP/Python route of having the library function just allocate a new buffer each time.)

kerblang · on March 17, 2022

Would something like this help? (No of course I haven't compiled it.)

    int len;
    while ((len=file.Read(buf,0,buf.Length)) > 0) {
        for (int i=0; i<len; i++_) {
            if (buf[i] == '1') {
                counter++;
            }
        }
    }

Someone · on March 17, 2022

If you know the file isn’t too large, there also is File.ReadAllBytes (https://docs.microsoft.com/en-us/dotnet/api/system.io.file.r...)

My C# is very rusty (no pun intended) but I would guess the core of the program could be something like

  File.ReadAllBytes("file.bin").Where(x => x == '1').Count()

And nitpick: the code you gave has a bug. file.Read can return less than 4096. If so, you should only loop over the part of the buffer it filled.

deltaci · on March 16, 2022

but it makes function calls, it's not neglectable as it's called as many time as the bytes the file has

withinboredom · on March 16, 2022

That’s the api provided, and part of the point. The other option is to read the buffer directly, but it isn’t exposed to the programmer.

h3h3 · on March 17, 2022

the overload is right below it

  int StreamReader.Read()
  int StreamReader.Read(char[] buffer, int index, int count)

https://github.com/dotnet/runtime/blob/1ba0394d71a4ea6bee7f6...

deltaci · on March 17, 2022

You just need to do File.OpenRead instead of OpenText to get a FileStream, then you can read the buffer

deltaci · on March 7, 2022

A similar comparison on CI workloads between desktop CPUs and Cloud(Azure) here: https://buildjet.com/for-github-actions/blog/a-performance-r...

deltaci · on Feb 15, 2022

most of the latency comes from network layer. my naiive guess is they probably switched from a standard ethernet setup to a infiniband setup to achieve 600us of total latency.

deltaci · on Dec 2, 2021

It's revenue already lost. No tournament was happening since October 2020 already. Source: https://www.forbes.com/sites/adamzagoria/2020/07/23/wta-to-c...

ErikVandeWater · on Dec 2, 2021

Then when China eases Covid restrictions they'll quietly add China back to the schedule like nothing happened.

necovek · on Dec 2, 2021

Unfortunately, "China" holds grudges (in quotes, because I suspect it might not even be the government, but different officials wanting to appear as good citizens to the government, and definitely not Chinese people), so it won't be that easy: Simon will likely need to step down (though he's kinda being diplomatic in only "wanting answers", not putting particular blame on anyone, but he's still pretty adamant).

Witness NBA-Houston Rockets case which caused Houston Rockets games to be blacked out for 15 months even though Morey moved on from Houston late in 2020 after tweeting in support of Hong Kong protests early in the year.

What did CCTV do? Ban 76ers NBA games who are the current employer for Morey.