msmillie's comments

msmillie · on Nov 19, 2014

In flame graphs heavy iteration tends to give "wider" structures. Recall the stacks aren't ordered on the X axis by time, but by contents.

msmillie · on Sept 3, 2014

Guess who's more important to the newspaper: this one guy who encountered some "you were naiive to not expect this" corruption, or the senior city hall politico? And what did he do, anyway, ask for a seat on the board in return for his help? and that position comes with a salary? We can't run that story, quit wasting our time.

tsunamifury · on Sept 3, 2014

Its more that its never explicitly stated at any point in time or by any individual.

But if you've ever worked at this level, you know the connections made by those brokering deals or in close proximity to dealmaking is often the only way to find success. You need these people to connect you with the wealth or opportunity, and they charge for that connection.

tedunangst · on Sept 3, 2014

"Corruption in city hall" is front page news all the time.

coldtea · on Sept 3, 2014

Which goes to show that exposing it in the "frong page" doesn't do much to stop it.

derefr · on Sept 3, 2014

There is a difference between stopping an instance of a problem, and an entire species of problem. Just because the species of problem (municipal political corruption) persists in various forms, doesn't mean any individual corrupt politician will get off scot-free. They go down all the time, removing the particular barriers they had put in place.

msmillie · on Aug 6, 2014

http://dtrace.org/blogs/bmc/2011/02/08/llquantize/ describes a very similar log binning technique added to dtrace a few years back. In practice, it's been one of the most useful things in the toolkit.

msmillie · on March 24, 2014

The following scenario helps explain: you have two radars. One is a wide-angle, general radar, which sees a missle at some coordinates, travelling at some speed.

The other radar is for target acquisition, has a very narrow spread ("range gate"), and must determine the precise location of the missle given an initial set of coordinates, a velocity, and a timestamp.

In other words, the system is actually aiming at targets differently depending on how long it has been up, tracking so far in front (or behind, I don't recall the details) that it can't acquire the target.

danbruc · on March 24, 2014

You are describing a problem where two clocks drift relative to each other, not a problem where one clock drifts away from the actual time. On the other hand the article gives the impression the failure occurred because the system failed to exactly measure the up-time, not because clocks in different systems or system components drifted relative to each other.

vonmoltke · on March 24, 2014

That is not the impression I got from the article. No logic in the system cares how long it has been up, not directly at least. What matters is drift from its time reference, which is a function of uptime.

Various modules in a complex system like this each have their own clock, which I will refer to generically as a real-time binary counter (RTBC), which the module uses as its event time reference. The RTBC starts at 0 when the module comes up. At some point shortly after coming up the module will check in with its controller, which will send a time-of-day (TOD) message. The module links the TOD message to a particular RTBC tick to create its time reference. At this point the time is free to start drifting relative to the actual wall clock time, until the system is power cycled again.

danbruc · on March 24, 2014

That is exactly what I said - different clocks drifting relative to each other. It is completely irrelevant that their one tenth of a second was not exactly one tenth of a second, what matters is that different clocks in the system had different ideas of one tenth of a second.

darkmighty · on March 24, 2014

You're applying a principle too broadly. Although the laws of physics don't change under a linear expansion of time, they are, for example, sensitive to linear expansion of velocities of missiles only: any non time-linear effect on the velocity is going to impact you -- for example, reynolds numbers for air depend non-linearly on the velocity which may be varying with time. Sure, if you multiply the whole system you would have that compensated by the increase in temperature and pressure which a faster time reference would observe, but it's not simulating the universe, just a limited set of variables.

Also, for obvious reasons of consistency and precision it would be better to keep a standard reference regardless.

danbruc · on March 25, 2014

I do not think of the problem as scaling the time by a factor - although this is the correct description - but as adding a constant offset. I think this is justified because the small drift is not significant during the relative brief period of time a target approaches. The offset builds up over time but only in the parts of the system that did not receive the improved algorithm and therefore these different parts disagree more and more on what the current time is.

vonmoltke · on March 24, 2014

> It is completely irrelevant that their one tenth of a second was not exactly one tenth of a second.

Its very relevant when the module that is off is trying to make telemetry calculations based on target Doppler velocity, which is given with real, ISO standard seconds. There is no clock involved in that. Diverging module clocks amplifies the problem.

Also, the ultimate reference is the true definition of a second. All modules are expected to be using it, as it is used to synchronize modules. It is the clock and at some level a clock that has a faulty definition will be drifting off another clock. Your distinction is irrelevant as far as real-time systems are concerned.

danbruc · on March 24, 2014

You are making a lot of assumption about how time might be used, nut I will ignore that because I have no clue if that is what really happens.

Let me repeat my point clearly. All clocks will drift away from the actual time. All the physics involved and measurements done are not depended on the current time - they will work the same at 14:07 as they do at 23:51 and they will therefore also work the same when the clock of the system drifted away from the actual time and believes it is 12:34 while it is 12:35. Important is only that all parts of the system agree on what the current time is and that the clock does not drift at such an high rate that all measurements and calculations done during a brief period of time become invalid, i.e. the clock should not report that it took two seconds for the incoming missile to travel one kilometer while it took only one second.

And the article gave the impression - at least to me - that the failure was caused because the system believed to be up for 100 hours while it was up for 100 hours and 340 milliseconds longer due to an imperfect representation of one tenth of a second. This makes no sense and is not what caused the failure. The failure was caused - as detailed in the other linked article - because one part of the system believed to be up for 100 hours while another part of the system performed more precise time conversions and knew that it was up for 100 hours and 340 milliseconds and this time difference between two parts of the system caused the failure.

For example one part of the system may have decided that the missile should be launched at 12:00:00.000 and the system responsible for doing so did that according to its clock but because of the time difference it was at 12:00:00.340 according to the clock of the system that made the decision.

krfsm · on March 25, 2014

My interpretation of the article:

Time is kept as an integer, stepped ten times per second. This can be exactly represented as a float, so probably uses the same 24 bit register. For 100 hours this integer would be 3600000, which fits into 24 bits with some room to spare. (But it would give a max uptime of the system of about 466 hours.)

Wide arc radar notes location, velocity, and time from clock above. This output data is still good enough for pinpointing the next position with a precision of about 170 meters (the distance the scud travels in the 0.1 second step of the clock). The precision radar system probably had accounted for this, and had a wide enough beam to handle this case.

Now, when deciding where to point the next precision beam, the radar multiplies the stored time value (exactly 3600000) with 0.1 (which is not represented exactly, but instead is about 0.000000095 less than 0.1) and uses this computed value in further calculations. This floating point value is now 0.34 seconds less than expected. The precision radar, even though it uses the same clock as above, has an incorrect representation of when the last wide arc radar update took place, and this propagates to the prediction of where the scud will be next (which is now off by 0.34 * 1676 ~= 570 meters). Thus, when it points the beam to where it believes the scud will be, the scud is outside the precision radars cone.

Note that both wide arc and precision beam systems have exact knowledge of the current system time at the point of their respective operations. What fails is precision beam's calculation of what wide arc's time reference actually meant.

The ironical part in the article probably refers to some computation using a delta, and if both time references ("then" and "now") have the error the delta will be small and possibly insignificant. However, if "now" is replaced with a more accurate representation of the clock above, only "then" has the big error, and the delta will be just as far off as the incorrect value above.

The exact error propagation depends on the order the calculations are performed in, and there's a whole field (numerical analysis) dedicated to controlling these errors. As developers we gladly ignore the problem even when we shouldn't.

grkvlt · on March 24, 2014

> You are describing a problem where two clocks drift relative to each other, not a problem where one clock drifts away from the actual time

The latter is simply a specific case of the former. The second clock is the one measuring 'actual' time, and the drift is relative to it.

danbruc · on March 25, 2014

Of course, but the point was, that a drift relative to actual time does not cause problems as long as all clocks in the system drift at the same rate while the failure was caused by different clocks in the system drifting relative to each other and therefore with different rates relative to actual time. Therefore I treated actual time as a special clock.

msmillie · on Feb 26, 2014

That someone can innocently and unironically describe Trader Joe's as being "known for low prices" is really the crux of the problem here.

Perhaps you're not familiar with minimum wage?

zenocon · on Feb 26, 2014

http://m.traderjoes.com/fearless-flyer/article.asp?article_i...

http://thebillfold.com/2013/08/price-comparison-battle-safew...

http://www.checkbook.org/interactive/spmkt/other/t/article.c...

> Trader Joe’s received higher overall quality ratings than Dominick’s and Jewel-Osco, and charges similar prices.

I could go on, but I guess you can do your own research. Seriously, their prices are on average, lower than most.

11thEarlOfMar · on Feb 26, 2014

Not sure what you are getting at. Trader's pays very well:

"...full-time crew members can start in the $40,000 to $60,000 a year range."

http://womensenews.org/story/labor/130722/cashier-the-deal-o...

AjithAntony · on Feb 26, 2014

Uneducated, inexperienced, poor people cannot compete for a job that pays that well. It would be trivial to staff the store with much higher quality candidates from farther away. Unless this specific TJ makes an explicit commitment to hire extremely local folks, then there will not be any working there.

300bps · on Feb 26, 2014

My wife and I buy about 20 different items weekly from Trader Joes because it does have the cheapest prices on those 20 items.

Speaking of minimum wage... Trader Joe's pays substantially above it. My niece works there and as a stock clerk she gets paid substantially over minimum wage and has health benefits from them.

paisawalla · on Feb 26, 2014

It's true and what's more, TJ's is not even close to a one-stop grocery store. They have a few general interest items and a bunch of foods specific only/mostly to their store. Only a nutritionally deficient person could shop solely at TJ, for everyone else I know it's supplemental shopping. There is a lower limit to your wealth before you get to the point where you're in the market for supplemental groceries.

*source: I lived above a TJ's in Berkeley for a few years.

fr0sty · on Feb 26, 2014

> Only a nutritionally deficient person could shop solely at TJ.

I shop almost exclusively at Trader Joe's* and am not (to my knowledge at least) nutritionally deficient. I get fruits, vegetables, dairy, meats, eggs, bread/cereal, and the odd snack. What am I missing?

paisawalla · on Feb 26, 2014

In my experience, while it's true that technically they have items for every section of the food pyramid, you're often limited to a very small selection within many of those categories. Maybe your store is better stocked than mine was, but I could only eat there if I had the same few meals repeatedly.

I'll admit to a small bit of hyperbole in calling that nutritional deficiency, I should have better called that a distinct lack of variety.

cm2012 · on Feb 26, 2014

Trader Joe's can get meal costs really low. If you're past the stage on only eating rice, eggs, and chopped veggies, TJs is comparable to any other store. Having lived in relatively poor neighborhoods that are not at all hip, it's not like fresh fruit is super available otherwise.

graylights · on Feb 26, 2014

Trader Joes has low prices but they don't have the dirt cheap junk food that you might find at some stores. But you can eat cheaply at a Trader Joes. You can also eat very expensively if you go into some their specialty stuff.

antidaily · on Feb 26, 2014

They put a Whole Foods (definitely not known for low prices) on a vacant lot in downtown Detroit and it's doing great.

msmillie · on Dec 1, 2013

> If the OP of the patch had gone through and consistently changed all He|She -> Them|They

If they missed any gendered pronouns in the now-landed PR, surely they would be discoverable in the current source:

  - https://github.com/joyent/libuv/search?q=him&type=Code
  - https://github.com/joyent/libuv/search?q=he&type=Code
  - https://github.com/joyent/libuv/search?q=his&type=Code
  - https://github.com/joyent/libuv/search?q=she&type=Code
  - https://github.com/joyent/libuv/search?q=her&type=Code
  - https://github.com/joyent/libuv/search?q=hers&type=Code

To a reasonable approximation, it turns out the author of PR did consistently change gendered language in the project, there just wasn't very much of it to begin with. That wasn't my assumption either (I thought there'd be more such language somewhere), and the biases that created that assumption are surely an interesting "this is how these problems happen" sort of side topic.

Anyway, definitely a lesson in where we choose to direct the benefit of the doubt. In this case, the author of the PR was the one who deserved it, but wasn't the one who received it.

(edit - link formatting)

teacup50 · on Dec 1, 2013

> Anyway, definitely a lesson in where we choose to direct the benefit of the doubt. In this case, the author of the PR was the one who deserved it, but wasn't the one who received it.

"Benefit of the doubt" isn't such a finite resource. If anyone escalating this chain of tomfoolery had applied it, you would have likely found that there was more than enough to go around.

msmillie · on Nov 30, 2013

As a Joyent engineer, I feel qualified to field this one.

tl;dr: you couldn't be more wrong.

The long version requires a crucial observation: this wasn't one simple mistake, made in haste, ignorance, or confusion. This was a deliberate action to revert to exclusionary language. Speculating on the precise motivation strikes me as pointless; it's the action itself that is problematic.

At Joyent, I have room to make mistakes, and enough respect among my colleagues that I can take corrections without misgivings. Deliberate and repeated "mistakes" on my part would have consequences (but surely that's the literal definition of a problem employee).

In case you aren't already one step ahead of me, here is the corollary: I am protected from the deliberate and repeatedly hostile actions of others.

So: the ability (even obligation) to speak your mind, listen to others, and be protected from bullying, exclusionary behaviour?

That, my friend, is what you call security.

teacup50 · on Dec 1, 2013

The fact that you don't see Bryan Cantrill's public shaming as deliberately and extremely hostile is astounding. It was misleading, it was predicated on a straw man that wasn't remotely supported by the evidence, and it was mind-boggling aggressive. The permanent social and business ramifications for everyone involved are substantial, and if Bryan Cantrill were my employee, I'd have him posting an apology or submitting his resignation.

I'm not sure what sort of culture you have there at Joyent, but if it's a monoculture that fully supports this sort of grossly disproportionate public escalation and aggressive behavior from anyone, especially under the company's name, then I'm happy to keep Joyent off the short list of companies I'd consider working for or with.

Surveying the facts, I'm far more inclined to believe that this event simply provided you with a convenient self-righteous excuse for Joyent to stretch the truth enough to hang what you considered to be a troublesome contributor (and competitor) with his own rope.

[edit] On top of all this, I've had to fire a number of people over the years, and always, regardless of how justified it was, I did so with a heavy heart. The fact that Bryan Cantrill would speak to gleefully of firing anyone is, if nothing else, a testament to the myopia of self-righteousness.

jpll · on Dec 1, 2013

speaking of monoculture: https://twitter.com/bcantrill/status/407228751731101696

waterlion · on Dec 1, 2013

Do we know absolutely, for sure, that this is what happened? From the revert, bnoorhuis clearly didn't know that Bert had signed it off. To me it looks like there was a bit of confusion, and there's not enough info to say either way. As I said elsewhere, the error bars on this look far too wide.

Given even the remotest possibility that this was a confusion, such a public and childish telling off seems entirely unreasonable.

I appreciate you are an employee and are qualified to speak as such. But as an outsider (and a feminist), I won't touch Joyent with a bargepole after this.

jpll · on Dec 1, 2013

As far as I understood from Twitter, the commit wasn't originally signed off.

@piscisaureus signed it afterwards and forced pushed it: https://twitter.com/piscisaureus/status/407022200290410496

waterlion · on Dec 1, 2013

What a mess this whole thing is. I'm looking forward to a level-headed explanation of all of this in a few days. It's a pity the witch-hunt happened before there's been meaningful dialogue to find out exactly what the situation is (name-calling and posturing doesn't count as meaningful dialogue).

jpll · on Dec 1, 2013

Except it all seems @bcantrill used @joyent's blog to win battle against an opponent.

http://venturebeat.com/2013/09/18/can-this-startup-steal-nod...

Exclusionary language is the excuse for publicly shaming an individual seen as a pain in the ass

jpll · on Dec 2, 2013

If you look carefully at https://gist.github.com/trevnorris/7729322/#comment-961709

It's clear @bcantrill does all what it takes to keep Ben Noordhuis out. It's so obvious I don't understand why people seem to be so blind

rok3 · on Dec 2, 2013

The hipocrisy in his response is ridiculous.

- says people should be judged by actions not intent.

- writes a post that blows things way out of proportional and implies Ben is a sexist starting with the sensational title.

- then says that Joyent doesn't need to clarify their actions because their intent was to focus on Ben's actions