More

EvgeniyZh · 2026-03-19T14:47:50 1773931670

Statistically speaking "murderer is black" is a sensible assumption in US [1], but I'd prefer it wouldn't be made

[1] https://ucr.fbi.gov/crime-in-the-u.s/2019/crime-in-the-u.s.-...

ceejayoz · 2026-03-19T16:03:25 1773936205

The chance of a random black person being a murderer is substantially lower than the chance of a random cop being racist.

EvgeniyZh · 2026-03-10T06:48:34 1773125314

There are two generations and 4.5 years between A100 and B200.

A100 has 312 TFLOPS of FP16 for 250W, i.e., 1.25 TFLOPS/W.

B200 has 2250 TFLOPS of FP16 compute for 1000W, i.e., 2.25 TFLOPS/W.

This is ~34% growth per generation and ~14% per year. It's hard to believe it will be 400% per generation this time

KeplerBoy · 2026-03-10T08:05:39 1773129939

It might be 400% in the one thing everyone is interested in.

npn · 2026-03-10T07:32:00 1773127920

you think in FP16. nobody uses FP16 for inference anymore. 400% probably for FP4/INT4 computation.

EvgeniyZh · 2026-03-10T09:09:31 1773133771

Tensor core performance is inversely proportional to precision across all generations (i.e., reducing precision by a factor of 2 increases OPS by a factor of 2). 8-bit precision will give you the same improvement ratio. A100/H100 didn't support 4-bit if I remember correctly.

So FP4/INT4 will likely improve the same 30% OPS/W. You could get a separate improvement by reducing precision, but going 1-bit for 4x improvement feels unlikely for now.

EvgeniyZh · 2026-03-03T14:35:46 1772548546

> Do we have an example of a real quantum computer doing some kind of a computation that is not easily accessible by the regular computer?

Simulations of condensed matter simulations performed on QCs (google's OTOCs, quantinuum's HUbbard model) are not easily accessible by the regular computer. There are people working hard on simulating these results classically so it's quite likely they'll be simulated eventually. We're at point where classical computers are still in the race thanks to immense scale and algorithmic progress, but I think it won't be the case soon.

> something useful in real life?

usefulness is subjective. There are results that are potentially interesting to some people on Earth (as opposed to RCS).

EvgeniyZh · 2026-02-24T19:33:19 1771961599

Can you remember any pro-Israeli posts you turned flags off for since the October 7 attack?

dang · 2026-02-25T03:46:20 1771991180

I can't remember virtually anything - this is not a joke - having one's brain be sandblasted by the firehose every day turns memory into a dodgy thing (https://hn.algolia.com/?dateRange=all&page=0&prefix=true&que...). I believe there have been some, though not as many. That's largely a function of the submission feed, i.e. which articles the community submits, upvotes, or flags. All we can do is respond case-by-case, and we try to do that in a principled way. The principles we apply (or try to) have been explained many times and can be found via https://hn.algolia.com/?dateRange=all&page=0&prefix=false&so... and links from those comments.

If you feel like the submission feed and/or the moderation decisions on top of it are biased, all I can tell you is that everyone feels that way, especially on any topic they are passionate about. You needn't look far for examples of commenters complaining that we're suppressing and censoring the Gaza story - there are some in this thread.

What I feel a lot more confident talking about, in terms of balanced moderation, is the comments. We've moderated, warned, and banned many accounts for breaking the site guidelines while posting anti-Israeli (and sometimes even anti-semitic) comments, and we'll continue to do that. That's something we take very seriously, and of course, we do the same the other way round as well.

EvgeniyZh · 2026-02-25T08:05:52 1772006752

Thanks for the answer (and what seems to be unflagging the comment). Having some experience moderating (of course, much smaller) communities I understand it's impossible to keep everyone satisfied.

I, of course, can't judge the intent or the effort. What I can say is that I read all captions of 150+ votes submission, rarely skipping any, and I saw 20+ pro-Palestine ones and zero pro-Israeli ones. I think this is quite objective measure.

At some point I thought it might be intentional but now I think it is just bias amplification: these submission are flagged too fast and upvoted too slow to get anywhere.

throwworhtthrow · 2026-02-26T07:04:18 1772089458

Coincidentally, I just used hn.algolia to look up one of your old comments where you describe being sandblasted, and was surprised to find the most recent use of "sandblasted" on HN is by you, linking to an algolia search of you saying "sandblasted".

Thank you sincerely for your sacrifice, Dan. Whenever I have an urge to flame, I picture my impending comment as one more grain of sand speeding towards your cranium, and instead I step away from the keyboard.

underdeserver · 2026-02-24T20:00:36 1771963236

Can you point to any pro-Israeli posts on HN since October 7, flagged or not?

computerex · 2026-02-24T19:59:50 1771963190

They don’t get flagged though.

EvgeniyZh · 2026-02-24T20:09:16 1771963756

Yes they are, just like the comment you answered to will.

EvgeniyZh · 2026-01-25T08:35:24 1769330124

Yes it can [1].

https://docs.kernel.org/kbuild/llvm.html

EvgeniyZh · 2025-12-19T15:09:55 1766156995

It's worth noting that this is "compute-bound optimal", i.e., given fixed compute, the optimal choice is 20:1.

Under Chinchilla model the larger model always performs better than the small one if trained on the same amount of data. I'm not sure if it is true empirically, and probably 1-10B is a good guess for how large the model trained on 80B tokens should be.

Similarly, the small models continue to improve beyond 20:1 ratio, and current models are trained on much more data. You could train a better performing model using the same compute, but it would be larger which is not always desirable.

EvgeniyZh · 2025-12-18T09:25:40 1766049940

> your car, TV

yeah I hope I won't ever be shown ads on TV for which I already paid

crtasm · 2025-12-18T12:55:52 1766062552

Unavoidable ads in the UI of the TV itself? I would hope not.

On channels/services that you might choose to access via the TV? That's a separate matter.

EvgeniyZh · 2025-12-16T14:37:04 1765895824

How many? What are the top 3 countries scammers flee to?

mda · 2025-12-16T14:53:18 1765896798

You didn't answer his question

EvgeniyZh · 2025-12-16T19:09:53 1765912193

Was I supposed to? My comments makes it quite clear that I don't have information to do that.

The author of the question appears to have some information I don't (unless he made it up of course), so I asked him to share it

djohnston · 2025-12-16T18:26:02 1765909562

cool it with the anti-semitism

EvgeniyZh · 2025-11-11T14:01:02 1762869662

Asic for matmul is systolic array more or less

EvgeniyZh · 2025-10-23T14:53:07 1761231187

Quantum volume is a good metric but that's kind of one-dimensional take. Almost any interesting circuit doesn't requires all-to-all connectivity and superconducting QC are bad at all-to-all connected circuit so we can have interesting NISQ experiments without particularly large QV

cashsterling · 2025-10-23T15:12:55 1761232375

It is not a one dimensional take... it is a stress test of qubit gate fidelity [across all qubits involved in the circuit], state prep and measurement , lifetime (coherence), memory errors, etc.

Now I agree that there are other great stress tests of quantum computer systems... but most of the industry agreed that quantum volume was a great metric several years ago. As many companies systems have been unable to hit decent QV, companies have pivoted away from QV to other metrics... many of them are half baloney.

EvgeniyZh · 2025-10-23T16:13:16 1761235996

> fidelity [across all qubits involved in the circuit]

I don't see a scenario in which the fidelity of 2QG between two far away qubits matter. Stress tests should be somehow related to the real tasks the system is intended to solve.

In case of quantum computers, the tasks are either NISQ circuits or fault-tolerant computation, and in both cases you can run them just fine without applying 2QG between far-away qubits that translate in large amount of swaps.

If you're interested in applying Haar-random unitaries, then surely QV is an amazing metric, and then systems with all-to-all connectivity is your best shot (coincidentally, Quantiniuum keeps publishing their quantum volume results). It's just not that interesting of a task.