More

wall_words · on March 8, 2016

The performance graph is deceptive for two reasons: (1) Leaf with CuDNN v3 is a little slower than Torch with CuDNN v3, yet the bar for leaf is positioned to the left of the one for Torch, and (2) there's a bar for Leaf with CuDNN v4, but not for Torch.

It's good to see alternatives to Torch, Theano, and TensorFlow, but it's important to be honest with the benchmarks so that people can make informed decisions about which framework to use.

kibwen · on March 8, 2016

The graph in the readme is outdated, you can see the version with Torch/CuDNN v4 here: http://autumnai.com/deep-learning-benchmarks

And I don't believe the first point counts as deceptive; the bars are ordered by Forward ms, not by the sum of Forward and Backward. In both CuDNN v3 and v4, Leaf is faster than Torch by that metric (25 vs 28 for v4, 31 vs 33 for v3).

emcq · on March 8, 2016

Yes, on their site they post Torch CuDNN v4 as faster than Leaf [0]. Seems exciting for an early release.

Can it get much faster than something like Torch? I would think if CuDNN is doing most of the computation time it would be hard to see big improvements. Perhaps go the route of Neon and tune your GPGPU code like crazy [1, 2], or MXNet and think about distributed computing performance [3].

[0] http://autumnai.com/deep-learning-benchmarks

[1] https://github.com/soumith/convnet-benchmarks

[2] https://github.com/NervanaSystems/neon

[3] http://alex.smola.org/talks/NIPS15.pdf

jean- · on March 8, 2016

> Leaf with CuDNN v3 is a little slower than Torch with CuDNN v3, yet the bar for leaf is positioned to the left of the one for Torch

I think that's because they're sorting by forward time rather than forward+backward. That would also explain why in the Alexnet benchmark Tensorflow (cuDNN v4) is to the left of Caffe (cuDNN v3) despite having a much taller bar overall.

wall_words · on March 7, 2016

std::tuple is essentially an "anonymous" struct -- there's no allocation involved

kzhahou · on March 7, 2016

But the members inside don't have proper names.

masklinn · on March 7, 2016

That's rarely a problem where MRV makes sense. Though you could always fix it by using anonymous structs whose fields are both named and positional (similar to Python's namedtuples)

wall_words · on Aug 25, 2015

Thanks, I tried finding the link to the Bitbucket repository but I had a hard time finding the link via Google.

therobot24 · on Aug 25, 2015

It actually took me more than a few minutes as well, had to use BitBucket's search since Google (rarely enough) was returning crap

pmelendez · on Aug 25, 2015

I have noticed Google not being too useful in some queries this week... I'm wondering if they are trying new updates or something like that...

wall_words · on Aug 25, 2015

I'm guessing that's the point at which the working set exceeds the L1 cache size. You can see a few more subtle dips in the performance graph at later points; these correspond to working set spilling out of the L2 and L3 caches.

wall_words · on Aug 25, 2015

I've had great success with Blaze, despite the fact that it has received little publicity compared to alternatives like Eigen, Armadillo, etc. Blaze is consistently the leader of the pack in benchmarks, and even outperforms Intel MKL on the Xeon E5-2660 (the CPU for which the benchmark results are shown).

arcanus · on Aug 25, 2015

For what problems? General statements like this are hard to back up, especially in the wild world of numerical linear algebra.

From my experience, there are currently no good distributed-memory open source sparse-direct solvers.

No good distributed-memory ILU implementation, either. Scalability is almost non-existant beyond 100 cores.

math_and_stuff · on Aug 25, 2015

I'm working on it...see https://github.com/elemental/Elemental

wall_words · on Aug 25, 2015

I've used Blaze for machine learning applications, where I've relied on the performance of elementwise operations and dense matrix multiplication on a single machine (the results advertised in the benchmark). Eigen has more functionality, but in my experience is not always optimized as well as Blaze. Neither has support for distributed computing, but I believe this is a problem that HPX is trying to address: https://github.com/STEllAR-GROUP/hpx

onalark · on Aug 25, 2015

That's because direct solvers can't scale. If you want to solve a large (distributed over hundreds of nodes) sparse linear algebra problem as fast as possible, decades of research have been poured into efficient techniques (Krylov methods, Multigrid, preconditioners) for solving them iteratively.

math_and_stuff · on Aug 25, 2015

Can't scale in a weak, strong, or asymptotic complexity sense? And for what sorts of problems (I assume you're thinking of 2D and 3D PDEs discretized with local basis functions)?

onalark · on Aug 25, 2015

Yes, I'm thinking of discretizations of elliptic 2D/3D PDEs. They don't scale in the weak or strong sense, and they can't hold O(n log n) asymptotic complexity due to fill-in from Cholesky/LU-style factorizations.

wall_words · on Oct 25, 2014

If you want to generate LaTeX from Markdown, you can use Pandoc. Pandoc has various extensions to regular Markdown (including inline math, tables, etc.), so this gives you some flexibility when producing more complicated types of documents. In fact, Pandoc converts from Markdown to LaTeX to PDF when you choose PDF as the output format.

Terretta · on Oct 25, 2014

Love Pandoc. Recommend it highly for trans-format needs.

Have also found that Texts > Pandoc > (whatever) works very well for non-techies -- think Texts (a "Markdown word processor") needs more love:

http://www.texts.io/

giancarlostoro · on Oct 25, 2014

Texts says it's Cross-Platform, and even has a (misleading) picture of Tux in the same page, yet no Linux version whatsoever?

wall_words · on Oct 1, 2014

I found a lot of examples in the full manual here: https://raw.githubusercontent.com/simoncozens/sile/master/do...

The syntax is very similar to LaTeX, but it's more modern in that it has native support for fonts and images, and uses Lua as its scripting language.

wall_words · on Sept 3, 2014

This is an important statement and should be upvoted more. Case in point: "the Weierstrass approximation theorem states that every continuous function defined on a closed interval [a, b] can be uniformly approximated as closely as desired by a polynomial function."

wall_words · on Sept 3, 2014

> but a migration of research interest away from neural nets seemed increasingly promising, and today, the migration seems largely complete.

What are you talking about? Deep learning is one of the hottest areas of research today, and a lot of it has to do with neural networks. NN's are the state of the art in several domains. Case in point: http://image-net.org/challenges/LSVRC/2014/results. All of the top entries use convolutional networks; in fact, almost all of the entries do.

The fact that the loss function represented by a neural network can be highly nonconvex is what makes them so effective in the domains in which they are used. See this presentation by Yann LeCun for more info: http://www.cs.nyu.edu/~yann/talks/lecun-20071207-nonconvex.p...

"ML theory has essentially never moved beyond convex models, the same way control theory has not really moved beyond linear systems. Often, the price we pay for insisting on convexity is an unbearable increase in the size of the model, or the scaling properties of the optimization algorithm ... This is not by choice: nonconvex models simply work better. Have you tried acoustic modeling in speech with a convex loss? ... To learn hierarchical representations (low-level features, mid- level representations, high-level concepts....), we need “deep architectures”. These inevitably lead to non-convex loss functions."

This isn't to say that NN's are going to solve all our problems, but to say that there has been a shift in interest away from NN's is absurd.

seanmcdirmid · on Sept 3, 2014

Parent might be living in the recent past. There was a migration away from NNs in the 90s/early 00s, then Hinton and other people brought it back to life...with a vengeance :)

ozgung · on Sept 3, 2014

Exactly. The history of NN is full of ups and downs and it's becoming increasingly popular again the form of Deep Learning thanks to increasing cloud processing power and advancements by Hinton and others. Most to of the traditional criticism of NN is related to shallow nets. But deeper and far more complex structures like those in the animal brains are not explored enough.

The next quantum leap is expected with the introduction of more specialized hardware such as neuromorphic chips: http://www.technologyreview.com/view/428235/intel-reveals-ne...

http://www.youtube.com/watch?v=pPk42xyNpSA

wall_words · on Sept 3, 2014

If you like deciphering ancient recipes, you may interested in "The Forme of Cury: A Roll of Ancient English Cookery Compiled, about A.D. 1390": http://www.amazon.com/The-Forme-Cury-Ancient-Compiled/dp/142...

It's remarkable that English written over six centuries ago is still more or less comprehensible, albeit with a little effort.