For me my test for multivariable calculus text is their treatment of the chain rule. If the book says something like “derivative of a composition is the composition of the derivatives” it’s good. If they instead say something involving a sigma and things like ∂f/∂x ∂x/∂t it’s likely bad.
You need to think of a derivative as just a function that takes a function, a point, and returns a linearized function that best approximates the original function at that point.
So basically the chain rule states that if you have two functions F, G composed together and you want to find the derivative (the linearized approximation), you simply compute the linearized function for both F and G, and compose the linearized version afterwards.
But the unfortunate reality is that too many textbooks formulate the chain rule in such a complicated manner that it obscures the simplicity and elegance of the chain rule.
Thank you for the explanations. I agree with you that this more abstract view of the chain rule (and the derivative itself) is superior to the sum-of-products formula one usually sees in a first course in multivariable calculus, but I feel most students have to learn the complicated, technical version first before they can see the beauty of the more abstract one.