Of course cars can move their dozen or so cameras hundreds of times a second. But good luck getting a DNN to process all of that as well as the system handling tasks like identifying traffic lights, pedestrians.
Head of Tesla AI has already stated that even with the new Nvidia hardware they struggle to meet the computational requirements.
Those are two largely unrelated problems. Reconstructing geometry from multiple views is a separate (very well understood) problem from analysis to understand what the shapes mean.
Head of Tesla AI has already stated that even with the new Nvidia hardware they struggle to meet the computational requirements.