Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

One thing computer vision is missing is making a depth map given a 2d image. You can look at a photograph and describe it as a 3D scene. This will be important for many fields.


This problem has already been solved with decent success using deep learning.

See: https://homes.cs.washington.edu/~jxie/pdf/deep3d.pdf


That's a good start. I was thinking you could generate unlimited training data by using a game engine. You'd have the actual 3D model for every single frame.


Yup, the community is on it!

http://www.cv-foundation.org/openaccess/content_cvpr_2016/ht...

http://www.cv-foundation.org/openaccess/content_cvpr_2016/ht...

https://link.springer.com/chapter/10.1007/978-3-319-46475-6_...

And there's more every week... Blender, Unity Engine, Unreal Engine, you name it. (Disclaimer: am author on one of these papers)


I'm aware that certain automotive companies are already doing this.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: