This AI Creates Detailed 3D Renderings from Thousands of Tourist Photos
A team of researchers at Google have come up with a technique that can combine thousands of tourist photos into detailed 3D renderings that take you inside a scene... even if the original photos used vary wildly in terms of lighting or include other problematic elements like people or cars.
The tech is called "NeRF in the Wild" or "NeRF-W" because it takes Google Brain's Neural Radiance Fields (NeRF) technology and applies it to "unstructured and uncontrolled photo collections" like the thousands of tourist photos used to create the demo you see below[1][2], and the samples in the video above[3].
It's basically an advanced, neural network-driven interpolation that manages to include geometric info about the scene while removing 'transient occluders' like people or cars and smoothing out changes in lighting.
[1] demo1.gif (36.75 MiB)
[2] demo2.gif (35.66 MiB)
[3] YouTube video (3m42s).
NeRF in the Wild: Neural Radiance Fields for Unconstrained Photo Collections (arXiv:2008.02268v2 [cs.CV])
(Score: 0) by Anonymous Coward on Sunday August 30 2020, @11:23PM
> Something doesn't quite add up with stories of this sort.
> Then you run the GIMP
Not to knock GIMP but I think you answered your own question.
> I don't understand "convolution". The GIMP's interface just gives you a 5x5 matrix to fill in.
Convolution is literally just matrix multiplication. [gimp.org]
> So I am unsure what the state of the art is on deblurring.
HQ predictive optical flow is some way off. [catalyzex.com]