
With the megapixel race already past the point of noticeable benefit to consumers, it looks like the next camera arms race will be the number of lenses your rig sports -- a team at Stanford is working on a 3D camera that uses 12,616 micro-lenses to generate high quality 3 megapixel images with self-contained "depth maps" that measure the distance to every object in the frame. The system works by focusing each lens above four different overlapping sensor arrays, which work in concert to determine depth -- just like your eyes. Unlike similar systems, the Stanford rig is able to use that data to create a depth map without lasers, prisms, or even complex calibration, which will allow the team to shrink the tech down to compact and cellphone camera size. Once it's ubiquitous, the teams says depth map information can be used to do anything from enhancing facial recognition systems to improving robot vision, but there's still a long way to go -- the team has just started trying to work out how to manufacture the system.
Reader Comments (Page 1 of 1)
Adam @ Mar 19th 2008 7:13PM
Wow, this is really quite neat! Can it model NURBS for me? Huh? Please?
Chuckles McGee @ Mar 19th 2008 7:57PM
All I want to know is "Will it blend?"
Wisam @ Mar 19th 2008 10:52PM
Well, I know what "Will it play Doom?" means but I also keep hearing "Will it blend?" can someone explains that to me?
Adam @ Mar 19th 2008 11:08PM
Check out www.willitblend.com (not sure if I can put websites here but here we go!)
mushrooshi @ Mar 20th 2008 1:30AM
FIRST
I for one welcome our doom playing, blending overlords.
Happy now?
MissUniverse @ Mar 19th 2008 7:16PM
Just imagine ONE DECADE from now - it will probably be a Video Camera
Matthew C @ Mar 20th 2008 1:50AM
OK- I read this and immediately thought of the seen in Minority report where he's playing the 3d-ish movies of his kid at the beach on the hologram thing. If it could shoot at a quick enough framerate- this would be able to shoot video just like that- w/ real depth, but from a single POV. Now, to design a reasonable system for playback...
austin @ Mar 19th 2008 7:20PM
i remember when 3mp with one lens was amazing. now they can do it with 12,616 simultaneously
Aguiluz @ Mar 19th 2008 7:21PM
Focusing this with shaky hands may prove a bit of a problem. Imagine the waiting time needed to focus all those lens.
MikeG @ Mar 20th 2008 3:52AM
That's what I was thinking haha!
mikedman_ak @ Mar 19th 2008 7:37PM
Combine this with a 3 axis inertial sensor and you could generate a point cloud model of a room just by panning the camera around! Cool implications for high-speed 3D modeling.
Adam @ Mar 19th 2008 7:59PM
With technology like this, you don't even need an inertial sensor to generate a point cloud. With the addition of a 3d depth map, it's very easy to identify points to track the 3d movement of the camera.
frank @ Mar 19th 2008 7:42PM
Does anyone have a link showing the output of one of these?
Loomis @ Mar 20th 2008 1:19AM
Frank, this sounds like it is based around plenoptic cameras (or integral photography), which have been around for awhile. If you visit http://graphics.stanford.edu/papers/lfcamera/, they have a good video that shows some of the uses of plenoptics circa 2004-2005, such as the "3D" capture, rotating viewpoints, and digital refocusing.
So, if plenoptic cameras have been around for so long, why is this one making the news? According to the press release, it sounds like this is the first time that the imaging, sensor elements, and electronics have been integrated together onto a dedicated chip -- as opposed to modifying a current camera to do the plenoptic thing.
You might also check out coded aperture photography as another interesting way of encoding 3D information into a single photograph (http://groups.csail.mit.edu/graphics/CodedAperture/, also with cool video), or digital holography as a more quantitative research tool.
Adam @ Mar 19th 2008 7:55PM
What an epic day, first we see that Verizon is going to offer an any device service, and now we've got depth mapping cameras. Bring on the robotics revolution! The next 10 years are going to make the .com boom look quaint.
Netmagix @ Mar 19th 2008 8:01PM
Here we go again....starting from....3mp
fischju @ Mar 19th 2008 8:06PM
The Google maps party van should get some of these.
eng177 @ Mar 24th 2008 11:49PM
"Once it's ubiquitous, the teams says depth map information can be used to do anything enhance facial recognition systems to improving robot vision..."
Where do I start on the grammatical problems? This is why the internet is ruining the english language - when even journalists won't proofread their work.
puxpux @ Mar 19th 2008 9:34PM
Bloggers are considered Journalists these days? Damn this pesky internet and all of it's confusing tubes!
james43062 @ Mar 19th 2008 9:21PM
ah, i believe that English should be capitalized. as should ...Ass Hat!
Hammond X @ Mar 19th 2008 10:54PM
How ironic.
Wisam @ Mar 19th 2008 10:54PM
and without the.
Basel @ Mar 20th 2008 4:11AM
The more pixels are shrunk, the less light each pixel will have to process at a given shutter speed.
The data on surrounding pixels could be aggregated, but the noise per pixel will be very high. Since pixels being digital by nature, low levels of data would be lost unless these pixels are as light sensitive per area compared to how small they are.
This camera will be very hard to use in low-light, or any movement.
ishism @ Mar 20th 2008 1:25AM
Looks like "The Fly" eyes are here.
Anyway, how are we going to see the 3d? On a special screen, through glasses or what?
Atanas Boev @ Mar 20th 2008 4:21AM
And here is the paper.
http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=4114959
[warning, link requires subscription :) ]
ebeasley @ Mar 20th 2008 9:37AM
I'd love bring a video camera based on this tech into work and film the whole place. Then I'd turn it into a Doom map. Fraggin co-workers (and my boss) FTW!