Cornell researchers taught a robot to take Airbnb photos

Humanity has taken its first tentative steps towards a future with robo-realtors.

Former Senior Editor

Wed, Mar 16, 2022, 9:30 AM·4 min read

Aesthetics is what happens when our brains interact with content and go, “ooh pretty, give me more of that please.” Whether it’s a starry night or The Starry Night, the sound of a scenic seashore or the latest single from Megan Thee Stallion, understanding how the sensory experiences that scintillate us most deeply do so has spawned an entire branch of philosophy studying art, in all its forms, as well as how it is devised, produced and consumed. While what constitutes “good” art varies between people as much as what constitutes porn, the appreciation of life’s finer things is an intrinsically human endeavor (sorry, Suda) — or at least it was until we taught computers how to do it too.

The study of computational aesthetics seeks to quantify beauty as expressed in human creative endeavors, essentially using mathematical formulas and machine learning algorithms to appraise a specific piece based on existing criteria, reaching (hopefully) an equivalent opinion to that of a human performing the same inspection. This field was founded in the early 1930s when American mathematician George David Birkhoff devised his theory of aesthetics, M=O/C, where M is the aesthetic measure (think, a numerical score), O is order and C is complexity. Under this metric simple, orderly pieces would be ranked higher — i.e. be more aesthetically pleasing — than complex and chaotic scenes.

German philosopher Max Bense and French engineer Abraham Moles both, and independently, formalized Birkoff’s initial works into a reliable scientific method for gauging aesthetics in the 1950s. By the ’90s, the International Society for Mathematical and Computational Aesthetics had been founded and, over the past 30 years, the field has further evolved, spreading into AI and computer graphics, with an ultimate goal of developing computational systems capable of judging art with the same objectivity and sensitivity as humans, if not superior sensibilities. As such, these computer vision systems have found use in augmenting human appraisers’ judgements and automating rote image analysis similar to what we’re seeing in medical diagnostics, as well as grading video and photographs to help amateur shutterbugs improve their craft.

Recently, a team of researchers from Cornell University took a state of the art computational aesthetic system one step further, enabling the AI to not only determine the most pleasing picture in a given dataset, but capture new, original — and most importantly, good — shots on its own. They’ve dubbed it, AutoPhoto, its study was presented last fall at the International Conference on Intelligent Robots and Systems. This robo-photographer consists of three parts: the image evaluation algorithm, which evaluates a presented image and issues an aesthetic score; a Clearpath Jackal wheeled robot upon which the camera is affixed; and the AutoPhoto algorithm itself, which serves as a sort of firmware, translating the results from the image grading process into drive commands for the physical robot and effectively automating the optimized image capture process.

For its image evaluation algorithm, the Cornell team led by second year Masters student Hadi AlZayer, leveraged an existing learned aesthetic estimation model, which had been trained on a dataset of more than a million human-ranked photographs. AutoPhoto itself was virtually trained on dozens of 3D images of interior room scenes to spot the optimally composed angle before the team attached it to the Jackal.

When let loose in a building on campus, as you can see in the video above, the robot starts off with a slew of bad takes, but as the AutoPhoto algorithm gains its bearings, its shot selection steadily improves until the images rival those of local Zillow listings. On average it took about a dozen iterations to optimize each shot and the whole process takes just a few minutes to complete.

“You can essentially take incremental improvements to the current commands,” AlZayer told Engadget. “You can do it one step at a time, meaning you can formulate it as a reinforcement learning problem.” This way, the algorithm doesn’t have to conform to traditional heuristics like the rule of thirds because it already knows what people will like as it was taught to match the look and feel of the shots it takes with the highest-ranked pictures from its training data, AlZayer explained.

“The most challenging part was the fact there was no existing baseline number we were trying to improve,” AlZayer noted to the Cornell Press. “We had to define the entire process and the problem.”

Looking ahead, AlZayer hopes to adapt the AutoPhoto system for outdoor use, potentially swapping out the terrestrial Jackal for a UAV. “Simulating high quality realistic outdoor scenes is very hard,” AlZayer said, “just because it's harder to perform reconstruction of a controlled scene.” To get around that issue, he and his team are currently investigating whether the AutoPhoto model can be trained on video or still images rather than 3D scenes.

Engadget
The world's leading AI companies pledge to protect the safety of children online
Leading artificial intelligence companies including OpenAI, Microsoft, Google, Meta and others have jointly pledged to prevent their AI tools from being used to exploit children and generate child sexual abuse material (CSAM).
52m ago
Engadget
Tesla previews ride-hailing experience ahead of August robotaxi unveil
Tesla has shown off a preview of an upcoming ride-hailing feature in its app ahead of an August robotaxi unveiling.
1h ago
Engadget
Roland’s mobile podcasting studio gives you a mic and streaming app for $140
Roland has a new on-the-go podcasting setup with an eye-catching price. The company’s Go:Podcast studio includes a USB condenser mic and a companion app that can set up streaming to platforms like YouTube, Twitch and Facebook.
3h ago
Engadget
Ray-Ban Meta smart glasses do the AI thing without a projector or subscription
The Ray-Ban Meta smart glasses have adopted multimodal AI features. This allows the glasses to describe the world around you and translate languages.
5h ago
Engadget
Samsung's Galaxy S24 Ultra is on sale for its lowest price yet at Amazon and Best Buy
Samsung's Galaxy S24 Ultra smartphone is cheaper than ever after a $200 discount.
5h ago
Engadget
Amazon’s updated grocery delivery program has some strings attached
After asserting itself as an overshadowing presence in retail, Amazon is still experimenting with ways to leave a similar mark with grocery shopping. The company’s latest tweak to the service lowers the minimum price for free grocery deliveries to $35.
5h ago
Engadget
8BitDo's Nintendo-style Retro Mechanical Keyboard hits a new low of $70 at Woot
The Nintendo-inspired 8BitDo Retro Mechanical Keyboard is on sale for a new low of $70 at Amazon subsidiary Woot.
5h ago
Engadget
Your old Rock Band guitars now work in Fortnite Festival
Fortnite's Rock Band-style Festival mode now supports Rock Band 4 guitars. Meanwhile, Billie Eilish has joined the game as its latest music icon.
6h ago
Engadget
Elon Musk says it's his turn to have the remote
X just announced a smart TV app for streaming video content. X TV may or may not launch at some point in the near or far future.
6h ago
Engadget
Nobody needs to spend $160 on a gaming mouse, but Razer’s new Viper V3 Pro is excellent anyway
Razer has announced the Viper V3 Pro, its latest premium wireless gaming mouse. Here are our hands-on impressions.
7h ago
Engadget
Apple will host a virtual event on May 7th, ahead of WWDC
Apple isn't waiting until WWDC to make some announcements. The company will hold a virtual event on May 7 and all signs suggest it'll have new iPads to show off.
7h ago
Engadget
The best noise-canceling headphones for 2024
Noise cancellation is a primary feature on most flagship, over-ear headphones. If you're looking to get a pair of cans that can truly block out the world, these are the best noise-canceling headphones you can get today.
9h ago
Engadget
The rebuilt Sonos app focuses on getting you to your tunes faster
Sonos has rebuilt its mobile app from the ground up to make it easier for people to get to their favorite content, regardless of what service it comes from.
9h ago
Engadget
Castlevania fan uncovers new Konami code in 1999 game
A new Konami Code has been discovered hidden inside Castlevania's code, and it changes gameplay so drastically that it aficionados will want to give it a fresh try.
10h ago
Engadget
The best travel gear for graduates
For those recent grads itching to get away after a busy semester, these are the best travel gifts you can get them.
10 months ago
Engadget
Rivian offers (up to) $5,000 discount if you trade in your gas-powered truck
Rivian will give you up to around $5,470 in discount if you trade in an eligible gas-powered truck or SUV when you purchase or lease a qualifying R1 electric vehicle package in the US and Canada.
10h ago
Engadget
The Morning After: Meta teases a limited-edition Quest headset inspired by Xbox
The biggest news stories this morning: Grindr sued for allegedly sharing users’ HIV status and other info with ad companies, What we watched: Bluey’s joyful finales, Amazon halts drone deliveries in California, but kicks off tests in Phoenix
11h ago
Engadget
Metaphor: ReFantazio, a fantasy RPG from the Persona 5 team, comes out in October
Metaphor: ReFantazio is coming out on October 11, and you can pre-order it right now.
12h ago
Engadget
Microsoft's lightweight Phi-3 Mini model can run on smartphones
Microsoft has unveiled its latest light AI model called the Phi-3 Mini designed to run smartphones and other devices.
12h ago
Engadget
Adobe Photoshop's latest beta makes AI-generated images from simple text prompts
Adobe will now let you use AI to generate images directly within Photoshop. That's just the beginning.
13h ago

Cornell researchers taught a robot to take Airbnb photos

Humanity has taken its first tentative steps towards a future with robo-realtors.

Latest Stories

The world's leading AI companies pledge to protect the safety of children online

Tesla previews ride-hailing experience ahead of August robotaxi unveil

Roland’s mobile podcasting studio gives you a mic and streaming app for $140

Ray-Ban Meta smart glasses do the AI thing without a projector or subscription

Samsung's Galaxy S24 Ultra is on sale for its lowest price yet at Amazon and Best Buy

Amazon’s updated grocery delivery program has some strings attached

8BitDo's Nintendo-style Retro Mechanical Keyboard hits a new low of $70 at Woot

Your old Rock Band guitars now work in Fortnite Festival

Elon Musk says it's his turn to have the remote

Nobody needs to spend $160 on a gaming mouse, but Razer’s new Viper V3 Pro is excellent anyway

Apple will host a virtual event on May 7th, ahead of WWDC

The best noise-canceling headphones for 2024

The rebuilt Sonos app focuses on getting you to your tunes faster

Castlevania fan uncovers new Konami code in 1999 game

The best travel gear for graduates

Rivian offers (up to) $5,000 discount if you trade in your gas-powered truck

The Morning After: Meta teases a limited-edition Quest headset inspired by Xbox

Metaphor: ReFantazio, a fantasy RPG from the Persona 5 team, comes out in October

Microsoft's lightweight Phi-3 Mini model can run on smartphones

Adobe Photoshop's latest beta makes AI-generated images from simple text prompts

About

Sections

Contribute

Buying Guides