How 3rd Party apps might integrate with Siri

Senior Writer

Updated Fri, Mar 30, 2012, 3:00 PM·4 min read

Third-party integration into Siri remains at the top of many of our TUAW wish lists. Imagine being able to say "Play something from Queen on Spotify" or even "I want to hear a local police scanner." And Siri replying, "OK, you have two apps that have local police scanners. Do you want ScannerPro or Wunder Radio?"

So why doesn't Siri do that?

Well, first of all, there are no third party APIs. Second, it's a challenging problem to implement. And third, it could open Siri to a lot of potential exploitation (think of an app that opens every time you say "Wake me up tomorrow at 7:00 AM" instead of deferring to the built-in timer).

That's why we sat down and brainstormed how Apple might accomplish all of this safely using technologies already in-use. What follows is our thought experiment of how Apple could add these APIs into the iOS ecosystem and really allow Siri to explode with possibility.

Ace Object Schema. For anyone who thinks I just sneezed while typing that, please let me explain. Ace objects are the assistant command requests used by the underlying iOS frameworks to represent user utterances and their semantic meanings. They offer a context for describing what users have said and what the OS needs to do in response.

The APIs for these are private, but they seem to consist of property dictionaries, similar to property lists used throughout all of Apple's OS X and iOS systems. It wouldn't be hard to declare support for Ace Object commands in an application Info.plist property lists, just as developers now specify what kinds of file types they open and what kind of URL addresses they respond to.

Practicality. If you think of Siri support as a kind of extended URL scheme with a larger vocabulary and some grammatical elements, developers could tie into standard command structures (with full strings files localizations of course, for international deployment).

Leaving the request style of these commands to Apple would limit the kinds of requests initially rolled out to devs but it would maintain the highly flexible way Siri users can communicate with the technology.

There's no reason for devs to have to think of a hundred ways to say "Please play" and "I want to hear". Let Apple handle that -- just as it handled the initial multitasking rollout with a limited task set -- and let devs tie onto it, with the understanding that these items will grow over time and that devs could eventually supply specific localized phonemes that are critical to their tasks.

Handling. Each kind of command would be delineated by reverse domain notation, e.g. com.spotify.client.play-request. When matched to a user utterance, iOS could then launch the app and include the Ace dictionary as a standard payload. Developers are already well-acquainted in responding to external launches through local and remote notifications, through URL requests, through "Open file in" events, and more. Building onto these lets Siri activations use the same APIs and approaches that devs already handle.

Security. I'd imagine that Apple should treat Siri enhancement requests from apps the same way it currently works with in-app purchases. Developers would submit individual requests for each identified command (again, e.g. com.spotify.client.play-request) along with a description of the feature, the Siri specifications -- XML or plist, and so forth. The commands could then be tested directly by review team members or be automated for compliance checks.

In-App Use. What all of this adds up to is an iterative way to grow third party involvement into the standard Siri voice assistant using current technologies. But that's not the end of the story. The suggestions you just read through leave a big hole in the Siri/Dictation story: in-app use of the technology.

For that, hopefully Apple will allow more flexible tie-ins to dictation features outside of the standard keyboard, with app-specific parsing of any results. Imagine a button with the Siri microphone that developers could add directly, no keyboard involved.

I presented a simple dictation-only demonstration of those possibilities late last year. To do so, I had to hack my way into the commands that started and stopped dictation. It would be incredibly easy for Apple to expand that kind of interaction option so that spoken in-app commands were not limited to text-field and text-view entry, but could be used in place of touch driven interaction as well.

Engadget
The best Android phones for 2024
If you're looking for a new Android phone, check out our guide to the best handsets on the market from budget to flagship and everything in between.
Engadget
test-setcion
test-setcion test-setcion test-setcion
Engadget
A bipartisan bill is looking to end Section 230 protections for tech companies
House Energy and Commerce Committee Chair Cathy McMorris Rodgers and ranking member Frank Pallone, Jr. have released a bipartisan draft looking to sunset Section 230 of the Communications Decency Act, because it has "outlived its usefulness."
Engadget
Dyson’s first dedicated hard floor cleaner doesn’t suck
The Wash G1 is the company’s debut hard-floor cleaner, and it swaps suction for high-speed rollers, water and nylon bristles. It’ll go on sale later this year for $700.
Engadget
iPad Air (2024) review: Of course this is the iPad to get
Apple’s updated iPad Air keeps everything it did right before and adds a more powerful chip and a big screen option.
Engadget
iPad Pro (2024) review: So very nice, and so very expensive
Apple’s latest iPad Pro is super powerful as well as incredibly expensive. It’s a delight to use, but is it right for you?
Engadget
Meta’s next hardware project might be AI-infused headphones with cameras
Meta is in the early stages of “exploring” designs for AI-enabled headphones, according to a new report in The Information.
Engadget
What to expect at Google I/O 2024: Gemini, Android 15, WearOS and more details
Google's I/O developer conference is right around the corner. Here's what we're expecting to see, including Android 15 details and a whole bunch of AI news.
Engadget
iOS 17.5 is here with support for web-based app downloads in the EU
Apple has rolled out iOS 17.5, which includes a cross-platform alert system for unwanted Bluetooth trackers that it worked on with Google. Also new are a web-based app distribution option in the EU and a daily word game for Apple News+.
Engadget
OpenAI claims that its free GPT-4o model can talk, laugh, sing and see like a human
The new model accepts any combination of text, audio and images as input and can generate an output in all three formats.
Engadget
Amazon workers become the first to unionize at one of the company's Canadian warehouses
Amazon workers at a Quebec facility have become the first to form a union at one of the company's Canadian warehouses.
Engadget
Google teases new camera-powered AI feature one day ahead of I/O
Google is teasing an intriguing new AI feature one day ahead of its IO developer conference.
Engadget
Apple and Google roll out a cross-platform feature to tackle unwanted Bluetooth trackers
Apple and Google are rolling out a feature that will alert iOS and Android users when an unknown Bluetooth tag appears to be tracking them.
Engadget
Google’s Project Starline video conferencing tech is coming to offices
Google just announced that it has teamed up with HP to bring its futuristic Project Starline video conferencing system to enterprise clients. It even integrates with Google Meet and Zoom.
Engadget
A free PS1 emulator for iPhone is burning up the App Store charts
A free PS1 emulator for iPhone is burning up the App Store charts, currently resting at number six. The app is called Gamma and offers on-screen virtual controls, but allows integration with Bluetooth controllers and keyboards.
Engadget
The best midrange smartphones for 2024
Here's a list of the best midrange smartphones you can buy, as chosen by Engadget editors.
Engadget
The Rogue Prince of Persia is delayed because Hades II is a juggernaut
The Rogue Prince of Persia was supposed to debut in early access on May 14, but Evil Empire and Ubisoft have delayed it to get out of the way of Hades II.
Engadget
The M2 iPad Air is $30 off if you preorder at Amazon
The new M2 iPad Air arrives this week and you can already snap it up for a modest discount if you preorder on Amazon.
Engadget
Pick up this Anker 10,000mAh magnetic power bank for only $32
This Anker 10,000mAh magnetic power bank is on sale via Amazon for $32. That’s a discount of more than 20 percent.
Engadget
Apple's 10th-gen iPad hits a new low of $334
Amazon has marked down Apple's 10th-generation iPad to an all-time low price.

How 3rd Party apps might integrate with Siri

Latest Stories

The best Android phones for 2024

test-setcion

A bipartisan bill is looking to end Section 230 protections for tech companies

Dyson’s first dedicated hard floor cleaner doesn’t suck

iPad Air (2024) review: Of course this is the iPad to get

iPad Pro (2024) review: So very nice, and so very expensive

Meta’s next hardware project might be AI-infused headphones with cameras

What to expect at Google I/O 2024: Gemini, Android 15, WearOS and more details

iOS 17.5 is here with support for web-based app downloads in the EU

OpenAI claims that its free GPT-4o model can talk, laugh, sing and see like a human

Amazon workers become the first to unionize at one of the company's Canadian warehouses

Google teases new camera-powered AI feature one day ahead of I/O

Apple and Google roll out a cross-platform feature to tackle unwanted Bluetooth trackers

Google’s Project Starline video conferencing tech is coming to offices

A free PS1 emulator for iPhone is burning up the App Store charts

The best midrange smartphones for 2024

The Rogue Prince of Persia is delayed because Hades II is a juggernaut

The M2 iPad Air is $30 off if you preorder at Amazon

Pick up this Anker 10,000mAh magnetic power bank for only $32

Apple's 10th-gen iPad hits a new low of $334

About

Sections

Contribute

Buying Guides