WHAT’S IN THIS CHAPTER?
WROX.COM CODE DOWNLOADS FOR THIS CHAPTER
The code downloads for this chapter are found at www.wrox.com/go/androidwearables
on the Download Code tab. The code is in the Chapter 6 download and the files are individually named according to the listing numbers noted throughout the chapter.
[…] the phone is like your brain; it’s controlling your TV or driving your car […]
— Justin Koh, Android Wear developer, Google I/O 2014
Do you remember Dick Tracy, the comic-strip detective who had a two-way wrist radio he used to talk to his boss at the police station? It is as if Android Wear is trying to re-create that vision of the future from almost 70 years ago.
Thirty-six years later, David Hasselhoff, playing the character Michael Knight, talked to his intelligent car, KITT, using a wristwatch on which he would get voice-generated answers from the car. According to some fan sites, the watch Hasselhoff wore was a modification of a Star Wars-branded watch manufactured by Bradley for the U.S. (I recommend that you search for images of both characters and their watches. We could not include any due to copyright restrictions.)
Android Wear offers half the communication capabilities of Dick’s and Michael’s radio watches. You can dictate commands to your Wear device, but it has no sound-playing capabilities. None of the existing Wear-enabled devices currently has any speakers or sound output. This is a forthcoming feature and will make it possible to stream audio via Buetooth toward external speakers.
On the other hand, all of them can be commanded via voice in multiple ways. This chapter introduces different types of voice input. The code listings give you basic information on how to use the different types of voice input. Check the chapter downloads for full code examples.
There are different types of voice interaction. You could give your smartwatch simple commands, and it could recognize a series of predefined commands by comparing the sound input to a table of existing recordings to provide you with immediate feedback.
Or the device could transfer the stream of audio, once digitized, to the smartphone or tablet it is connected to for the other device to operate a voice recognition algorithm. Sending the voice information to, for example, a cloud service for remote storage or treatment would fall into the same category. In essence it is the same case where the smartwatch sends the data to a different system for it to deal with it.
Finally, the smartwatch (or any other Wear-enabled device) could simply use voice interaction to react to a notification coming from the phone or tablet. This interaction pattern is slightly different, because the smartwatch pushes commands or data toward the device it’s paired to. In this case, the phone expects an answer in textual form that will be entered via voice. The screen real estate isn’t optimized for a virtual keyboard.
It’s not that easy to conceptually understand these cases by reading Android’s documentation site. Table 6.1 shows all the possibilities for the use of voice within Android Wear to help you choose the one you need for your project. In this chapter you’ll get to try all of them.
Table 6.1 Types of Voice Interaction
INTERACTION | DESCRIPTION |
Send a command to the host | The Wear device computes a voice recognition operation and sends a command to the host device. In the official documentation, this is called app-provided voice capabilities. |
Send a text to the host | The Wear device captures the audio and pushes it through its codec to produce a stream of bytes of audio that can be sent to the host device. |
Respond to a query from the host | When a notification arrives, it is possible to get a set of predefined answers to which the user can simply answer using voice. |
Respond to a query from the host in a text | When a notification arrives from, for example, an e-mail, it is possible to answer by talking into the Wear device and to get the speech transcribed into a text to send as an answer. |
Execute simple system commands | The Wear device computes a voice recognition operation based on a series of predefined intents. In the official documentation, this is called system-provided voice capabilities. |
Each interaction mode requires a different set of commands on the phone’s Android application package (APK) and the Wear device’s APK to get both programs to work in sync. The only exception is the last case—running commands without a connected device.
It is possible to enhance existing apps using voice input. Apps created prior to the launch of Wear cannot be used with voice commands unless they are remade to include the proper intents on the APK on the phone and the APK on the smartwatch. In other words, if you want your existing app to include voice interaction using Wear, you must do the following:
The Wear SDK has been created to make this integration seamless. Adding voice input takes just a couple keystrokes. But remember that even if the technology allows you to, for example, make every text input field in your application Wear-enabled, you still need to ensure that the interaction pattern makes sense.
The Android Wear platform provides several voice intents that are based on simple user actions. Table 6.2 lists the actions you can use. When you use voice commands, all your sentences need to start with “OK, Google.” This phrase is unique enough for the Wear device not to be triggered randomly.
Table 6.2 Predefined Voice Intents According to the Official Wear Documentation
NAME | SAMPLE PHRASES | INTENT (VALUES OF THE CONSTANT) | ||
Action | Category/MIME-Type | Extras | ||
Call a car/taxi | “OK, Google, get me a taxi.”“OK, Google, call me a car.” | com.google.
android.gms.
actions.RESERVE_
TAXI_RESERVATION |
||
Take a note | “OK, Google, take a note.”“OK, Google, note to self.” | android.intent.
action.SEND |
com.google
|
android.content
A string with note body |
Set alarm | “OK, Google, set an alarm for 8 a.m.”“OK, Google, wake me up at 6 tomorrow.” | android.intent.
action.SET_ALARM |
android.provider
An integer with the hour of the alarm android.provider
An integer with the minute of the alarm (These two extras are optional. Either neither or both are provided.) |
|
Set timer | “OK, Google, set a timer for 10 minutes.” | android.intent.
action.SET_TIMER |
android.provider
An integer in the range of 1 to 86,400 (the number of seconds in 24 hours) representing the length of the timer |
|
Start/Stop a bike ride | “OK, Google, start cycling.”“OK, Google, start my bike ride.”“OK, Google, stop cycling.” | vnd.google.
fitness.TRACK |
vnd.google.
|
actionStatus
A string with the value ActiveActionStatus
when starting and CompletedActionStatus
when stopping |
Start/stop a run | “OK, Google, track my run.”“OK, Google, start running.”“OK, Google, stop running.” | vnd.google.
fitness.TRACK |
vnd.google.
|
actionStatus
A string with the value ActiveActionStatus
when starting and CompletedActionStatus
when stopping |
Start/stop a workout | “OK, Google, start a workout.”“OK, Google, track my workout.”“OK, Google, stop workout.” | vnd.google.
fitness.TRACK |
vnd.google.
|
actionStatus
A string with the value ActiveActionStatus
when starting and CompletedActionStatus
when stopping |
Show heart rate | “OK, Google, what’s my heart rate?”“OK, Google, what’s my bpm?” | vnd.google.
fitness.VIEW |
vnd.google.
|
|
Show step count | “OK, Google, how many steps have I taken?”“OK, Google, what’s my step count?” | vnd.google.
fitness.VIEW |
vnd.google.
|
Try the capabilities of your smartwatch. Note that you cannot do so by using an emulator connected to a phone over the USB. According to different reports, it is not possible to use voice commands from the emulator to trigger events on the phone. This will not work for either system-predefined commands or app-predefined ones.
To launch an app, just talk to your Wear device. Say, “OK, Google, start HelloWorld.” If you still have your first application installed on the device, it should launch.
As hinted earlier, when talking about intents, it is possible to “hijack” an existing intent to launch your own application. You just need to modify the AndroidManifest.xml file to your app so that it is listed as one of the possible actions to take upon the arrival of the intents defined in Table 6.2. Remember that it is possible to hack any intent within Android. Listing 6-1 shows the filter you need to add to the activity you want to launch in your manifest file to get it to be triggered with the call of a timer. For example, you could launch your HelloWorld app when telling your smartwatch “OK, Google, set a timer for 10 minutes.”
Besides trying to take over an existing intent in your smartwatch, you also can use a different way in your wearable’s AndroidManifest.xml file to get the system to call your app when saying “OK, Google, start myApp.” By default your app’s android:name
attribute to the activity tag is .MyActivity
. (If you are looking at the downloadable example for this chapter, it is .MyActivityWearable
.) As shown in Listing 6-2, just add a second activity called StartRunActivity
with the proper intent filter, install the app on your watch, and it should work.
The wearable services within Wear consist of three APIs that help the application software on the phone or tablet easily communicate with the Wear device: the data API, the message API, and the node API. Each takes care of different parts of the information exchange between devices.
All can be called from both the phone app and the wearable one, and all your data can be used on both sides. WearableListenerService
implements all the listeners for the different APIs.
Using voice as an input, in any of the forms explained in Table 6.1, requires the devices to communicate with each other. Therefore, understanding the role of each API is important at this point.
The data API lets you put data on a sort of virtual cache shared among the connected devices. You don’t need to worry about anything but putting the data into storage and taking it from there at either side.
This is convenient when dealing with images, because you don’t have to send them repeatedly, thus saving battery life.
The message API sends byte arrays between the devices. Suppose you are capturing voice on the watch and sending it, after it is digitized, to the phone. As long as you push your audio information as a byte array, you can get direct access to the message API and send the information directly.
The node API checks when nodes enter the connection range. This function can be used to discover whether the phone is present. After a while it should be straightforward to simply put the device to sleep if too much time has passed since the last connection was detected.
In the following section you will write an example in which voice plays a big role. In this case you will start by creating a new project with apps for both a phone and a Wear device. Chapter 5 showed you how to create a new project from scratch involving both a mobile and a Wear device. This chapter gets you a little further. Its aim is getting you up and running with a hybrid development system using a phone and a Wear emulator that talk to each other.
It is possible to run almost any kind of scenario between emulators and real devices, knowing the limitations of launching applications or triggering intents. In these two examples you will experiment with a phone running Android’s version of KitKat (which should work on any later version) and an emulator of a Wear smartwatch. In this way you can send events from one to the other.
Notifications including an action to input text, such as replying to an e-mail, should normally launch an activity on the Wear device to input the text. As mentioned earlier, Wear devices—currently smartwatches—have little space for displaying a keyboard, so you can let users dictate a reply or provide predefined text messages using RemoteInput
.
In this first case you will make a wear and a phone app that communicate with each other. When launched, the phone application sends a request (in the form of a notification) for a voice answer on the Wear device.
This gets the Wear device to show a query for a voice answer onscreen. The user is prompted to talk to the device. The answer is captured by RemoteInput
, and the wear application displays the result onscreen while it sends it back to the phone application. Finally, the phone application shows the result on a text field.
The following sections show how this simple application is made.
Chapter 4 explored the different types of notifications on a Wear device. This time you will use the simplest one, a button running on your phone’s screen, to trigger an event on the smartwatch emulator.
Follow these steps to create your new Android Wear project:
WatchViewStub
as the main layout file. Click Next.
You should now have an Android Studio project that has two application structures within it called mobile and wear.
The gradle file for the mobile app needs to include a reference to a series of dependencies needed for this app bundle to compile properly. Listing 6-3 shows the file with the new code highlighted.
By default the app comes with a simple layout featuring the default icon for apps as well as the classic “Hello world!” message, as shown in Figure 6.7.
A good way to continue is to add a button to send notifications to the Wear device when pressed, as well as to add the classic Wrox logotype to the application. To implement this modification, you need to add some code to the activity_my_phone.xml definition file. Listing 6-4 highlights the changes I decided to implement in this case. I just added code to implement a button with the callback function simpleNotification
. Figure 6.8 shows the result.
To change the logotype, you need to add Wrox’s logotype to the resources folder to a subfolder called drawable-xxhdpi. That will let you access it from within the AndroidManifest.xml file for your phone. Listing 6-5 shows the modifications on the default manifest file to include the new logotype as well as assign a launch mode.
Figure 6.9 shows the app with the new image.
Adding the callback function is not a big deal in this case. Listing 6-6 highlights the additions you need to make to the default source code for it to run.
After this code has been uploaded to a phone, when the button is clicked, the smartwatch simulator should capture the event with its default configuration. In other words, we haven’t yet uploaded an app to the watch to do anything in conjunction with the handheld.
Figure 6.10 shows the notification to the watch on the simulator.
This SimpleNotification app is a good way to play around with the voice capabilities on the watch. It allows you to trigger voice input response requests when the notification is issued. This is very helpful for testing your new Wear-based apps.
Given the previous SimpleNotification example, you will next build an application that requests an action from the user. The idea is that the answer is input via voice. Because the simulator has no voice input capabilities, you will use the keyboard to type your answers.
In this case, you will create an app for your handheld device that will be an improvement to the previous one. Below the button to send the notification, you will add a text box to capture the response from the user.
Figure 6.11 shows this new addition to the app.
When declaring the intent for the Wear device to issue its answer, you will use the RemoteInput
class. This class tells the smartwatch to be ready to reply to the notification using voice as an input. The watch has everything that is needed for it to answer. In other words, you just need to make a phone app and not a Wear app in this case.
The callback function needs to include all the code related to the RemoteInput
call. Listing 6-7 shows this new function added to the SimpleNotification app you worked with earlier. The new notification callback function is called voiceNotification()
, which means I have removed the previous one.
You will notice the three strings defined at the beginning of the class. Those are needed when creating the intent that we will use to capture the data coming back from the smartwatch. An intent has four parameters:
this
in this caseDemandIntentReceiver
Both the extra information and the type of action are user-defined. You need to define a string that will contain the value for those. This is how the broadcast receiver (which we will define next) can filter the data coming in.
We will capture the response from the Wear device by listening to a broadcast it will issue through the intent: new Intent(this, DemandIntentReceiver.class)
. To do this, just add a new class to your project. In the first location you could just log the incoming data to logcat. In a later iteration of the code you could populate the text field on the app with the data coming from the smartwatch. This new class is shown in Listing 6-8.
At this level, we are basically getting everything that comes to the broadcast receiver and just capturing EXTRA_VOICE_REPLY
. It is possible to use the other filters defined in the app to separate the data from other broadcasts that are happening. However, doing so is not necessary at this point. Check the provided code examples for Chapter 6 to see yet another iteration of DemandIntentReceiver.java.
To get the app ready to show the data received from the Wear device on the screen, you need to add a text field to the layout, as shown in Listing 6-9. You will also notice that here you need to change the name to the notification’s callback function on the button’s definition.
The new class has to be added to the manifest file within a receiver tag so that the program launches the receiver the way it should. Listing 6-10 shows the additions to the manifest file.
We have added a new text field to the app. This requires a label. The name should be added as part of the strings.xml file in the resources folder. Listing 6-11 shows the new addition.
Launch the app on your phone in parallel to the simulator. When you click the phone’s notification button, the simulator shows the actual notification inviting you to swipe to the left and reply via voice, as shown in Figure 6.12.
After you swipe, you get a button you can use to reply to the notification, as shown in Figure 6.13.
If you click that icon, the voice input screen appears for you to type in your voice command. Figure 6.14 shows the input screen on the Wear device. Just type something (I assume you are using the emulator, just like I did, and therefore need to type first) and press Enter.
Figure 6.15 shows the screen after you enter the data.
The broadcast receiver does the rest by showing the information you entered on the logcat. If you filter the incoming data with “wrox” as a filter, you should see the following three lines (to simplify, I took out the timestamps):
16199/com.wiley.wrox.chapter6.simplenotification V/wrox﹕ Handheld sent notification
16199/com.[...].simplenotification V/wrox﹕ got data
16199/com.[...].simplenotification V/wrox﹕ User reply from wearable: hola
Finally, you want to show the data you got from the Wear device on the app’s screen. This is done by adding a LocalBroadcastManager
to the main activity. Broadcast receivers exist for short periods of time (they are automatically killed by the operating system after a few seconds), therefore it is not possible to modify the UI on the main activity from them. The technique I recommend using to inform the UI of the arrival of data from the wearable is to add another receiver in the main activity and register it for an intent that will be produced by the DemandIntentReceiver
class after getting valid data from the smartwatch. This implies making a couple of changes to both the DemandIntentReceiver.java
file, but also to the MyActivityPhone.java
file. Listing 6-12 shows how to improve the broadcast receiver to add the received text to the field on the app.
Listing 6-13 includes the additions to make to MyActivityPhone.java for it to register a LocalBroadcastReceiver handler and capture the intents carrying the information coming from the wearable after being proxied by the other Broadcast. This operation might seem redundant, it is possible to make an inner class to the main activity capable of receiving the broadcasts from the wearable. However, this way allows you to easily get the code to grow in the form of new classes contained in external files.
Figure 6.16 shows the app on the phone after the data is retrieved and displayed on the designated text field.
This chapter was an overview of the power of using voice as an input to Wear devices. You can use different types of voice inputs. This chapter used “free speech” input, where you talk to your smartwatch as an answer to a notification sent from the phone.
You went through a step-by-step example to see how you can build your own voice-based apps starting from a default project created by Android Studio.
You also saw how to launch your existing apps using the voice command “OK, Google, YourAppName.”
There is a lot more you can do with voice. I recommend you take a look at the suggested readings as a way to continue your exploration.
In the next chapter you’ll see how to extend the basic voice example created in this chapter to send your commands to the net and back.
https://developer.android.com/training/wearables/notifications/voice-input.html
.