feature article
Subscribe Now

Voice Activation Gets More Efficient

Ambiq Voice-on-SPOT Development Kit Says Talk is Cheap

Science fiction is all about “what if.” What if there were no gravity? What if apes developed an advanced civilization after our own? What if we’re living in a computer simulation? 

What if voice-activated gadgets used a lot less power? Would that change the way people use them, or make them more useful, or open up new applications? What if?

Tiny fabless chip company Ambiq spent a lot of effort to answer that question. As Marc Miller, Ambiq’s Director of Solutions Marketing, describes it, “We think there’s a big untapped opportunity for voice apps.” Right now, people talk to the voice assistant in their phones, and maybe to an Amazon Echo or their car’s navigation system, but that’s about it. There are voice-activated TV remotes, but few people seem to use them. There are even voice-activated refrigerators and microwave ovens, but very few people talk to those. On purpose, I mean. 

Ambiq believes that power is the issue and has leveraged its unusual SPOT (sub-threshold power optimized technology) design and fabrication technique to prove the point. SPOT enables very low-power digital devices, like the company’s Apollo4 MCU family. Now the company is pairing its MCUs with a software development kit for voice-activated applications. 

Current voice-activated gadgets are compromised, says Ambiq, because they’re not very good. Push-to-talk buttons are a necessary evil to reduce power, not to help the user. Early smartphones, cars, and TV remotes all needed a pushbutton wakeup for the voice feature because leaving it on 24/7 used too much power. Changing the power equation would change the usage model. 

A second problem is distance. Most voice-activated devices need to be nearby, and you need to talk directly to them. That’s because it’s hard to separate voice commands from background noise, and it gets exponentially harder with distance. “Near field” devices will work within about a 1-meter range, but a “far field” device with a 3-meter range would be a game changer. No more walking across the room to talk to Alexa. 

Ambiq thinks it’s solved both of these problems with its Apollo-based hardware and software development kit, called Voice-on-SPOT (VoS). It comes bundled with a Vesper analog microphone and software from DSP Concepts, Sensory, and Retune DSP. Miller says analog mics like Vesper’s use about one-tenth the power of digital mics, which have their own (comparatively inefficient) A/D converters. Better to let the low-power Apollo MCU do that, he says. 

To jump to the conclusion, Ambiq says a voice-activated remote control using its technology can run nonstop for well over a year on a pair of AAA batteries. And it can handle far-field (3 meter) activation with just a single microphone, not an array of microphones like other products. As a result, Ambiq thinks this may change the game and make voice activation viable for smartwatches, game controllers, smart TVs, wireless speakers, wearables, and other consumer gizmos. 

As part of its tests, Ambiq conducted weeklong trials in employees’ homes. They would use the voice-activation features (or not) in a real-world setting with pets, children, background noise, and lost remotes under the sofa. The intent was to profile hardware and software duty cycles, measure performance, gauge accuracy, and compare near and far behavior. 

After a week of near-field testing, they found that the system was in its dormant state for about 95% of the total time, or a little under 23 hours per day. In this mode, the microphone is awake but the MCU is stopped. If the mic detects a spike in sound pressure level, it wakes the MCU, which ticks over at 1 MHz in low-power mode to do a first-pass analysis of the signal. This intermediate state averaged only about 15 minutes, or 1%, of each day. 

If that initial analysis decides that someone was speaking, the MCU would accelerate to 18 MHz and begin processing the audio to detect wake words or commands. This third mode totaled about an hour out of each day, or 4% of the total time. 

Far-field testing in the same household changed the percentages slightly. Dormant time fell by about an hour per day, to 90% from 95%, with the intermediate and active duty cycles taking up the slack equally (2% and 8%, respectively). Ambiq’s Miller says the increased ambient noise levels make it harder to detect voices in far-field testing, hence the rise in active-duty time. 

Nevertheless, Ambiq says its system can run for 409 days on two AAA batteries. That estimate is based on 2500mWh from the batteries, a 92% power conversion factor, and the usage model similar to the one it created above. That last assumption turns out to be critical. 

Reinforcing the standard 90/10 rule, the system spends 90% or more of its time in dormant mode, which consumes only 0.59 mWh, or 10%, of the total power budget. The 2% it spends in intermediate mode consumes another 10%, and the final 8% where it’s actively processing audio eats up almost two-thirds (62%) of the total power. The rest goes to Bluetooth BLE communication with a presumed base station, plus miscellaneous hardware I/O. 

In terms of energy expended per unit of time, BLE is the most power-hungry feature by far, with 0.55 mWh (9% of total) over a scant 6 seconds per day. Wireless communication is, by definition, pouring energy into the air. 

That all sounds pretty good. A consumer device that runs for 13 months or so on a pair of cheap batteries while monitoring voice commands continuously 24/7, with no button presses and no particular need to speak directly into a handset or remote, sounds like an improvement. Wearables would probably have a similar usage profile, though without the need for far-field recognition and with smaller batteries. 

It’s hard to say whether market adoption of voice-activated products is limited by battery life or by some other factor of customer acceptance. But Ambiq has a way to remove at least one variable from that equation. 

One thought on “Voice Activation Gets More Efficient”

  1. “And it can handle far-field (3 meter) activation with just a single microphone, not an array of microphones like other products.”

    This is interesting as it sounds like they have a way to separate voices from background sounds. I wish the article had gone more into how they accomplish this. Efficiency is great but a real awesome breakthrough would be the ability to cleanly separate a given voice from other voices and noise. Amazon would drool over it.

Leave a Reply

featured blogs
Apr 25, 2024
Cadence's seven -year partnership with'¯ Team4Tech '¯has given our employees unique opportunities to harness the power of technology and engage in a three -month philanthropic project to improve the livelihood of communities in need. In Fall 2023, this partnership allowed C...
Apr 24, 2024
Learn about maskless electron beam lithography and see how Multibeam's industry-first e-beam semiconductor lithography system leverages Synopsys software.The post Synopsys and Multibeam Accelerate Innovation with First Production-Ready E-Beam Lithography System appeared fir...
Apr 18, 2024
Are you ready for a revolution in robotic technology (as opposed to a robotic revolution, of course)?...

featured video

MaxLinear Integrates Analog & Digital Design in One Chip with Cadence 3D Solvers

Sponsored by Cadence Design Systems

MaxLinear has the unique capability of integrating analog and digital design on the same chip. Because of this, the team developed some interesting technology in the communication space. In the optical infrastructure domain, they created the first fully integrated 5nm CMOS PAM4 DSP. All their products solve critical communication and high-frequency analysis challenges.

Learn more about how MaxLinear is using Cadence’s Clarity 3D Solver and EMX Planar 3D Solver in their design process.

featured paper

Designing Robust 5G Power Amplifiers for the Real World

Sponsored by Keysight

Simulating 5G power amplifier (PA) designs at the component and system levels with authentic modulation and high-fidelity behavioral models increases predictability, lowers risk, and shrinks schedules. Simulation software enables multi-technology layout and multi-domain analysis, evaluating the impacts of 5G PA design choices while delivering accurate results in a single virtual workspace. This application note delves into how authentic modulation enhances predictability and performance in 5G millimeter-wave systems.

Download now to revolutionize your design process.

featured chalk talk

PolarFire® SoC FPGAs: Integrate Linux® in Your Edge Nodes
Sponsored by Mouser Electronics and Microchip
In this episode of Chalk Talk, Amelia Dalton and Diptesh Nandi from Microchip examine the benefits of PolarFire SoC FPGAs for edge computing applications. They explore how the RISC-V-based Architecture, asymmetrical multi-processing, and Linux-based reference solutions make these SoC FPGAs a game changer for edge computing applications.
Feb 6, 2024
10,707 views