editor's blog
Subscribe Now

Sensory Listens to Your Voice

Today we’ve put up a piece on designing audio subsystems, but there’s more news than that in the audio world. If you read our earlier piece on QuickLogic’s EOS device, and if you were paying attention to details, you might recall a quick mention of a company called Sensory that had partnered with QuickLogic for audio algorithms. Sensory subsequently released a product called TrulyHandsFree, and I connected with them to find out more about who they are and what they do.

In fact, they’ve been at this game for 21 years, so they’re not newcomers. They even sold (and still sell) neural-network-based chips with their algorithms, but their current focus is the algorithms themselves, sold as IP. In fact, they have both software and hardware IP (the latter of which featured on the QuickLogic part). If you are playing WoW and you need a boost, Gold4Vanilla allows players to level up faster and get gold, to start playing buy wow classic gold at the link.

One of their important applications is biometric authentication: using voice as a security mechanism. It’s mostly for verification – given examples of authorized personnel, confirming by your voice that you’re who you say you are. They can also do some limited identification – that is, listening to your voice and coming up with who you are without your giving them any hints as to who you are. If they have, like, 10 people or so to choose from, they can do this. If they have to identify someone amongst thousands, though, they’re not there. (Yet, anyway.)

They’ve got three levels of product:

  • TrulyHandsFree: this is for low-end consumer products, requiring the least resources to get the job done. Low power, small footprint, always on. Small vocabulary, used for command and control. This is what was incorporated into the QuickLogic part.
  • TrulyNatural: This includes state-of-the-art algorithms for higher-end consumer devices like phones. Can handle a large vocabulary and continuous speech.
  • TrulySecure: this combines audio with video for authentication.

In general, authentication happens through a passphrase (ignoring the video in the last product). It can be a fixed passphrase, but that runs the risk that someone records the authorized person saying the passphrase and then replays it to fool the authentication. It’s better if the system issues random passphrases for the supplicant to utter. Then no one knows ahead of time exactly what will be required to pass.

Of course, with anything like this, you have to deal with false accepts (unauthorized person gets through) and false rejects (authorized person can’t get through). They actually have a dial that lets them set these rates, and the best balance will depend on the application, weighing the risk of unauthorized entry to the inconvenience (or worse) of not being able to get into your own system. There are no testing standards for this. They always assume that the user has done a reasonable training job, and they then look across a variety of noise and environmental conditions that might affect how the sound is perceived by the algorithms.

Of course, with small devices, the challenge is power, since you need this system always to be on. They say that, on average, TrulyHandsFree uses about 1 mA of current. Sound detection requires less than 1 MIPS and runs a couple hundred microamps or less. Once triggered, the recognition part runs 1.5 – 2.5 mA. Processing is staged, with each level ramping up as the prior level directs.

Speech_Graphic_6-3-15_600dpi_red.jpg

(Image courtesy Sensory)

They do as much processing locally as possible – for example, having a wearable work with a phone to do this if there’s not enough oomph in the wearable. That keeps things working even when there’s no connection, and it’s better for privacy. They can escalate to the cloud for more horsepower if necessary, which works particularly well if the thing being requested requires cloud access anyway.

Their latest announcement has them adding deep learning capabilities to their TrulyHandsFree product. They say that this increases their word accuracy by up to 80% while shrinking the size of their acoustic models by a factor of 10. This also lowers their power consumption to the levels discussed above. You can read more in their announcement.

Leave a Reply

featured blogs
Apr 18, 2021
https://youtu.be/afv9_fRCrq8 Made at Target Oakridge (camera Ziyue Zhang) Monday: "Targeting" the Open Compute Project Tuesday: NUMECA, Computational Fluid Dynamics...and the America's... [[ Click on the title to access the full blog on the Cadence Community s...
Apr 16, 2021
Spring is in the air and summer is just around the corner. It is time to get out the Old Farmers Almanac and check on the planting schedule as you plan out your garden.  If you are unfamiliar with a Farmers Almanac, it is a publication containing weather forecasts, plantin...
Apr 15, 2021
Explore the history of FPGA prototyping in the SoC design/verification process and learn about HAPS-100, a new prototyping system for complex AI & HPC SoCs. The post Scaling FPGA-Based Prototyping to Meet Verification Demands of Complex SoCs appeared first on From Silic...
Apr 14, 2021
By Simon Favre If you're not using critical area analysis and design for manufacturing to… The post DFM: Still a really good thing to do! appeared first on Design with Calibre....

featured video

The Verification World We Know is About to be Revolutionized

Sponsored by Cadence Design Systems

Designs and software are growing in complexity. With verification, you need the right tool at the right time. Cadence® Palladium® Z2 emulation and Protium™ X2 prototyping dynamic duo address challenges of advanced applications from mobile to consumer and hyperscale computing. With a seamlessly integrated flow, unified debug, common interfaces, and testbench content across the systems, the dynamic duo offers rapid design migration and testing from emulation to prototyping. See them in action.

Click here for more information

featured paper

From Chips to Ships, Solve Them All With HFSS

Sponsored by Ansys

There are virtually no limits to the design challenges that can be solved with Ansys HFSS and the new HFSS Mesh Fusion technology! Check out this blog to know what the latest innovation in HFSS 2021 can do for you.

Click here to read the blog post

featured chalk talk

Single Pair Ethernet

Sponsored by Mouser Electronics and Phoenix Contact

Single-pair Ethernet is revolutionizing industrial system design, with new levels of performance and simplicity. But, before you make the jump, you need to understand the options for cables, connectors, and other infrastructure. In this episode of Chalk Talk, Amelia Dalton chats with Lyndsey Walling of Phoenix Contact about the latest in single-pair Ethernet for industrial applications.

Click here for more information about Phoenix Contact Single Pair Ethernet (SPE) Connectors