TM32 MCUs pair with Sensory’s VoiceHub technology to streamline development of voice-based user interfaces on wearables, IoT, and smart-home applications
STMicroelectronics, a global semiconductor leader serving customers across the spectrum of electronics applications, and Sensory Inc., a leading provider of embedded speech recognition technology and an ST Authorized Partner, have announced a collaboration that will enable the STM32 microcontroller (MCU) user community to develop and prototype intuitive voice-based user interfaces for a wide range of smart embedded products.
The joint efforts pair ST’s STM32 hardware and software with Sensory’s voice-control technologies, including the new VoiceHub online portal that supports seamless creation of embedded speech-recognition models using custom wake words, voice-control command sets, and large natural-language grammars in almost twenty languages and dialects.
The solution is based on an STM32Cube software extension package and runs on a high-performance STM32H7 MCU, taking advantage of its architecture, internal Flash, SRAM, and high CPU speed. This combination plays a key role in increasing voice-control accuracy and minimizing command-recognition time. Hosting the voice application and speech models in the generous on-chip memory of the high-performance STM32 MCUs further boosts the system integration and ease of use, as well as lowers cost of ownership.
“This collaboration sets to jump-start the development of embedded-voice user interfaces, adding friction-free command control and custom wake word to any device, from wearables to smart-home appliances,” said Ricardo De Sa Earp, Executive Vice President, General-Purpose Microcontroller Sub-Group Vice President, STMicroelectronics. “The unique combination of ST and Sensory technologies will enable the STM32 user community to deploy ‘Voice AI on the edge’ without any programming, data-science, or machine-learning expertise, for free in prototypes and with favorable licensing terms in production.”
“Sensory designed our VoiceHub so developers could quickly and painlessly create custom speech-recognition models. However, after creating a custom model, integrating the model onto hardware, and moving to licensing terms were the next hurdles that needed to be cleared,” said Todd Mozer, CEO, Sensory. “This world-class collaboration with ST creates a complete software, hardware, and licensing package for embedded speech recognition across the STM32 family and makes adding Voice UIs, simple.”
ST’s new software package dedicated to voice-user interfaces is available at-
https://www.st.com/en/embedded-software/x-cube-localvui
STM32 Local voice user interface expansion package
Description
X-CUBE-LocalVUI implements local voice recognition user interfaces based on audio capture, and speech recognition. It integrates the Sensory TrulyHandsfree™ (THF) software and VoiceHub-generated vocabulary. It also integrates the Sensory TrulyNatural™ (TNL) software and VoiceHub-generated vocabulary. The audio capture is based on STM32 peripherals and middleware. It shows how to capture audio from the board microphone through SAI. The default implementation of the speech recognition is a home automation solution. However, it is ready to be tuned into any other UI a user may need. The speech recognition benefits from the VoiceHub web tool to generate new vocabulary models and software, and run them locally on the STM32 microcontroller. X-CUBE-LocalVUI provides an implementation on an STM32H747I-DISCO Discovery kit. It can be ported to other STM32 microcontrollers and boards with audio features. THF, TNL, and VoiceHub are products from Sensory, an authorized STMicroelectronics partner.
- All features
- Vocabulary model built with Sensory VoiceHub
- Sensory TrulyHandsfree™ (THF) software component
- Sensory TrulyNatural™ (TNL) software component
- Cloudless voice user interface (UI)
- Audio capture from board microphone
- Integration of automatic speech recognition (ASR) software
- Capability to build and integrate a customized voice UI vocabulary model
- Support for wake word detection only, or wake word and command mode
- Detected command logged on Virtual COM port
- Possibility to connect the board as a USB device to record the microphone audio capture
视频演示:
http://mpvideo.qpic.cn/0bc3pqaacaaavqaljdwxuzqva7gdaf6aaaia.f10002.mp4?
http://mpvideo.qpic.cn/0bc3haadmaaajaaaf3wau5qvaogdgy4aanqa.f10002.mp4?
---
About Sensory Sensory Inc. creates a safer and superior UX through vision and voice technologies. Sensory’s technologies are widely deployed in consumer electronics applications including mobile phones, automotive, wearables, toys, IoT, PCs, medical products and various home electronics. Sensory’s product line includes TrulyHandsfree voice control, TrulySecure biometric authentication, and TrulyNatural large vocabulary natural language embedded speech recognition. Sensory’s technologies have shipped in over three billion units of leading consumer products.
About STMicroelectronics At ST, we are 48,000 creators and makers of semiconductor technologies mastering the semiconductor supply chain with state-of-the-art manufacturing facilities. An integrated device manufacturer, we work with more than 200,000 customers and thousands of partners to design and build products, solutions, and ecosystems that address their challenges and opportunities, and the need to support a more sustainable world. Our technologies enable smarter mobility, more efficient power and energy management, and the wide-scale deployment of the Internet of Things and connectivity. ST is committed to becoming carbon neutral by 2027. Further information can be found at www.st.com.