industry news
Subscribe Now

Alif Semiconductor elevates generative AI at the edge with new support for ExecuTorch Runtime in its Ensemble MCUs

  • The Ensemble E4/E6/E8 are the industry’s first MCUs to provide hardware acceleration for transformer networks, enabling local generative AI inference on edge and endpoint devices
  • Together with Arm, Alif demonstrated at the PyTorch Conference generative AI models compiled with the ExecuTorch Runtime running on the Ensemble E8 

San Francisco, CA, October 30, 2025 – Alif Semiconductor®, the leading global supplier of secure, connected, power efficient Artificial Intelligence and Machine Learning (AI/ML) microcontrollers (MCUs) and fusion processors, today announced that developers can now use the ExecuTorch Runtime, a quantization extension of the popular PyTorch ML framework, for AI applications built to run on its Ensemble E4/E6/E8 series of MCUs and fusion processors. 

The ExecuTorch extension enables developers of AI models aimed at resource-constrained hardware targets such as MCUs to use the PyTorch framework to create lightweight models which support low-latency, high-accuracy inference. Since the Ensemble E4, E6 and E8 products include the Arm® Ethos™-U85 NPU, which supports transformer-based ML networks, developers can now use PyTorch and the ExecuTorch Runtime to build generative AI applications to run locally in battery-powered endpoint devices such as smart glasses, human-to-computer interfacing, healthcare and diagnostics, robotics, transportation, toys and education, smart homes, and smart city equipment. 

At the PyTorch Conference (Moscone West, San Francisco CA, 22-23 October), Arm demonstrated examples of generative AI applications compiled with ExecuTorch and running on Alif’s Ensemble E8 fusion processor, including:  

  • A small language model running on the Ensemble E8, capable of generating stories suitable for children in response to visual prompts, and extendable to verbal prompts.
  • Real-time on-device speech-to-text models, suitable for integration into wearables such as smart glasses. The demo performs real-time transcription of speech to enable live captioning. 

Reza Kazerounian, President of Alif Semiconductor, said: “Alif’s continuous innovation keeps us at the forefront of edge AI, now extending into the era of generative AI. With native support for the ExecuTorch Runtime on our Ensemble microcontrollers, we’re expanding what’s possible for AI at the MCU level. For embedded device manufacturers building next-generation intelligent products, Alif remains the trusted leader in bringing AI to the edge.”

Paul Williamson, Senior Vice President and General Manager, IoT Business at Arm  said: “Generative AI at the edge is enabling a new class of intelligent, battery-powered devices that can understand and respond in real time. Using the ExecuTorch framework, developers can efficiently deploy PyTorch models on Alif’s Ensemble MCUs for low-power, on-device inference. This collaboration demonstrates how a unified ecosystem can scale intelligence across the edge, accelerating real-world AI innovation.”

The Alif DK-E8 development board used in the PyTorch Conference demonstrations, which supports development on the entire E4/E6/E8 series, is available to purchase now. 

To learn more about the Ensemble E4, E6, and E8 GenAI MCUs, visit alifsemi.com or email contact@alifsemi.com.

About Alif Semiconductor: 

Alif Semiconductor is the industry-leading supplier of next-generation secure AI/ML-enabled 32-bit microcontrollers. Since 2019, Alif’s expanding offering of microcontrollers and fusion processors has been revolutionizing the way developers can create broad, scalable, and connected AI-enabled embedded applications that are genuinely power efficient. Alif Semiconductor is the only choice for power efficient microcontrollers that can handle heavy AI and ML workloads for battery-operated IoT devices. For more information go to  https://alifsemi.com/.

Leave a Reply

featured blogs
Mar 9, 2026
What happens to our digital history when the world's biggest archive of retro video games disappears?...

featured video

Cadence Chiplets Solutions | Helping you realize your chiplet ambitions

Sponsored by Cadence Design Systems

In this webinar, David Glasco, VP of Compute Solutions at Cadence, discusses how Cadence enables customers to transition from traditional monolithic SoC architectures to modular, scalable chiplet-based solutions, essential for meeting the growing demands of physical AI applications and high-performance computing.

Read eBook: Helping You Realize Your Chiplet Ambitions

featured chalk talk

Global Coverage With NTN
In this episode of Chalk Talk, Paul Fadlovich from TE Connectivity and Martin Lesund from Nordic Semiconductor and Amelia Dalton explore the what, why and how of NTN technology. They also explore the role that antennas play in satellite communication systems, and how Nordic Semiconductor’s nRF9151 System-in-Package and TE Connectivity’s broad range of antenna solutions can jump start your next global IoT design.
Feb 19, 2026
24,711 views