feature article
Subscribe Now

ARMing a New Generation

Altera Announces Processor Architecture for Gen X

The future of FPGA-based processors is coming into focus. Altera just announced that the processor architecture for their upcoming “Generation 10” Stratix FPGA family will be the ARM Cortex-A53. Pretty clear, eh? OK, that’s it. End of article. Move on along. 

What? Still have questions? What does it all mean?

To review, Altera has announced that their next Stratix family (Generation 10) will be built on Intel’s 14nm Tri-Gate (FinFET) process. As we have discussed before, Altera is currently in a race with archrival Xilinx, whose first FinFET FPGAs will be riding in on TSMC’s 16nm FinFET process. Which horse is faster? Intel is widely believed to have superior process technology and has already been shipping 22nm FinFET-based devices. Those points go to Intel. TSMC, on the other hand, has vastly more experience as a merchant fab and has announced that they are working closely with Xilinx to accelerate their FinFET program, in a blitz whose marketing name is “FinFAST.”

At this point, therefore, it is unclear who will be shipping first, (and, except for bragging rights between the two companies, probably few people care.) It is likely that we will not see production devices from either company before 2015, so we are definitely in “future” mode here. It is also unclear how the performance attributes of the two companies’ offerings will stack up. Altera has shown more of their hand thus far, and their predictions are impressive – up to four million LUT-4 equivalent 1GHz programmable fabric, 56Gbps SerDes, better power efficiency, tons-o-RAM – and a high-powered processing subsystem in the SoC version. What’s the processing subsystem look like? That’s why we are gathered here today. 

There was speculation that the architecture might be other-than-ARM since the manufacturer is none-other-than-Intel. As far as we know, Intel hasn’t historically been too keen on manufacturing competing processor architectures. However, two other, more important market forces are at work in this situation. First, Altera has made a huge commitment to the ARM architecture with their current-generation SoC FPGAs. Getting their customers committed to the ARM/FPGA architecture and then jumping ship and forcing them to migrate after only one generation would be a major inconvenience, and it would be a big black eye for Altera. It would have been very unlikely that Altera would have inked the Intel deal knowing that they couldn’t continue their ARM commitment.

Second, Intel is obviously trying to make a go at it in the merchant fab business. If the company had a hard-and-fast policy of never manufacturing a chip with an ARM architecture on board, they’d be severely limiting their market. While Intel has already been building FPGAs for both Tabula and Achronix, getting Altera in their stable is a whole ‘nuther deal. Putting aside petty concerns about processor architecture is a small price to pay for better street cred in the merchant fab business.

So, this week, Altera announced that they are planning a quad-core, 64-bit ARM Cortex-A53 as the SoC engine inside Stratix 10. Given the performance specs of Intel’s 14nm Tri-Gate process, this is likely to be a tiger of a processor. The Cortex-A53 is the architecture used in the “LITTLE” end of ARM’s 64-bit big.LITTLE configuration pair, with the Cortex-A57 as the “big” part. The A53 has a simple, in-order, 8-stage pipeline (compared with the A57’s complex, out-of-order, multi-issue pipeline.) Altera’s choice of the A53 comes down to power. The A53 delivers a crazy level of performance with very high power efficiency. Altera says the A53 will deliver up to 6 times the performance of current generation SoC FPGA processors. While the A57 could deliver more performance, it would be at a severe power penalty. In an SoC FPGA, when you want the big performance, you use hardware acceleration in the FPGA fabric, so the A53 will be able to deliver a lot of processing at a very low power, and it will dovetail nicely with super-fast, hardware-accelerated algorithms that will virtually sip power as well.

The A53 can also run in 32-bit mode – and, in that mode, it will be code-compatible with the A9 processor in Altera’s current SoC FPGAs. That should allow a smooth migration path for current customers, with the option to kick things up to 64-bit power at the cost of a modest software port. It isn’t clear how many customers have developed/deployed solutions based on the Altera/Cortex-A9 so we don’t know how widely this compatibility will be used.

With this much processing power on board, we are seeing a philosophical departure from the FPGA norm. Previous generation FPGA SoCs feel like connectivity devices with a helpful processor on board. With this level of capability, however, what we really have is a true heterogeneous computing device – a high-powered processor with impressive acceleration capabilities in programmable logic fabric. Boosting those capabilities will be Altera’s already-announced hard-wired floating point DSP cores which, when combined with the ARM subsystem, should give these new devices a crazy amount of processing capability.

These days, power efficiency is even more important than computing power. Many systems are form-factor constrained and are already at the limit of the amount of heat that can be extracted or the amount of power that can be supplied. That means increases in computing power have to be accompanied by offsetting improvements in power efficiency. This metric is where these new SoC FPGAs should really shine.

The tricky part of harnessing all that power and efficiency, however, is the programming. Most design teams don’t have the wherewithal to do the complex software/hardware development that would be required to take full advantage of this kind of device using a traditional HDL-based approach for designing the hardware acceleration bits. Altera’s go-to answer for this problem is their OpenCL support, which promises to allow software developers to use the language – popular for GPU applications – to write algorithms that can be accelerated with FPGA fabric. The degree to which this approach is able to take advantage of the capabilities of the upcoming Stratix 10 FPGA SoC has yet to be seen, of course, but Altera has already gained considerable experience with the OpenCL approach on their current-generation devices.

There are still a large number of unknowns about the next generation of FPGA technology, but if the disclosures we’ve seen so far are any indication – 2014/2015 will be a wild ride in programmable logic land.

One thought on “ARMing a New Generation”

  1. Altera announced that their upcoming Stratix 10 SoC FPGAs will feature a quad-core ARM Cortex-A53 processor architecture. What do you see as the killer app for this Intel-fabbed ARM-based heterogeneous computing platform? How will people write software for it?

Leave a Reply

featured blogs
Sep 20, 2021
The Cadence Scholarship Program is the flagship CSR program of Cadence India, introduced five years ago. Many meritorious students from under-served sections of society drop out of the education... [[ Click on the title to access the full blog on the Cadence Community site. ...
Sep 18, 2021
Projects with a steampunk look-and-feel incorporate retro-futuristic technology and aesthetics inspired by 19th-century industrial steam-powered machinery....
Sep 15, 2021
Learn how chiplets form the basis of multi-die HPC processor architectures, fueling modern HPC applications and scaling performance & power beyond Moore's Law. The post What's Driving the Demand for Chiplets? appeared first on From Silicon To Software....
Aug 5, 2021
Megh Computing's Video Analytics Solution (VAS) portfolio implements a flexible and scalable video analytics pipeline consisting of the following elements: Video Ingestion Video Transformation Object Detection and Inference Video Analytics Visualization   Because Megh's ...

featured video

Accurate Full-System Thermal 3D Analysis

Sponsored by Cadence Design Systems

Designing electronics for the data center challenges designers to minimize and dissipate heat. Electrothermal co-simulation requires system components to be accurately modeled and analyzed. Learn about a true 3D solution that offers full system scalability with 3D analysis accuracy for the entire chip, package, board, and enclosure.

Click here for more information about Celsius Thermal Solver

featured paper

Configure the charge and discharge current separately in a reversible buck/boost regulator

Sponsored by Maxim Integrated (now part of Analog Devices)

The design of a front-end converter can be made less complicated when minimal extra current overhead is required for charging the supercapacitor. This application note explains how to configure the reversible buck/boost converter to achieve a lighter impact on the system during the charging phase. Setting the charge current requirement to the minimum amount keeps the discharge current availability intact.

Click to read more

featured chalk talk

Thunderbolt Technology Overview

Sponsored by Mouser Electronics and Intel

Thunderbolt is the closest thing we’ve got to universal interconnect between a wide variety of devices and systems. With a universal USB-C connector, it can do video, power, data communication - all at scalable rates with smart adjustment. In this episode of Chalk Talk, Amelia Dalton chats with Sandeep Vedanthi of Intel about the latest in Thunderbolt technology - Thunderbolt 4, which brings a number of benefits over previous versions.

Click here for more information about Intel 8000 series Thunderbolt™ 4 Controllers