feature article
Subscribe Now

ARMing a New Generation

Altera Announces Processor Architecture for Gen X

The future of FPGA-based processors is coming into focus. Altera just announced that the processor architecture for their upcoming “Generation 10” Stratix FPGA family will be the ARM Cortex-A53. Pretty clear, eh? OK, that’s it. End of article. Move on along. 

What? Still have questions? What does it all mean?

To review, Altera has announced that their next Stratix family (Generation 10) will be built on Intel’s 14nm Tri-Gate (FinFET) process. As we have discussed before, Altera is currently in a race with archrival Xilinx, whose first FinFET FPGAs will be riding in on TSMC’s 16nm FinFET process. Which horse is faster? Intel is widely believed to have superior process technology and has already been shipping 22nm FinFET-based devices. Those points go to Intel. TSMC, on the other hand, has vastly more experience as a merchant fab and has announced that they are working closely with Xilinx to accelerate their FinFET program, in a blitz whose marketing name is “FinFAST.”

At this point, therefore, it is unclear who will be shipping first, (and, except for bragging rights between the two companies, probably few people care.) It is likely that we will not see production devices from either company before 2015, so we are definitely in “future” mode here. It is also unclear how the performance attributes of the two companies’ offerings will stack up. Altera has shown more of their hand thus far, and their predictions are impressive – up to four million LUT-4 equivalent 1GHz programmable fabric, 56Gbps SerDes, better power efficiency, tons-o-RAM – and a high-powered processing subsystem in the SoC version. What’s the processing subsystem look like? That’s why we are gathered here today. 

There was speculation that the architecture might be other-than-ARM since the manufacturer is none-other-than-Intel. As far as we know, Intel hasn’t historically been too keen on manufacturing competing processor architectures. However, two other, more important market forces are at work in this situation. First, Altera has made a huge commitment to the ARM architecture with their current-generation SoC FPGAs. Getting their customers committed to the ARM/FPGA architecture and then jumping ship and forcing them to migrate after only one generation would be a major inconvenience, and it would be a big black eye for Altera. It would have been very unlikely that Altera would have inked the Intel deal knowing that they couldn’t continue their ARM commitment.

Second, Intel is obviously trying to make a go at it in the merchant fab business. If the company had a hard-and-fast policy of never manufacturing a chip with an ARM architecture on board, they’d be severely limiting their market. While Intel has already been building FPGAs for both Tabula and Achronix, getting Altera in their stable is a whole ‘nuther deal. Putting aside petty concerns about processor architecture is a small price to pay for better street cred in the merchant fab business.

So, this week, Altera announced that they are planning a quad-core, 64-bit ARM Cortex-A53 as the SoC engine inside Stratix 10. Given the performance specs of Intel’s 14nm Tri-Gate process, this is likely to be a tiger of a processor. The Cortex-A53 is the architecture used in the “LITTLE” end of ARM’s 64-bit big.LITTLE configuration pair, with the Cortex-A57 as the “big” part. The A53 has a simple, in-order, 8-stage pipeline (compared with the A57’s complex, out-of-order, multi-issue pipeline.) Altera’s choice of the A53 comes down to power. The A53 delivers a crazy level of performance with very high power efficiency. Altera says the A53 will deliver up to 6 times the performance of current generation SoC FPGA processors. While the A57 could deliver more performance, it would be at a severe power penalty. In an SoC FPGA, when you want the big performance, you use hardware acceleration in the FPGA fabric, so the A53 will be able to deliver a lot of processing at a very low power, and it will dovetail nicely with super-fast, hardware-accelerated algorithms that will virtually sip power as well.

The A53 can also run in 32-bit mode – and, in that mode, it will be code-compatible with the A9 processor in Altera’s current SoC FPGAs. That should allow a smooth migration path for current customers, with the option to kick things up to 64-bit power at the cost of a modest software port. It isn’t clear how many customers have developed/deployed solutions based on the Altera/Cortex-A9 so we don’t know how widely this compatibility will be used.

With this much processing power on board, we are seeing a philosophical departure from the FPGA norm. Previous generation FPGA SoCs feel like connectivity devices with a helpful processor on board. With this level of capability, however, what we really have is a true heterogeneous computing device – a high-powered processor with impressive acceleration capabilities in programmable logic fabric. Boosting those capabilities will be Altera’s already-announced hard-wired floating point DSP cores which, when combined with the ARM subsystem, should give these new devices a crazy amount of processing capability.

These days, power efficiency is even more important than computing power. Many systems are form-factor constrained and are already at the limit of the amount of heat that can be extracted or the amount of power that can be supplied. That means increases in computing power have to be accompanied by offsetting improvements in power efficiency. This metric is where these new SoC FPGAs should really shine.

The tricky part of harnessing all that power and efficiency, however, is the programming. Most design teams don’t have the wherewithal to do the complex software/hardware development that would be required to take full advantage of this kind of device using a traditional HDL-based approach for designing the hardware acceleration bits. Altera’s go-to answer for this problem is their OpenCL support, which promises to allow software developers to use the language – popular for GPU applications – to write algorithms that can be accelerated with FPGA fabric. The degree to which this approach is able to take advantage of the capabilities of the upcoming Stratix 10 FPGA SoC has yet to be seen, of course, but Altera has already gained considerable experience with the OpenCL approach on their current-generation devices.

There are still a large number of unknowns about the next generation of FPGA technology, but if the disclosures we’ve seen so far are any indication – 2014/2015 will be a wild ride in programmable logic land.

One thought on “ARMing a New Generation”

  1. Altera announced that their upcoming Stratix 10 SoC FPGAs will feature a quad-core ARM Cortex-A53 processor architecture. What do you see as the killer app for this Intel-fabbed ARM-based heterogeneous computing platform? How will people write software for it?

Leave a Reply

featured blogs
May 5, 2021
New 5G infrastructure is powering smart city projects worldwide; explore the importance of IoT security for smart city solutions in public safety & logistics. The post How 5G Networks Will Accelerate Development of Smart Cities appeared first on From Silicon To Software...
May 4, 2021
What a difference a year can make! Oh, we're not referring to that virus that… The post Realize Live + U2U: Side by Side appeared first on Design with Calibre....
May 3, 2021
As a NASA flight enthusiast, the idea of unmanned aerial vehicle systems (also known as drones) sounds like a lot of fun. A good example of how fun drones can be is through drone racing'¦yes you read that right'¦ drone racing! However, apart from how fun they can be, drones...
May 2, 2021
https://youtu.be/1HEd6JCriCQ Made in Groveland CA (camera Carey Guo) Monday: Package Assembly Design Kits Tuesday: Rapid Adoption of the Arm Server-Class Processors Wednesday: Arm V9A Thursday:... [[ Click on the title to access the full blog on the Cadence Community site. ]...

featured video

The Verification World We Know is About to be Revolutionized

Sponsored by Cadence Design Systems

Designs and software are growing in complexity. With verification, you need the right tool at the right time. Cadence® Palladium® Z2 emulation and Protium™ X2 prototyping dynamic duo address challenges of advanced applications from mobile to consumer and hyperscale computing. With a seamlessly integrated flow, unified debug, common interfaces, and testbench content across the systems, the dynamic duo offers rapid design migration and testing from emulation to prototyping. See them in action.

Click here for more information

featured paper

Compact. Precise. Connected. Increase productivity with intelligent edge computing.

Sponsored by Texas Instruments

Smart devices in factories and buildings are getting smaller and more capable, with enhanced real-time control, robust connectivity, and configurable web services. Read about new processor technology that is unleashing the true potential of Industry 5.0 and the Internet of Things.

Click here to read more

featured chalk talk

AC Protection & Motor Control in HVAC Systems

Sponsored by Mouser Electronics and Littelfuse

The design of HVAC systems poses unique challenges for things like motor control and circuit protection. System performance and reliability are critical, and those come in part from choosing the right components for the job. In this episode of Chalk Talk, Amelia Dalton chats with Ryan Sheahen of Littelfuse about choosing the right components for your next HVAC design.

Click here for more information about Littelfuse AC Protection & Motor Control in HVAC Solutions