feature article
Subscribe Now

Intel Acquires Omnitek

FPGA AI Race Heats Up

Following on the recent announcement of the new Agilex FPGA family, Intel announced they are acquiring Omnitek – a developer of video and vision acceleration IP for FPGAs. As the market for FPGA-powered acceleration heats up, one of the killer applications is video and machine vision. FPGAs are extremely well suited to video analytics, as the combination of acceleration capabilities and flexible handling of high bit-rate data streams, plus the adaptability to various application-specific parameters is a near ideal fit.

Omnitek is a UK-based company with about 40 employees that has been in business for twenty years. They focus on IP and services for the design of video and vision systems, based on FPGAs and SoCs. They specialize in high-performance video/vision and AI/machine-learning for markets including medical, broadcast, professional AV, automotive, government homeland security, aerospace/defense, industrial/scientific, consumer, and test & measurement.  The company also manufactures video test & measurement equipment to complement their IP and services offerings.

Omnitek is being rolled into the programmable systems group (PSG, which is the former Altera team) at Intel. The combination makes sense, as video and vision acceleration with machine learning is clearly one of the key battlegrounds for Intel PSG in their ongoing rivalry with Xilinx. Xilinx has made strides in repositioning itself to go head-to-head with Intel in the data center acceleration market with their “ACAP” FPGA offering, their Alveo accelerator cards, and a rich portfolio of IP. Xilinx also bolstered their offering with the acquisition of DeePhi Tech, a  Beijing-based start-up with capabilities in machine learning built around Xilinx devices.

For Intel, the Omnitek acquisition is a strategic move, strengthening the company’s own machine-learning portfolio. while effectively blocking Xilinx from taking full advantage of Omnitek’s offering. Omnitek has developed over 220 FPGA IP cores, as well as accompanying software for video-related applications. In the battle to capture the rapidly growing intelligent video acceleration market, FPGA companies will need to engage customers who do not have experienced teams of FPGA designers, so ready-baked optimized IP, reference designs, software, and services will make the advantages of FPGAs available to a much larger set of customers and applications.

At almost the same time, Omnitek announced the availability a new IP offering for convolutional neural networks (CNNs), “delivering world-leading performance per watt at full FP32 accuracy” with the Intel Arria 10 GX FPGA. The Omnitek Deep Learning Processing Unit (DPU) achieves 135 GOPS/W at 32-bit floating point accuracy when running the VGG-16 CNN in an Arria 10 GX 1150. The design “employs a novel mathematical framework combining low-precision fixed point maths with floating point maths to achieve this very high compute density with zero loss of accuracy.”

The DPU is scalable across both Intel’s Arria 10 GX and Stratix 10 GX devices, and it can be tuned for either low cost or high performance for either embedded or data center use. Omnitek says the DPU is “fully software programmable in C/C++ or Python using standard frameworks such as TensorFlow, enabling it to be configured for a wide range of standard CNN models including GoogLeNet, ResNet-50 and VGG-16 as well as custom models.”

With this acquisition, Intel gets a strong boost in IP, but, more importantly a big influx of talent – engineers already expert in creating applications based on Intel FPGAs. Intel’s strategy in FPGA appears to be to deploy as many ready-made applications as possible into high-growth markets, and to have as many of those solutions as possible already bundled into Intel hardware solutions. This takes advantage of the company’s incredible breadth of technology as well as capitalizing on their dominance in the data center. This approach creates a formidable barrier to companies such as NVidia and Xilinx, who aim to make a living selling into what amounts to an Intel-owned ecosystem. The fortification of Intel’s FPGA offering – in hardware, IP, and software – raises the bar for what those competitors will have to do in order to get a foothold in the acceleration part of those systems.

Looking at just the AI acceleration market in the data center, NVidia has created a decent business selling GPUs as acceleration engines for both training and inference use in data centers. The problem with GPU-based solutions is that they do well in the acceleration department but don’t provide much benefit in the performance-per-watt department. FPGA-based accelerators are enormously more power efficient, but they have traditionally had a steep development curve to get them programmed optimally.

It appears that Intel is taking a multi-pronged approach to squeeze NVidia out of this space. First, Intel has significantly improved the performance of their Xeon processors for AI inferencing tasks. With something like a 30x inferencing upgrade announced in their recent release, the baseline for applications that would even need acceleration is raised significantly. After all, if you’ve already got Xeon-based servers sitting there and they can handle your particular AI task, why go for acceleration at all?

Then, for those who still need acceleration, there are many competitors in the market, including NVidia with their GPUs, companies such as Xilinx and Achronix with third-party FPGA solutions, and Intel, themselves, with their own range of accelerators including FPGAs from PSG. In many situations, Intel and their OEMs are packaging FPGAs into servers, cards, and even the same packages as Xeon processors. With Intel’s FPGA hardware already sitting in the system, the argument for bringing in third-party accelerators is made even more difficult.

Regardless of the competitive landscape, several things are clear. First, FPGAs will play a much larger role than ever in the enormous opportunities emerging in and out of the data center with the explosion of data generated by the latest generations of cameras, sensors, and assorted IoT devices. The challenge of processing and moving all that data is a perfect match for FPGAs, but FPGA companies must be creative in their approach to reducing the learning curve for engineering teams wanting to take advantage of their capabilities. By pre-engineering IP, reference designs, software, and entire applications – in addition to radically upgrading development tool suites – FPGA suppliers can enormously expand their customer base and the number of systems in which they are deployed. We are clearly at the doorstep of the biggest opportunity for FPGA market growth in history, but that growth will not occur without some serious innovation in the ecosystem. It will be interesting to watch.

Leave a Reply

featured blogs
May 16, 2021
https://youtu.be/_wup2MSTVks Made on Communication Hill, San Jose (camera Carey Guo) Monday: Intel eASIC: Linley and DARPA Tuesday: Please Excuse the Mesh: CFD and Pointwise Wednesday: Linley:... [[ Click on the title to access the full blog on the Cadence Community site. ]]...
May 13, 2021
Samtec will attend the PCI-SIG Virtual Developers Conference on Tuesday, May 25th through Wednesday, May 26th, 2021. This is a free event for the 800+ member companies that develop and bring to market new products utilizing PCI Express technology. Attendee Registration is sti...
May 13, 2021
Our new IC design tool, PrimeSim Continuum, enables the next generation of hyper-convergent IC designs. Learn more from eeNews, Electronic Design & EE Times. The post Synopsys Makes Headlines with PrimeSim Continuum, an Innovative Circuit Simulation Solution appeared fi...
May 13, 2021
By Calibre Design Staff Prior to the availability of extreme ultraviolet (EUV) lithography, multi-patterning provided… The post A SAMPle of what you need to know about SAMP technology appeared first on Design with Calibre....

featured video

Industry’s First USB4 Silicon Success

Sponsored by Synopsys

USB4 offers up to 40Gbps speeds for incredibly fast connections. Join Synopsys to see the first demonstration of USB4 IP in silicon, along with real TX eyes for DesignWare USB4, DisplayPort, and USB 3.x IP.

Click here for more information about DesignWare USB4 IP

featured paper

Optimizing an OpenCL AI Kernel for the data center using Silexica’s SLX FPGA

Sponsored by Silexica

AI applications are increasingly contributing to FPGAs being used as co-processors in data centers. Silexica's newest application note shows how SLX FPGA accelerates an AI-related face detection design example, leveraging the bottom-up flow of Xilinx’s Vitis 2020.2 and Alveo U280 accelerator card.

Click to read

Featured Chalk Talk

Intel NUC Elements

Sponsored by Mouser Electronics and Intel

Intel Next Unit of Computing (NUC) compute elements are small-form-factor barebone computer kits and components that are perfect for a wide variety of system designs. In this episode of Chalk Talk, Amelia Dalton chats with Kristin Brown of Intel System Product Group about pre-engineered solutions from Intel that can provide the appropriate level of computing power for your next design, with a minimal amount of development effort from your engineering team.

Click here for more information about Intel NUC 8 Compute Element (U-Series)