industry news
Subscribe Now

Deci Unveils DeciLM-7B: A Leap Forward in Language Model Performance and Inference Cost Efficiency

DeciLM 7B sets new performance standards in the large language model (LLM) space, outperforming notable open-source models such as Llama2 7B and Mistral 7B.

TEL AVIV, Israel, December 12, 2023 — Deci, the deep learning company harnessing AI to build AI, today unveiled the latest addition to its suite of innovative generative AI models, DeciLM-7B, a 7 billion parameter large language model. Building upon the success of its predecessor DeciLM 6B, DeciLM 7B is setting new benchmarks in the large language model (LLM) space, outperforming prominent open-source models such as Llama2 7B and Mistral 7B in both accuracy and efficiency. 

DeciLM-7B stands out for its unmatched performance, surpassing open-source language models up to 13 billion parameters in both accuracy and speed with less computational demand. It achieves a 1.83x and 2.39x increase in throughput over Mistral 7B and Llama 2 7B, respectively, which means significantly faster processing speeds compared to competing models. Its compact design is ideal for cost-effective GPUs, striking an unparalleled balance between affordability and high-end performance.

The remarkable performance of DeciLM-7B can be further accelerated when used in tandem with Infery-LLM, the world’s fastest inference engine, designed to deliver high throughput, low latency and cost effective inference on widely available GPUs. This powerful duo sets a new standard in throughput performance, achieving speeds 4.4 times greater than Mistral 7B with vLLM without sacrificing quality. Leveraging DeciLM-7B in conjunction with Infery-LLM enables teams to drastically reduce their LLM compute expenses, while simultaneously benefiting from quicker inference times. This integration facilitates the efficient scaling of Generative AI workloads and supports the transition to more cost-effective hardware solutions.

This synergy enables the efficient serving of multiple clients simultaneously without excessive compute costs or latency issues. This is especially crucial in sectors such as telecommunications, online retail, and cloud services, where the ability to respond to a massive influx of concurrent customer inquiries in real time can significantly enhance user experience and operational efficiency.

Licensed under Apache 2.0, DeciLM-7B is available for use and deployment anywhere, including local setups, enabling teams to fine tune for specific industry applications without compromising on data security or privacy. Its versatility allows teams to easily tailor it for unique use cases across a wide range of business applications, including content creation, translation, conversation modeling, data categorization, summarization, sentiment analysis and chatbot development, among others. When fine tuned for specific data sets, DeciLM-7B can deliver similar quality to that of much larger models such as GPT 3.5 at approximately 97% lower cost and better speed.

“With the increasing use of Generative AI in various business sectors, there’s a growing demand for models that are not only highly performant but also operationally cost efficient,” said Yonatan Geifman, CEO and co-founder of Deci. “Our latest innovation, DeciLM-7B, combined with Infery-LLM, is a game-changer in this regard. It’s adaptable to diverse settings, including on-premise solutions, and its exceptional inference efficiency makes high-quality large language models more accessible to a wider range of users.”

DeciLM-7B’s cost-effectiveness and reduced computational demand make advanced AI technologies more accessible to businesses of all sizes, fostering innovation and driving forward the digital transformation across various sectors. With DeciLM-7B, companies can now leverage the full potential of AI without the prohibitive costs or complexities previously associated with high-end language models.

Deci AI’s introduction of DeciLM-7B builds on its track record of innovative and efficient Generative AI models, including DeciLM 6B, DeciCoder 1B, and DeciDiffusion 1.0. Similar to its other models, DeciLM 7B was generated with Deci’s cutting-edge Automated Neural Architecture Construction (AutoNAC) engine, the most advanced Neural Architecture Search (NAS)-based technology on the market, with its focus on efficiency.

To learn more about Deci’s lineup of state-of-the-art models and generative AI offering, visit www.deci.ai

About Deci

Deci enables deep learning to live up to its true potential by using AI to build better AI. With the company’s deep learning development platform, AI developers can build, optimize, and deploy faster and more accurate models for any environment including cloud, edge, and mobile, allowing them to revolutionize industries with innovative products. The platform is powered by Deci’s proprietary automated Neural Architecture Construction technology (AutoNAC), which automatically generates and optimizes deep learning models’ architecture and allows teams to accelerate inference performance, shorten development time and reduce computing costs. Founded in 2019, Deci’s team of deep learning engineers and scientists are dedicated to eliminating production-related bottlenecks across the AI lifecycle.

Leave a Reply

featured blogs
Aug 11, 2025
If you're like me, all three of these videos will leave your brain buzzing with ideas, thoughts, and unanswered questions....

Libby's Lab

Libby's Lab Scopes out Texas Instruments AMC0311s Precision Isolated Amplifier

Sponsored by Mouser Electronics and Texas Instruments

Join Libby and Demo in this episode of “Libby’s Lab” as they explore the Texas Instruments AMC0311s Precision Isolated Amplifiers, available at Mouser.com! These amplifiers are great for protecting sensitive circuits in high-power applications. Keep your circuits charged and your ideas sparking!

Click here for more information about Texas Instruments AMC0x11S Precision Isolated Amplifier

featured chalk talk

Power Modules and Why You Should Use Them in Your Next Power Design
In this episode of Chalk Talk, Amelia Dalton and Christine Chacko from Texas Instruments explore a variety of power module package technologies, examine the many ways that power modules can help save on total design solution cost, and the unique benefits that Texas Instruments power modules can bring to your next design.
Aug 22, 2024
43,329 views