Menu

Search

  |   Business

Menu

  |   Business

Search

Nvidia to Launch New AI Inference Processor to Boost OpenAI Performance

Nvidia to Launch New AI Inference Processor to Boost OpenAI Performance. Source: Martijn Boer, Public domain, via Wikimedia Commons

Nvidia is preparing to introduce a new processor aimed at accelerating AI inference computing, a critical component in delivering faster and more efficient artificial intelligence systems for companies like OpenAI. According to a Wall Street Journal report citing sources familiar with the matter, the new AI hardware platform will be officially unveiled at Nvidia’s upcoming GTC developer conference in San Jose next month.

The new system is specifically designed for inference computing, the process that enables AI models such as ChatGPT to generate responses to user queries in real time. As demand for generative AI applications continues to surge, improving inference speed and efficiency has become a top priority for AI developers and cloud service providers.

The report indicates that Nvidia’s upcoming platform will incorporate a chip designed by AI startup Groq. This collaboration highlights Nvidia’s strategy to strengthen its position in the AI chip market while addressing customer concerns about performance bottlenecks. Reuters noted that it could not independently verify the Wall Street Journal’s claims, and both Nvidia and OpenAI have not issued official comments.

Earlier this month, Reuters reported that OpenAI has expressed concerns about the speed of Nvidia’s existing hardware, particularly for tasks such as software development and AI-to-software communication. Sources said OpenAI is seeking next-generation hardware that could eventually support roughly 10% of its inference computing needs.

OpenAI has reportedly explored partnerships with chip startups including Cerebras and Groq to secure faster inference solutions. However, Nvidia signed a $20 billion licensing agreement with Groq, effectively halting OpenAI’s independent discussions with the startup.

In addition, Nvidia previously announced plans to invest up to $100 billion in OpenAI as part of a broader agreement that granted the chipmaker an equity stake while providing OpenAI with capital to purchase advanced AI chips. The move underscores Nvidia’s growing influence in the AI infrastructure and semiconductor market.

  • Market Data
Close

Welcome to EconoTimes

Sign up for daily updates for the most important
stories unfolding in the global economy.