What Is Inference in Reading

Nvidia’s rivals are focusing on building AI inference chips. Here’s what to know

Startups as well as traditional rivals are pitching more inference-friendly chips as Nvidia focuses on meeting the huge demand from bigger tech companies for its higher-end hardware. But the same ...

VentureBeat

What's a NIM? Nvidia Inference Microservices is new approach to gen AI model deployment that could change the industry

Nvidia is aiming to dramatically accelerate and optimize the deployment of generative AI large language models (LLMs) with a new approach to delivering models for rapid inference. At Nvidia GTC today, ...

The Next Platform

The Battle Begins For AI Inference Compute In The Datacenter

The major cloud builders and their hyperscaler brethren – in many cases, one company acts like both a cloud and a hyperscaler – have made their technology choices when it comes to deploying AI ...

NextBigFuture

OpenAI Strawberry LLM Reasoning Needs More Compute and Energy for Inference

Jim Fan is one of Nvidia’s senior AI researchers. The shift could be about many orders of magnitude more compute and energy needed for inference that can handle the improved reasoning in the OpenAI ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results