Cache Memory Microprocessor System Design

Nvidia shrinks LLM memory 20x without changing model weights

Nvidia's KV Cache Transform Coding (KVTC) compresses LLM key-value cache by 20x without model changes, cutting GPU memory costs and time-to-first-token by up to 8x for multi-turn AI applications.

EDN

Last-level cache has become a critical SoC design element

LLC, positioned between external memory and internal subsystems, stores frequently accessed data close to compute resources.

14d

SOCAMM2 Is The Memory Standard AI Is Looking For

AI infrastructure can't evolve as fast as model innovation. Memory architecture is one of the few levers capable of ...

The Manila Times

Supermicro Reveals DCBBS® with New NVIDIA Vera Rubin NVL72, HGX Rubin NVL8, and Vera CPU Systems, Designed to Accelerate Customer Time-to-Market

Supermicro's NVIDIA Vera Rubin NVL72 and HGX Rubin NVL8 systems are built on the DCBBS liquid-cooling stack, targeting up to ...

Marvell Launches Next-generation CXL Switch, Enabling Memory Pooling to Break Through the AI "Memory Wall"

Marvell Technology, Inc. (NASDAQ: MRVL), a leader in data infrastructure semiconductor solutions, today announced Marvell® ...

Intel: Left Out Of Nvidia's GTC CPU Roadmap, Left Behind In AI

Intel faces mounting execution risks as Nvidia's GTC 2026 announcements deepen competitive threats in CPU-based AI compute.

New memory architecture targets AI inference bottlenecks

Lightbits Labs Ltd. today is introducing a new architecture aimed at addressing one of the most stubborn bottlenecks in large ...

1don MSN

New Nvidia AI chip design raises questions over HBM demand: Report

Seoul [South Korea], March 16 (ANI): Nvidia may unveil a new artificial intelligence inference chip architecture built around on-chip static random access memory, or SRAM, at the Nvidia GTC 2026 ...

Synopsys Reveals Agentic AI To Speed Next-Gen Chip And System Designs

At its Synopsys Converge event currently underway in Santa Clara, the company announced an array of tools and initiatives to ...

19h

Nvidia BlueField-4 STX adds a context memory layer to storage to close the agentic AI throughput gap

Nvidia's BlueField-4 STX reference architecture inserts a dedicated context memory layer between GPUs and traditional storage ...

10d

Intel Panther Lake-H High-Res Die Shots Reveal 18A CPU Design

Its Core Ultra 200V "Lunar Lake" processors offered a great blend of CPU compute, GPU horsepower, and excellent power efficiency, and the latest Core Ultra 300 "Panther Lake" chips continue that trend ...

Electronic Specifier

What new products did NVIDIA announce at GTC?

NVIDIA has used its latest GTC keynote to lay out a vision for the future of AI infrastructure, unveiling the new Vera Rubin ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results