Cache Memory Microprocessor System Design

Nvidia shrinks LLM memory 20x without changing model weights

Nvidia's KV Cache Transform Coding (KVTC) compresses LLM key-value cache by 20x without model changes, cutting GPU memory costs and time-to-first-token by up to 8x for multi-turn AI applications.

16h

Intel: Left Out Of Nvidia's GTC CPU Roadmap, Left Behind In AI

Intel faces mounting execution risks as Nvidia's GTC 2026 announcements deepen competitive threats in CPU-based AI compute.

16h

Marvell Launches Next-generation CXL Switch, Enabling Memory Pooling to Break Through the AI "Memory Wall"

Marvell Technology, Inc. (NASDAQ: MRVL), a leader in data infrastructure semiconductor solutions, today announced Marvell® ...

14h

Supermicro Reveals DCBBS with New NVIDIA Vera Rubin NVL72, HGX Rubin NVL8, and Vera CPU Systems, Designed to Accelerate Customer Time-to-Market

(NASDAQ: SMCI), a Total IT Solution Provider for Cloud Computing, AI/ML, Storage, and 5G/Edge, today unveiled its upcoming system portfolio powered by the NVIDIA Vera Rubin platform. As data centers ...

Electronic Specifier

What new products did NVIDIA announce at GTC?

NVIDIA has used its latest GTC keynote to lay out a vision for the future of AI infrastructure, unveiling the new Vera Rubin ...

FullCleared on MSN

Alienware Aurora gaming desktop cuts $550 from RTX 5060 Ti configuration

Deal summary Gaming enthusiasts can secure $550 off the Alienware Aurora desktop featuring NVIDIA’s RTX 5060 Ti graphics card and Intel’s latest Core Ultra 7 processor. The combination of 32GB DDR5 ...

Open source Mamba 3 arrives to surpass Transformer architecture with nearly 4% improved language modeling, reduced latency

This release is good for developers building long-context applications, real-time reasoning agents, or those seeking to ...

DATAQUEST

Show inaccessible results

Nvidia shrinks LLM memory 20x without changing model weights

Intel: Left Out Of Nvidia's GTC CPU Roadmap, Left Behind In AI

Marvell Launches Next-generation CXL Switch, Enabling Memory Pooling to Break Through the AI "Memory Wall"

Supermicro Reveals DCBBS with New NVIDIA Vera Rubin NVL72, HGX Rubin NVL8, and Vera CPU Systems, Designed to Accelerate Customer Time-to-Market

What new products did NVIDIA announce at GTC?

Alienware Aurora gaming desktop cuts $550 from RTX 5060 Ti configuration

Open source Mamba 3 arrives to surpass Transformer architecture with nearly 4% improved language modeling, reduced latency

Synopsys intros software-defined hardware-assisted verification to enable AI proliferation

NVIDIA’s 7-Chip Vera Rubin Platform in Full Production, Company Materialises Groq Deal

Nvidia announces seven Vera Rubin chips in volume production

Supermicro Among First to Unveil NVIDIA BlueField-4 STX Storage Server to Improve AI Inference Performance

Nvidia announces Vera Rubin platform, signaling a shift to full-stack AI infrastructure