Fintech Magazine June 2026 | Page 2

Webinar

Enterp

Enterprise-Scale AI

How Inference to Overcome is Memory t Bound the

How

Memory to Overcome

Wall the Memory Wall

Enterprise-Scale AI Inference is Memo

Key Benefits for Attendeees

• Learn how the“ memory wall” limits GPU performance and increases AI inference cost

• Discover strategies to improve generative AI inference efficiency and reduce idle GPU compute time

• Explore how Penguin Solutions’ MemoryAI™ KV Cache Server can boost AI performance by up to 8X

• Understand how disaggregated memory architecture can scale AI workloads while reducing total cost of ownership( TCO) by up to 39 %

Tuesday, June 9, 2026 9:00 AM PST