Product marketers have long relied on NIST for clarity and consistency — but with new frameworks emerging for AI, it's time to ask whether these guidelines go far enough in prioritizing fairness, safety, and accuracy.
At GTC 2025, a discussion between Deloitte and VAST showed how their partnership is scaling enterprise AI with secure, auditable infrastructure—bringing business value for next-gen, agentic AI adoption.
Verge.io’s George Crump shares how a unified infrastructure approach is driving efficiency, performance, and AI-readiness — without the legacy bloat.
At GTC 2025, Nebius and VAST shared how their collaboration delivers high-performance, scalable AI infrastructure for enterprise workloads—making cloud AI more usable and accessible.
MLPerf Inference 5.0 signals the rise of large language models, with LLAMA 2 70B surpassing ResNet-50 in submissions and driving next-gen AI performance across compute platforms.
MemryX, a provider of edge AI acceleration hardware, recently closed its latest round of funding, serving as a potential bellwether for the next growth edge in AI compute.
Two new genAI tests (Llama 3.1 8B, Flux.1) align with production stacks as multi-node results climb. NVIDIA posts many fastest times; University of Florida, Wiwynn, and Datacrunch expand the ecosystem.
Allyson Klein talks with author and Google/Intel alum Wanjiku Kamau on moving past AI skepticism, learning fast, and using new tools with intention—so readers start where they are and explore AI with hope.
AI racks are blowing past air’s limits. Here’s a frank framework for when cold plate still wins, when it fails, and how to plan the pivot to immersion—without stranding today’s investments.
FinTech expert Anusha Nerella shares insights on staying ahead of fraud, navigating regulation, and building collaborative teams to scale responsible AI across the financial services sector.
Nat-sec cyber innovator Sean Grimaldi compares the cyber war to the War on Drugs: incentives persist; adversaries adapt. What if we shifted the goal from zero breaches to rapid detection, containment, and recovery?
Under CEO Lisa Spelman, Cornelis turns constraints into a competitive weapon, delivering speed, precision, and a customer-obsessed purpose in the high-stakes AI infrastructure arena.
Runpod head of engineering Brennen Smith joins a Data Insights episode to unpack GPU-dense clouds, hidden storage bottlenecks, and a “universal orchestrator” for long-running AI agents at scale.
From CPU orchestration to scaling efficiency in networks, leaders reveal how to assess your use case, leverage existing infrastructure, and productize AI instead of just experimenting.
From the OCP Global Summit, hear why 50% GPU utilization is a “civilization-level” problem, and why open standards are key to unlocking underutilized compute capacity.
In the Arena: Allyson Klein with Axelera CMO Alexis Crowell on inference-first AI silicon, a customer-driven SDK, and what recent tapeouts reveal about the roadmap.
In this episode of Data Insights, host Allyson Klein and co-host Jeniece Wnorowski sit down with Dr. Rohith Vangalla of Optum to discuss the future of AI in healthcare.
From OCP Summit, Metrum AI CEO Steen Graham unpacks multi-agent infrastructure, SSD-accelerated RAG, and the memory-to-storage shift—plus a 2026 roadmap to boost GPU utilization, uptime, and time-to-value.
Runpod head of engineering Brennen Smith joins a Data Insights episode to unpack GPU-dense clouds, hidden storage bottlenecks, and a “universal orchestrator” for long-running AI agents at scale.
From CPU orchestration to scaling efficiency in networks, leaders reveal how to assess your use case, leverage existing infrastructure, and productize AI instead of just experimenting.
From the OCP Global Summit, hear why 50% GPU utilization is a “civilization-level” problem, and why open standards are key to unlocking underutilized compute capacity.
In the Arena: Allyson Klein with Axelera CMO Alexis Crowell on inference-first AI silicon, a customer-driven SDK, and what recent tapeouts reveal about the roadmap.
In this episode of Data Insights, host Allyson Klein and co-host Jeniece Wnorowski sit down with Dr. Rohith Vangalla of Optum to discuss the future of AI in healthcare.
From OCP Summit, Metrum AI CEO Steen Graham unpacks multi-agent infrastructure, SSD-accelerated RAG, and the memory-to-storage shift—plus a 2026 roadmap to boost GPU utilization, uptime, and time-to-value.