
In highly collaborative industries like media and entertainment, time isn’t just money—it’s opportunities. Giving your animators, designers, and visual effects artists more time means they have more space to coordinate and develop better creative outcomes. And when you have hundreds of collaborators, saving each one just a few minutes every hour can exponentially increase the amount of time spent on creative endeavors instead of, for example, waiting for software to load.
I recently had the opportunity to explore how storage innovation is enhancing collaborative workflows in the media and entertainment industry with Alex Timbs, Senior Business Development Manager of Media and Entertainment at Dell Technologies, and Scott Shadley, Leadership Marketing Director at Solidigm. During our Data Insights episode, it became clear that changes in content production workflows from pre-production to final edits are causing a fundamental shift in how storage supports content creation, moving flash storage from “nice to have” to essential for modern production pipelines.
Alex brought a unique perspective to our conversation, having spent 15 and a half years at Animal Logic (now Netflix Animation) before joining Dell. His experience as the company scaled from 80 to over 1,000 people globally provided compelling real-world context for understanding storage evolution in creative environments.
Alex saw firsthand the “serendipitous performance improvements” that emerge when organizations transition to flash storage and save minutes that add up to hours of freed-up creative time, witnessing gains that went far beyond what traditional metrics might predict. At Dell, he’s worked with customers who achieved this as well. He cited a recent Dell film studio customer who achieved 100x performance improvements—not 100% gains, but literally 100 times faster workflows.
The need for faster storage has been recently accelerated by AI and real-time workloads, which demand rapid filling and flushing of video random access memory (VRAM) on graphics processing units (GPUs). Where 24GB VRAM used to be sufficient, today’s workloads often demand 96GB or more. To keep these GPUs fed, VRAM must be filled and flushed at extreme speeds, making high-performance flash storage no longer a luxury, but an absolute necessity.
Scott emphasized how storage has transformed from an afterthought to a critical performance enabler. The concurrent access patterns required by modern workflows—where multiple users need simultaneous access to large files alongside their associated metadata—can only be efficiently handled by flash technology. Existing spinning HDD storage simply cannot deliver the random access performance required for today’s collaborative, high-resolution content creation environments.
Dell’s AI Factory serves as a robust foundation for media and entertainment organizations striving to lead amid surging data growth, new content formats, and adoption of AI-powered workflows. The platform uniquely combines validated, full-stack solutions, enabling companies to start small and scale incrementally, directly addressing the sector’s dual mandates of technological advancement and financial discipline.
At its core, Dell AI Factory leverages the PowerScale family: from the cost-effective F210, optimized for studio or departmental use, to the high-density, high-performance F910 designed for the most demanding enterprise-scale operations. This architecture empowers customers to only pay for what they need today, with the confidence they can scale both performance and capacity linearly as their needs evolve, eliminating the risks of overprovisioning or stranded investment.
The result is a unified platform that streamlines collaborative workflows (including editing, visual effects, and broadcast), consolidates data silos, and supports both on-premises and multi-cloud deployment, all with high security and efficiency. Multiple industry-leading media organizations already rely on PowerScale for everything from 4K/8K post-production to real-time virtual production and generative AI–driven analytics. Dell’s integrated data reduction, metadata solutions, and cyber protection further drive down operational costs, while the modular “grow as you go” model enables ongoing financial prudence. This makes the AI Factory a trusted partner: future-ready, validated by top global brands, backed by deep ISV partnerships, and proven to accelerate creative delivery while protecting the bottom line.
The edge computing dimension adds another layer of complexity and opportunity. A modern film production might have 10 cameras that are capable of capturing resolutions up to 17K, and the crew will want to start working with that immediately. Alex described in-camera visual effects (ICVFX) scenarios where directors give real-time creative feedback, viewing final-quality visual effects directly on on-set monitors. This surge in edge computing for ICVFX pushes the need for high-performance storage that can operate in demanding production environments, all while delivering the rock-solid reliability that tight shooting schedules require.
Interestingly, Alex compared today’s transformation to the shift from analog film to digital photography. Just as digital cameras delivered instant feedback and removed the high cost of mistakes tied to film processing, modern workflows in content production combine real-time creative feedback with minimal risk. This immediacy allows teams to iterate more often, experiment more boldly, and ultimately achieve stronger creative outcomes by removing traditional bottlenecks.
Solidigm’s collaborative approach resonates strongly with this philosophy. Rather than pushing customers toward the highest-performance solutions regardless of need, Scott described how their solutions lab and upcoming AI lab allow customers to test workloads before making commitments. This “try-before-you-buy” model helps organizations right-size their storage investments while ensuring they can achieve their performance objectives.
Looking ahead, both experts see storage demands continuing to accelerate. Organizations working in 4K today need to prepare for native 8K workflows tomorrow, requiring storage architectures that can scale both performance and capacity over multi-year timeframes.
The convergence of AI, real-time workflows, and edge computing is fundamentally reshaping storage requirements across industries, with media and entertainment serving as the proving ground for technologies that will eventually transform other verticals. As Alex noted, the future belongs to organizations that can make the most informed real-time decisions possible, and that capability fundamentally depends on having the right storage foundation in place. Dell and Solidigm’s partnership demonstrates how thoughtful collaboration can deliver solutions that scale from individual creators to global production companies.
For more insights on Dell’s storage solutions for media and entertainment, visit their website www.delltechnologies.com/powerscale or connect with Alex Timbs on LinkedIn. Learn more about Solidigm’s AI-focused storage solutions at solidigm.com/ai or reach out via LinkedIn to Scott Shadley.

The use of AI in health care promises a remarkable transformation. For an industry facing chronic staffing shortages against increasing demand, the potential for always-on support for care providers and an ability to move toward proactive, predictive care systems would literally save lives. My recent discussion with Dr. Rohith Vangalla, lead software engineer at Optum, revealed how AI has the potential to reshape everything from infrastructure architecture to clinical workflows, and why privacy-first design has become the cornerstone of scalable health care AI.
During our conversation ahead of the upcoming AI Infrastructure Summit, Rohith shared insights from his unique background, which includes backend development, aviation (he’s also a licensed helicopter pilot), and academic research. These diverse experiences have shaped his perspective that AI must focus on creating tools that make health care “smarter, faster, and more human-centric.”
The regulatory landscape in health care presents unique challenges that many industries don’t face. As Rohith emphasized, “A bad model doesn’t just mean poor performance. It literally costs lives.” Rather than viewing regulations as obstacles, he sees them as essential safety rails that prevent innovation from going off track. The real danger, he argued, lies in under-regulation that could allow biased or opaque models into clinical care, leading to misdiagnosis and eroding trust in health care systems entirely.
With trust acting as a crucial cornerstone in health care AI delivery, privacy-first architecture has emerged as an essential element to new solutions. Rohith highlighted how federated learning enables hospitals and rural clinics to train shared models without moving patient data off their servers, maintaining local data control while harnessing collective intelligence. When combined with zero-trust frameworks that verify every access request, and confidential computing that keeps data encrypted even during processing, these technologies create infrastructure that doesn’t sacrifice privacy for performance.
The conversation revealed how these architectural strategies are opening doors for international collaboration that wouldn’t have been possible otherwise. Rather than slowing down innovation, privacy-first design is actually accelerating it by enabling secure data sharing across previously isolated health care systems.
Real-world impact is already visible across multiple health care domains. AI can highlight tiny anomalies on X-rays that experienced radiologists might miss, reducing diagnostic errors and accelerating treatment. Voice-enabled documentation frees physicians to spend more time connecting with patients. And on the operational side, AI-powered call centers could route patients to appropriate specialists in seconds, eliminating anxiety-inducing hold times.
Looking ahead, Rohith identified the most exciting frontier as the shift from reactive to proactive care. Predictive analytics can now identify early risk factors for conditions like heart failure or sepsis before symptoms appear, enabling clinicians to intervene before patients require emergency care. This capability becomes even more powerful when considering underserved areas. A rural clinic without a cardiologist, for example, could leverage AI-powered tools to support general practitioners in making critical diagnoses.
The infrastructure evolution supporting these advances focuses on efficiency and accessibility rather than raw computational power. Technologies for fine-tuning large language models (LLMs) enable organizations with limited resources to customize powerful models without enormous infrastructure investments. Rohith shared an compelling example from a university hackathon where students created a lightweight AI system that could run directly in ambulances, helping paramedics triage patients en route based on vitals and symptoms without requiring cloud connectivity.
At his upcoming AI Infrastructure Summit presentation, Rohith will address the critical balance between privacy and performance when choosing cloud, on-premises, or hybrid deployments. While cloud offerings provide speed, scalability, and cost optimization, on-premises solutions offer greater control and data residency, which are crucial factors in health care. Hybrid architectures often hit the sweet spot by keeping sensitive data local while offloading heavy compute workloads to the cloud.
The TechArena Take
Rohith’s vision for health care AI represents an industry focused on practical, ethical solutions that prioritize patient outcomes. His emphasis on privacy-first architecture, responsible AI development, and proactive care models demonstrates how thoughtful engineering combined with regulatory compliance can drive meaningful innovation.
The future Rohith envisions, where AI serves as a non-judgmental health and wellness companion working in the background to ensure people feel seen, supported, and safe, reflects the true potential of AI in health care. It’s not about the technology itself, but about how that technology can bridge gaps in access, improve care quality, and ultimately save lives through early intervention and predictive insights.
Connect with Rohith on LinkedIn to continue the conversation about AI infrastructure in health care. Learn more about Optum’s AI initiatives at the Optum Marketplace, where you can find the latest articles and trials on health care AI innovation.

Surveying 250 IT pros, we found 29% already run SSDs beyond performance tiers, 81% would migrate when TCO wins, and storage innovation is a top lever to free power and space across the data center.

From Intel’s layoffs to stealth automation, AI is reshaping work at a pace that outstrips human adaptation—driving record stress, uneven gains, and a scramble to reskill before the next downturn hits.

In the not-so-distant past, data center storage was somewhat of an afterthought. You needed a place to gather data; you needed it to be reliable; and you needed it to be economical. And that’s pretty much where the conversation ended. Now in the era of AI workloads, storage is taking center stage for the critical role it plays in data activation. Having the right storage solutions in the right place provides the flexibility, efficiency, and security to feed AI at scale.
I recently had the opportunity to explore this transformation with Saif Aly, senior product marketing manager at Dell, and Scott Shadley, leadership marketing director at Solidigm, to explore how enterprise storage requirements are evolving in response to AI-driven workloads and data-intensive applications. During our TechArena Data Insights episode, it became clear that storage has evolved to the critical foundation enabling AI success.
The AI workload revolution has created unprecedented demands on storage infrastructure. As Saif explained, these workloads require sustained throughput, low latency, and massive scale simultaneously. The challenge extends beyond simple performance. Enterprises face data fragmentation across edge, core, and cloud environments, creating operational complexity that can lead to vendor lock-ins and underutilized graphics processing unit (GPU) resources.
Dell’s response centers on their AI Data Platform, built on the principle that modern storage must support the entire data lifecycle. The PowerScale platform serves as the foundation, delivering what Saif described as unmatched performance improvements: 220% faster data ingestion and 99% faster data retrieval compared to previous generations. The introduction of MetadataIQ further accelerates search and querying capabilities, directly supporting AI workload requirements.
Scott emphasized how customer conversations have evolved beyond traditional capacity discussions to focus on “time to first data”—how quickly organizations can access information when they need it. In AI application workloads, different data types require varying levels of accessibility and performance characteristics. The challenge lies in understanding what data needs to sit directly adjacent to GPUs versus what can be retrieved from more distant storage tiers.
The discussion revealed how inference workloads, particularly retrieval-augmented generation (RAG) architectures, create unique storage demands. These systems require large datasets to be readily accessible for real-time referencing while simultaneously managing active data processing next to compute resources. Success depends on optimizing the balance between high-performance local storage and efficient data movement from archive locations.
While flash storage dominates high-performance applications, both experts acknowledged that hard disk drives (HDDs) retain value for cold and warm datasets. The key insight: not all data is equal, and successful architectures blend flash-based solid-state drives (SSDs) and HDD storage within unified namespaces to balance performance and cost considerations.
The conversation highlighted remarkable capacity evolution, with Saif recounting his amazement at holding Solidigm’s 122 TB drive, a device containing massive data volumes in a small form factor. This density revolution, progressing from 30 TB to 60 TB to 122 TB drives just in the last year, enables dramatic improvements in rack space efficiency, power consumption, and cooling costs while maintaining the throughput AI workloads demand.
Scott connected this capacity evolution to practical customer needs, explaining how optimization now focuses on the right bandwidth, density, and time-to-data characteristics rather than simply maximum speed. As storage capacity per device increases, the focus shifts to infrastructure optimization that delivers customer value through improved total cost of ownership and operational efficiency.
Real-world impact emerged through customer examples Saif shared. Kennedy Miller Mitchell, the studio behind the Mad Max franchise, used PowerScale to enable pre-visualization of entire scenes before filming. That capability allows directors to iterate creatively and make real-time decisions. Subaru leveraged the platform to manage exponentially growing data volumes, handling 1,000 times more files than previously possible and directly improving their AI-driven driver-assistance technology accuracy.
Looking ahead, both experts see storage demands continuing to accelerate, driven by AI’s exponential data growth and evolving workload requirements. As Saif noted, “the data explosion is not going to stop,” with AI both consuming and creating massive amounts of data. The distributed nature of modern computing—spanning edge, core, and cloud environments—requires storage solutions that provide consistent experiences and seamless data mobility across all locations.
The TechArena Take
The convergence of AI workloads, massive data growth, and distributed computing architectures is fundamentally reshaping enterprise storage from a cost center to a strategic enabler. Dell and Solidigm’s partnership demonstrates how thoughtful collaboration can deliver solutions that scale from individual creators to global enterprises while addressing the critical balance between performance, capacity, and cost efficiency. As storage continues to assert its place as a foundation of modern workloads, organizations that invest in flexible, high-performance architectures today will be best positioned to capitalize on tomorrow’s AI-driven opportunities.
For more insights on Dell’s enterprise storage solutions, visit Dell.com/PowerScale or connect with Saif Aly on LinkedIn. Learn more about Solidigm’s AI-focused storage innovations at solidigm.com/AI or reach out via LinkedIn to Scott Shadley.

Allyson Klein and Robert Blum of Lightwave Logic unpack how electro-optic polymers, paired with silicon photonics, lower power and boost density on the road to AI-fueled 400G-per-lane optics—with a 2027 volume ramp in sight.

TechArena’s flagship interview series earned a 2025 Stevie® Award in the International Business Awards® annual contest, recognizing authentic, executive-level conversations on AI, data centers, edge, and sustainability.
The In the Arena podcast took home recognition in the highly competitive “Shows – Technology” category. The win acknowledges a simple idea that has guided the series from day one: put real innovators in the spotlight and make complex technology understandable for decision-makers. It’s a gratifying milestone for a show that has quietly grown into a go-to forum for leaders shaping the infrastructure of the AI era.
Now in its 22nd year, the IBAs are widely regarded as one of the world’s premier business awards programs, drawing thousands of entries from organizations of all sizes and across industries. Winners are selected by the average scores of more than 250 global executives, with this year’s judging taking place from May through July.
In the Arena is hosted by industry veteran Allyson Klein—TechArena’s founder and principal—and produced by the TechArena editorial team. Across three seasons, the show has welcomed executives and founders from companies representing more than $9 trillion in market capitalization—including Microsoft, NVIDIA, Google, and Meta—alongside standout startups pushing the edges of compute, storage, networking, and energy. The conversations are deliberately accessible without sacrificing depth, aiming to surface the real decisions behind AI deployments, data center architectures, and sustainability initiatives.
Judges called out the podcast’s high production quality, editorial clarity, and guest caliber, noting that the series “bridges executive insight with emerging innovation.” Several praised the show’s consistent focus on AI and sustainability, and highlighted its effort to elevate underrepresented voices in tech leadership—an intentional part of the booking strategy since the series’ inception.
For TechArena, the recognition is as much about community as it is about content.
“This recognition celebrates the innovators who’ve delivered insights across our 176 episodes and our audience, which values the conversation as technology accelerates in the era of AI,” Klein said.
Beyond the accolades, the show’s impact shows up in the themes it consistently returns to: the operational realities of AI at scale; the fast-changing power and cooling profiles behind GPU clusters; the role of high-capacity flash and data orchestration in breaking bottlenecks; the promise (and limits) of optical I/O; the emergence of agentic workflows in semiconductor design; and the practical steps organizations can take to reduce environmental impact while improving performance. The result is a library designed for CTOs, architects, and product leaders who need both strategic direction and applied lessons, not just highlight reels.
The win also comes at a moment when enterprise leaders are hungry for grounded dialogue. AI adoption has accelerated, but success still hinges on fundamentals: data placement, latency, reliability, energy, and cost. In the Arena aims to be a steady companion through that transition—candid, technical when it matters, and always human.
Looking ahead, the team is lining up episodes leading into upcoming conferences including Yotta, AI Infra, OCP Global and SC’25.
To everyone who’s listened, shared an episode, or joined us behind the mic: thank you. And to the judges—thank you for recognizing a show built on curiosity, clarity, and respect for the people doing the work.

From federated learning and zero-trust to confidential computing, Dr. Rohith Vangalla shares a practitioner’s playbook for explainable, scalable AI that moves healthcare from reactive to proactive.

For every headline celebrating agentic AI’s potential to revolutionize business, there’s a data privacy lawsuit quietly working its way through the courts—a reminder that innovation has outpaced consent infrastructure. This week, Permission announced Permission Agent, a system designed to broker high-quality human data with verifiable consent and contributor rewards.
“AI is only as good as the data it’s trained on, and the best data comes directly from people—with their permission,” said Charlie Silver, CEO of Permission. “Permission Agent is the missing bridge between individuals and AI systems, enabling direct, compliant, and mutually beneficial data exchange at scale.”
Permission Agent operates as a persistent, identity-tied “digital mini-me.” It collects only user-approved signals (e.g., intent, preferences, context) and attaches usage rights and consent metadata to each record. Buyers receive structured datasets with provenance and audit trails so they can prove lawful basis and honor revocation. Contributors are compensated in $ASK, which Permission has made omnichain via LayerZero’s OFT standard, allowing movement across supported chains without wrapped tokens.
Enterprises racing into agentic architectures are discovering their identity and governance foundations weren’t built for autonomous actors. Without machine-readable consent and revocation, agents can overstep policies and contracts, raising legal exposure as author and media cases proceed and as regulations (from GDPR to the EU AI Act) tighten expectations for transparency and consent.
Permission isn’t the first to reward data contributors, but it’s one of the few aiming squarely at consented human signals for AI rather than a single vertical. Where Brave compensates attention and projects like Hivemapper/DIMO target maps and vehicle telematics, Permission’s pitch is a portable, auditable consent layer and marketplace that personalization teams and AI builders can safely use.
Permission Agent is in early access for enterprises and individual contributors. AI organizations can request sample datasets; consumers can join the waitlist.
For enterprises considering purchasing permissioned data for AI, here are some due diligence suggestions:
Agentic AI doesn’t scale without machine-consumable consent. Permission’s move is notable for putting the individual at the center of a permissioned data supply chain aimed at training and personalization—and for bundling compensation and auditability from the start.
Two execution risks will determine impact:
Adoption density: Quality requires sustained, diverse contributors and buyer demand—plus SDKs that make capture, revocation, and downstream enforcement trivial across RAG, fine-tuning, and personalization stacks.
Enforceable revocation: Recording consent isn’t enough. Buyers will expect deletions to propagate to feature stores, embeddings, caches, and logs—fast.
Bottom line: If Permission proves dataset quality, scalable revocation, and low-friction integration, it could become the default vendor for “human signals with receipts” in agentic pipelines—good for builders, and overdue for the people whose data fuels the system.
Want to learn more? Check out www.permission.ai/.

Verizon Business’ Nikhil Tyagi shares insights on scaling AI at the edge—from small language models and multimodal experiences to infrastructure challenges and adaptive inference. Want to learn more about the AI Infra Summit and see Nikhil Tyagi live? Find out more here.

The data center industry faces an ongoing challenge: how to securely reuse storage devices when decommissioning them without compromising data integrity. My recent Great Debate with Amber Huffman and Jeff Andersen from Google revealed not just the scope of this challenge, but how the Open Compute Project’s Layered Open-Source Cryptographic Key-management (OCP L.O.C.K.) initiative could reshape how the industry approaches storage security and sustainability.
During our discussion, Amber and Jeff painted a picture of an industry with a dilemma. Everyone would like to get longer lives out of storage devices. But to date, there hasn’t been a solution that still meets the top priority of protecting users’ data. As Amber explained, in the second-hand market, even encrypted drives face potential threats from nation-state actors with substantial resources, and evolving technology could eventually break older encryption algorithms.
In current practices, organizations physically destroy drives or perform time-consuming multi-pass overwrites. Destruction, while secure, creates significant operational inefficiencies and environmental waste; overwrites take a long time and are failure prone.
OCP L.O.C.K., currently at 0.85 specification and available for review, is a new alternative, a comprehensive project to deliver an open implementation at CHIPS Alliance that provides encryption key management services to storage drives and hosts. It builds on the established Caliptra open-source hardware root of trust, also an implementation at CHIPS Alliance, and will be integrated into Caliptra 2.1.
OCP L.O.C.K. improves on traditional data security methods in several ways. At its core, OCP L.O.C.K. ensures that only trusted, verified components can access the encryption keys that protect data on drives. The system creates multiple layers of key management. So when a drive is provisioned with OCP L.O.C.K., cloud service providers can trust that data remains inaccessible without the proper access credentials. And when that drive needs to be decommissioned, OCP L.O.C.K. attests that process has been completed successfully.
Jeff’s and Amber’s insights into the technical architecture revealed the sophistication of this approach. OCP L.O.C.K. introduces multi-party authorization, requiring multiple keys rather than a single password to access drive data. This creates layers of protection that persist even if a drive is physically stolen. The implementation leverages Caliptra’s open-source foundation, allowing the security community to harden these systems through collaborative development.
The possibilities created by OCP L.O.C.K. have broad implications for the storage ecosystem. Amber emphasized how OCP L.O.C.K. could transform the value chain, enabling hyperscalers and businesses to sell decommissioned drives in secondary markets rather than destroying them. This represents a significant shift toward sustainability without compromising security — achieving what she called “the best of both worlds.”
The technical roadmap they outlined demonstrates the project’s maturity and industry backing. With partners including Microsoft, Samsung, Kioxia, and Solidigm, along with contributions from other industry partners, OCP L.O.C.K. aims to be a standard implementation rather than a niche solution. Work with standards bodies like Trusted Computing Group (TCG) has already yielded results, with the OCP L.O.C.K.-inspired Multiparty Authorization specification having now been published. These efforts, along with targeting Federal Information Processing Standards (FIPS) compliance, show the thorough approach being taken to ensure widespread adoption.
Looking toward the future, both speakers highlighted how OCP L.O.C.K. represents a broader trend in hardware security. Amber’s observation about the shift from open-source software to open-source hardware particularly resonated, suggesting we’re entering an era where foundational security components that aren’t differentiators become collaborative, community-driven efforts rather than proprietary implementations.
The implications for post-quantum security were equally compelling. Jeff’s discussion of hybrid cryptographic approaches, combining established elliptic curve methods with new post-quantum algorithms like ML-KEM (module-lattice-based key-encapsulation mechanism), demonstrates how OCP L.O.C.K. is designed not just for today’s threats but for the quantum computing era ahead.
What’s the TechArena take? The OCP L.O.C.K. initiative represents more than just another security standard: it’s a reimagining of how the industry approaches storage lifecycle management. By combining robust security with environmental responsibility, Google and its partners are creating a framework that could serve as a model for other infrastructure components. As data centers continue to scale and sustainability becomes increasingly critical, initiatives like OCP L.O.C.K. show how collaborative open-source development can address both security and environmental challenges simultaneously.
The project’s emphasis on implementation rather than just specification, combined with its open-source approach and broad backing, suggests we’re looking at a future where secure storage decommissioning becomes as routine and trustworthy as Amber hopes — ultimately “boring” in the best possible way.
Connect with Amber Huffman and Jeff Andersen on LinkedIn to follow their continued work on storage security and open compute initiatives. The OCP L.O.C.K. 0.85 specification is available for download today, with the 1.0 release targeted for later this year ahead of the OCP October global summit.

The Ai4 conference kicked off today at the MGM Grand in Las Vegas, drawing an expected 8,000+ attendees for three days of keynotes, 600+ speakers across 50 tracks, and a 250-plus-vendor expo aimed squarely at applied, enterprise AI. Organizers touted a sharpened focus on agentic systems, governance, and real-world deployments.
Opening day set the tone with two back-to-back arena sessions: a fireside chat on AI’s impact on classrooms and childhood with AFT president Randi Weingarten, followed by Tengyu Ma, Chief AI Scientist at MongoDB, on “RAG in 2025: State of the Art and the Road Forward.”
Ai4’s program stretches across 19 stages and 50 themed tracks, including Generative AI, Agentic Systems, AI Policy, a reimagined AI Research Summit, and a “Beginner’s Summit” to onboard first-timers. Exhibit-hall action will formally begin Tuesday morning and run alongside daily receptions.
Tuesday’s opening main stage will feature Shirin Ghaffery of Bloomberg News moderating a fireside chat with Geoffrey Hinton, who is known as the visionary godfather of AI. The talk is titled "AI, Ethics, and the Future of Humanity." The event will highlight a slate of sector tracks, live demos across the show floor, and a social-impact spotlight with Colossal Biosciences’ Ben Lamm discussing de-extinction and AI in a fireside format.
Wednesday’s keynotes will lean into the future of perception and agents. Fei-Fei Li will dive into world models, spatial intelligence, and human-centered AI, followed by a Cisco fireside where President & CPO Jeetu Patel will explore what it will take to safely catalyze an “agentic AI revolution.”
If you’re attending Ai4 to evaluate products, the expo floor is positioned as more than a showcase. Expect hands-on demos spanning AI applications & agents, data platforms, security/governance, and cloud infrastructure—plus brand activations.
With launches and briefings from governance vendors and platform players, expect concrete approaches to policy, observability, and controls for autonomous workflows.
Ma’s keynote framed how retrieval augments, rather than competes with, fine-tuning and long-context—important for enterprises balancing cost, latency, and source-of-truth requirements.
With Fei-Fei Li headlining, spatial intelligence and world modeling will continue to pick up steam as the path beyond text-only systems—especially for robotics and simulation-heavy industries.
Ai4’s program reads like a referendum on agentic AI at enterprise scale. Three practical threads stand out for IT and cloud architects:

The enterprise AI landscape is undergoing a fundamental transformation. While organizations have focused heavily on graphics processing unit (GPU) compute power and model sophistication, a critical infrastructure component has emerged as the new performance differentiator: storage. The Supermicro Open Storage Summit, running from August 12 to 28 with online sessions from leading solutions providers, promises to reveal how innovative storage strategies are delivering breakthrough performance improvements that could reshape your AI deployment economics.
As organizations scale from AI experimentation to production deployment, they’re discovering that inference workloads demand different storage characteristics than training pipelines. The data tells a compelling story: enterprises deploying solid state drive (SSD) storage solutions are seeing 10x to 20x throughput improvements, 4,000x input-output per second (IOPS) scaling improvements, and up to 40% total cost of ownership (TCO) reductions compared with traditional storage solutions.
These aren’t theoretical gains. Real-world implementations for retrieval-augmented generation (RAG) workloads have demonstrated that storage optimization with SSDs can deliver 70% increases in queries per second while simultaneously reducing memory footprint by 50%. For enterprises struggling with the economics of AI deployment, these performance multipliers represent an opportunity to maximize return on investment.
The Supermicro Open Storage Summit expands on these opportunities with two must-attend sessions that tackle the most pressing storage considerations facing enterprise AI deployments today.
Storage to Enable Inference at Scale (August 19, 10:00 AM PT) brings together industry leaders from Solidigm, Supermicro, NVIDIA, Cloudian, and Hammerspace to explore how new storage protocols and distributed inference frameworks are enabling large-scale inference processing. This session will reveal how organizations are moving beyond traditional storage approaches to deploy validated infrastructure optimized for GPUs that unlocks real-time performance at scale.
Enterprise AI Using RAG (August 27, 10:00 AM PT) dives deep into RAG, one of the most critical enterprise AI use cases. With experts from Solidigm, Supermicro, NVIDIA, VAST Data, Graid Technology, and Voltage Park, this session addresses how enterprises can operationalize generative AI securely and efficiently while maintaining proximity to their most valuable data assets.
One of the most compelling insights emerging from enterprise AI deployments challenges conventional storage wisdom. Solidigm’s recent breakthrough work, which will be discussed in the upcoming sessions, demonstrates that strategically offloading data from memory to high-performance SSDs doesn’t just reduce costs: it actually improves performance in many scenarios.
The company’s innovative approach involves moving model weights and RAG database components from expensive distributed random-access memory (DRAM) to optimized SSDs, achieving better performance at lower cost. In one demonstration involving a 100 million vector dataset, this approach delivered 57% less DRAM usage while maintaining or even improving query performance. The economic implications are huge as enterprises can run complex models on GPUs that would otherwise lack sufficient onboard memory.
The storage optimization story extends far beyond raw performance metrics. In the upcoming sessions, Solidigm will also discuss how cutting-edge storage solutions are demonstrating dramatic improvements in TCO across the entire infrastructure stack.
Take a practical example, a 50-petabyte dataset deployment with 12 NVIDIA H100 systems. Traditional HDD-based approaches require nine racks consuming 54 kilowatts. Deploy high-density 122TB SSDs, and that footprint shrinks to a single rack with up to 90% power reduction and 50% increase in available GPU footprint.
These efficiency gains matter more than ever as enterprises grapple with data center space constraints, cooling challenges, and escalating power costs.
Organizations that leverage cutting-edge storage optimization strategies are positioning themselves for sustainable competitive advantage. While competitors struggle with infrastructure costs and performance limitations, early adopters are achieving superior AI outcomes at lower total cost of ownership.
The ability to deploy more sophisticated models, process larger datasets, and deliver faster inference responses directly translates to better customer experiences and operational efficiency.
The window for competitive advantage is narrowing rapidly. As these storage optimization techniques become mainstream, the organizations that implement them first will establish performance and cost advantages that become increasingly difficult for competitors to match.
The Supermicro Open Storage Summit provides an opportunity to learn directly from teams of industry leaders who are defining the future of AI infrastructure. With sessions featuring experts representing all layers of the stack, you’ll gain access to the collective expertise of the companies driving AI infrastructure innovation. The summit’s focus on real-world implementations, demonstrated performance improvements, and practical deployment strategies makes it essential viewing for any organization serious about scaling AI effectively.
Don’t let storage bottlenecks limit your AI ambitions. Register below today and discover how strategic storage optimization can transform your enterprise AI performance while dramatically improving your deployment economics.
Storage to Enable Inference at Scale | August 19, 10:00 AM PT
Enterprise AI Using RAG | August 27, 10:00 AM PT

I still remember walking through the bustling electronics markets of Tokyo and Hong Kong with Jim Pappas, marveling at the incredible diversity of devices and form factors surrounding us. As my manager early in my Intel career, Jim had a unique way of opening my eyes to the bigger picture—showing how the foundational work we were doing in semiconductor and standards development was sparking creativity and innovation across the globe.
This week, as the Future of Memory and Storage (FMS) conference concluded in Santa Clara, I was thrilled to see Jim receive the event’s most prestigious recognition—the 2025 Lifetime Achievement Award. As someone who witnessed firsthand his passion for the vibrant innovation that happens through standards delivery of technology, watching the industry honor his decades of foundational contributions felt like a full-circle moment.
Jim has had a tremendous impact on the industry throughout the trajectory of his career. Just imagine a world without a USB standard for a second...or without PCI and all its variants. These technologies that Jim and his collaborators helped define have formed a foundation for compute innovation. I was recently at a Tech Field Day event where I listened to the delegates sharing their Jim Pappas stories, how each of them had been impacted by his leadership over the years. One thing I was lucky enough to know is that beyond his technical brilliance, Jim’s approach to management was transformational. For me personally, he enabled me to take risks, provided incredible autonomy, and helped accelerate my career and set me up for success. To FMS: Well done on this recognition. To Jim: thanks for all that you did to propel a young Allyson forward. Your impact informs my work every day.
The conference that honored Jim’s legacy also marked a pivotal moment for the memory and storage industry, as AI dominated the conversations in Santa Clara. In fact, FMS 2025 featured AI in more than 60% of its keynote presentations and expert panel sessions—a clear signal that AI workloads are fundamentally reshaping the entire storage industry.
“Artificial intelligence is no longer just part of the conversation—it is the conversation,” said Tom Coughlin, Conference Chair of FMS. The three-day event brought together industry leaders to explore how innovations in DRAM, NAND, CXL, and computational storage are revolutionizing AI inference and training at scale.
Jim Pappas’s recognition with the Lifetime Achievement Award represents more than individual accomplishment—it celebrates the foundational standards work that enables the entire industry to thrive. His journey began in 1991 with establishing the PCI-SIG and PCI standard, work that later evolved into defining the PCI Express (PCI-e) standard that remains foundational to computing platforms today.
After joining Intel in 1994, Jim collaborated with seven companies—including IBM and Microsoft—to create the hugely successful USB standard. His recent contributions include launching SNIA’s Persistent Memory Technology Initiative, chairing the Compute Express Link (CXL) Consortium, driving formation of Universal Chiplet Interface Express (UCIe), and serving as President of Ultra Accelerator Link (UAL) for AI scale-up architectures.
“For over 30 years, Jim Pappas has played a pivotal role in creating a number of the most important standards and industry organizations which have been critical in the dramatic growth of the memory and storage industries,” Coughlin noted during the award presentation.
Having worked directly with Jim, I can attest that his influence extends far beyond technical standards. He understands that true innovation happens through collaboration and ecosystem building—principles that shaped not just technologies like USB and PCI, but entire generations of technologists who learned from his example.
FMS 2025 concluded with the presentation of its Best of Show Awards, with the program receiving a record-breaking number of nominations in its 19th year. The awards showcased the breadth of innovation across memory and storage technologies, particularly highlighting solutions addressing AI infrastructure challenges.
Celebrating Women in Technology
AMD’s Rita Gupta earned the 2025 SuperWomen of FMS Award, sponsored by Hammerspace and Pure Storage. As a Fellow in AMD’s Server System Architecture team and CXL End-End Architect, Gupta leads advanced CXL memory system architectures for current and next-generation EPYC platforms including Genoa, Turin, and Venice.
Her impact extends beyond AMD as co-chair of the CXL Consortium Memory Systems Workgroup, where she’s helped shape industry standards by authoring JEDEC CMC01 and JESD325 specifications and contributing significantly to CXL 2.0 and 3.0 development. Gupta’s work exemplifies the collaborative, standards-driven approach that Jim Pappas championed throughout his career.
The Best of Show Awards highlighted breakthrough innovations across multiple categories:
Additional notable winners included Micron for 1-Gamma Node LPDDR5X LPDRAM, Samsung for PM1763 16-Channel PCIe Gen6 SSD, KIOXIA for LC9 Series 245.76 TB SSD with BiCS FLASH generation 8 Memory, and Western Digital for Advanced Rare Earth Material Capture Program.
A significant theme throughout FMS 2025 was the critical role of industry standards in enabling the AI revolution. Multiple awards recognized standards organizations, including the CXL Consortium for CXL 3.X Specifications, UALink Consortium for UALink 200G 1.0 Specification, and various SNIA technical work groups.
These standards ensure interoperability and scalability as the industry races to meet exponentially growing AI workload demands. The emphasis on standardization reflects industry maturation and recognition that collaborative approaches—the kind Jim pioneered with USB and PCI—remain essential for addressing complex technical challenges.
As FMS 2025 concluded, it became clear that the memory and storage industry has firmly embraced its role as a foundation for the AI revolution. The convergence of advanced memory technologies, innovative storage architectures, and industry-wide standardization efforts positions the sector for continued rapid growth as delivery of data across an AI pipeline becomes pervasively critical to organizations.
The record-breaking attendance and award nominations demonstrate the vitality and innovation driving the industry forward. With AI workloads continuing to evolve and scale, the memory and storage ecosystem will remain at the forefront of enabling next-generation computing capabilities.
Reflecting on Jim Pappas’ recognition and the broader innovations showcased at FMS 2025, I’m reminded of those walks through Tokyo electronics markets years ago. Jim’s vision of how foundational standards could spark global creativity and innovation has proven remarkably prescient. Today’s AI revolution builds directly on the collaborative, standards-driven approaches he championed—USB ports powering development workstations, PCI Express connecting GPUs and accelerators, and newer standards like CXL enabling the memory architectures that make modern AI possible. With so much grappling on how standards keep pace for AI innovation, I think its prescient to remember how we all benefit from this collaborative innovation and not be swayed by a need for stovepiped custom designs for bespoke deployments.
My final takeaway is how memory and storage are climbing into the center of the AI conversation. Without innovation in this space, costly GPUs can spend time in idle waiting for data delivery and costing organizations wasted cycles and opportunity. FMS is part of this solution. As Coughlin noted, “FMS is the place where the entire ecosystem meets to solve these challenges head-on.” The 2025 event proved that this industry, built on foundations laid by pioneers like Jim Pappas, continues to rise and meet each new technological moment with collaborative innovation and unflinching determination.

Dell and Solidigm leaders explore how modern storage—flash, SSDs, and flexible architectures—enables AI, accelerates performance, and helps enterprises manage data across edge to cloud.

Fragmented approaches to security and IT solutions have frustrated the private and public sector for decades, creating a need for costly integrations while still leaving vulnerabilities. My recent Data Insights interview with Jeniece Wnorowski, director of industry expert programs at Solidigm, and Bora Güzey, senior IT consultant at sayTEC, revealed how organizations are finally solving this problem as they demand unified, security-first solutions that eliminate the complexity of these legacy approaches to IT architectures.
During our conversation, Bora provided an in-depth look at how sayTEC is pioneering sovereign IT infrastructure by fundamentally reimagining how security, access, and storage work together. This transformation begins with recognizing that traditional IT models treat these critical components as separate, siloed systems — an approach that increases risk, cost, and administrative overhead while leaving organizations vulnerable to evolving cyber threats.
Bora highlighted a key differentiator that sets sayTEC apart from conventional solutions: their holistic approach to IT security. sayTEC has built a unified platform where access control, data protection, and system performance are integrated from the ground up. This security-first architecture is based on zero trust and includes built-in regulatory compliance. The combination ensures that protection isn’t an add-on but is embedded across every layer of the system, delivering what Bora described as “military-grade security without compromising performance or cost efficiency.”
The company’s hyperconverged infrastructure (HCI) platform combines compute, S3 object storage, backup, and secure remote access into a single integrated system. Thanks to partnerships with companies like Solidigm and Virtuozzo, sayTEC can deliver impressive performance metrics — S3 storage speeds of up to 150 gigabytes per second and seamless scaling up to 200 petabytes, all with zero downtime.
As enterprises grapple with increasingly sophisticated cyber threats, Bora addressed how sayTEC’s zero trust architecture goes beyond basic implementations. Their sayTRUST VPSC (Virtual Private Secure Communication) technology actively monitors the full communication path, blocking unauthorized traffic before it even enters the tunnel. The system deploys pre-tunnel verification, token-based access control, and layered encryption, including perfect forward secrecy, to create what they call a “darknet environment” for secure communications.
For sovereign data handling — a critical concern for government and enterprise customers dealing with sensitive information — sayTEC’s systems ensure full control over where and how data is stored and accessed. This resonates particularly strongly with organizations dealing with critical infrastructure or sensitive personal data, where sovereignty and adaptability are paramount.
One of the most impressive aspects of sayTEC’s solution is their promise of dynamic scaling without system downtime. Bora explained how their modular architecture allows customers to start with as few as three nodes and scale up to hundreds without interrupting operations. This is achieved through distributed workloads, erasure coding for redundancy, and live data migration capabilities.
For organizations facing rapid growth or stringent regulatory demands, this means no painful transitions or migrations. They can grow their infrastructure in real time while maintaining full compliance, business continuity, and budget predictability.
Bora also emphasized the importance of strategic partnerships in delivering exceptional value to customers. The company’s research and development collaboration with Solidigm enables them to leverage high-performance NVMe drives that dramatically reduce latency while optimizing energy efficiency. These partnerships have allowed sayTEC to reduce infrastructure costs by over 50%, accelerate deployment times, and offer return on investment often within just 12 months.
sayTEC’s solutions are particularly well-suited for sectors where security, compliance, and scalability are non-negotiable. The company has seen strong demand in finance, public sector, defense, and health care — industries that deal with sensitive data and face constant regulatory scrutiny. In addition, their simplified deployment model and competitive cost structure are increasingly attracting medium-sized enterprises looking for secure, future-proof IT systems without requiring large in-house expertise.
Looking ahead, Bora outlined an ambitious roadmap that includes hyperconverged infrastructure with GPU computing for AI and machine learning workloads, enhanced zero trust for mobile environments, privileged access management integration, and plans to double S3 storage acceleration to 300 gigabytes per second. The company also plans to expand compute power support to 256 cores per node and scaling up to one petabyte per node.
In the rapidly evolving landscape of enterprise IT security, sayTEC’s approach represents a significant departure from traditional fragmented architectures. By delivering a truly unified, security-first platform that combines infrastructure, access, and storage into a single system, they’re addressing fundamental challenges that have plagued enterprise IT for decades.
The company’s focus on plug-and-play systems that simplify complexity while delivering military-grade security positions them well for the growing demand for sovereign IT solutions, particularly in Europe, where data sovereignty regulations are becoming increasingly stringent.
Check out sayTEC’s full range of solutions at www.saytec.eu. To connect with Bora and learn more about their sovereign IT infrastructure approach, you can reach out via LinkedIn or email for direct inquiries and demo opportunities.

MLCommons today announced results for its MLPerf Storage v2.0 benchmark, setting new records with over 200 performance results from 26 organizations. The results provide a trove of new data for AI trainers looking to make informed storage decisions and avoid bottlenecks in machine learning (ML) workloads.
The dramatic surge in participation compared to the v1.0 benchmark signals how critical storage has become for AI training systems as they scale to billions of parameters and clusters reach hundreds of thousands of accelerators. Companies ranging from tech giants to specialized storage providers submitted results, representing seven different countries in what officials called unprecedented global engagement.
“The MLPerf Storage benchmark has set new records for an MLPerf benchmark, both for the number of organizations participating and the total number of submissions,” said David Kanter, Head of MLPerf at MLCommons. “The AI community clearly sees the importance of our work in publishing accurate, reliable, unbiased performance data on storage systems, and it has stepped up globally to be a part of it.”
A total of 26 organizations submitted results: Alluxio, Argonne National Lab, DDN, ExponTech, FarmGPU, H3C, Hammerspace, HPE, JNIST/Huawei, Juicedata, Kingston, KIOXIA, Lightbits Labs, MangoBoost, Micron, Nutanix, Oracle, Quanta Computer, Samsung, Sandisk, Simplyblock, TTA, UBIX, IBM, WDC, and YanRong.
The MLPerf Storage benchmarks focus on testing a storage system’s ability to keep pace with accelerators, either graphics processing units (GPUs) or application-specific integrated circuits (ASICs). Among other metrics, the suite measures if the storage system can maintain accelerator utilization levels above 90% across different ML workloads.
The v2.0 results reveal storage systems now simultaneously support roughly twice the number of accelerators compared to the previous benchmark round, a critical improvement as training clusters continue to grow to meet demand.
The suite evaluates how well storage systems handle the data demands of actual AI training without requiring organizations to run full training jobs. The benchmarks work by simulating the “think time” of accelerators, the processing periods when they’re computing rather than reading or writing data. This approach generates realistic storage access patterns while testing whether storage systems can maintain the required performance levels to keep accelerators fed with data across different system configurations.
The v2.0 suite carries over three core workloads from v1.0 that represent common AI applications: 3D U-Net for medical image segmentation, ResNet-50 for image classification, and parameter prediction for scientific computing in cosmology.
The v2.0 suite introduces new tests to meet a harsh mathematical reality of AI training: in a 100,000-accelerator cluster running at full utilization for extended periods, failures can occur every 30 minutes. In a theoretical million-accelerator system, that’s a failure every three minutes.
The new checkpointing tests address this challenge head-on. Regular checkpoints—saved snapshots of training progress—are essential to mitigate the effects of accelerators failing. To optimize the use of these checkpoints, however, AI trainers require accurate data on the scale and performance of storage systems. The MLPerf Storage v2.0 checkpoint provides that data.
More information on checkpointing and the design of the benchmarks can be found in a blog post by Wes Vaske, a member of the MLPerf Storage working group.
The submissions showcase remarkable diversity in approaches to high-performance AI storage. The v2.0 results include 6 local storage solutions, 2 systems using in-storage accelerators, 13 software-defined solutions, 12 block systems, 16 on-premises shared storage solutions, and 2 object stores.
This technical variety reflects what MLPerf Storage working group co-chair Oana Balmau called innovation driven by necessity. “Everything is scaling up: models, parameters, training datasets, clusters, and accelerators,” she said. “It’s no surprise to see that storage system providers are innovating to support ever larger scale systems.”
Enterprise storage leaders demonstrated significant advances in supporting massive AI training clusters.
DDN’s AI400X3 appliance achieved over 110 GiB/s sustained read throughput while supporting up to 640 simulated H100 GPUs on ResNet-50, representing a 2x performance improvement over the previous generation.
HPE submitted results for its Cray Supercomputing Storage Systems E2000. The E2000 more than doubles I/O performance compared to previous generations and powers six of the world’s fastest top 10 supercomputers, demonstrating proven scalability at unprecedented computational scales.
IBM showcased real-world performance with its Storage Scale system, which delivered 656.7 GiB/s read bandwidth for the massive Llama 3 1T model—equivalent to loading the entire trillion-parameter model in approximately 23 seconds—while simultaneously supporting mixed production workloads.
Quanta Cloud Technology (QCT) demonstrated the effectiveness of thoughtful system design through its QuantaGrid D54X-1U server platform, testing configurations with both Solidigm D7-PS1010 NVMe SSDs for low-latency metadata operations and D5-P5336 NVMe SSDs for high-capacity streaming read throughput.
When you’re running million-dollar training jobs that can fail every few minutes, storage is mission-critical infrastructure. The overall improvement in the number of accelerators that storage systems can support and record participation numbers reveal an ecosystem that’s taking storage seriously as a potential bottleneck to AI training efficiency.
We’re also excited to see the diversity of approaches represented in these results. Six different storage architectures, spanning everything from local NVMe to object stores, suggests there’s no single “right” answer yet. The industry is still experimenting, which means significant performance gains are likely still on the table. We’ll be watching for those gains in the next benchmark round.
The complete MLPerf Storage v2.0 results are available at MLCommons.org.

As the AI PC market moves from hype to real deployment, MLCommons has released a critical piece of infrastructure: MLPerf Client v1.0, the first benchmark specifically designed to measure the performance of large language models (LLMs) on PCs and client-class systems.
The release marks a major milestone in the effort to bring standardized, transparent AI performance metrics to the fast-emerging AI PC market, ML Commons officials said.
It’s a move that couldn’t be more timely. From developers building AI-first applications to enterprises deploying productivity tools powered by on-device inference, there’s a growing need for standardized, vendor-neutral performance metrics that reflect real-world usage. MLPerf Client v1.0 delivers just that.
MLPerf Client v1.0 introduces a broader and deeper evaluation suite than its predecessor. Here’s what stands out.
Expanded LLM support:
New prompt categories:
Wider hardware support:
Benchmarking made easy:
With participation from AMD, Intel, Microsoft, NVIDIA, Qualcomm, and top PC OEMs, this version represents one of the broadest industry collaborations yet in the AI PC space.

The AI PC conversation just got real. MLPerf Client v1.0 gives the industry a common language to talk about performance—not just raw inference speed, but usability across context lengths, structured prompts, and compute environments that look more like real end-user conditions.
It’s especially important in an ecosystem full of proprietary benchmarks and marketing-led performance claims. For OEMs and chipmakers racing to stake out territory in the AI PC era, this is a reality check.
But the bigger picture is this: AI workloads are going local. And that means we need tools that reflect how AI is actually used on devices with power, memory, and thermal constraints. MLPerf Client v1.0 answers that call with open, standardized, and scriptable benchmarks—all the ingredients needed to build trust across the ecosystem.
As AI PC adoption ramps, expect MLPerf Client to play a foundational role—not just in performance reviews, but in how next-gen silicon, SDKs, and even software experiences are shaped.
Download MLPerf Client v1.0: mlcommons.org/benchmarks/client.

Dell and Solidigm explore how flash storage is transforming creative pipelines—from real-time rendering to AI-enhanced production—enabling faster workflows and better business outcomes.

I recently sat down with Solidigm’s Jeniece Wnorowski and Mohan Potheri, principal solutions architect at Hypertec, to unpack how immersion cooling is reshaping data-center economics for AI and high-performance computing (HPC). During our discussion, it became clear that the biggest constraint on AI progress isn’t silicon — it’s keeping that silicon cool. Hypertec, founded in 1984 and now shipping over 100,000 servers a year to customers in more than 80 countries, has spent four decades learning how to squeeze more compute into less space without breaking the power budget, an experience that set the stage for our conversation.
Mohan painted a sobering picture of an industry straining under the weight of its own momentum. AI, HPC, and edge-computing workloads have pushed power and cooling demand to record highs just as sustainability-focused goals demand lower energy footprints. Operators face a conflicting mandate: deploy clusters faster than ever, but do so with tighter efficiency targets and, in many sites, within real-estate footprints that can’t grow any further. Space-constrained facilities must find ways to condense more compute while still meeting aggressive thermal budgets, all without blowing out capital or operating expenses. These pressures, he said, turn traditional air-cooled data centers into bottlenecks the moment racks tip into multi-kilowatt territory.
Hypertec’s answer is to start with liquid rather than retrofit for it. The company’s single-phase "immersion-born" servers live permanently in dielectric fluid, eliminating fans and chillers and cutting cooling power by roughly 50% while driving site-level power usage effectiveness (PUE) down to about 1.03.
Because every component is designed for submersion from day one, the servers avoid material-compatibility problems that plague air-cooled hardware dipped into tanks after the fact, and they let central processing units (CPUs) and graphics processing units (GPUs) sustain 90-95% of peak clocks instead of throttling under heat. A 10-megawatt deployment that would normally sprawl across 100,000 square feet collapses into roughly a tenth of that footprint, and Hypertec’s field data shows hardware lasting up to 60% longer thanks to the vibration-free, contaminant-free bath.
Tanks roll in pre-assembled, set up in under 10 minutes, and fill with fluid in less than half an hour, giving operators a shortcut from loading dock to AI production. Add immersion-ready storage nodes that put as much as two petabytes beside the compute they feed, plus 800 Gigabit-per-second networking, and Hypertec delivers a dense, sustainable, and rapidly deployable platform that sidesteps the very constraints throttling its air-cooled peers.
Before we wrapped, Mohan shifted the spotlight to storage—the quiet partner that can still slow an otherwise cutting-edge system. He explained that if data can’t reach the processors quickly, even the fastest GPUs and CPUs end up waiting. To avoid that pinch-point, Hypertec extends its immersion approach to storage as well, placing dense drive enclosures in the same fluid bath and on the same high-throughput fabric as the compute nodes. By treating cooling, compute, and data as one integrated stack, the company keeps every component working in sync and lays a cleaner path to future scale.
What’s the TechArena take? Together, these solutions make a compelling argument: immersion isn’t a niche experiment but a practical response to AI’s insatiable appetite for watts, racks, and real estate. Hypertec’s immersion-born solutions show how vendors can rethink server design to meet that challenge head-on—reducing energy, shrinking footprints, extending equipment life, and freeing budgets to buy more compute instead of more chillers.
Listen to the full conversation here, to learn how immersion cooling is quickly moving from “interesting” to inevitable.

Earlier this month, European automotive original equipment manufacturer (OEM) leaders BMW, Mercedes Benz, and VW announced an agreement to collaborate in the development of an open-source shared software platform for electric vehicles (EVs). This unlikely collection of competitors has decided to join forces to stave off the heated competition from the Chinese automotive EV OEMs.
In a recent interview updating my predictions for the automotive market for 2025, I highlighted the point that BYD, a Chinese automotive OEM, today ships more EVs annually than Tesla. It’s not just BYD’s momentum alone that has the German OEMs concerned; the Chinese EV market is exploding as evidenced by the more than 100 new EVs that were introduced by various Chinese OEMs at this year’s Shanghai International automotive show. In short, China is dominating the “new energy” vehicle market segment, leading to the German OEMs taking drastic measures to stave off the stiff competition.
In principle, collaboration among the major OEMs to address a global competitive threat makes sense. In practice, achieving this objective can prove to be tricky, if not impossible. While some of the key underpinnings for success are in place—such as the collaboration focusing on areas that don’t establish a vehicle’s brand or differentiated value—as recently as two years ago, 10 of the major Japanese automotive OEMs formed a similar consortium with a similar set of objectives and constraints, only to disband this effort after just six months.
The 10 “J OEMs,” named so in recognition of the 10 Japanese automotive OEMs that banded together to address global competition, collectively came to the realization that establishing a common hardware platform was not tenable because of the very different market segments the different OEMs addressed, which spanned from very low-end to very high-end.
While one of the key motivations of the software defined vehicle (SDV) is to abstract away the underlying hardware such that high-end vs. low-end hardware platforms appear similar in nature, it’s not always clear what software features and capabilities establish brand identity and differentiation—especially given the nascent nature of this evolving market.
To that end, the smartphone is often cited as a good illustration of the concept of a software defined platform, a good parallel to the SDV. Today’s smartphone operating system, which addresses a wide range of underlying hardware combinations, raises the question—what are the critical differences between an Apple iPhone and a Samsung Galaxy? Is it the software, or is it the hardware? I believe the answer is yes….it’s both. So, developing a universal software platform that provides significant industry momentum while allowing different OEMs to retain differentiation and brand identity will prove to be a bridge too far, unfortunately, is my prediction.
It’s been said that the best way to determine if a strategy will succeed is to execute that strategy. In this case, however, there have already been multiple industry-wide collaborations that have spectacularly failed as the industry grapples with this new world order of self-driving and new energy cars with an ever-increasing number of new entrants and business models that stand to challenge, if not eliminate, the long-term incumbents.
That said, there are also parallel efforts to the recently announced initiative by the German OEMs, including the SOAFEE SIG (Scalable Open Architecture for Embedded Edge special interest group), an industry-led initiative focused on defining a new open-standards-based architecture for SDVs. Similarly, a key goal of SOAFEE is to enable software to be developed and deployed across different hardware platforms, simplifying development and reducing the need for platform-specific code.
SOAFEE is a collaborative effort involving worldwide automakers, semiconductor companies, software providers, and cloud technology leaders and has been established since 2021. It’s unclear how the efforts of SOAFEE compare to those of the recently announced German OEMs, but again, time will tell.
And to muddy the waters just a little more, the Autonomous Vehicle Computing Consortium (AVCC), which also comprises many industry-wide automotive OEMs and solutions providers, is also focused on establishing open-source solutions to help accelerate the development of and deployment of SDVs. How these all differ from one another is a task left for the reader.
As the title of this blog states—the lamb lays down with the lion to avoid being eaten by the wolf. The competition is fierce, and there are too many irons in too many fires with significant R&D investments that will ultimately lead to spectacular losses. It’s a brave new world in the automotive industry, and while the industry seems to recognize as much, there don’t appear to be many clear or sound strategies to navigate this evolving landscape.

As AI continues its meteoric rise, the technologies enabling that growth – compute, memory, networking, and chip architecture – are being stretched to their limits. While the pace of innovation is accelerating to address these limits, the application of AI in the workflows continues to innovate as we move from reinforcement learning to generative AI and now, AgenticAI. During Gen AI Week, I had the distinct honor of moderating a panel of silicon industry heavyweights to explore how the next wave of chip design is evolving to meet the challenges of large-scale AI deployment.
Our discussion underscored a common theme: the age of agentic AI is upon us and it will fundamentally reshape how chips are architected, manufactured, and even conceived.
Joining me on stage were:

The panel kicked off with a look at the current state of chip design and why AI is creating unprecedented pressure on silicon teams. As AI models double in size nearly every year, chipmakers must accelerate the speed and the intelligence of their design processes.
Kelvin Low of Samsung Foundry pointed to the growing complexity of IP subsystems, which are now often pre-optimized for AI workloads. He noted that the industry is moving beyond traditional chipmaking, focusing instead on delivering pre-optimized IP subsystems and full-stack solutions tailored to specific AI workloads.
John Koeter echoed that sentiment, highlighting how today’s hyperscalers are pushing for 5–10x improvements in silicon performance, at a time when Moore’s Law and Dennard scaling are plateauing. He emphasized that the semiconductor industry is at an inflection point, where traditional scaling is no longer enough and entirely new approaches, like multi-die design and agentic AI, are needed. We need to re-engineer design workflows from the ground up.
While GPUs have dominated headlines, the panelists emphasized that AI infrastructure relies on a broader constellation of compute and memory technologies. Brennan outlined two parallel trends: going big with monolithic training chips and going out by scaling across multiple smaller units.
He also introduced the idea of compute density versus capacity, especially when it comes to high-bandwidth memory (HBM). He stressed that while HBM plays a key role in performance, it also introduces significant power challenges – accessing these memory stacks alone can consume hundreds of watts.
Low explained how the industry is working on custom HBM stacks optimized for specific workloads, with next-gen configurations offering lower power and higher integration thanks to basedies utilizing a logic-process instead of a DRAM process.
“Power is everything,” he said. “We just do not have enough power to fit into the data center. So wherever possible to reduce power, we can do that.”
As AI clusters grow from 20,000 to 100,000+ compute nodes, network infrastructure is becoming a primary design constraint.
Harish Bharadwaj explained that AI workloads are pushing data movement beyond traditional thresholds. He noted that AI cluster-level bandwidth is growing up to 10x in a single year, driving the need for far more scalable and efficient network infrastructure.
John Koeter added that networking is no longer just a back-end concern; it has become a critical co-architect of overall system performance. Koeter expanded on the evolution of standards.
“The time between standards used to be three to four years, and that’s been accelerating to 2 years to 18 months,” he said. “And the question is, ‘Why?’ And that’s across the board – memory interface, PCI Express...even good old USB. The interface standards are accelerating. And the reason is because you can pack an enormous amount of compute units onto a chip, but you have to be able to transfer data on and off that chip very, very efficiently.”
One of the most exciting – and existential – topics of the panel was the rise of agentic AI, or the use of autonomous software agents in chip design workflows.
Koeter explained that AI is transforming not just what engineers design, but how they design. He described a future where networks of autonomous agents assist with key stages of chip development – prompting teams to completely rethink and rebuild traditional engineering workflows from the ground up.
From macro placement to RTL generation, panelists said agentic AI is beginning to automate and optimize historically manual tasks. Brennan noted that although silicon engineering lacks the vast open-source data available in software, AI tools are already producing meaningful speedups.
“What used to take weeks now takes hours,” Bharadwaj said.
Still, the panelists agreed: AI won’t replace chip designers, but designers who use AI will replace those who don’t.
Agentic AI is also reshaping how teams are structured and trained.
Brennan pointed to workforce challenges, citing predictions that the industry could be short a million engineers in agentic AI by 2030.
“The cool kids are no longer going into silicon,” he said. “They’re going into algorithms and software.”
The panelists called for a shift in training and team structure, with junior engineers gaining AI-augmented capabilities once reserved for veterans. But challenges remain – particularly around proprietary data security and best practices that haven’t caught up with the tech.
When asked how teams are quantifying productivity gains, Bharadwaj was clear: it’s about pace. He noted that companies are under intense pressure to launch new xPUs annually, and that technologies like agentic computing may play a crucial role in helping the industry keep pace.
Koeter offered a final perspective.
“I tell my team all the time… there’s two types of design engineers in the future: ones that lean in and embrace agentic AI with all their hearts, and dodos and dinosaurs,” he said. “I’m like, don’t be a dodo. You gotta lean in.”
AI is no longer just a workload. It’s a force reshaping the silicon landscape. From custom memory to co-architected networks, and agentic workflows to workforce transformation, this panel revealed the full-stack rethink underway as the industry races toward a trillion-dollar AI economy.

The edge computing landscape stands at an intersection of practical necessity and AI transformation. My recent Fireside Chat with Hunter Golden, senior product manager at OnLogic, revealed just how different the reality of what is needed is from the hype. As organizations grapple with deploying AI at the edge, Hunter reveals how smart sizing edge investments will get the best return.
During our discussion, Hunter explained that OnLogic has more than two decades of experience in industrial and edge computing, long before AI became the driving force. OnLogic’s computers have lived running applications behind the scenes in our daily lives — from amusement park kiosks to flight information screens to robots working in warehouses and even harvesting crops. But the onset of AI and an increase in automation opportunities has fundamentally shifted the compute density requirements at the edge while the physical footprint remains largely static.
Hunter emphasized a critical misconception plaguing enterprises looking to deploy AI at the edge: the belief that AI requires massive cloud infrastructure or discrete GPUs. As he explained, “both training and inference can easily occur at the edge” with lower-than-expected compute requirements, noting that even his “not very powerful laptop” could run DeepSeek.
We explored the balance between performance, power, and cost that defines successful edge AI deployment, and how hardware selection and the workload objective are completely intertwined. For example, for computer vision, sizing up the workload includes understanding the number of video streams, resolution requirements, model size, and target frame rates. Once that is understood, organizations can spec appropriate hardware rather than defaulting to expensive, overpowered solutions.
The conversation also highlighted three key advantages of edge AI deployments that can get overlooked in cloud-focused discussions:
Achieving lower latency, with benefits that are immediate and measurable in edge deployments
Maintaining data sovereignty, which is critical in medical applications and other use cases where it’s critical to own your own data
Bypassing network reliability concerns, with edge deployments allowing applications to continue to function even if a network goes down
Hunter’s insights into IT modernization revealed a sector dealing with diverse transformation paths. Some companies are just connecting programmable logic controller (PLC) data to operational technology networks, while others are deploying autonomous mobile robots for material handling. The key is understanding both short-term objectives and long-term roadmaps so you can spec the right hardware and don’t have to rip and replace later on.
Looking toward future infrastructure needs, Hunter underlined the importance of guaranteed lifecycles and scalable architectures. OnLogic’s commitment to five-year lifecycles from launch addresses a common pain point where prototype hardware becomes unavailable by deployment time. The company’s commitment to life cycle transparency when embarking on multi-year projects with customers helps enterprises know they’ll have the right hardware when they get to deployment.
What’s the TechArena take? As organizations like OnLogic continue to balance innovation with practical constraints, we’re witnessing the emergence of edge AI that prioritizes efficiency, reliability, and cost-effectiveness without sacrificing the transformative potential of AI solutions. The real breakthrough is in the thoughtful matching of workload requirements to appropriate infrastructure, supported by partners who understand both the technical challenges and the business realities of edge deployment.
Listen to the full Fireside Chat for more from our conversation. Connect with Hunter Golden on LinkedIn and explore OnLogic’s Ultimate Edge Server Selection Checklist here.

The semiconductor industry is facing rapid changes and major shifts. One of them, previously announced, is finalized as of today: Synopsys, a leader in electronic design automation (EDA), has acquired Ansys, a giant in simulation and analysis software. The blockbuster deal, valued at approximately $35 billion in cash and stock, aims to create an undisputed leader in “silicon to systems” design solutions.
The acquisition brings together two titans of the engineering software world. Synopsys’ foundational tools for chip design will be combined with Ansys’ broad portfolio that simulates how those chips and entire products will perform in the real world. This fusion is designed to address the soaring complexity driven by AI, widespread silicon proliferation, and software-defined systems.
Synopsys’ President and CEO Sassine Ghazi released a video message regarding the acquisition today, calling it an “exiting day” for Synopsys employees, customers and engineering innovators everywhere.”
“We have completed the acquisition of Ansys,” he said in a blog post, “…a transaction that combines leaders in silicon design, IP, and simulation and analysis to create the leader in engineering solutions from silicon to systems.
“Together, we will maximize the capabilities of engineering teams broadly, enabling them to rapidly innovate AI-powered products.”
The move was a “logical next step” to the seven-year partnership between the companies, Ghazi said.
Ajei Gopal, President and CEO of Ansys, echoed the sentiment, stating, “This transformative combination brings together each company’s highly complementary capabilities to meet the evolving needs of today’s engineers and give them unprecedented insight into the performance of their products.”
Synopsys’ acquisition of Ansys is more than just a massive financial transaction; it’s a bold declaration about the future of engineering and product design. The traditional walls between chip design, software development, and physical system analysis are crumbling, and Synopsys is betting the house on owning the entire, integrated workflow. In an era where AI-powered smart devices are becoming ubiquitous, the ability to create a “digital twin” — a perfect virtual replica of a product that can be tested before it’s built — is no longer a luxury, it’s a necessity.
This move is a direct challenge to competitors like Cadence and Siemens EDA. By creating a one-stop-shop for engineering everything from the transistor to the final system, Synopsys is aiming to build a deeply entrenched platform that is difficult to displace. It’s a classic vertical integration play for the digital age, locking down the foundational blueprint of modern technology.
The ultimate test, however, will be execution. Integrating two massive companies with distinct cultures and complex software portfolios is a monumental task – though the companies’ deal website addresses this point – saying the cultures are complementary cultures of innovation, with the formal acquisition building on eight years of strategic partnership to “drive the fusion of electronics and physics, augmented with AI.”
The promise of a “seamlessly integrated” platform is powerful, but delivering on it will be the true measure of success. The race to own the end-to-end design chain is on, and Synopsys just made a decisive, multi-billion-dollar move.

Allyson Klein and Jeniece Wnorowski welcome Mohan Potheri of Hypertec to explore how immersion cooling slashes energy use, shrinks data-center footprints, and powers sustainable, high-density AI, HPC, and edge solutions on this Data Insights episode. Find the audio-only podcast here.