
I recently had the delightful opportunity to moderate a fireside chat with technologists from Ansys and Rohde & Schwarz about how the convergence of simulation, test and measurement is fundamentally changing how 5G and 6G radio systems are developed and validated.
The conversation centered on a groundbreaking collaboration that enables developers to virtually replicate any installation site, bringing real-world RF environments directly into the laboratory for early, reliable validation. This applies not only to outdoor urban, rural, or mobile networks but also to indoor and mixed outdoor-indoor installations for private networks in locations like factories, warehouses, and hangars.
The chat featured Shawn Carpenter, Ansys Program Director for 5G/6G and Space, Andreas Roessler, Rohde & Schwarz Technology Manager, and Jayraj Nair, Ansys Field CTO – high-tech.
Shawn discussed how dramatically antenna design has evolved, reflecting back on how single band antennas were used in the development of antennas for 2G and 3G systems.
“Today, as we unwrap the new spectrum allocations for 5G and explore millimeter wave, there's a wide number of channels and spectrum that we have to accommodate,” he said.
The complexity doesn't stop at frequency bands. Modern 5G/6G systems must handle multiband operations, manage thermal characteristics that can detune antennas, and incorporate sophisticated spatial diversity techniques. As Jayraj described, we’ve reached an era in which validating wireless systems is akin to reengineering a plane while it's midair with paying customers aboard.
Ansys and Rohde & Schwarz bring together two traditionally separate worlds: simulation and test and measurement. Their solution creates highly accurate virtual environments – digital twins of real cities – complete with five-centimeter resolution models that capture everything from street furniture to window frames and trees.
Here's where it gets interesting: the system can model electromagnetic wave propagation through these virtual environments in real time, capturing the complex interactions that occur as signals bounce off buildings, reflect from surfaces, and encounter moving objects. These channel characteristics are then fed into Rohde & Schwarz's signal generators, creating authentic RF conditions that real devices can be tested against in the lab.
“You could do a virtual representation of where you want to deploy a digital twin, use the Ansys tool to do the channel modeling, put it into test measurement equipment, and optimize machine learning algorithms for that particular channel representation,” Andreas said.
What resonates with me most is how this innovation addresses a fundamental challenge in modern technology development: the growing complexity of systems paired with shrinking validation timelines.
By enabling comprehensive testing in controlled laboratory environments, this approach could accelerate time-to-market while improving reliability. Companies using similar simulation-driven approaches have realized up to 3x acceleration in development time and cost reductions of up to 60%.
The technology also opens new possibilities for regulatory compliance and public safety validation. Shawn mentioned exploring how base station signals might interact with aircraft radar altimeters – critical safety research that can be conducted safely in simulation before any real-world testing.
The most exciting aspect of this development isn't just what it enables today, but what it makes possible for tomorrow. Andreas hinted at a future where 6G networks could continuously optimize themselves.
“You could collect data in your network, take that data and retrain that default model and do a site-specific adaptation,” he said.
Imagine networks that automatically adapt their signal processing algorithms based on changing environments, all validated through digital twin technology before implementation.
As I reflect on this conversation, I'm reminded of how often the most transformative innovations come from combining existing technologies in novel ways. Ansys and Rohde & Schwarz’ marriage of high-fidelity simulation with hardware testing represents a breakthrough that could fundamentally change how we develop, validate, and deploy wireless systems.
The implications extend far beyond telecommunications. Any industry deploying complex RF systems – from automotive radar to IoT networks – could benefit from this approach. As we stand on the brink of the 6G era, with its promise of supporting everything from autonomous vehicles to immersive reality applications, having the tools to validate these systems thoroughly before deployment becomes essential.
The future of wireless technology isn't just about faster speeds or lower latency – it's about creating systems that adapt to our ever-changing world.

Earlier this year, I shared two stories that signaled a profound shift underway in the world of silicon design.
In March, during Synopsys’ annual user group conference , the company laid out a bold roadmap for agentic AI: a vision in which autonomous AI agents assist human engineers and become co-designers of the most complex compute systems on Earth. Weeks later, at the TSMC Technology Symposium, Synopsys announced a set of certified AI-driven design flows for the A16 and N2P nodes, tightening the loop between angstrom-era process technology and AI-native tools.
These developments underscore that AI isn’t just changing how we design chips – it’s changing who the designers are.
That message came into sharp focus during a recent panel I moderated between leaders at Microsoft, Arm, Marvell, Sandisk, and NYU. Held in conjunction the Design Automation Conference, the panel featured an early model multi-agent RTL design demo – code-based and powered by Synopsys tools that are in the proof-of-concept phase. But what struck me most wasn’t the code. It was the conversation that followed, centered around three questions that will shape engineering leadership in the agentic era:
1. What happens when every engineer becomes a manager of agents, from both a technology and leadership perspective?
2. What does it mean when a junior designer skips straight to system-level orchestration?
3. How do we reimagine engineering teams when a 10-person squad can operate at the velocity of 100 engineers today?
Synopsys and Microsoft kicked off the panel with a prototype demo using early models of the multi-agent platform in testing, showcasing a fully autonomous flow that generated, validated, fixed, and revalidated RTL for a complex product design. Utilizing real code with Synopsys tools in the back end, this example demonstrated how capabilities come together.
This accessibility speaks to a major inflection point for engineers and the drawing card of a packed house for the executive discussion. And while the demo ran autonomously, the team emphasized the importance of human-in-the-loop integration in real-world deployments. The agents are being designed to collaborate with engineers to help move faster to market.
That collaborative theme echoed throughout the panel and each panelist stressed that human engineers will still hold the baton for silicon delivery. Bill Chappell, CTO of Microsoft’s strategic missions and technology, offered one of the most striking observations of the night on this topic.
“Everybody is now a senior dev – because you now have 100,000 virtual workers working for you, and you have to have that instinct to know when things are going wrong and be able to sign off on that,” he said. “And so, the ability to manage all of the things that are going to be able to be done is going to be the hardest thing.”
It’s a compelling redefinition of engineering. In the past, career progression often meant expanding from focus on one element of a chip to multi-sub-system and then full chip architecture. In the agentic age, it might mean graduating from writing simple instructions to orchestrating teams of specialized AI collaborators across complex designs.
Aman Joshi, vice president of design enablement and automation at Sandisk, explained it this way:
“Our...post-production test people always get this data that is very old. They're like, ‘Hey, your RTL doesn't match the documentation,’ and (in testing these early models), you can actually dive deep into the RTL and extract the information,” he said. “So you’re finding lots of very useful cases in that sense. So very productive, and also not only productive, very accurate, and also catching some of these problems.”
In practice, that means that AI has the potential to accelerate verification, improve documentation, and even reduce onboarding time for junior engineers. But it also demands a new kind of vigilance.
“It's very tempting today, with all these agentic things, you have an agent that...parses a…report, figures out the critical path, then generates the histogram, puts it into a slide, (and) sends it out in an email,” said Soumya Banerjee, senior vice president of ASIC design, CAD and methodology at Marvell Semiconductor. “But the worry there is, if the engineers stop thinking about those reports and don't look at it, what are they going to miss? And I don't think we are at that robustness level today to sign off on it.”
This comes with a key conclusion: the integration of agentic tools must transform how engineering leaders build organizations and train skillsets for newer in career staffers. Panelists from Microsoft and Arm emphasized a shift from centralized Centers of Excellence to cross-functional teams in which every engineer is expected to prototype, validate, and own more of the stack.
“There's a foundational shift in the shape of teams,” said Microsoft’s Chappell. “The PM role has foundationally changed.”
This shift demands both technical upskilling and a cultural willingness to evolve. Several panelists described senior engineers who’ve gone from writing every line of CAD code to overseeing the generation and validation of that code in real time as they’ve been testing these tools. They pointed to the fact that agentic automation redefines engineering jobs in a way that many engineers may not be prepared for because they are used to writing code themselves.
Panelists expressed clear concerns about skill atrophy, loss of engineering intuition, and the risk of over-automation. But the consensus was clear: organizations that prepare their teams for orchestration – not just execution – will be the ones that thrive and scale their design delivery.
As often happens when engineers congregate, the conversation shifted to how to measure the productivity gains delivered by agentic AI on engineering teams over time. While several companies projected 20–30% productivity gains, some leaders warned of “agentic sandbagging,” in which team members could underreport impact to protect future headcount. It’s also a question of how leaders use their engineering talent to reach further vs. simply reduce staff size.
“I will say it's a true cultural test for a company,” Chappell said. “Given (a projected) 30% more productivity across the board, what do you do with that? If you reduce your workforce, that's admitting that you don't know how to start new things. How well you can actually get into new fields and start new areas is going to be a true test.”
Others agreed that AI is not a replacement for the workforce, but a scaling mechanism. Teams will need to deliver more customized silicon, with smaller, more nimble teams, and ultimately customers benefit with more choice of solutions in the market.
“...More and more, we’re seeing in the marketplace that people want...a custom solution to their needs, and chip organizations will not scale if everything becomes custom,” said Kevork Kechichian, executive vice president of solutions engineering at Arm. “You can make that customization almost incremental on R&D teams and chip teams. That's where I see the value coming in, where you deliver something to a partner that seems custom to them, but you're benefiting from the scaling and all the tools that you put in place.”
Synopsys’ roadmap targets L1 capabilities by late 2025 and early access to L2/L3 capabilities – such as autonomous static analysis agents, e.g. Lint agents – also by year's end.
These tools aren’t just changing how chips are built. They’re changing how engineering is taught, led, and imagined.
“Curiosity and confidence is the only thing that matters in the education process,” Chappell said. “That is what we need to be teaching. You don't really care what you’re learning – it's how you learn. You own the system. The system doesn't own you.”
This panel delivered more than a status check. It gave us a metric for readiness – both technical and organizational. Agentic AI is moving from whiteboard to workflow. Engineers are becoming orchestrators. And leaders are being called to reimagine how teams learn, structure, and scale. The braintrust on the panel, and in the room, reflected how urgent and important this topic is to the silicon arena. It also served as a case study for broader implications across job categories, one that I hope is treated with the same amount of forethought as exhibited by these engineering leaders.
From my vantage point, this is the most exciting and consequential moment in engineering since the rise of EDA. And like all meaningful revolutions, it’s not about the tools – it’s about the people, the trust we build, and the futures we’re willing to imagine. I suggested that we hold another panel next year at DAC to gauge progress, and I can’t wait to hear how engineering teams advance with these powerful tools.
As I said onstage: It’s time to go invent the future.

The AI landscape is evolving at breakneck speed, and my recent Fireside Chat with Sanford’s Daniel Wu revealed just how transformative this moment truly is. As we prepare for the AI Infra Summit, where Daniel will deliver a keynote, his insights illuminate an industry balancing unprecedented innovation with the critical need for trust and responsible deployment.
During our discussion, Daniel painted a picture of Stanford’s AI Professional Program that mirrors the broader democratization of AI knowledge. What began in 2019 as a single technical course has expanded into seven comprehensive courses serving physicians, executives, teachers, and product managers alongside software engineers – reflecting AI’s expanding reach across every sector.
Daniel emphasized four major trends reshaping the AI landscape. First is agentic AI, which he called “the clear star of the moment.” We’re witnessing a shift toward autonomous systems capable of reasoning, planning, and executing complex tasks. Markets and Markets projects the agentic AI market will grow from $13.8 billion this year to over $140 billion by 2032, a 40% compound annual growth rate.
The second trend, embodied AI, represents the physical manifestation of these intelligent systems. Companies like Tesla with Optimus and Figure AI are developing humanoid robots for warehouses, factories, and homes. Daniel noted that 2025 is positioned as the first year of mass production for industrial robots, with the World Economic Forum suggesting billions could be operating globally by 2040.
Supporting these advances is multimodal AI, which enables systems to process text, images, audio, and video simultaneously. This capability is critical for AI to operate in real-world complexity, with the market expected to leap from $2.5 billion in 2025 to over $42 billion by 2034.
Perhaps most importantly, Daniel highlighted the trend toward trustworthy AI. A KPMG study revealed that 44% of US workers admit to using AI improperly at work, while only 41% are willing to trust AI systems. As Daniel said, “Building trust and building robust, ethical and reliable systems is not just about a trend. It’s an absolute necessity for any of the technology to realize its true potential. That’s also a core part of what I'm passionate about, and what I will be touching on in my keynote this year.”
When we explored AI’s most impactful applications, Daniel identified four transformative areas where AI is accelerating discovery at unprecedented scales.
As we approach the AI Infra Summit, Daniel expressed excitement about three key areas: state-of-the-art advancements across the entire AI tech stack, creative applications beyond the tech industry, and infrastructure for trustworthy AI. The infrastructure needs for agentic AI present unique challenges, requiring ultra-low latency for real-time decision making, new memory architectures for long-term context, and complex orchestration frameworks.
When asked about the industry’s most critical challenge, Daniel was unequivocal: it’s not technical, but human. Building trust and confidence at scale is the single most important hurdle. A recent Edelman Trust Barometer report found that 56% of people are skeptical of business AI use. To overcome this hurdle, Daniel suggests a three-part approach involving people, process, and mindset.
For people, we need massive investment in AI literacy and continuous learning. For process, we need collaborative benchmarking for responsible AI, similar to the National Institute of Standards and Technology (NIST) AI Risk Management Framework. For mindset, we need leaders who cultivate cultures of experimentation and humble continuous improvement.
Daniel’s vision for the future is remarkably optimistic. Rather than dystopian scenarios, he envisions AI as a great equalizer. But this future isn’t inevitable; it must be built intentionally through governance, safety alignment, fairness, and human oversight.
What’s the TechArena take? Daniel’s insights reveal the critical inflection point AI has reached. The technical capabilities are advancing rapidly, but the real challenge lies in building the trust, frameworks, and human capacity needed to realize AI’s transformative potential responsibly. As we move toward the AI Infra Summit, the conversations about infrastructure won’t just be about compute power and storage – they’ll be about building the foundations for the vision of the future, one where AI amplifies human creativity rather than replacing it.
Check out the full Fireside Chat. To connect with Daniel, find him on LinkedIn.

I had the delightful opportunity to sit down with Kamesh Darisipudi, growth director at Together AI, to discuss how the company is scaling in one of the most competitive and dynamic markets in tech.

Founded by a group of AI researchers and systems engineers, Together AI is building what it calls the “AI Acceleration Cloud” — a full-stack platform combining open-source models, high-performance compute, and developer-friendly APIs to power every stage of the generative AI lifecycle. With marquee projects like RedPajama, FlashAttention, and the recent acquisition of Refuel.ai, Together AI is quickly emerging as a go-to partner for enterprises and developers seeking both flexibility and performance.
In our conversation, Kamesh shared his perspective on growth, go-to-market strategy, and what it really takes to scale infrastructure in the era of open source and AI-native applications.
At Together AI, growth and marketing is a multi-dimensional concept that includes all of the above. We're operating at the intersection of product-led growth and enterprise sales, which means we need to think about growth in terms of both scale and depth.
On one side, we’re driving adoption through self-serve experiences, model usage, and community engagement. On the other, we’re building relationships and expanding accounts through a more traditional sales motion. We also support a mix of developer, consumer, and enterprise users, so growth means something slightly different across each segment.
Ultimately, growth at Together AI means launching new programs, expanding usage, accelerating time-to-value, and owning the key metrics that tie those activities back to long-term business outcomes like revenue, retention, and market leadership. It’s about moving fast while building in a way that compounds over time.
Together AI is positioned as the “AI Acceleration Cloud” - a comprehensive, full-stack solution that supports customers at every stage of their AI journey.
Whether you're just beginning to experiment with models or deploying mission-critical applications at scale, we provide the hardware, compute, tools, and flexibility needed to move fast and scale confidently. Unlike point solutions, our integrated stack bridges infrastructure, models, and deployment into a single cohesive platform.
Our open-source initiatives are key drivers of both innovation and adoption. Projects like RedPajama and FlashAttention help us earn credibility and visibility within the research and developer communities. They create a flywheel of engagement – developers build with our models, researchers publish on our innovations, and enterprises see a trusted platform backed by cutting-edge work.
Our in-house research team, which includes multiple professor-founders (ex. Chris Re, Percy Liang etc.), plays a central role in sustaining this momentum and reinforcing Together AI as a thought leader in the space.
By integrating Refuel.ai’s specialized models and orchestration capabilities into the Together AI Platform, we’re not only removing one of the biggest roadblocks in AI development – dealing with unstructured, messy data – but also enabling our customers to use their data with greater speed, accuracy, and scale.
The acquisition marks a significant step forward in our mission to accelerate the development of production-grade AI applications.
We view the AI infrastructure landscape as an interconnected ecosystem, not a zero-sum game. Strategic collaboration with hyperscalers, GPU vendors, and on-prem partners is critical to our go-to-market and scaling efforts. These relationships allow us to optimize resource availability, expand global reach, and tailor deployments to meet diverse customer requirements. Whether it's securing GPU supply or integrating with existing enterprise infra, we work hand-in-hand with partners to maximize performance and value for our customers.
Flexible Deployment Options: You can choose between serverless API endpoints (pay-as-you-go) and dedicated endpoints (reserved capacity with per-minute billing), allowing you to scale your deployment based on your traffic demands.
Horizontal and Vertical Scaling: Together AI offers flexible scaling options to ensure your deployment can handle traffic spikes and growth.
Optimized Inference Engine: Together AI's inference engine is designed for speed and efficiency, enabling fast processing of even complex AI tasks and large-scale deployments.
Top-of-funnel and brand marketing remain underutilized in AI infrastructure. Many teams focus heavily on bottom-of-funnel channels because they offer direct attribution and measurable ROI. But in a category as new and complex as AI, long-term growth depends just as much on trust and education as it does on performance marketing.
Even with all the attention around AI, we're still in the early days of true enterprise adoption. Buyers are often navigating unfamiliar technology, and they want to work with partners they trust to guide them through that process. A strong brand helps convey that trust. It positions a company as credible, forward-looking, and capable of supporting customers over the long term.
Investing in brand isn't just about visibility – it's about creating a durable advantage in a market where confidence and clarity matter as much as features and price.
I really admire the team at Ramp and how they’ve managed to transform a traditionally “unsexy” category like corporate cards and payments into one of the most memorable and innovative brands of this decade. They take creative risks, run thoughtful experiments, and aren’t afraid to challenge the status quo. Their approach is confident, original, and highly effective.
What stands out to me is their view of go-to-market as a continuous journey rather than a one-time transaction. That perspective closely aligns with our business at Together AI, where consumption is a core part of the model. It's not just about selling credits or access to GPUs. It’s about ensuring customers are actively using and gaining value from the platform over time. Ramp’s ability to blend product, brand, and lifecycle marketing has been a real source of inspiration for how I think about growth in our own space
Together AI is your full-stack AI platform – from GPU clusters to fine-tuned models to scalable inference endpoints. Whether you're building with open-source, customizing your own models, or deploying at scale, we accelerate every step of your generative AI journey with speed, flexibility, and reliability.
Check out Together AI: www.together.ai
*Kamesh's comments should not be considered official company statements.

As part of our 2025 predictions report in January, TechArena Principal Allyson Klein predicted we’d see our first major gen AI scandal this year – and just as we’re checking in on those predictions mid-year, it seems Allyson’s crystal ball served up some winners.
We sat down with Allyson to discuss how her predictions have played out.
Allyson: I haven’t seen the breakout yet from the corporate world that I’ve predicted, but we are only halfway through the year. The most notable scandal that comes to mind is Elon Musk’s Grok platform spewing pro-Nazi propaganda, including love for Hitler. That was not on my dance card for this year, but likely should have been, given the broader macro environment.
*Shortly after we posted this update on 2025 predictions, the year’s most notable gen AI scandal grew in scope. Linda Yaccarino announced on July 9 that she is stepping down from her role as CEO of X, just 24 hours after Grok began creating antisemitic comments praising Adolf Hitler.
Allyson: We're seeing massive disruption in the silicon market as companies race to capture the AI accelerator TAM. AMD snapped up silicon startup Enosemi, (as well as data center infrastructure provider ZT Systems, AI software optimization startup Brium, and the engineering employees of AI inference chip developer Untether AI). Qualcomm is expanding into data center infrastructure with its acquisition of Alphawave Semi. This trend will only accelerate as pressure mounts to reduce reliance on NVIDIA GPUs.
Allyson: We’ve all witnessed the market disruptions sparked by tariff negotiations and the ongoing ambiguity surrounding the Trump administration’s unresolved trade agreements. The market appears to be settling into a new normal — one where businesses factor tariff costs into baseline planning and build in buffers to navigate trade uncertainty. If this administration has taught us anything, it’s that this story is far from over. As the dust continues to settle, the computing industry — and its complex international supply chains — will remain front and center.
Allyson: I didn’t expect to see massive adoption of agentic computing in 2025, but I did expect the zeitgeist to shift — and it has. Within AI circles, agents have become the topic du jour, and terms like vibe coding are quickly becoming part of the common vernacular. We’re already seeing early adoption in sectors like financial services, healthcare diagnostics, and semiconductor design, with a slow, but steady, uptick expected in the second half of the year.
One big question now emerging: How must infrastructure evolve to support agentic memory? Or are we facing a new technical challenge altogether — agent amnesia?
Allyson: I was pleasantly surprised by the introduction of spatially aware hearing glasses from Nuance Audio — but frankly, I’m still anticipating major breakthroughs in the wearables market from this space. We recently wrapped an interview with Verizon, whose team is deeply invested in this area. They emphasized the importance of running small models directly on-device to unlock the full potential of next-gen wearables.
Allyson: Two announcements really stood out to me. First, Microsoft’s quantum breakthrough — successfully creating a new state of matter involving Majorana fermions. Who saw that coming? Second, DeepMind’s unveiling of AlphaGenome, which we covered on TechArena, and its role in decoding the mysterious "dark matter" of DNA. Both mark major milestones, and I can’t wait to see what comes next.
Allyson: I think the slow progress of enterprise adoption has thwarted my enterprise prediction. Look at IT organizations being cautious!

Platform9 and Commvault are teaming up to bring a new level of confidence to enterprises building modern private clouds.
Announced today, the partnership integrates Commvault’s cyber resilience and data protection solutions directly into Platform9’s Private Cloud Director – a fully-managed private cloud control plane designed to deliver the enterprise features of VMware without the operational complexity or lock-in.
This integration offers a unified, enterprise-ready solution for data protection across virtual machines and Kubernetes environments. For Platform9 customers, it’s a strategic evolution: combining application-consistent backup and agentless VM protection with a future-forward platform designed for multi-site recovery, cloud-native security, and long-term flexibility.
As the industry navigates seismic shifts in virtualization, particularly following the Broadcom-VMware acquisition, enterprises are rethinking long-term infrastructure strategies. Platform9 has positioned itself as a strong alternative, promising all the benefits of private cloud without the complexity or cost of legacy virtualization stacks.
But with that shift comes heightened scrutiny around one core requirement: resilient data protection. The joint Platform9–Commvault solution addresses that need head-on. Customers gain:
This isn’t just about backup – it’s about building secure, scalable private cloud environments that are resilient by design.
This partnership is a clear signal that the private cloud renaissance is here, and it’s maturing rapidly.
Platform9 is making an aggressive play to become the de facto choice for enterprises exiting legacy platforms. By layering in Commvault’s industry-standard data protection, they’ve checked off one of the last big “must-have” boxes for CIOs evaluating their next virtualization strategy.
From a strategic standpoint, this announcement also reflects a broader market shift: cloud-native platforms can no longer ignore traditional enterprise requirements like compliance-grade recovery, granular backup, and cross-site failover. Enterprises want simplicity and flexibility, but they won’t compromise on resilience.
If Platform9 can continue to deliver a VMware-equivalent experience while expanding into hybrid cloud and container-native services, it has a real shot at capturing mid-market and large enterprise customers searching for a safe harbor.

This summer, we’re checking in on our TechArena predictions for 2025 to see how they are holding up.
For today, TechArena correspondent Deanna Oothoudt sat down with automotive industry expert Robert Bielby to discuss what he got right in his predictions and what’s taken him by surprise.
Robert: Indeed, as predicted, the automotive semiconductor industry is now entering the trough of disillusionment as defined by the Gartner Hype Cycle, where overheated expectations are met with harsh market realities. It was only just a few years ago when it was vogue for semiconductor companies to be “all in” when it came to their commitment to the automotive market — citing this market as the critical growth driver and a key component of their diversification strategy.
Beyond Intel’s highly visible departure, 2025 has already seen several smaller semiconductor companies, primarily start-ups, previously focused on edge AI automotive applications, either fail, or redirect the focus away from automotive to other markets as the reality that the headwinds inherent to participate in the market are too significant, especially so for start-ups. I expect there will be a continued shakeout where more companies will fail, or they will be acquired by larger automotive semiconductor companies where the acquisition will address a gap in their portfolio. The acquisition of Kinara by NXP in February 2025 is a good illustration of this trend.
Robert: Currently, high-end vehicles from Mercedes, BMW, and Geely contain 8K resolution displays. And while 8K is currently considered somewhat of a novelty, we can expect to see a greater presence of 8K resolution displays as the 8K infrastructure continues to build out — including cameras, displays, and display electronics. In fact, we can expect to see 8K resolution displays begin to phase out 4K displays in a manner similar to how 4K is currently phasing out 1K.
8K resolution provides a meaningfully improved immersive experience over 4K, especially at close distances. One only needs to get up close to older televisions that were based upon the cathode-ray tube (CRT) to appreciate how the picture was made up of relatively large lighted dots on the screen. The move from 1K to 4K resolutions continues to reduce the size of those dots, which are not as noticeable when viewed at a distance, but are very noticeable when viewed from up close. Several of the leading automotive semiconductor companies, including Qualcomm’s Snapdragon Ride Elite and a leading Tier 1, have announced support for 8K resolution displays — so safe to say — watch this space (in 8K!). There’s more to come.
Robert: While it is going to be difficult, if not impossible, to predict how tariffs may or may not affect the Chinese automotive landscape, what is clear is that the China EV market as a whole is evolving at a pace that is significantly faster than other geographies, especially when it comes to EVs and vehicles with leading advanced driver-assistance system (ADAS) capabilities. In just shipments alone, BYD currently ships significantly more BEVs (battery-based EVs) than Tesla, the previous leader, by a factor of more than 1.5 times. In Q1 2025, BYD shipped 607,000 vehicles vs. 384,000 vehicles for Tesla for that same period.
At the Shanghai International Automotive Industry Association held this April, 163 new vehicles were debuted from both established and up-and-coming Chinese automakers. Over 70% of those vehicles were based on “new energy” (battery and hybrid) technologies. Chinese brands also dominated the “vehicle intelligence” launches, with 97 new models focused on this area. In short, China is quickly evolving into the leading automotive trendsetter and will correspondingly receive greater focus and attention on a global level.
Key factors leading to China’s overall success are the general lack of the need to support legacy architectures and a primary focus on EV versus ICE (internal combustion engine) technologies, which requires lower research and development (R&D) investments while supporting a faster time to a minimum viable product (MVP), where upgrades and fixes are readily addressed via over-the-air software updates.
Robert: Slow growth, the slow adoption of EVs, and increasing competition from China is having a strong impact on the German automotive market — resulting in the announcement of significant layoffs and restructuring across the German original equipment manufacturers (OEMs) and Tier 1s. While it was expected that the growth of the Chinese market would come at some market share loss from other geographies, in general it was not anticipated that the German market, as a whole, would see the level of impact that it is currently undergoing.
I would predict that we will see many of the custom application-specific integrated circuit (ASIC) programs that have been funded by both OEMs and Tier 1s put on the shelves in favor of adopting application-specific standard products (ASSPs) from the more traditional, long-term, committed automotive ASSP suppliers. As R&D funding dries up across OEMs and Tier 1s, the viability to develop a custom solution becomes increasingly out of reach, especially given the high silicon R&D costs, software development costs, and exorbitant costs associated with AI training. “Off the shelf,” “full stack” solutions that ultimately still support differentiation via software will become more attractive to OEMs than custom silicon alternatives.
Tier 1s, however, will feel the squeeze as the differentiated value they will be able to deliver will be reduced when compared to delivering a custom solution, relegating them mostly to a position where they are seen as automotive-compliant contract manufacturers. This could lead to further restructuring or consolidation of the automotive Tier 1s.

Stanford’s Daniel Wu unpacks AI democratization — exploring agentic & embodied AI, multi-modal models, and trustworthy systems. Learn more at Daniel’s AI Infra Summit 2025 live presentation.

The financial services sector stands at a pivotal moment in AI adoption, and my recent conversation with Anusha Nerella, Financial Industry Leader and Forbes Tech Council Member Leader, illuminated just how transformative this journey is. As we gear up for AI Infra Summit in September, where Anusha will be speaking, her insights reveal a sector that’s moving thoughtfully but decisively into AI implementation.
During our discussion, Anusha painted a picture of an industry still in its “explorative phase,” but one that’s laying crucial groundwork for AI integration. Many financial institutions are taking a measured approach — introducing enterprise-level AI licenses and co-piloting tools to reduce manual efforts while maintaining the stringent security and compliance standards that define the sector. This isn’t about rushing to implement the latest AI trends; it’s about strategic, sustainable transformation.
Anusha emphasized the importance of local large language models (LLMs) in the FinTech industry. When data sensitivity and regulatory compliance are paramount, the ability to deploy AI without internet dependencies isn’t just convenient: it’s essential. As she explained, financial institutions deal with “petabytes of data and billions and trillions of dollars in trades” every minute, making localized AI deployment a critical capability for handling complex, sensitive data streams.
We also explored agentic computing, where Anusha highlighted a fundamental shift from reactive to proactive AI systems. In financial services, this represents a significant leap — moving from AI that simply processes data to agents that can make context-based decisions and learn from outcomes. Yet she was careful to emphasize the boundaries: these systems must operate within carefully defined parameters, a reflection of the industry’s need for controlled, auditable AI behavior.
Perhaps most revealing were the challenges Anusha outlined around deployment, which broke down into three main areas. First, domain expertise is a critical hurdle. Financial AI agents need to understand the intricate rules and regulations that govern financial operations. Second, integration with legacy systems, a reality for most established financial institutions, adds another layer of complexity. Finally, the agents must not only perform accurately but be able to explain decisions. The need for transparency isn’t just a nice-to-have in this sector; it’s a regulatory requirement.
Looking toward infrastructure needs, Anusha underlined the importance of scalability and resilience. The financial sector’s stringent requirements—real-time inference, high throughput, and unwavering compliance—demand infrastructure that can perform at scale while maintaining the security and reliability standards clients expect.
As we approach the AI Infra Summit, Anusha expressed particular excitement about discussions around LLM observability and agentic orchestration. Her enthusiasm for learning about responsible scaling and regulatory compliance in agentic systems reflects the broader industry’s need for frameworks that enable innovation while maintaining the strict controls financial services require.
What’s the TechArena take? Anusha’s insights reveal a sector that’s approaching AI transformation with the same rigor it applies to managing trillions in assets. The foundation being laid represents a sustainable path to AI adoption that could serve as a model for other highly regulated industries. As financial institutions continue to balance innovation with responsibility, we’re witnessing the emergence of AI deployment frameworks that prioritize trust, transparency, and compliance without sacrificing the transformative potential of AI solutions.
Connect with Anusha on LinkedIn and through her contributions to Forbes Technology Council, where she continues to share insights on responsible AI adoption in financial services. Her upcoming session at AI Infra Summit promises to delve deeper into the critical considerations that will shape the future of financial technology.
Listen in to the full podcast.

SayTEC redefines IT with a zero trust, hyper-convergedplatform delivering sovereign cloud, seamless scalability, and military-gradesecurity for critical industries.

AI hyperscaler CoreWeave announced it will acquire data center operator Core Scientific in an all-stock transaction valued at approximately $9 billion, cementing a bold move to take greater ownership of the physical infrastructure that powers its expanding AI and HPC workloads.
The acquisition, set to close in Q4 2025, would transfer ownership of 1.3 GW of gross power across Core Scientific’s U.S. data center footprint to CoreWeave – with the potential for 1 GW+ of future expansion. Under the deal, Core Scientific shareholders will receive 0.1235 shares of CoreWeave Class A stock for each share they hold, ultimately accounting for less than 10% of the combined company’s ownership.
“This acquisition accelerates our strategy to deploy AI and HPC workloads at scale,” said Michael Intrator, CEO of CoreWeave, in a press release. “Owning this foundational layer of our platform will enhance our performance and expertise as we continue helping customers unleash AI’s full potential.”
With crypto-mining still accounting for a portion of Core Scientific’s active workload, CoreWeave’s leadership hinted at medium-term repurposing plans, signaling that the AI wave has become the dominant monetization path for high-density infrastructure.
“Together with CoreWeave, we will be well-positioned to accelerate the availability of world-class infrastructure for companies innovating with AI,” said Adam Sullivan, CEO of Core Scientific.
CoreWeave’s acquisition of Core Scientific is more than a real estate play – it’s a strategic realignment of the AI infrastructure stack, as companies that once rented compute now seek to own the grid that underpins it.
By pulling data center operations in-house, CoreWeave moves closer to hyperscaler parity with players like AWS and Microsoft, while maintaining its edge in delivering GPU-rich, AI-native compute services. The deal also marks a turning point in the crypto-to-AI transition, as underutilized mining assets are reimagined as AI infrastructure.
This is a classic verticalization play with a 21st-century twist: Instead of chasing scale through more compute alone, CoreWeave is locking down power, land, and efficiency – the real constraints in a generative AI economy.
In a landscape defined by power-hungry workloads, supply chain bottlenecks, and geopolitical uncertainty, control is king. And CoreWeave just bought itself a throne.

Anusha Nerella, financial industry leader and Forbes Tech Council member leader, explores AI-driven FinTech infrastructure — scalability, governance and agentic computing. Interested in finding out more about the AI Infra Summit and seeing Anusha Nerella live? Find out more here.

This summer, we’re checking back in on our TechArena predictions for 2025 to see how they are holding up. We’re starting with Vernon Turner’s environmental, social, and governance (ESG) predictions for multinational corporations (MNCs). Based on his performance, so far, we’re giving our predictions high marks.
Status: CONFIRMED
Turner warned about regulatory confusion stifling ESG software investment, innovation, and strategic flexibility, and it seems to have happened. In a survey of 125 large MNCs, 80% of respondents reported they were adjusting their ESG strategies in 2025 and 75% said they expected the shifts would “slow down” decarbonization efforts.
Status: ACCELERATING
New projections show that this prediction is right on track. AI adoption in ESG is now expected to grow at 28.2% compound annual growth rate through 2034. Companies are still facing regulatory frameworks demanding ESG data transparency and compliance, and AI offers a tantalizing path for automating data collection and reporting.
Status: IN PROGRESS
While the US regulatory scene is chaotic, MNCs answer to other regulators as well. The EU is working to rewrite its Corporate Sustainability Reporting Directive to make it less burdensome, but companies still need to prepare for reporting requirements in some form. That makes the hunt for ESG data scientists and reporting solutions urgent.
While regulatory chaos creates uncertainty, it’s also creating real opportunities in ESG tech infrastructure. Companies need robust, automated systems more than ever to navigate this fragmented landscape. We expect to see more tech investments in this area, even as ESG investment overall slows down, as we head into the back half of 2025.

The glowing green letters on the black screen seemed almost mystical to little Allyson Klein when she spotted her first Apple computer at her friend’s house in the 1970s.
Growing up in Silicon Valley during the semiconductor boom, Allyson’s house was already a playground for the emerging digital age, with every gaming console imaginable having found its way to their living room, courtesy of her father's work with the toy industry.
“At the beginning, there was this weird blend where gaming and computing were introducing electronics into the household,” she recalled. “My first computer was a Commodore 64, and with it, you had to learn BASIC because not much software had been written yet.”
For someone who would spend decades of her career translating complex technology into human stories, Allyson's early experiences with Atari and other gaming consoles shaped her appreciation for a world where silicon and software would reshape everything.
As founder and principal of TechArena, Allyson now leads a tech marketing agency and media platform that's carved out a unique space in the tech media landscape. But her path from Silicon Valley kid to tech industry storyteller wasn't linear; it curved its way through 22 years at Intel, a stint leading global marketing and communications at Micron, and pioneering technology podcasts along the way.
Allyson's father, an international marketing executive, brought marketing strategy discussions to the dinner table each night. Her mother worked as a nurse for semiconductor companies, bringing home stories about the intricate chemical processes used to create computer chips and what they could do to the human body.
“I developed a fascination with marketing strategy and with the process of semiconductor creation at a very young age,” she said.
The University of Oregon gave Allyson a foundation in marketing, management, and international studies – a combination that reflected her father's influence and her own intuitive understanding that technology's real power lay in its human applications. But like many business students, she found the theoretical aspects less compelling than the real-world applications she'd encounter later.
It wasn't until she was working on her MBA in Portland, surrounded by Intel employees sharing stories of their work, that she realized she wanted to be more than an observer of the tech revolution.
“Their stories about foundational work and industries being reshaped by technology convinced me I wanted to be part of that,” she said.
Allyson's Intel career began in the late 1990s, during one of the most transformative periods in computing history.
One of her most significant mentors early in her career was Jim Pappas, who was instrumental in creating USB, PCI, and countless other industry standards. Pappas didn't just teach Allyson about technology – he showed her how foundational innovations could spark creativity across entire global ecosystems.
Allyson found herself at the center of what she now calls one of the most disruptive forces in tech over the last 20 years: the rise of industry-standard data centers and the creation of cloud computing.
“We take what that technology has done for granted, but it really transformed the world,” she said. The pandemic would later prove her point dramatically – with cloud infrastructure keeping children in school, delivering products to doorsteps, and providing the connectivity tools that held society together during lockdown.
Allyson viewed her role at Intel as much more than marketing individual products; she was helping build ecosystems – creating initiatives that brought together companies delivering complementary technologies, crafting foundational messaging that would shape entire industry narratives, and telling stories that would help customers understand what Intel was building and why it mattered.
One of her most successful projects was the Open Data Center Alliance, formed with enterprise technology leaders from around the globe to document their requirements for cloud computing.
“What that taught me was a deep respect for what it takes to run IT operations,” she said. “Understanding the challenges they face day-to-day and what they think about in terms of workloads and workflows across very complex enterprise environments.”
In 2009, Allyson’s boss approached her with a challenge.
“There's this new thing called social media,” he said. “Go figure it out.”
Allyson researched and worked with external agencies to come up with recommendations for how Intel would engage in social media, and one of her two resulting actions was to start a podcast.
Allyson had an insight that would prove prescient: the best conversations about technology weren't happening in conference rooms or marketing presentations – they were happening in Intel's cafeterias, where she would sit for hours talking with engineers, asking them to explain their latest innovations.
Chip Chat launched as a weekly show and would eventually run for 754 episodes, reaching over 20 million listeners and winning numerous industry awards. The podcast gave Allyson “a profound appreciation for the role of inquiry in driving narrative.”
“People love to talk about what they've done, but they need that prompt to give them permission to share,” she noted.
After Intel, Allyson took on the role of leading global marketing and communications at Micron, the world's fourth-largest semiconductor manufacturer. The position offered her first opportunity to oversee corporate and internal communications, managing everything from COVID-19 messaging to responses to the Black Lives Matter movement to technology evolution and the CHIPS Act.
But by 2022, Klein found herself at a crossroads. She had proven she could own the message inside major corporations, but something was missing.
“I missed creating content. I missed telling stories,” she said. “I wasn't getting the opportunity to take pen to paper or sit in front of a mic anymore, and those things gave me joy.”
The idea for TechArena emerged from Allyson's realization that she might be more inspired working as strategic counsel across multiple companies rather than owning the narrative within a single organization. But it was also born from her unique perspective on an industry she'd lived inside for decades.
“Most tech journalists don't have the background of living inside companies,” she said. “At TechArena, we understand the shorthand and what might actually be going on because we've lived in that environment so long.”
TechArena launched as both a content platform and an agency, allowing Allyson to demonstrate her team's “mad skills” while building a business around strategic marketing counsel. The platform has featured more than $9 trillion in market cap worth of companies, as well as 84 founders and CEOs of small tech startups who've shared their stories.
Allyson’s approach to content differs markedly from traditional tech journalism. Every piece includes the “TechArena take” – an opinion based on insider knowledge. The writing style is deliberately less formal than typical industry publications because, as she puts it, “people are human and they want to enjoy the content they're consuming.”
Perhaps Allyson's most unconventional belief is her optimism about technology's impact on human jobs. While many worry about AI replacing human workers, Allyson draws on her experience with previous technological disruptions.
“When cloud computing and virtualization emerged, we thought consolidating workloads 20-to-1 would collapse the server market,” she recalled. “We worried about this constantly at Intel.”
Instead, more applications were built, more uses for technology emerged, and IT departments only grew larger.
What emerges from Allyson's story is a career built on a fundamental insight: technology's real power lies not in its technical specifications, but in its human applications. Whether creating narratives at Intel, such as, “We move, store, and process the world's data,” building ecosystems across the industry, leading massive global organizations, or launching podcasts that gave engineers permission to share their passion, she has consistently focused on the human element in technological advancement.
“The center of any marketing and communications program is the message and the audience,” she said when asked about her unique ability to work across both disciplines. “Understanding the unique challenges each field solves with that message and audience defines both their synergies and differences.”
As Allyson looks to the future of TechArena, her vision remains rooted in this human-centric approach. She envisions a team creating content with multiple voices, richer client collaborations, and a brand with deep meaning to its audience.
“I'm more fascinated with technology today than I've ever been,” she said, describing recent interviews on agentic AI's role in silicon development and AI-driven simulation for 5G and 6G antenna testing. “The geekier it gets, the more excited I become.”
For someone who started with gaming consoles in her childhood living room, Allyson still finds wonder at the intersection of human creativity and technological possibility – committed to telling the stories that help the rest of us understand why innovation matters.

The legal industry saw another major milestone in its AI transformation journey this week as Clio, a legal practice management company, announced its acquisition of AI-powered legal research platform vLex for $1 billion in cash and stock. The deal promises to bring together technologies spanning the management of law firms and the practice of law into a single, unified platform.
“Through this acquisition we are laying the foundation for the first and only cloud-based, AI-powered platform that seamlessly connects the business and practice of law,” Clio’s CEO and Founder Jack Newton said in a blog post announcing the acquisition. “It’s a moment that reflects not only the scale of what we’re building, but the scale of what’s possible and represents a bold step toward building a new category of legal technology.”
The scale of the deal is reflected not only in dollars, but in the reach of the two companies’ systems. Clio’s practice management software is used by over 200,000 law firms worldwide; vLex’s global legal intelligence platform, known for its built-in AI assistant Vincent and propriety database including more than a billion legal documents, serves more than 2.8 million registered users.
“With the most comprehensive global legal library and firm insights, Clio and vLex are uniquely positioned to reshape the mechanics of legal work and redefine the trajectory of the profession,” CEO and Co-Founder of vLex Lluis Faus said.
The acquisition announcement comes rapidly after another strategic alliance recently announced between Harvey AI, which offers a ChatGPT-based AI platform for tasks like legal research and contract analysis, and LexisNexis, which is one of the main competitors to vLex. In that case, the two companies announced that LexisNexis database and AI capabilities would be integrated into Harvey to create new workflows and “a powerful new experience for Harvey customers.”
Clio’s acquisition of vLex is subject to standard regulatory approvals. More information about the capabilities the combined platforms may offer is planned to be shared at ClioCon this October in Boston.
These partnerships reflect the legal industry’s rapid transformation as AI reshapes how lawyers conduct research, draft documents, and manage cases and their practices. They also point to an urgent necessity: an arms race for the best data in the domain.
Great data is crucial to building the best AI models and platforms. In an industry where model performance can be the difference between freedom and liberty, or which way a billion-dollar judgment goes, putting a price tag on “the best” is an expensive prospect.
Two of the three dominant sources of data that currently exist — vLex, LexisNexis, and a legal database owned by Thomson Reuters — have now been claimed. What remains to be seen is what will happen to the legal tech service providers who haven’t been able to get in on this gold rush.
We’ll be watching to see where Thomson Reuters lands. We’ll also be watching for consolidation waves in other industries with similarly concentrated, high-value data repositories. We suspect the scrambles to control the information that powers AI systems have only just begun.

After nearly 18 months of regulatory friction, Hewlett Packard Enterprise (HPE) has cleared the final hurdle in its $14 billion acquisition of Juniper Networks. On June 28, 2025, the U.S. Department of Justice announced it had reached a settlement with HPE, ending an antitrust challenge that had threatened to derail the high-stakes deal just weeks before trial.
The settlement, marked by two key concessions, removes a major overhang on the merger and allows HPE to move forward with a strategic acquisition designed to strengthen its position in the AI-native networking era.
As enterprises re-architect for AI workloads, the network has moved from a supporting role to a foundational pillar of modern infrastructure. The HPE–Juniper combination reflects this shift – where intelligence, performance, and adaptability must now live inside the network itself.
To address competitive concerns, HPE will divest its Instant On campus and branch WLAN business, including all IP, R&D, and customer relationships. The company will have 180 days to identify and secure a DOJ-approved buyer.
Additionally, the parties agreed to a technology licensing requirement: Juniper’s AI Ops for Mist source code – critical to its WLAN optimization and automation – must be made available through a non-exclusive, perpetual license via a competitive auction. This includes optional transitional support and even personnel transfers to jump-start competition.
These terms are designed to preserve competitive dynamics in enterprise wireless networking while still allowing HPE to advance the merger.
HPE CEO Antonio Neri has long framed the Juniper acquisition as more than just consolidation. In his words, the combined entity offers a “modern network architecture alternative” that’s optimized for AI workloads across cloud, enterprise, and service provider environments.
By bringing Juniper’s AI Ops capabilities, Mist AI portfolio, and networking silicon innovation under HPE’s umbrella, alongside Aruba’s access and security tools, the company is building what it sees as a differentiated, software-defined stack for the next wave of infrastructure.
With AI workloads reshaping data flows, automation, and telemetry requirements across all network layers, HPE’s endgame is to own the central nervous system of modern infrastructure. The network is no longer the highway for data – it’s the platform where real-time insight, optimization, and control happen. In this vision, the network isn't adjacent to AI infrastructure; it is AI infrastructure.
Channel partners have called the deal a “game changer” for HPE, particularly with its expanded reach across cloud and telco verticals. However, some observers remain skeptical of the DOJ’s assertion that the combined HPE-Juniper entity and Cisco would control over 70% of the U.S. WLAN market – a figure that may not fully account for disruptive players and emerging architectures in this fast-moving sector.
The DOJ’s settlement may have closed the chapter on legal opposition, but the real test is what HPE does next. With regulatory clearance in hand, the company now carries both the strategic potential and the execution burden of realizing its AI-native networking vision.
The divestiture of Instant On and the licensing of Juniper’s Mist AI source code underscore a deeper truth: AI-era infrastructure will be won not just by owning the stack, but by how open, adaptable, and customer-centric that stack proves to be.
If HPE delivers on its promise of real-time automation, silicon-level optimization, and intelligent edge-to-core integration, all without locking customers into a monolithic experience, it has a chance to challenge Cisco and reset the expectations for what enterprise networking should be.
The next moves – from roadmap integrations to go-to-market strategy – will determine whether this is just a large deal or a defining moment in the re-centering of the network as AI’s operating backbone.

With all of the buzz from the business world about how AI will affect how we work and live, it can be hard to remember the truly mind-blowing variety of potential applications for the technology. This week, Google DeepMind (whose AlphaFold won the 2024 Nobel Prize in Chemistry) broke through the noise with a stunning reminder when they unveiled AlphaGenome.
More than 20 years ago, the Human Genome Project succeeded in sequencing the 3.1 billion genetic letters that provide the DNA instructions to create a human being. But the end of that journey (like many scientific journeys) offered more questions than answers. Chief among them: What does 98% of our DNA actually do?
About 2% of our DNA was determined to be dedicated to making proteins. The rest, “non-coding,” DNA was dubbed by some to be “junk” and by those more curious, to be “dark matter” and the new frontier of genetic exploration.
Over time, scientists have come to understand that these non-coding sequences still affect protein activity. They determine whether or not certain genes are expressed (a process called regulation), which can affect if a gene that commonly drives cancer or heart disease, for example, is turned “on.” But no single DNA stretch has only one job, either. So understanding the effect of a mutation in a stretch of DNA remains an incredibly complex problem.
Two decades later, enter AlphaGenome: an AI model that predicts how genetic mutations affect gene regulation across this non-coding DNA. The tool processes up to one million DNA letters simultaneously and scores variant effects within seconds. Its model architecture is a hybrid neural network leveraging a mix of convolutional layers and transformers.
The AI model achieves state-of-the-art performance, outperforming specialized tools on 22 of 24 sequence evaluations. Unlike previous models that could handle long sequences but weren’t sensitive to single-letter changes in those sequences, AlphaGenome maintains this level of precision across the long DNA stretches.
“It’s a milestone for the field,” said Dr. Caleb Lareau of Memorial Sloan Kettering Cancer Center. “For the first time, we have a single model that unifies long-range context, base-level precision, and state-of-the-art performance across a whole spectrum of genomic tasks.”
AlphaGenome could help researchers pinpoint disease causes more precisely, guide the design of synthetic DNA for specific regulatory functions, and accelerate genome understanding by mapping crucial functional elements. Its applications today are in research, not personal genome prediction for individuals.
AlphaGenome launches through a non-commercial API with plans for full model release and commercial licensing.
While headlines continue to focus on AI’s disruption of industries and job markets, AlphaGenome represents something more profound: AI’s potential to help us unlock enduring mysteries. This isn’t about automating spreadsheets or generating marketing copy. As we improve our understanding of what 98% of our DNA actually does, we’re not just advancing medicine or research. We’re fundamentally expanding our comprehension of life. Google DeepMind’s announcement this week reminded us that AI’s most transformative power may not lie in reshaping how we work, but in revealing who we are.

In case you missed it, AI's impact on workforce reductions arrived in force last week. An open letter from Andy Jassy declared that future layoffs will re-shape Amazon's workforce and re-tool skill sets of human workers to better complement AI productivity. At Microsoft, this re-shape has already begun as the co-pilot is taking over controls of one of the world's largest companies. According to the World Economic Fund, 41% of employers worldwide intend to reduce their workforce because of AI in the next five years across industries, so these latest tech moves are expected if not perhaps arriving earlier than some foresaw. This bullish perspective is rampant in boardrooms despite the less than stellar productivity gains realized by generative AI to date. According to the Federal Reserve, gen AI has increased US productivity by an anemic 1.1% in an economy that is arguably ahead of the adoption curve. And while function level reviews show more rapid advancement, most notably a 14% increase in productivity in customer support, with a notable 35% increase in entry-level roles within the field, according to the National Bureau of Economic Research, other fields are falling...flat, leaving many to query if gen AI, at least in the 2025 era, is CEO weak sauce for workforce reduction actions.
Enter Intel, who Friday announced that they would outsource their marketing functions to Accenture to run with, yes, you guessed it, generative AI. The news caught wildfire across the marketing landscape for its bold prognostication that the marketing practice is ready for handoff holistically. It caught my attention because I know the size, scale, and complexity of the historic Intel marketing engine well, having overseen the global data center and edge marketing practices at the company. While no one is going to argue that Intel's marketing is anything but a shadow of what it once was, the company did sell over 250 million estimated CPUs in 2024 and is still a global presence operating manufacturing sites across the world. They engage in many market segments with a complex partner ecosystem across hardware, software, and service providers. Even with a falling market position, the marketing of microprocessors is not for the faint of heart, with complex value propositions, better together narratives, and sales and support motions to deliver.
So, what to make of this move? Ironically, this week, TechArena started unpacking our Who's Who in the AI Zoo article series, discussing the current state of classes of AI toolsets from a marketing perspective. I explained in my LinkedIn post introducing the initial article that we've leaned into gen AI for everything we do as a marketing organization since day one. And while our productivity is amazing with these tools, we are still very much in an era where AI is an accelerant to marketing delivery akin to starting your day with a jumbo-sized Red Bull. I've been delighted by how this has especially supported less tenured staff and how it's opened the door for more efficient delivery of services to our clients.
Where are the gaps? Having worked in engineering cultures for the bulk of my career, I'm very aware of non-marketing professionals thinking that marketing is simply easy. After all, it's just words and pictures, right? While the landscape is changing every day, savvy human marketers are still uniquely poised to craft strategy, to choose messaging and narrative, and to nurture relationships across complex industries. And while Intel and Accenture may very well shock the world with outstanding marketing results, I'm viewing this latest move as yet another signal that Intel is waving a white flag for any quick return to industry leadership, and the marketing move is a rounding error towards Lip-Bu Tan's targeted 20%+ layoffs at the company.
The debate over the future of work will only heat up in the coming months as more companies make bold bets on gen AI integration into the workforce, and, at TechArena, we will be diving deep into the conversation across business functions and, of course, the leading edge of tech innovation.

Today, AI-native data platform company WEKA launched NeuralMesh, a software-defined storage system designed to power AI applications requiring real-time responses. The mesh-based architecture delivers performance for AI workloads and scales from petabytes to exabytes while becoming more resilient as it grows.
WEKA built NeuralMesh specifically for enterprise AI and agentic AI systems that demand data access in microseconds, not milliseconds. It offers a fully containerized, mesh-based architecture to seamlessly connect data, storage, compute, and AI services.
“AI innovation continues to evolve at a blistering pace,” says Liran Zvibel, WEKA’s cofounder and CEO. “Across our customer base, we are seeing petascale customer environments growing to exabyte scale at an incomprehensible rate. The future is exascale. Regardless of where you are in your AI journey today, your data architecture must be able to adapt and scale to support this inevitability or risk falling behind.”
Traditional storage architectures slow down and become fragile as AI environments expand, forcing organizations to add costly compute and memory resources to keep up with the demands of their AI workloads. In contrast, NeuralMesh’s software-defined microservices-based architecture allows its performance to improve rather than degrade as data volumes reach exabyte scale. The company says it is the “world’s only intelligent, adaptive storage system” purpose-built for accelerating graphical processing units, tensor processing units, and AI workloads.

WEKA highlights five key breakthrough capabilities delivered by NeuralMesh:
The solution offers benefits for AI companies looking to train models faster and deploy agents that respond quickly, hypserscale and neocloud service providers looking to expand service to more customers without also expanding infrastructure, and for enterprises that need to deploy and scale AI-ready infrastructure without overwhelming complexity.
NeuralMesh represents over a decade of development supported by more than 140 patents. WEKA hails it as a “revolutionary leap” in its data infrastructure solutions, which began with offering a parallel file system for high-performance computing and machine to and evolved to pioneering the high-performance AI data platform category in 2021.
NeuralMesh launches in limited release for enterprise and large-scale AI deployments, with general availability planned for fall 2025.
Following closely on last week’s announcement between WEKA and Nebius to deliver a GPU-as-a-Service (GPUaaS) platform, this news further demonstrates how WEKA’s commitment to innovation is solidifying its position as the high-performance storage layer for AI.
The scale here, with microsecond response times and exabyte-scale capability, is remarkable—and necessary. NeuralMesh positions enterprises to harness the full potential of agentic AI systems where split-second decisions drive competitive advantage.
With its existing product, customers have already claimed impressive results. A case study with Stability AI highlighted that by using WEKA, Stability AI achieved 93% GPU utilization during AI model training and increased their cloud storage capacity while reducing costs by 80%. We’ll be watching to see the results NeuralMesh can drive as a technological leap forward.

Antillion is a UK-based technology company focused on edge outcomes that brings together diverse talent to deliver innovative, user-centric solutions through collaborative partnerships with customers.
The company emphasizes research and development, maintaining cutting-edge capabilities at their London-area facility equipped with CNC machines, 3D printers, and quality assurance systems, while their team of engineers and system designers create visually stunning, functional products that blend modern design with advanced technology. Antillion's mission centers on simplifying complexity and addressing genuine user needs through products that provide immediate positive experiences, supported by comprehensive training, expert on-site assistance, and a commitment to exceptional craftsmanship that drives both trust and user satisfaction.
We caught up with Alistair Bradbrook, Antillion's founder and COO, to learn how they utilize storage technologies to bring tactical data center performance to the most challenging edge environments.
At Antillion, we build for customers operating where real-world demands push technology to its limits. These are environments that are harsh, unpredictable, and often disconnected — where you can’t count on infrastructure, power, or even a network signal.
The challenges? They’re complex and relentless.
First, there’s the environment itself. We’re talking about kit that needs to survive inside armoured vehicles filled with dust or bolted to poles in sub-zero Arctic winds. These aren’t lab conditions — this is real-world brutality where traditional IT just can’t cope.
Then there’s SWaP – size, weight, and power. We design platforms that can be carried in a rucksack or strapped inside a vehicle. Performance is still expected — but within incredibly tight physical and power constraints. That’s a huge design challenge.
Connectivity’s another big one. Our customers often operate in degraded or disconnected networks. There’s no cloud to rely on. So, everything — from data ingestion to processing and decision-making — needs to happen at the point of contact. Locally. Instantly.
The operational tempo is relentless too. When you’re in a mission-critical situation, latency isn’t an inconvenience — it’s a risk. Insight has to be real-time. You’re talking about sensor data flowing into compute, through analysis, and into an actionable decision — in seconds or less.
And lastly, usability under pressure. These systems aren’t being deployed by sysadmins — it could be soldiers, field engineers, or emergency responders. There’s no time for training manuals. It just needs to work — fast, reliably, and intuitively. That’s what we focus on delivering.
We’ve designed our PACE platforms from the ground up to bring data centre-grade capability to the edge — without compromising on performance, durability, or usability.
For us, it always starts with form and function. We don’t design for static data centre environments — we design for rooftops, vehicle interiors, trenches, and backpacks. Half-width, short-depth, modular hardware that’s built to fit real-world environments. That’s the basis of the PACE design language.
On the compute side, we’re integrating serious horsepower. We’re talking AMD EPYC, Intel Xeon, and even ARM — plus accelerators from NVIDIA and others — packed into ruggedized, sealed, and IP-rated platforms that can be mounted in a vehicle or carried by hand. These systems aren’t just tough — they’re powerful.
Storage is a big part of the picture, too. Solidigm’s EDSFF SSDs have been a game-changer for us. We now offer systems that support the 122TB D5-P5336 from Solidigm — and they’re holding up brilliantly in high-vibration, high-temperature scenarios. Whether it’s a wearable system or something mounted, we can keep massive volumes of data local, reliable, and fast.
Then there’s the design philosophy. We obsess over usability — intuitive deployment, straightforward servicing, and no-nonsense operation. These systems have to work for people in the field, under pressure. The goal is always the same: the tech disappears, the mission stays front and centre.
Whether it’s the ultra-portable PACE AIR or the fully ruggedized PACE FRONTIER, we’re not just making edge compute possible — we’re making it powerful, deployable, and trusted in the toughest environments.
It’s hard to overstate the impact high-density SSDs — especially Solidigm’s 15.36TB, E1.S and 122TB, E1.L — have had on what we can achieve at the edge.
Before, storage was a compromise. If you wanted a compact system, you had to accept limited capacity. Not anymore. Now, even our smallest PACE units — like the A211 — can handle massive mission datasets: multi-stream 4K ISR, full platform telemetry, and raw AI training data. And they can do it right where the data’s generated.
The NVMe performance is a huge enabler. We’re not waiting for data to move — we’re running real-time analytics, AI inference, and sensor fusion right there in the field. That’s crucial when you’re working in denied or degraded networks where the cloud just isn’t an option.
The efficiency is game-changing too — more capacity per watt, per millimetre, and per kilogram. That means smaller, lighter platforms that don’t sacrifice on performance or endurance — exactly what’s needed in vehicle, wearable, or airborne deployments.
From a reliability standpoint, Solidigm’s been rock solid. Across hundreds of deployed drives in some truly hostile environments, we’ve seen zero failures. That kind of trust is critical in military and security deployments — we don’t get second chances in the field.
Fewer drives mean fewer cables, faster builds, and simpler logistics. For our customers, that translates directly into reduced operational burden, easier maintenance, and faster time to deployment.
To put it simply: these drives are what let us bring data centre-class storage to the edge — and make it rugged, mobile, and mission-ready.
We don’t build traditional data centres — our philosophy is all about disaggregation and decentralisation. We’re taking compute out into the world, wherever the mission demands it. High-capacity SSDs are critical to making that model both sustainable and efficient.
For starters, SSDs give us a much better performance-per-watt ratio than spinning discs. That means more compute and more storage for less power — which is essential when you're relying on batteries or field generators. In remote, mobile deployments, every watt counts.
They also run cooler. That sounds simple, but it makes a huge difference in our sealed, rugged systems like PACE Frontier. We don’t have the luxury of big fans or data centre HVAC — SSDs let us keep things thermally efficient without adding complexity or extra energy overhead.
Another big factor is data movement — or rather, the lack of it. Because we can store and process petabytes locally on the edge, we’re not constantly pushing data back to central infrastructure. That dramatically reduces energy consumption, especially across constrained or expensive networks.
There’s also the sustainability of the platforms themselves. SSD durability helps extend hardware life. Combine that with our Evergreen program — where we upgrade and refresh existing systems instead of replacing them — and you’re looking at a far longer lifecycle. That means less waste, fewer shipments, and a smaller overall footprint.
It’s not just about energy efficiency — it’s about operational sustainability. We’re building systems that last longer, use less, and deliver more — wherever they’re deployed.
QLC drives have been a game-changer for what our customers can achieve in the field. They’ve opened up entirely new mission profiles — especially for defense, security, and industrial applications — by enabling us to deliver massive storage and lightning-fast performance in incredibly compact, rugged formats.
We’re now running AI and analytics right on the edge, on systems the size of a lunchbox. Clients are deploying models for things like object detection, anomaly spotting, and pattern recognition — and doing it in real time, exactly where the data’s being generated. There’s no need to wait for upload or connectivity — the insight happens there and then.
That’s crucial in DIL environments (Disconnected, Intermittent, or Limited networks). With these QLC drives, the data stays local and accessible even when comms are down. It’s not just about speed; it’s about continuity and control. For our customers, that kind of autonomy is often mission-critical.
What’s more, QLC drive density means we can scale up without scaling out. Using Solidigm’s E1.S or E1.L modules, our customers can multiply their storage without changing the physical footprint — same chassis, same power draw, just more capability. That’s especially important when size and weight are tightly constrained.
This tech also helps us move faster. Build and provisioning times are down by up to 30%, which gets systems into the field quicker. In operational terms, that can be the difference between acting now and reacting too late.
And perhaps most exciting — we’re enabling entirely new types of missions. Cybersecurity at the edge, autonomous platforms, predictive maintenance using AI — these just weren’t feasible before. Now they are, thanks to the performance and resilience these QLC drives bring.
As we expand the PACE portfolio with the latest high-core-count processors, greater memory, and high-capacity Solidigm storage, we’re developing more powerful and mission-specific platforms for the far edge. Each one is true to our design-first ethos and built to deliver more compute, more capability, and more outcomes wherever the mission takes them.

We sat down with David Lim, senior director of marketing for Hypertec, to learn more about infrastructure that is purpose-built for AI and HPC workloads.
Founded in 1984, Hypertec is an award-winning global technology provider offering a wide range of cutting-edge products and services with a strong emphasis on sustainability. Trusted by industry leaders, Hypertec serves clients in over 80 countries worldwide. The company has earned international recognition for its sustainability leadership and innovative manufacturing practices.
At the beginning of this collaboration is a pretty simple idea: the demands on data centers are growing fast, from AI training and real-time analytics to ultra-low latency use cases like streaming and remote surgery. So we teamed up with Solidigm to show how you can handle that kind of pressure with infrastructure that’s built for it.
We're bringing our immersion-born TRIDENT servers, which are designed from day one to run submerged in liquid for better cooling and higher density. Paired with Solidigm’s SSDs, we’re showing what it looks like when compute and data access move faster, run cooler, and scale smarter. whether it’s a massive AI cluster or it’s deploying compute in a hospital or telecom edge site.
The magic really happens when you combine immersion cooling with fast, reliable storage. Immersion lets us push performance limits. We’re running CPUs and GPUs at peak power without worrying about throttling or overheating. That’s critical for workloads that don’t stop like training large AI models or running inference in real time at the edge.
Now, add Solidigm’s SSDs, and you’ve got the speed to feed those compute engines. Whether it's a 4K video being streamed from a CDN node, or a radiology scan being pulled up instantly in a hospital, fast I/O makes all the difference. The system doesn’t just run, it flies, and it keeps doing so consistently under load.
Data centers are under immense pressure as AI and high-performance computing (HPC) workloads grow exponentially. These applications demand significant computational power, leading to increased energy consumption and heat generation. Traditional air-cooling methods are struggling to keep up, especially as power densities rise. For instance, average power densities have more than doubled in just two years, reaching 17 kilowatts (kW) per rack, and are expected to rise to as high as 30 kW by 2027.
Moreover, the massive data volumes processed by AI applications require storage solutions that can handle high throughput with low latency. Traditional storage systems often become bottlenecks, hindering overall system performance. Additionally, the increasing energy consumption raises concerns about the environmental impact of data centers. Projections suggest that data centers could consume up to 9% of the United States' electricity by 2030, more than twice their current usage.
To address these challenges, Hypertec and Solidigm have collaborated to develop integrated solutions. The more efficient heat dissipation allows higher power densities, enabling more compute resources in the same physical space while reducing reliance on traditional air-cooling systems. Solidigm's SSDs are designed for high throughput and low latency, addressing data bottlenecks in AI applications. Their high-capacity SSDs enable data centers to reduce the number of physical drives, decrease footprint, reduce power consumption, and simplify maintenance. Together, these technologies offer a scalable, energy-efficient, and high-performance infrastructure solution tailored for the demands of modern AI and HPC workloads.
It’s one thing to build fast infrastructure, it’s another to build smart, efficient infrastructure. That’s where we’re focused. Immersion cooling is incredibly efficient by removing air-cooling, we cut out a huge portion of the power bill. And Solidigm’s SSDs pull their weight too, with lower power draw and high capacity, so we can do more with fewer drives.
The result? You get the performance you’re looking for with lower carbon cost. And for customers in healthcare, finance, telecom who are all under pressure to hit sustainability goals this isn’t a nice-to-have: it’s table stakes.
This is just the beginning. We’re already aligning today around what’s next: support for Gen5 and CXL, AI at the edge, liquid-cooled storage all the building blocks of future-ready infrastructure.
Think about what’s coming: AI models that run in real time on the edge of a 5G network. Robotic surgeries assisted by AI, where latency is measured in milliseconds. High-frequency trading platforms that need zero delay. These are not sci-fi anymore, they’re live today. And we’re building the compute backbone that makes them possible, scalable, and sustainable.

With over $500 billion forecasted in the AI accelerator market, all eyes were focused in San Jose this week to hear from Lisa Su and her team on how AMD was progressing on its strategy to take the leadership mantle for AI infrastructure delivery.
The theme of the day? Open Innovation. With a keynote filled with heady announcements, the throughline weaved across discussions showcased that AMD fully intends to be both the partner and customer choice for collaborative innovation as the world’s data centers pivot to broadscale deployments of generative and agentic AI at scale. And while many feel that competitive NVIDIA platforms have a stronghold grip on the market, one thing I kept thinking about as Lisa discussed the commanding progress of her team is that she knows more than a little about relentless pursuit of a Goliath and is pretty good at playing the giant slayer. Let’s break down the news of the day.
Data center-scale performance will define success with AI system proliferation, and AMD is the only company offering a full suite of CPU, GPU, and network solutions needed to deliver AI clusters. Today, AMD delivered its Instinct MI355 to the market, the 4th generation architecture built on 3nm process and packing 185 billion transistors. What does that mean for real-world performance? With up to 35X gen-over-gen performance, AMD has closed a lot of the gap vs. NVIDIA B200, delivering an average of 3.5X performance uplift across training and inference. Some examples of competitive head-to-head metrics showed performance parity with different Llama configurations running pre-training workloads and 1.1X+ performance in equivalent fine-tuning environments. While it’s too early and frankly naïve to declare AMD the performance winner, these gains certainly will open doors to deeper collaboration opportunities with customers with this generation of products.
Deeper customer collaborations are essential for the enormous sunshine of what comes next: Helios. In 2026, AMD will unify their entire silicon portfolio to deliver Helios, an AI-optimized rack-scale system that integrates next-generation Venice EPYC processors, next-generation Instinct MI400 GPUs, and next-generation Pensando Vulcano DPUs, tapping UAL and Ultra Ethernet connectivity. This behemoth will be delivered as an OCP-compliant, open-standards based solution, which is expected to turn a lot of heads given claimed performance parity with NVIDIA Vera Rubin across GPU domain, scale-up bandwidth, FP4/FP8 FLOPS, and a 1.5x advantage in HBM4 memory capacity, memory bandwidth, and scale-out networking. At the heart of this configuration, of course, is the upcoming Instinct MI400 GPU, which did a bit of stage stealing from the MI355 introduction today as Lisa claimed a targeted 10X performance improvement gen over gen. Even in the era of 2x Moore’s Law progression, this is a stunning performance improvement target that places the competition on its heels.
Powering the AI software stack behind Helios is ROCm, AMD’s open compute platform, which provides the common programming foundation across the Instinct GPU lineup. With support for major AI frameworks and optimized libraries, ROCm enables portability, scale, and high performance across AMD hardware – cementing AMD’s strategy of combining open software with open systems to meet enterprise and hyperscaler needs.
While the Instinct MI355 introduction and promise of Helios commanded attention, the message of open innovation was delivered in every word from AMD and partner executives, reflecting a strong customer desire for choice in AI solution alternatives. This open innovation starts with advancement of the ROCm software platform, AMD’s alternative to CUDA with ROCm 7 introduced today. Historically, software has not been AMD’s strong suit, and it’s obvious that the company is investing to change this through internal development and acquisition of talent including a high-profile addition of Lamini, who was on hand to share how a full suite of developer training will soon be delivered to accelerate developer activation of the software in AI development.
Open was also supported through AMD’s strong commitment to the Open Compute Project Foundation and leadership within the Universal Accelerator Link (UAL) and Ultra Ethernet standards efforts, with Ultra Ethernet hitting 1.0 this week. This commitment to standards-based networking and significant keynote time delivered to standards-based advancement underscored the importance of network scale to AI compute delivery as well as a key differentiator from competition who relies on proprietary solutions. It was fantastic to see the industry support of Astera Labs and Marvell highlighted as leading examples of a vibrant networking industry assembling to deliver the next generation of AI data center connectivity, and I expect to see a lot more about solution delivery from a host of vendors in the coming year as standards mature and customers begin deploying these standards based solutions.
AMD placed a central spotlight on developers in attendance and its developer sessions as it continues to advance ROCm. To help tell this story, leaders from (FILL IN) emerged to share the progress of advancing AI together. This demonstrated that real developer cycles are being spent today on optimizing on AMD Instinct GPU-based infrastructure, and there is a groundswell of activation in this space. With the announcement of a new developer cloud and free access to all developer attendees, AMD is providing the access and tools required to support community advancement. While CUDA has a tremendous lead in this space, AMD is taking the right steps to at least get developers to take their tools and platforms for a test drive, and while doing so gain traction with developer loyalty.
With AI becoming a central aspect of geo-political policy, and as nation states race for AI sovereignty and supremacy, Lisa shared how important AI advancement for all was to realize the true vision of this historic technology. Tariq Amin, CEO of Humain, a leading AI operator in Saudi Arabia, came onstage to share his vision for AI advancement in the kingdom, noting that Saudi Arabia was a young nation full of innovation potential. The collaboration announced earlier this spring will bring AI platforms featuring AMD Instinct GPUs, EPYC CPUs, Pensando DPUs, RyzenAI and ROCm software to Humain data centers to deliver 500 megawatts of compute capacity over the next 5 years. This reflects a joint commitment to democratize AI access and foster innovation through scalable compute.
One thing that was striking about this event was the customer-centricity in everything that the AMD team designed. Through walk-ons with OpenAI CEO Sam Altman, Meta, Oracle Cloud, Cohere and more, Lisa and team continually discussed how deep collaborations with customers directly fuel design targets for AMD silicon advancement. This laser focus played out in the progress of collaboration advancement from science project sized deployments of first gen Instinct GPU clusters to scale deployments running significant customer workloads today. It also reflects in the open innovation focus with a notion that many voices, centered on customer requirements, will define ultimate AI advancement – and underscores the danger of so much global innovation in the hands of a single company.
Altman's appearance was particularly notable as he announced that OpenAI will be using AMD's upcoming MI400 chips, telling the audience, “It's gonna be an amazing thing” – a significant endorsement that lends considerable credibility to AMD's ambitious 10x performance improvement targets for the MI400.
So what’s the TechArena take? While NVIDIA certainly holds pole position on both market share and AI zeitgeist, the industry is collectively hungry for alternatives and worried that a single vendor will squeeze industry opportunity from this important moment. Frankly, the TAM for AI accelerators is too vast not to expect competitive alternatives to earn a segment of upcoming deployments, and AMD has put together the portfolio and partnerships to be that leading competitor. What’s striking to me is the advancement that the company is delivering gen-over-gen to bridge the gap and deliver a credible alternative to market. Investments and leadership in UAL and Ultra Ethernet will pay off as InfiniBand, a proprietary solution that was developed decades ago, is showing some tarnish.
I want to see more from ROCm software advancement and developer traction to get true solution parity with CUDA-fueled solutions, and AMD has work to deepen relationships with this essential element of the ecosystem. But even given that, the advancement is truly eye-opening, welcome, and inspiring us to anticipate what comes next. Well done, AMD.

Functional safety, as described by the ISO 26262 specification, and covered in multiple blog posts, is something that has only recently been supported in low-power double data rate (LPDDR) 5 automotive memory. It is probably safe to assume that going forward, functional safety will also be supported for storage devices. Make no mistake, both memory and storage devices are complex devices and have many different safety elements. Without safety mechanisms to detect and flag a failure in these elements, unpredictable and perhaps catastrophic results can occur.
A simple example of an otherwise undetectable error would be the case of a failed address decoder. While the host device believes that it is either reading from or writing to a specific memory location, a failed address decoder can lead to extreme data corruption, resulting in unpredictable system-level behavior because of writing or retrieving data from the wrong location. A safety mechanism that can detect an addressing failure and provide a failure flag allows for the system to take the appropriate action, ranging from disengaging the advanced driver assistance system (ADAS) to deliberately crippling the vehicle. The point, again, is that the adoption of state-of-the-art technologies is being driven by the automotive industry.
A relative newcomer to the memory market, HBM (high bandwidth memory), is also finding its way into the automobile as multi-modal generative AI is actively being employed to implement context-aware navigation. This class of navigation takes ADAS well beyond recognizing a street sign, pedestrian, cyclist, etc., while addressing basics such as lane keeping or auto emergency braking.
Context-aware navigation relies upon a class of neural networks referred to as large language models (LLM), which demand extreme levels of compute performance. Ultimately, through the real-time understanding of the environment, more intelligent driving decisions and behaviors can be made, mimicking those of the human driver. Examples of this include pulling over to the side of the road when an emergency vehicle is approaching with lights and siren engaged or cautiously entering an intersection or roadway when there is a lot of traffic, pedestrian or otherwise. When I was first learning how to drive, this was referred to as “defensive driving,” which basically is all about anticipating how a scenario might unfold and either acting or being prepared to act in case that’s what’s required to avoid an accident.
Multi-modal generative AI refers to the fact that, in addition to supporting text data sets, the LLM can also support other input data sources – most notably, video and even audio. Currently LLMs are all the rage as they can predict the next word in a sentence with reasonable degrees of accuracy – a concept that we are all becoming familiar with as AI continues to expand its reach into just about every facet of our lives. (As I write this, Microsoft Word is trying to predict which words I am going to type – nothing like someone telling you how to think!)
Multi-modal generative AI, when applied to ADAS, can predict possible scenarios and act accordingly – replicating the equivalent of “defensive driving.” Equally as important, ADAS that employs generative AI communicates directly to the driver exactly which actions are going to be taken and the rationale for those actions. This extended communication leads to increased driver and passenger confidence in the operation of the ADAS system.
With an appreciation of what multi-modal generative AI brings to the table, it should become apparent that there is an extreme amount of compute performance required to address context-aware navigation. For the past several decades, the bottleneck for compute performance has been, and continues to be, memory bandwidth, not the performance of the CPU or the AI offload engine. This has given relatively recent rise to the introduction of HBM, which is an “in-package” memory solution. “In package” means that these devices are not available in discrete packages and need to be tightly integrated into a common single package alongside the AI or CPU compute engine.
The latest-generation HBM 3E offers up to 1 terabyte per second of memory bandwidth that is derived from 1024 I/O pins that operate at multi-gigabit signaling rates. Generative AI and LLMs are hot, driving oversubscribed, insatiable demand for HBM to the point where it has been publicly stated that all HBM capacity has been sold out through 2025. Here again, the use of LLMs in the auto is driving the use of state-of-the-art, oversubscribed, HBM.
Not only has the automotive industry progressed to the point where it is now seen as the main driver of memory and storage, but the importance of these technologies has also moved from being in the back seat of the car to the front seat in terms of their importance in realizing the vehicle of today and the future.

A recurring theme I have raised over the past dozen blogs is that the automobile has transformed considerably over the course of the past few decades, from employing mature semiconductor technologies to the current state where the automotive market not only employs state-of-the-art technologies, but now drives development for semiconductor technologies.
This point is well illustrated by the memory and storage industry. Historically, the memory industry recognized the personal computer, then networking and communications applications, and finally smart phones as driving innovation. However, at last year’s JEDEC meeting (a forum to ensure alignment regarding memory standards to guarantee interoperability), representatives from the major memory and storage companies agreed that automotive applications have taken a front seat in terms of importance as a lead technology driver.
It’s no wonder why: today’s automobile employs leading-edge semiconductor devices and electrical/electronic (E/E) architectures, with some of the highest compute performance across the board. And it does so in an environment that is more stringent and unforgiving than that of the data center or smartphone. For any semiconductor device to be given practical consideration in an automotive application, there is a litany of quality specifications (AEC Q100, TS 16949, etc.) that must be supported in addition to guaranteed operation at extended temperatures that range from -40° C up to 125° C depending on the physical location of the semiconductor device. Memory and storage devices are no exception to these requirements.
Automotive is affecting all memory and storage types. In memory, the influence ranges from state-of-the-art HBM (high bandwidth memory), which is heavily embraced in the data center for generative AI, to low-power double data rate (LPDDR) and double data rate (DDR) memories; in storage devices, device types affected include universal flash storage (UFS) and solid-state drives (SSDs). These acronyms reflect the potpourri of different memory and storage technologies in the market: all of them can be found in today’s or tomorrow’s automobile. I’ll discuss many examples below and in a second blog post shortly to follow.
As the automotive industry accelerates towards the broad deployment of the software defined vehicle (SDV), there is an explosive growth in the number of lines of software used to control the vehicle. Today’s high-end vehicle contains well over 100 million lines of code, which is expected to grow to 1 billion lines of code by 2030. This alone is driving large storage requirements in the vehicle. Writing to semiconductor-based storage (flash) is a destructive process with a limited number of write cycles before a given re-written region of storage needs to be retired. The technique of tracking total storage write cycles and reallocating the written data to “fresh storage” is referred to as wear leveling.
Wear leveling ensures that the stored data is not lost due to writing to a storage cell that no longer supports data retention. The expired storage cells are retired and managed via an internal map created by the internal storage controller that tracks available vs. retired storage locations based on tracking write cycles to a given cell. With such a large code footprint, reaching up to 1 billion lines of code, it is not unreasonable to expect very frequent over the air (OTA) updates of that code over the 10-year lifespan of the vehicle. This leads to the need for a significantly larger storage footprint to accommodate regular updates or write cycles given the fixed number of write cycles that flash storage can support. Additionally, there is a need for at least twice the required storage to accommodate roll-back in case an OTA update was unsuccessful.
Suffice to say, security is of paramount importance, leading to the introduction of the ISO 21434 standard, which was expressly defined to address cyber security for automotive applications. This standard focuses on design methodologies used to design, test, and verify the design of memory and storage to guard against malicious attack. SDVs are designed expressly with OTA in mind, so state-of-the art security is paramount. An emerging branch of cybersecurity referred to as post-quantum computing (PQC) threatens to disrupt the current security landscape in just a few short years. This too will need to be given careful consideration to avoid malicious attacks and will again require state-of-the-art technologies in the vehicle down the road.
Continuing to look at the trajectory of automotive semiconductor storage technologies, because the SDV is driving a centralized architecture, the aggregation of all storage in one singular location leads to a more efficient means to support OTA updates in addition to increased overall board area and efficiency. An example of such efficiency that can be realized through centralized storage is illustrated by sharing such maps between advanced driver assistance systems (ADAS) and in-vehicle infotainment (IVI). ADAS uses map data to navigate the roadways to get to the desired location, whereas the IVI system uses map data to display to the driver and passenger exactly where the vehicle is on the given roadway or to enter the desired location.
Centralized storage devices such as single root I/O virtualized (SR-IOV) SSDs allow for access to a common storage area from multiple different sources while providing hardware isolation, preventing user-downloaded applications from affecting mission critical code. Here again, SR-IOV represents some of the state of the art in storage technologies that are being employed in today’s automobile.
In part two, I’ll discuss how two more use cases—functional safety and context-aware navigation powered by multi-modal generative AI—are driving further innovation in memory and storage technologies.

The infrastructure to support AI workloads is evolving as rapidly as AI workloads are growing.
In a strategic partnership announced today, WEKA and Nebius are meeting that challenge head-on – delivering a GPU-as-a-Service (GPUaaS) platform that brings ultra-high performance, scalability, and simplicity to the AI infrastructure market.
The solution integrates WEKA’s AI-native data platform with Nebius’ full-stack AI cloud, offering customers an infrastructure backbone purpose-built to handle the unique demands of AI model training and inference at scale. The partnership gives insight into how the next generation of AI cloud infrastructure will look.
Enterprises training cutting-edge models often face infrastructure constraints in four areas: compute, memory, storage, and data management. These friction points stall innovation and increase time to value. The WEKA-Nebius collaboration addresses these limitations by delivering a cloud-native solution with microsecond latency, high throughput, and seamless scalability from petabytes to exabytes of data.
At the heart of the solution is Nebius’ GPU-rich AI Cloud, a purpose-built platform designed from the ground up for AI/ML workloads. Nebius blends proprietary cloud software, in-house hardware design, and developer-first tooling to deliver a streamlined environment for model builders, from startups to research institutions.
To fuel its premium tier, Nebius selected WEKA’s data platform, citing its consistent performance across mixed I/O workloads, robust metadata handling, and multitenancy capabilities – must-haves for large-scale AI environments.
“WEKA exceeded every expectation and requirement we had,” said Danila Shtan, CTO at Nebius. “It delivers outstanding throughput, IOPS, and low latency while managing mixed read/write workloads at scale.”
One of the first deployments of this integrated solution is already in action at a leading research institution. The organization selected Nebius to power its large-scale experimentation and AI model development and brought in WEKA to meet storage performance and manageability needs. The result? A multi-thousand-GPU cluster backed by 2PB of WEKA storage – delivering a fully managed, high-performance environment tailored for rigorous AI research.
Key features like user and directory quotas were critical in customizing the platform to the institution’s operational demands. And by pairing Nebius’ scalable compute with WEKA’s ultra-fast storage layer, the deployment ensures minimal bottlenecks and maximum utilization, accelerating time to insights.
This partnership exemplifies a key trend in enterprise AI: the rise of neoclouds. Unlike general-purpose hyperscalers, neocloud providers like Nebius offer tailored platforms for AI development, focusing on performance, control, and flexibility. These environments are quickly becoming the go-to solution for enterprises that want to move fast without compromising on power.
Meanwhile, WEKA continues to cement its position as the high-performance storage layer for AI, enabling faster training and smarter infrastructure utilization. In environments where every millisecond counts, the ability to reduce latency, improve GPU utilization, and eliminate data silos can be the difference between leadership and lag.
“Together, Nebius and WEKA are redefining what's possible when high-performance storage meets AI-first infrastructure,” said Liran Zvibel, WEKA CEO. “It’s a unified solution that is a catalyst for enterprise AI and agentic AI innovation.”
The WEKA-Nebius solution is a compelling model for what’s next: AI-native infrastructure as a service, where every layer of the stack is designed to accelerate AI.
Learn how Nebius and WEKA are powering next-gen AI infrastructure.