The Ultimate Guide to AI Infrastructure
A curated Kiwi edition of TechDay news, analysis, interviews, reviews, job moves, and related resources for AI Infrastructure.
What to know about AI Infrastructure
AI Infrastructure explores the hardware, software, and systems that make modern artificial intelligence possible. This tag covers everything from compute and storage architectures to networking, data pipelines, and observability stacks that keep AI workloads reliable and efficient.
Stories here dig into practical questions: how to design scalable training and inference clusters, choose between GPUs and emerging accelerators, manage feature stores, and orchestrate distributed workloads. You’ll find discussions of MLOps practices, cost optimization, performance tuning, and the trade-offs behind different infrastructure patterns.
Whether you’re building a new AI platform or evolving an existing stack, this tag helps you understand the components, constraints, and design decisions that sit underneath AI products. Reading these pieces will give you concrete examples, architectural patterns, and lessons learned that you can apply to your own systems.
Kiwi AI Infrastructure News
Regional stories with direct local relevance
AI could drive advances that solve the problems it brings, computer scientist suggests
Higher electricity demand from artificial intelligence could be eased if it speeds up more efficient solar panels, batteries and chips.
NIWA partners with VAST Data to upgrade climate data systems
The National Institute of Water and Atmospheric Research in New Zealand has chosen VAST Data's platform to enhance its climate data management capabilities.
Analyst Insights
Research and market analysis connected to AI Infrastructure
New AI era defined by agents, rising costs and maturity gaps
The path to autonomous operations: Why observability is the reliability layer for AI
RAMaggedon: Why the memory crisis is a digital inclusion crisis
AI drives data centre power demand surge in Australia
Rafay & Argentum AI strike software orchestration deal
Featured News
Expert Columns
Interviews
Interviews and video coverage from the networkRecent AI Infrastructure News
AMD chips power 191 supercomputers as rankings shift
Energy-efficient computing is tilting towards AMD, which now powers 191 ranked systems and four of the world's 10 fastest supercomputers.
F5 & Equinix join forces on enterprise AI security
The tie-up gives enterprises a single policy layer to curb data leaks and compliance risks as AI workloads spread across clouds and models.
Envoy AI Gateway reaches 1.0 for production AI use
Enterprises can now route AI traffic with open-source governance and observability as Envoy AI Gateway reaches version 1.0.
CMC Invest launches AI tool for portfolio insights
Retail investors will get ranked, source-cited insights on holdings across shares, ETFs and crypto as CMC Invest rolls out CMC Intelligence.
Dell launches PowerEdge XE8812 for AI supercomputing
Data centres and research labs could cram larger AI models and simulations in memory, with Dell's new rack scaling to 144 GPUs per rack.
Platform9 launches partner plan for VMware migrants
Cloud providers facing the end of VMware's CSP programme in 2027 can now tap migration tools and new pricing to protect margins.
IBM study finds executives struggle with AI sovereignty
Most executives lack visibility over AI suppliers and infrastructure, leaving core operations exposed to outages, compliance risks and vendor lock-in.
Cast AI integrates MiniMax M3 into Kimchi Coding agent
Developers using Kimchi can now route tasks to MiniMax M3, cutting costs and keeping code inside controlled enterprise environments.
Glean adopts Nile network service to speed AI growth
Network speeds jumped and support tickets nearly vanished after the rollout, easing pressure on a lean IT team as AI use expands.
Taboola opens DeeperDive ads to AI chatbot providers
AI chatbot firms can now sell adverts against user queries, as Taboola extends DeeperDive's monetisation system beyond publishers.
Equinix & Cisco expand secure AI factory in Singapore
Singapore businesses can now deploy secure AI systems in private data centres, easing sovereignty concerns as demand rises across regulated sectors.
Cast AI adds MiniMax M3 to Kimchi Coding as default model
Businesses can now route coding jobs to a lower-cost open-weight model as Cast AI makes Kimchi Coding the first autonomous agent to offer MiniMax M3.
Databricks launches open-source Omnigent for AI agents
The open-source release gives enterprises a single control layer for fragmented AI agent tools, with governance and cost controls built in.
Cyera raises USD $600 million at USD $12 billion valuation
The funding values the cybersecurity group at USD $12 billion as enterprises race to secure data exposed to AI tools and agents.
Companies turn to EnterpriseDB for AI data control
Banks and retailers are adopting the platform as AI projects mature, with data sovereignty now shaping budgets, risk and infrastructure choices.
Linux Foundation launches DocLang group for AI documents
It aims to solve a key enterprise AI problem by standardising how software reads PDFs, Word files and images without losing layout or meaning.
Parallel Works adds AI governance & token budgeting
Rising AI costs and weaker oversight are pushing enterprises to demand tighter controls as token use spreads across clouds and in-house models.
DE-CIX chief warns orbital data centres need networks
Reliability, not raw compute, may decide whether orbital AI data centres can work, as DE-CIX says links to Earth remain the bigger hurdle.
Direct-to-chip coolants market set for rapid growth
Liquid cooling is gaining ground as AI data centres outgrow air systems, with the market forecast to hit USD $1.3 billion by 2032.
HPE expands AI factory platform with new NVIDIA integrations
Security and governance tools are being added as enterprises push agentic AI from pilots into live production systems.