
Research
AI Agents
The model is commoditising. The ontology is consolidating. The question every European CIO should now ask is: whose ontology?

Article by
Dr. Anoj Winston Gladius
·
In the same five-week window between mid-May and mid-June 2026, two developments unfolded in parallel that resolve, when you set them side by side, to a single architectural conclusion. On 13 June, Z.aireleased GLM-5.2 on Hugging Face under the MIT licence — a 753-billion-parameter Mixture-of-Experts model with a one-million-token context window that scored 62.1 on SWE-bench Pro and 81.0 on Terminal-Bench 2.1, beating GPT-5.5 and sitting within one percentage point of Claude Opus 4.8 on FrontierSWE, at roughly one-sixth the cost. The release was explicitly framed by Z.ai as a response to the US government's recall of Claude Fable 5 the previous day; the company's positioning was that open weights are the structural answer to "the geopolitical restriction of AI access." [¹] Across the same five weeks, four of the largest enterprise software vendors on the planet — SAP, Microsoft, Google, and Palantir — made the ontology and the knowledge graph their headline architectural commitment. SAP Sapphire unveiled the SAP Knowledge Graph as "the most significant technical development from the event," anchoring the new SAP Business AI Platform around a context-build-governance three-layer architecture borrowed almost verbatim from Palantir's published architecture. Microsoft Build expanded Microsoft Fabric IQ as the structured business data layer of Microsoft IQ. Google Cloud Next had earlier launched the Knowledge Catalog and Enterprise Knowledge Graph. Palantir, with a decade's head start on the Ontology, was positioned as an SAP Endorsed App for migration and a co-integration target across the new vendor stack. Four vendors. Four ontologies. Same bet. These are not two unrelated stories. They are the same story. As the model layer commoditises, the semantic layer becomes the new moat — and the four largest enterprise software vendors are racing to own it before European enterprises notice that the next vendor lock-in cycle has already started.
This is the tenth piece in a series I have been writing for neuland.ai. The thread that runs through all of them is the same: in enterprise AI, the value, the risk and the moat sit in the layer above and around the model, not in the model itself. [²] Earlier pieces have unpacked that argument across control plane and execution surfaces, model drift and Multi-LLM observability, model topology, compliance as a system property, agent security and the lethal trifecta, MCP protocol governance, the fast-follower workhorse thesis, the new enterprise data silos created by SAP and Microsoft's AI gateways, and the flexibility-as-architecture argument that crystallised after the Fable 5 recall. [²] This piece extends the argument one more layer. The next moat is the ontology. And the European enterprises that do not architect for it now will find themselves, twelve months from now, paying per-action rents on a semantic layer that encodes their own business knowledge inside a vendor's product.
The model just commoditised in public
It is worth spending a moment on what GLM-5.2 actually demonstrates, because the headline numbers can obscure the structural conclusion.
Z.ai's release is a 753-billion-parameter Mixture-of-Experts model with approximately 40 billion active parameters per token, distributed under a permissive MIT licence with no regional usage restrictions. The model ships with FP8 variants and is supported across mature open-source serving stacks including vLLM, SGLang, xLLM and Ktransformers. It introduces an architectural optimisation called IndexShare that reuses one lightweight indexer across every four sparse-attention layers, cutting per-token FLOPs by 2.9× at the full one-million-token context length while maintaining accuracy. The benchmark numbers are independently verifiable: 62.1 on SWE-bench Pro, 81.0 on Terminal-Bench 2.1, 74.4 on FrontierSWE — the strongest open-source model across all three long-horizon coding benchmarks, and within striking distance of Claude Opus 4.8 (85.0 on Terminal-Bench, ~75.0 on FrontierSWE). Hosted API pricing sits at roughly £1.10 per million input tokens, against frontier proprietary pricing at roughly six times that. [³]
The cleanest read on what this means structurally comes from looking at the time gap. Anthropic Opus 4.5 — the frontier coding-agent benchmark setter — shipped on 24 November 2025. GLM-5.2 shipped on 16 June 2026. The gap is 204 days. In the previous piece in this series on the workhorse strategy I argued for fast-follower deployment of open-weight models on the basis that the closed-open capability gap had compressed to a window most enterprise workloads could tolerate. GLM-5.2 is the empirical demonstration. The gap is now approximately seven months at the absolute frontier of coding capability, less for the broader workload distribution that actually accounts for the majority of enterprise AI traffic, and shrinking with each release cycle.
There is a caveat that European enterprises need to plan around. Z.ai's hosted API is subject to China's National Intelligence Law, which compels disclosure to the People's Republic on request — a procurement risk profile materially different from the MIT-licensed self-hosted deployment. The same model, on the customer's own infrastructure, carries none of that exposure. [⁴] This is the same architectural distinction this series has been making for nine pieces: the protocol or the weights are not the jurisdiction; the deployment is.
The structural conclusion is straightforward and the major enterprise software vendors have all reached it independently. If frontier-class capability is now downloadable under MIT licence, with mature serving stacks, on hardware in the secondary GPU market at acceptable economics, then the model layer is no longer where the vendor margin can live. The margin has to move somewhere. The vendors have decided where.
The four-vendor ontology race
Across the five-week window between SAP Sapphire and Microsoft Build, four of the largest enterprise software vendors on the planet have made the ontology — variously called Knowledge Graph, semantic layer, Foundry Ontology, or Knowledge Catalog — their central architectural commitment for the agentic era.
SAP announced the SAP Knowledge Graph at Sapphire as the foundational layer of the new SAP Business AI Platform, structured as a three-layer architecture — context, build, governance — that the platform's independent analysts noted as borrowed almost verbatim from Palantir Foundry's published architecture. The "single semantical data layer" vocabulary is directly transferred. Joule Studio 2.0 is now model-agnostic, supporting Anthropic, Cohere, Mistral and others as underlying reasoning engines — an explicit concession that SAP will not compete at the model layer. The commercial gate, critically, has shifted: SAP is no longer monetising primarily on software licensing or on the model. It is monetising on AI Units — a consumption meter applied per agent action against a budget pool, drawn down each time an agent executes inside the SAP environment. Free design-time access to Joule Studio runs through end-2026; the monetisation happens at execution, on a meter, against the ontology that the customer's business processes have been encoded into. [⁵]
Microsoft has positioned Microsoft Fabric IQ as the structured business data layer inside the broader Microsoft IQ context architecture announced at Build 2026, sitting alongside Work IQ (from M365 Signals) and the new Web IQ. Frontier Tuning, the privately previewed Microsoft offering, allows agents to learn how a customer's business operates within Microsoft's compliance boundaries. The Microsoft Agent Framework 1.0 reached general availability. All of this routes through Microsoft 365 tenants, on Microsoft Foundry, on Microsoft Azure infrastructure. [⁶]
Google entered the race at Google Cloud Next '26 with the Knowledge Catalog and Enterprise Knowledge Graph, joining Microsoft Fabric IQ and Palantir Foundry as a third major hyperscaler bet on the ontology as the central enterprise AI architecture. [⁷]
Palantir has been operating in this space for over a decade. Foundry's Ontology — independently described as "not just a schema, but a model of every real-world entity, the logic governing decisions around them, and the actions that can be triggered as a result" — was running across intelligence agencies, healthcare systems, manufacturers and supply chains while the BI industry was still perfecting LookML. Palantir AIP for migration is now an SAP Endorsed App, with an SAP Solution Extension planned for Q3 2026. Apollo, Palantir's continuous-delivery platform, runs Foundry and AIP across multi-cloud, on-premises, private network and fully air-gapped deployments — a deployment topology flexibility the cloud-native ontology competitors do not yet match. The Sapphire 2026 partnership signing means Palantir is now positioned not as a competitor to SAP's ontology, but as the migration accelerator into it. [⁸]
The technical argument all four vendors are making is genuinely correct. Large language models do not know your business. Without a knowledge layer that grounds agent reasoning in the actual entities, relationships, decision rules and permitted actions of your specific organisation, even the best model produces generic outputs that fail at the precision and accountability bar that mission-critical enterprise workflows require. This is not vendor marketing. It is the central technical insight that has driven Palantir's growth across regulated industries for a decade, that has produced the drug discoveries and clinical decision support systems that semantic-web research enabled, and that distinguishes "AI that can answer your question" from "AI that can act inside your business." [⁹]
The technical insight is correct. The architectural conclusion the vendors want you to draw from it is the part European enterprises now have to think very carefully about.
What the lock-in mechanism actually is
Here is the mechanism, in plain language, because it is going to define the next two years of enterprise AI procurement.
Every business has a finite set of entities, relationships, rules and actions that constitute how the business actually operates. Customers, suppliers, contracts, assets, employees, products, orders, inventory, exceptions, approval thresholds, audit trails — the things a serious agent needs to reason about to take action that means anything inside the company. Encoding all of that into a formal ontology is the work the AI agent layer requires to be productive. It is also expensive, slow, and customer-specific. Once it has been done — once the company's operational architecture has been embedded into a particular vendor's ontology product — the cost of moving it elsewhere is no longer measured in software switching fees. It is measured in the loss of the institutional self-knowledge that the platform has now absorbed.
The independent analysis that landed after SAP Sapphire put the dynamic most sharply: "once an enterprise embeds its operational architecture into the Ontology, the cost of ripping it out is not measured in software fees. It is measured in the loss of the company's own self-knowledge encoded in the platform." [¹⁰]
This is structurally a more profound lock-in than the SaaS-era one. The SaaS lock-in was about the data the customer put into the vendor's database. The ontology lock-in is about the meaning the customer's organisation has built up over years — the semantic relationships, the decision logic, the rules and exceptions and edge cases that constitute institutional knowledge. The vendor that owns the ontology in which all of that has been encoded becomes structurally embedded in a way that no rip-and-replace migration can clean. The monetisation gate sits on top: SAP's AI Units per agent action; Microsoft's tenant-bound metering through Agent 365; Palantir's per-execution consumption; ServiceNow's Action Fabric tax per outside agent. The model is free or model-agnostic. The ontology is the meter.
And the jurisdictional dimension is the same one this series has been writing about for ten pieces. SAP's Knowledge Graph lives inside SAP's Business AI Platform on SAP's cloud regions. Microsoft Fabric IQ lives inside Microsoft 365 tenants on Azure. Google Enterprise Knowledge Graph lives on Google Cloud. Palantir Foundry can be air-gapped, but only on Palantir's deployment architecture. None of these is a sovereign substrate the European customer owns. They are venues into which the customer is being invited to embed their institutional knowledge — under deployment topologies, pricing models and contractual continuity terms that the vendor controls.
Why this is the wrong architecture for European enterprises
The argument I have been making across this series resolves, at the ontology layer, to a sharper version of the same conclusion. The semantic layer is too important to live inside any single vendor's product. It is the layer where the company's own knowledge lives. If sovereignty as a procurement principle means anything in the AI context — and the Fable 5 recall, the SAP API restrictions, the Microsoft Agent 365 routing, and the EU Digital Omnibus political pressure all argue that it now does — then the ontology is the layer where it matters most.
The architectural answer follows from the diagnosis. A European enterprise AI stack designed to survive the next vendor consolidation cycle has to host its own ontology, on infrastructure it owns or controls, accessible to whatever model is best at each moment, and portable across the four hyperscaler-owned ontology products rather than embedded in any single one. The ontology should be a property of the enterprise. The enterprise should not be a property of the ontology.
Concretely, the ontology layer needs the same five conditions I argued for at the model layer in the previous piece. First, it has to be customer-owned, deployed inside the customer's HUB, accessible to multiple model providers without re-encoding. Second, it has to be hybrid by construction — capable of connecting to SAP Knowledge Graph, Microsoft Fabric IQ, Google Enterprise Knowledge Graph, Palantir Foundry, or any standards-based ontology source as inbound integration points, while materialising the resulting semantic layer into a customer-owned substrate. The vendor ontologies become sources to draw from, not venues to live inside. Third, it has to be sovereign in deployment — on-premises, EU-jurisdictional sovereign cloud (STACKIT, IONOS, T Cloud Public, Plus Server, Hetzner, Aleph Alpha PhariaAI), or hyperscaler regions only where the specific workload justifies it. Fourth, it has to support migration without re-integration — when a model changes, when a vendor ontology product changes pricing, when a regulatory boundary shifts, the customer's encoded semantic layer stays intact and only the routing updates. Fifth, the same orchestration plane that governs the model layer and the tool layer governs the ontology layer too: same identity, same RBAC, same audit, same policy, same jurisdictional routing, same capability abstraction. [¹¹]
This is the architectural extension that the next twelve months of European enterprise AI procurement will, in my view, be defined by.
Where neuland.ai stands
The neuland.ai HUB is built to be the enterprise AI orchestration and management platform that hosts the customer's sovereign ontology alongside the rest of the stack we have described across this series. The HUB sits above the multi-tier model layer described in the workhorses piece, above the heterogeneous execution surfaces described in the protocol governance piece, above the multi-modal ingestion and retrieval substrate described in the silos piece, and above the customer's own knowledge layer. All of it deployed on customer-controlled substrate — on-premises, EU-jurisdictional sovereign cloud, or hyperscaler region where the workload justifies. All of it governed through a single identity, RBAC, audit and policy plane. The ontology is the customer's. The orchestration is hyperscaler-independent. The substrate is sovereign. [¹²]
Our research team has been engaged in exactly this work for months. The thesis articulated above — that as the model commoditises the semantic layer becomes the moat, and the customer's ontology should live inside the customer's orchestration platform rather than inside a vendor's product — pre-dates the major vendor announcements that have now made the architectural argument public. Field experience with customer engagements in regulated DACH industrial settings, including extended structured-and-unstructured knowledge work on file-server-scale enterprise corpora with formal ontology layers, has shaped the way we think about the architecture and informed what the research team is building. I am being deliberately careful about the level of detail here, because some of what makes our approach architecturally distinctive is not appropriate to publish on a blog. But the public-facing shape of the design is worth being precise about: customer-owned ontology, deployed inside the customer's HUB, connecting to vendor ontology systems through documented integration paths but materialising into a substrate the customer controls. Multiple model providers — frontier and workhorse, proprietary and open-weight — reasoning over the same customer-controlled semantic layer. Migration without re-integration. Hyperscaler-independent. [¹³]
Personal take
When the model layer commoditises — and GLM-5.2 is now the proof, sitting at MIT-licensed open weights within one percentage point of Claude Opus 4.8 on the hardest publicly available coding benchmark — the vendor margin has to move. SAP, Microsoft, Google and Palantir have all decided, independently and within a five-week window, that the place it moves to is the ontology layer. Whoever owns the semantic layer in which the customer's business has been encoded owns the customer in the post-commodity-model era. The technical insight is correct. The architectural conclusion the vendors want their customers to draw from it is the one European enterprises now have to think very carefully about, because every previous piece in this series has shown what happens when European enterprises adopt the vendor's preferred architecture without architecting for what comes next.
We will use frontier proprietary models — from Anthropic, OpenAI, Google, Microsoft — as long as the deployment topology permits and the jurisdictional posture is acceptable. We will use European workhorses — Mistral Large 3, Aleph Alpha Pharia 7B, LightOn for long-context document workloads — where they are the right tool. We will use open-weight Chinese-origin models like GLM-5.2 on customer-controlled infrastructure where the MIT licence permits and the National Intelligence Law exposure is avoided by self-hosting. We will integrate to SAP Knowledge Graph, Microsoft Fabric IQ, Google Enterprise Knowledge Graph and Palantir Foundry where the customer's data source systems require it. None of those vendor ontologies, however, becomes the operating control plane of the customer's AI strategy. The customer's ontology lives in the customer's HUB, on the customer's substrate, governed by the customer.
This is the work in front of us, and the work we have been doing.
A brief note on the regulatory backdrop, since it continues to develop. The 7 May 2026 EU Digital Omnibus agreement postponed the high-risk Annex III obligations from 2 August 2026 to 2 December 2027, and Annex I obligations to 2 August 2028. [¹⁴] GPAI enforcement powers under Chapter V remain on the original 2 August 2026 schedule. The Fable 5 recall has, if anything, intensified European political pressure for sovereign AI infrastructure, and the ontology race adds a procurement dimension to that political reading that did not exist six months ago.
The model is commoditising. The ontology is consolidating. The question every European CIO should now ask, and answer with intent rather than by default, is whose.
¹ Z.ai (formerly Zhipu AI), GLM-5.2 release, 13–16 June 2026. Hugging Face: zai-org/GLM-5.2. MIT licence, no regional usage restrictions. Z.ai company history: Tsinghua University spinoff founded 2019, IPO on Hong Kong Stock Exchange (HKEX: 02513) on 8 January 2026. Z.ai company statement on the GLM-5.2 release framing it as a response to "geopolitical restriction of AI access": see The New Stack, VentureBeat and TechTimes coverage, June 2026.
² Series articles at neuland.ai/insights. Previous pieces have addressed: control panels and execution surfaces; model drift, Multi-LLM strategy and observability; model topology and hyperscaler independence; compliance as a system property; agent security and the lethal trifecta; MCP protocol governance; the fast-follower workhorse thesis with sovereign deployment; the new enterprise data silos created by SAP / Microsoft / ServiceNow / Salesforce AI gateways; and the flexibility-as-architecture argument crystallised by the Fable 5 recall.
³ GLM-5.2 architecture and benchmarks: 753-billion-parameter Mixture-of-Experts; ~40 billion active parameters per token; 1-million-token context window (stable, up to ~131,072 output tokens per response); IndexShare architectural optimisation reuses one lightweight indexer across every four sparse-attention layers, reducing per-token FLOPs by 2.9× at 1M context length; improved MTP layer for speculative decoding raises acceptance length up to 20%. Benchmarks: 62.1 on SWE-bench Pro (vs GLM-5.1 at 58.4); 81.0 on Terminal-Bench 2.1 (vs GLM-5.1 at 62.0; vs Opus 4.8 at 85.0); 74.4 on FrontierSWE (within ~1% of Opus 4.8); ranks 1st in open-weight category on Artificial Analysis Intelligence Index v4.1 (score 51 vs MiniMax-M3 44, DeepSeek V4 Pro 44, Kimi K2.6 43); 1st on Design Arena single-round HTML web design leaderboard, non-agent category. Pricing: ~$1.40 per million input tokens via Z.ai API; ~1/6th of comparable closed frontier API pricing. Serving stack support: vLLM, SGLang, xLLM, Ktransformers. Sources: Z.ai official model card (docs.z.ai/guides/llm/glm-5.2); VentureBeat 16 June 2026; The New Stack June 2026; AIToolsReview June 2026; LLM-Stats GLM-5.2 page.
⁴ National Intelligence Law of the People's Republic of China, applicable to Z.ai-hosted API operations. Risk profile applies to the hosted API only and does not apply to MIT-licensed self-hosted deployment of the GLM-5.2 weights. US House of Representatives formal inquiry, May 2026, into cybersecurity risks posed by PRC-origin AI models in critical infrastructure, naming Zhipu AI (Z.ai) alongside DeepSeek, MiniMax and ByteDance.
⁵ SAP Sapphire 2026, Orlando, 12–14 May 2026. SAP Knowledge Graph announcement; SAP Business AI Platform unified architecture (context-build-governance three-layer structure); Joule Studio 2.0 (model-agnostic, supporting Anthropic, Cohere, Mistral and others); SAP Autonomous Suite. Independent analysis on the architectural parallels with Palantir Foundry: Mario Defelipe, Medium, May 2026. Independent technical analysis on SAP Knowledge Graph as the "most significant technical development from the event": K2 University, May 2026. AI Units consumption-meter commercial structure: SAP Sapphire 2026 announcements; SAPinsider coverage, June 2026.
⁶ Microsoft Build 2026, San Francisco, 2–3 June 2026. Satya Nadella keynote: Microsoft IQ (Work IQ + Fabric IQ + Web IQ) as context layer architecture; Frontier Tuning (private preview); Microsoft Agent Framework 1.0 (GA); Microsoft Foundry continued maturation; Copilot Autopilots in M365 tenant; Microsoft Execution Containers; Project Solara (Microsoft + Qualcomm chip-to-cloud agent platform). Agent 365 tooling-server gateway: endpoints at agent365.svc.cloud.microsoft/agents/tenants/{tenant_id}/servers/.
⁷ Google Cloud Next '26, April 2026: Knowledge Catalog and Enterprise Knowledge Graph announcements. See independent analysis: Pankaj Kumar, "Google vs Microsoft vs Palantir: The Enterprise Ontology Race," Medium, May 2026.
⁸ Palantir Foundry Ontology: production deployments across intelligence agencies, healthcare and life sciences, manufacturing and supply chain since approximately 2012; Foundry Apollo continuous-delivery platform supports multi-cloud, on-premises, private-network and fully air-gapped deployments. Palantir AIP for SAP migration: SAP Endorsed App as of Sapphire 2026, with SAP Solution Extension planned for Q3 2026. See SAPinsider, "Palantir Foundry, AIP & Apollo in the SAP Enterprise: What Every CIO Needs to Know After Sapphire 2026," June 2026.
⁹ For the underlying technical argument on why ontology and knowledge graph architectures are structurally necessary for enterprise AI agent reasoning — distinct from semantic layers built for BI dashboarding — see: Context and Chaos, "Ontologies, Context Graphs, and Semantic Layers: What AI Actually Needs in 2026," January 2026. For the broader European enterprise knowledge graph platform landscape: d.AP, Neo4j, eccenca, Stardog, GraphAware, Palantir Foundry — see digetiers-dap.com, "Best Enterprise Knowledge Graph Platforms in 2026."
¹⁰ Mario Defelipe, Medium analysis of SAP Sapphire 2026 announcements, May 2026.
¹¹ For the underlying sovereign cloud landscape referenced — STACKIT (Schwarz Digits), IONOS, T Cloud Public (Deutsche Telekom), Plus Server, Hetzner, and Aleph Alpha PhariaAI for fully sovereign deployment — see the previous piece in this series on workhorse deployment topology.
¹² neuland.ai HUB capabilities referenced: identity / RBAC / audit trail / tool-call governance / capability abstraction / Multi-LLM routing / cost-aware and jurisdictional routing / hyperscaler-independent deployment (on-premises, EU-jurisdictional sovereign cloud, hyperscaler region as required) / multi-modal ingestion at petabyte-scale non-functional target / columnar storage backend unifying vector, full-text and columnar representations / in-cluster GPU-based embedding, reranking and OCR serving / customer-owned ontology and knowledge-graph layer deployed inside the orchestration plane. neuland.ai AG retains responsibility for content quality and clean delivery of results.
¹³ Field experience reference: ontology-grounded knowledge-tool architectures developed for DACH industrial customers, including file-server-scale enterprise corpora with formal ontology layers. Research-team work on customer-owned ontology and knowledge-graph capability for the HUB has been continuous through Q1 and Q2 2026.
¹⁴ Council of the EU and European Parliament provisional political agreement on the Digital Omnibus on AI, 7 May 2026. Annex III high-risk obligations postponed from 2 August 2026 to 2 December 2027 (16-month delay); Annex I obligations postponed to 2 August 2028 (12-month delay); Article 50(2) watermarking moved to 2 December 2026. GPAI enforcement powers under Chapter V remain on the original 2 August 2026 schedule.