Google Cloud Publishes the Open Knowledge Format for AI Agents
In June 2026, Google Cloud published the Open Knowledge Format (OKF), a draft open specification for representing curated organisational knowledge in a portable, vendor-neutral format that both humans and AI agents can read. It formalises the “LLM-wiki” pattern, inspired by Andrej Karpathy’s LLM Wiki concept, into a documented convention.
OKF is not a search or SEO standard. Google scopes it to a specific problem: the institutional knowledge that organisations need their AI agents to use is scattered across metadata catalogs, wikis, code comments, and people’s heads. OKF gives that knowledge a common representation so agents can consume it without a proprietary platform.
What is the Open Knowledge Format?
OKF v0.1 represents knowledge as a directory of Markdown files with YAML frontmatter. Each file describes one thing (a dataset, a table, a metric, a definition) and links to related files with ordinary Markdown links, producing a graph-like structure of curated context.
The design is deliberately minimal:
- Just Markdown that renders in any editor or on GitHub
- Just files that ship as a tarball or live in a git repository
- Just YAML frontmatter with queryable fields such as
type,title,description,resource,tags, andtimestamp
Only the type field is required. There is no SDK, no new runtime, and no platform dependency, which is the point: a knowledge bundle written by one producer can be read by a different consumer’s agent without translation.
A bundle is just a folder. A sales knowledge base might look like this:
sales/
├── index.md
├── datasets/
│ ├── index.md
│ └── orders_db.md
├── tables/
│ ├── index.md
│ ├── orders.md
│ └── customers.md
└── metrics/
├── index.md
└── weekly_active_users.md
Each file pairs structured frontmatter with a human-readable body, and links to related files with ordinary Markdown links:
---
type: BigQuery Table
title: Orders
description: One row per completed customer order.
resource: https://console.cloud.google.com/bigquery?p=acme&d=sales&t=orders
tags: [sales, revenue]
timestamp: 2026-05-28T14:30:00Z
---
# Schema
| Column | Type | Description |
|--------|------|-------------|
| `order_id` | STRING | Globally unique order identifier. |
| `customer_id` | STRING | FK to [customers](/tables/customers.md). |
An agent can query the frontmatter as structured fields (find every file where type is BigQuery Table, or filter by tags) while still reading the Markdown body for context. The same file renders cleanly in any editor or on GitHub.
What did Google ship alongside the spec?
Google updated its Knowledge Catalog to ingest OKF and serve it to agents, and published reference tooling: an enrichment agent that auto-generates OKF documents from BigQuery datasets, a static HTML visualiser for browsing a bundle, and three sample bundles built on the GA4 e-commerce, Stack Overflow, and Bitcoin public datasets.
Google is explicit that OKF v0.1 is “a starting point, not a finished standard,” and expects the format to evolve as more producers and consumers emerge and as the ecosystem learns what knowledge representations agents actually need in practice.
Could OKF work for websites?
This is where OKF becomes interesting for publishers, though the framing comes from the SEO community rather than from Google. The format is structurally close to conventions already debated for the agentic web: a folder of Markdown files with frontmatter is not far from llms.txt or Markdown versions of pages served to agents. Commentators including Suganthan Mohanadasan and No Hacks have asked whether a site could publish an OKF bundle as an external, agent-readable knowledge layer, and there is already a request to support the format in the Hugo static site generator.
The case for caution is the same one that applies across agent-readiness. Google built OKF for internal data shared with an organisation’s own agents; nothing in the announcement positions it as a web-search or AI-citation signal, and no AI platform has confirmed it reads externally published OKF bundles at inference time. Treating it as a ranking or citation lever would repeat the mistake made with llms.txt: assuming that because a format is easy to publish and intuitively useful, agents must be consuming it.
What this means
For content and editorial sites, OKF changes nothing today. It is internal-knowledge infrastructure, not a visibility signal, and the website application is speculative extrapolation by practitioners, not a Google proposal.
It is worth understanding for two reasons. First, the pattern matters: OKF is another data point in a clear direction of travel, where Markdown-with-frontmatter is becoming the lingua franca for handing curated knowledge to AI agents, the same shape as llms.txt, agents.md, and content negotiation. Second, if an external use case does emerge, the cost of producing an OKF bundle from existing structured content is low. Until a major agent platform confirms it reads external bundles, OKF belongs in the watch column alongside the other draft conventions of the agentic web, valued for where it points rather than for any confirmed citation impact.
Sources
- How the Open Knowledge Format can improve data sharing — Google Cloud Blog
- Google Cloud Announces The Open Knowledge Format — Search Engine Journal
- Open Knowledge Format (OKF): Google’s New Markdown Format for AI Agents — Suganthan Mohanadasan
- Google’s Open Knowledge Format Could Work For Websites, Too — No Hacks
More news
-
Cloudflare and Browser Makers Announce PACT Bot-Verification Protocol
Cloudflare, Google, Microsoft, Mozilla and Shopify are developing PACT, a privacy-first token protocol to separate humans and authorised agents from abusive bots.
-
Google, Microsoft and Hugging Face Publish the Agentic Resource Discovery Spec
ARD is a draft open spec for publishing and discovering AI capabilities via an ai-catalog.json file. Backed by Google, Microsoft and Hugging Face, but adoption is unconfirmed.
-
Study: AI Overviews Cite Self-Serving Listicles but Recommend Competitors
Lily Ray's June 2026 analysis found Google's AI Overviews cite brands' own 'best' listicles while recommending competitors around 69% of the time.