The Block Protocol: Bringing Structure to the Web Without the Headache

The Web's Missing Structure

Since the 1990s, the web has primarily served as a platform for publishing human-readable documents. Most of these documents are written in HTML, which offers only basic structural cues—like indicating a paragraph or emphasizing a word. Then CSS steps in to add visual flair, such as making text tiny and gray, which might look trendy but can alienate older readers who struggle to read it. That's about as far as "structure" goes on the web today.

The Block Protocol: Bringing Structure to the Web Without the Headache — Source: www.joelonsoftware.com

Consider a simple example: mentioning a book on a web page. You might write something like:

Goodnight Moon by Margaret Wise Brown, illustrated by Clement Hurd, Harper & Brothers, 1947, ISBN 0-06-443017-0

To a human, this clearly describes a book. But a naive computer program sees little more than bold text. It has no way of knowing that this is a book—with a title, author, illustrator, publisher, year, and ISBN. This lack of machine-readable meaning limits what the web can do.

The Long-Awaited Semantic Web

As early as 1999, Tim Berners-Lee envisioned a smarter web. In his book Weaving the Web, he wrote:

“I have a dream for the Web [in which computers] become capable of analyzing all the data on the Web – the content, links, and transactions between people and computers. A ‘Semantic Web’, which makes this possible, has yet to emerge, but when it does, the day-to-day mechanisms of trade, bureaucracy and our daily lives will be handled by machines talking to machines. The ‘intelligent agents’ people have touted for ages will finally materialize.”

The Semantic Web promised to add structured data to web pages, making them computer-readable. To publish a book with such markup, you would typically visit schema.org, look up their definition of a book, and then use a format like RDF or JSON-LD to embed additional annotations in your HTML. This would tell machines: “Hey! This is a book!”

The Challenge of Adding Markup

In theory, that sounds great. In practice, it’s hard. Most people publish a blog post, make it look good, and then move on. Adding semantic markup feels like extra homework—complex, time-consuming, and often unrewarded. Unless a computer is already consuming your data, there’s little incentive to invest the mental energy. As a result, very little semantic markup exists in the wild today. The dream of the Semantic Web has largely remained unrealized.

A New Approach: The Block Protocol

We believe this situation needs to change. Human progress depends on making information readily accessible—not just to people, but to AI assistants and traditional computer programs. The key insight: people will only add semantic markup if doing so is easy and valuable. That’s where the Block Protocol comes in.

The Block Protocol reimagines how structured data is created and consumed on the web. Instead of requiring you to manually embed RDF or JSON-LD, it treats blocks of content as self-describing units. Each block knows its own type (e.g., “book,” “event,” “person”) and can automatically expose its data in a machine-readable format. Writers and developers can use pre-built blocks that handle the complexity behind the scenes, making semantic markup as simple as dragging a component onto a page.

For example, with the Block Protocol, mentioning a book becomes as straightforward as selecting a “Book” block, filling in fields like title, author, and ISBN, and having the block output both human-readable HTML and machine-readable structured data. No need to learn schema.org vocabulary or JSON-LD syntax—the block does it for you.

Why This Matters for the Future

The implications are profound. When structured data becomes ubiquitous, intelligent agents can finally operate as Berners-Lee envisioned. Search engines can deliver richer results. AI assistants can answer complex questions by pulling data from multiple sources. Businesses can automate transactions without human intervention. And all of this can happen without burdening content creators.

The Block Protocol is open and extensible, allowing anyone to define new block types. It’s designed to work alongside existing web standards, not replace them. By lowering the barrier to adding semantic markup, it has the potential to turn the web into a truly structured ecosystem—one block at a time.

We’re excited about the progress being made. The protocol is already being adopted by platforms and publishers who see the value in a more connected, machine-friendly web. If you’re a developer, consider contributing to the protocol. If you’re a content creator, look for tools that support it. Together, we can make the Semantic Web a reality—finally.

Tags:

The Block Protocol: Bringing Structure to the Web Without the Headache

The Web's Missing Structure

The Long-Awaited Semantic Web

The Challenge of Adding Markup

A New Approach: The Block Protocol

Why This Matters for the Future

Related Articles

Recommended

Discover More