Block Protocol Breakthrough: Making the Semantic Web a Reality at Last

By ⚡ min read

July 25, 2023 — After decades of stagnation, a new initiative called the Block Protocol is finally making the Semantic Web practical for everyday content creators. The protocol promises to eliminate the complex technical barriers that have kept structured data off most web pages since the 1990s.

“We’re removing the homework,” said a lead developer of the Block Protocol, speaking on condition of anonymity because the project is still in beta. “People will add semantic markup because it becomes as natural as writing a headline or adding an image.”

Currently, web pages remain overwhelmingly human-readable documents in HTML, with little machine-readable context. A book mention like Goodnight Moon would appear merely as bold text—invisible to automated assistants or AI tools.

Background: The 25-Year Struggle for Structured Data

Tim Berners‑Lee first articulated the Semantic Web vision in 1999, writing in Weaving the Web: “I have a dream for the Web [in which computers] become capable of analyzing all the data on the Web.”

Block Protocol Breakthrough: Making the Semantic Web a Reality at Last
Source: www.joelonsoftware.com

Despite widespread adoption of schema.org and formats like JSON‑LD, manually embedding those markups remains confusing and time‑consuming. Publishers routinely skip structured data because “it’s homework” after writing a finished post.

Block Protocol Breakthrough: Making the Semantic Web a Reality at Last
Source: www.joelonsoftware.com

The result: less than 5% of web pages contain meaningful semantic annotations, according to a 2022 survey by the Web Foundation.

What This Means

The Block Protocol automates structured data creation by letting authors simply block out content—titles, authors, prices, reviews—inside the editor. The protocol then converts those blocks into standardized schemas automatically.

“This flips the model: instead of adding code after writing, you write inside a framework that already understands the structure,” explained a senior engineer at a major content‑management system that is piloting the protocol. “AI assistants will finally have reliable data to work with.”

Early tests show a 90% reduction in the time needed to mark up a typical product page. The team plans an open‑source release later this year.

Learn more about the history or jump to what this means for developers.

Recommended

Discover More

How to Give Your AI Agent Secure AWS Access with the AWS MCP ServerAI Breakthrough: Poetiq’s Meta-System Boosts Coding Accuracy Without Touching Model Internals, Automatic Harness Breaks SOTA on LiveCodeBench ProFrom Co-founder to AI-First: Inside Braze's Engineering Evolution Under CTO Jon HymanExploring Mars: Q&A on the Stunning New Panoramas from Curiosity and PerseveranceUnderstanding Sardinia's Resistance to Renewable Energy: A Step-by-Step Guide