Skip to content

Article Schema

The Article schema is designed for aggregating news articles, blog posts, and other written content.

FieldTypeRequiredIdentityDescription
titlestringYesYes (1)Article title
sourcestringYesYes (2)Publication or website name
urlstringNoNoLink to the full article
published_atstringNoNoPublication date
summarystringNoNoArticle summary or excerpt
authorstringNoNoAuthor name
tagsstring[]NoNoTopics or categories

The identity is computed from title + source in that order. This handles cases where the same article might appear on multiple aggregator sites but should be deduplicated based on the original source.

{
"title": "The Future of Serverless Computing",
"source": "TechCrunch",
"url": "https://techcrunch.com/2026/01/20/serverless-future",
"published_at": "2026-01-20",
"summary": "A deep dive into where serverless is heading in 2026...",
"author": "Jane Smith",
"tags": ["serverless", "cloud", "infrastructure"]
}
  • Niche news aggregators
  • Industry-specific content feeds
  • Research and monitoring tools
  • Content curation platforms
{
"container": "article.post",
"fields": {
"title": {
"selector": "h1.article-title",
"type": "text"
},
"source": {
"selector": ".publication-name",
"type": "text"
},
"url": {
"selector": "a.read-more",
"type": "attribute",
"attribute": "href"
},
"published_at": {
"selector": "time",
"type": "attribute",
"attribute": "datetime"
},
"summary": {
"selector": ".excerpt",
"type": "text"
},
"author": {
"selector": ".author-name",
"type": "text"
}
}
}