Items
Items are the actual data records collected from your sources. Each time a source runs, it extracts items and stores them in your aggregator.
What is an Item?
Section titled “What is an Item?”An item is a single data record that:
- Belongs to an aggregator
- Came from a specific source
- Conforms to a specific schema version
- Has an identity hash (for deduplication)
- Is immutable (never updated, only inserted)
Item Structure
Section titled “Item Structure”When you fetch items via API, each item includes:
{ "id": "clx1abc123", "source_id": "clx2def456", "schema_version": 2, "created_at": "2026-01-20T10:30:00Z", "data": { "title": "Senior DevOps Engineer", "company": "Acme Corp", "location": "Remote", "url": "https://acme.com/jobs/123", "tags": ["devops", "kubernetes", "aws"] }}| Field | Description |
|---|---|
id | Unique item identifier |
source_id | Which source produced this item |
schema_version | Schema version the item conforms to |
created_at | When the item was first collected |
data | The actual extracted data |
Append-Only Storage
Section titled “Append-Only Storage”Items are never updated, only inserted. This means:
- The
created_attimestamp represents when we first saw this item - You can reliably query “what’s new since X” using the
sinceparameter - Historical data is preserved
Deduplication
Section titled “Deduplication”Items are deduplicated based on identity fields. If a source extracts an item with the same identity hash as an existing item, the new item is still stored (append-only), but you can use the dedupe query parameter to filter duplicates at query time.
Storage Limits
Section titled “Storage Limits”The number of items you can store depends on your plan:
| Plan | Items |
|---|---|
| Free | 1,000 |
| Starter | 25,000 |
| Pro | 100,000 |
When you reach your limit:
- You’ll be notified at 80% capacity
- Sources pause automatically at 100%
- Delete old items or upgrade to resume
Querying Items
Section titled “Querying Items”Fetch items via the API with various filters:
# Get all itemscurl "https://api.fetchosaurus.com/api/v1/aggregators/{id}/items"
# Get items since a timestampcurl "https://api.fetchosaurus.com/api/v1/aggregators/{id}/items?since=2026-01-20T00:00:00Z"
# Get items from a specific sourcecurl "https://api.fetchosaurus.com/api/v1/aggregators/{id}/items?source_id=clx2def456"
# Get deduplicated itemscurl "https://api.fetchosaurus.com/api/v1/aggregators/{id}/items?dedupe=true"Related Concepts
Section titled “Related Concepts”- Aggregators - Items belong to aggregators
- Sources - Sources produce items
- API Reference - Full items API documentation