Data Model
Version 2.2 by Robert Schaub on 2025/12/11 21:34
Data Model
This page describes the current data model for FactHarbor.
Core Data Model Refinements
The system relies on the following versioned core entities:
- CLAIM_CLUSTER
- ``ClusterID`` (PK), ``EmbeddingVectorRef``, ``Theme``
- Groups related claims into topical clusters.
- CLAIM / CLAIM_VERSION
- ``CLAIM`` is the long‑lived anchor for a real‑world claim.
- ``CLAIM_VERSION`` is an immutable snapshot of wording + basic metadata.
- Verdicts are NOT attached to ClaimVersion but to Scenario.
- SCENARIO / SCENARIO_VERSION
- ``SCENARIO`` represents a stable interpretive context for a claim.
- ``SCENARIO_VERSION`` is an immutable snapshot of that context (definitions, assumptions, boundaries).
- Verdicts are attached to SCENARIO, with verdict history in VERDICT_VERSION.
- EVIDENCE / EVIDENCE_VERSION
- ``EVIDENCE`` is the logical source (report, article, dataset…).
- ``EVIDENCE_VERSION`` is the extracted/processed snapshot (summary, reliability, etc.).
- VERDICT / VERDICT_VERSION
- ``VERDICT`` represents “this scenario is evaluated for this claim.”
- ``VERDICT_VERSION`` is an immutable snapshot of a concrete evaluation (likelihood, confidence, reasoning, timestamp).
- SCENARIO_EVIDENCE_VERSION_LINK
- Connects ``ScenarioVersion`` ↔ ``EvidenceVersion`` (many‑to‑many).
- Fields: Relevance, Direction (SUPPORTS / CONTRADICTS / NEUTRAL).
- Rule: The link always targets VERSIONED entities, never the base tables.