Data Model

Version 2.2 by Robert Schaub on 2025/12/11 21:34

Data Model

This page describes the current data model for FactHarbor.

Core Data Model Refinements

The system relies on the following versioned core entities:

  • CLAIM_CLUSTER
    • ``ClusterID`` (PK), ``EmbeddingVectorRef``, ``Theme``
    • Groups related claims into topical clusters.
  • CLAIM / CLAIM_VERSION
    • ``CLAIM`` is the long‑lived anchor for a real‑world claim.
    • ``CLAIM_VERSION`` is an immutable snapshot of wording + basic metadata.
    • Verdicts are NOT attached to ClaimVersion but to Scenario.
  • SCENARIO / SCENARIO_VERSION
    • ``SCENARIO`` represents a stable interpretive context for a claim.
    • ``SCENARIO_VERSION`` is an immutable snapshot of that context (definitions, assumptions, boundaries).
    • Verdicts are attached to SCENARIO, with verdict history in VERDICT_VERSION.
  • EVIDENCE / EVIDENCE_VERSION
    • ``EVIDENCE`` is the logical source (report, article, dataset…).
    • ``EVIDENCE_VERSION`` is the extracted/processed snapshot (summary, reliability, etc.).
  • VERDICT / VERDICT_VERSION
    • ``VERDICT`` represents “this scenario is evaluated for this claim.”
    • ``VERDICT_VERSION`` is an immutable snapshot of a concrete evaluation (likelihood, confidence, reasoning, timestamp).
  • SCENARIO_EVIDENCE_VERSION_LINK
    • Connects ``ScenarioVersion`` ↔ ``EvidenceVersion`` (many‑to‑many).
    • Fields: Relevance, Direction (SUPPORTS / CONTRADICTS / NEUTRAL).
    • Rule: The link always targets VERSIONED entities, never the base tables.

Core Data Model ERD