System Architecture

Last modified by Robert Schaub on 2025/12/24 20:35

System Architecture

FactHarbor system architecture including POC simplifications and full system design


1. Architecture Evolution

FactHarbor will be built in phases, with architecture complexity increasing as we validate core capabilities.

1.1 POC1 Architecture (Simplified)

Goal: Validate core AI capability with minimal complexity

Warning

You are not allowed to create this diagram.

Components:

  • Single AKEL API call (Claude Sonnet 4.5)
  • Gates 1 & 4 (claim validation, verdict confidence)
  • Basic UI display
  • Manual quality tracking

Data Storage: Minimal (stateless or simple SQLite)


1.2 POC2 Architecture (Enhanced)

Goal: Add complete quality framework and evidence deduplication

Warning

You are not allowed to create this diagram.

New Components:

  • Scenario generation
  • Evidence deduplication system
  • Gates 2 & 3 (evidence relevance, scenario coherence)
  • Quality metrics dashboard

Data Storage: Enhanced (claims, scenarios, evidence, metrics)


1.3 Full System Architecture (V1.0+)

Goal: Production-ready multi-component system

Warning

You are not allowed to create this diagram.

Production Components:

  • Multi-component AKEL pipeline
  • Review workflow system
  • Audit sampling framework
  • Federation architecture
  • Full data model (PostgreSQL + Redis + S3)

2. Quality Gate Architecture

2.1 Quality Gate System

Purpose: Prevent low-quality/hallucinated content from publication

Warning

You are not allowed to create this diagram.


2.2 Gate Implementation by Phase

GatePOC1POC2Beta 0V1.0
Gate 1: Claim Validation✅ Basic✅ Enhanced✅ Enhanced✅ Hardened
Gate 2: Evidence Relevance✅ Implemented✅ Enhanced✅ Hardened
Gate 3: Scenario Coherence✅ Implemented✅ Enhanced✅ Hardened
Gate 4: Verdict Confidence✅ Basic✅ Enhanced✅ Enhanced✅ Hardened

Hardening means: Thresholds validated, edge cases handled, <5% failure rate


3. Data Architecture

3.1 POC Data Model (Simplified)

Storage: SQLite or minimal database

Entities:

  • Articles (input text/URL)
  • Claims (extracted from articles)
  • Verdicts (per claim)
  • Quality metrics (aggregated)

No complex relationships, versioning, or scenarios


3.2 Full System Data Model (V1.0+)

Storage: PostgreSQL (primary), Redis (cache), S3 (documents)

Core Entities:

  • Claims (with versions, clustering)
  • Scenarios (interpretations of claims)
  • Evidence (deduplicated, provenance tracked)
  • Verdicts (per scenario, versioned)
  • Reviews (human oversight)
  • Quality metrics (per component, aggregated)

Complex relationships, full audit trail, federation support


4. Component Architecture

4.1 AKEL Orchestrator

POC: Single API call  
Full System: Multi-component orchestration

Responsibilities:

  • Route input through component pipeline
  • Manage component state
  • Handle errors and retries
  • Coordinate quality gates
  • Trigger review workflows

4.2 Quality Gate Validator

All Phases: Present but evolving complexity

Responsibilities:

  • Execute all configured gates
  • Aggregate gate results
  • Make publication decisions
  • Generate explanatory messages
  • Log quality metrics

4.3 Review Queue Manager

POC: Not present  
Beta 0+: Optional  
V1.0: Full implementation

Responsibilities:

  • Route low-confidence verdicts to review
  • Manage reviewer assignments
  • Track review status
  • Implement audit sampling
  • Generate review metrics

5. Architecture Decisions

5.1 Why Single AKEL Call for POC?

Rationale:

  • Validates core capability fastest
  • Simplest to implement and test
  • Fail-fast if AI fundamentally can't do task
  • Learn prompt engineering before architecting components
  • Reduce moving parts during initial testing

Trade-off: Less granular control, harder to optimize individual steps


5.2 Why Add Components in V1.0?

Rationale:

  • Better error handling per step
  • Independent optimization of each component
  • Easier to add new capabilities
  • Better observability and debugging
  • Supports federation (multiple FactHarbor instances)

Trade-off: More complexity, more to maintain


6. Related Pages


Document Status: ✅ Architecture Specified (POC1, POC2, Full System)  
Version: V0.9.70