System Architecture
System Architecture
FactHarbor system architecture including POC simplifications and full system design
-
1. Architecture Evolution
FactHarbor will be built in phases, with architecture complexity increasing as we validate core capabilities.
1.1 POC1 Architecture (Simplified)
Goal: Validate core AI capability with minimal complexity
Components:
- Single AKEL API call (Claude Sonnet 4.5)
- Gates 1 & 4 (claim validation, verdict confidence)
- Basic UI display
- Manual quality tracking
Data Storage: Minimal (stateless or simple SQLite)
-
1.2 POC2 Architecture (Enhanced)
Goal: Add complete quality framework and evidence deduplication
New Components:
- Scenario generation
- Evidence deduplication system
- Gates 2 & 3 (evidence relevance, scenario coherence)
- Quality metrics dashboard
Data Storage: Enhanced (claims, scenarios, evidence, metrics)
-
1.3 Full System Architecture (V1.0+)
Goal: Production-ready multi-component system
Production Components:
- Multi-component AKEL pipeline
- Review workflow system
- Audit sampling framework
- Federation architecture
- Full data model (PostgreSQL + Redis + S3)
-
2. Quality Gate Architecture
2.1 Quality Gate System
Purpose: Prevent low-quality/hallucinated content from publication
-
2.2 Gate Implementation by Phase
| Gate | POC1 | POC2 | Beta 0 | V1.0 |
|---|---|---|---|---|
| Gate 1: Claim Validation | ✅ Basic | ✅ Enhanced | ✅ Enhanced | ✅ Hardened |
| Gate 2: Evidence Relevance | ❌ | ✅ Implemented | ✅ Enhanced | ✅ Hardened |
| Gate 3: Scenario Coherence | ❌ | ✅ Implemented | ✅ Enhanced | ✅ Hardened |
| Gate 4: Verdict Confidence | ✅ Basic | ✅ Enhanced | ✅ Enhanced | ✅ Hardened |
Hardening means: Thresholds validated, edge cases handled, <5% failure rate
-
3. Data Architecture
3.1 POC Data Model (Simplified)
Storage: SQLite or minimal database
Entities:
- Articles (input text/URL)
- Claims (extracted from articles)
- Verdicts (per claim)
- Quality metrics (aggregated)
No complex relationships, versioning, or scenarios
-
3.2 Full System Data Model (V1.0+)
Storage: PostgreSQL (primary), Redis (cache), S3 (documents)
Core Entities:
- Claims (with versions, clustering)
- Scenarios (interpretations of claims)
- Evidence (deduplicated, provenance tracked)
- Verdicts (per scenario, versioned)
- Reviews (human oversight)
- Quality metrics (per component, aggregated)
Complex relationships, full audit trail, federation support
-
4. Component Architecture
4.1 AKEL Orchestrator
POC: Single API call
Full System: Multi-component orchestration
Responsibilities:
- Route input through component pipeline
- Manage component state
- Handle errors and retries
- Coordinate quality gates
- Trigger review workflows
-
4.2 Quality Gate Validator
All Phases: Present but evolving complexity
Responsibilities:
- Execute all configured gates
- Aggregate gate results
- Make publication decisions
- Generate explanatory messages
- Log quality metrics
-
4.3 Review Queue Manager
POC: Not present
Beta 0+: Optional
V1.0: Full implementation
Responsibilities:
- Route low-confidence verdicts to review
- Manage reviewer assignments
- Track review status
- Implement audit sampling
- Generate review metrics
-
5. Architecture Decisions
5.1 Why Single AKEL Call for POC?
Rationale:
- Validates core capability fastest
- Simplest to implement and test
- Fail-fast if AI fundamentally can't do task
- Learn prompt engineering before architecting components
- Reduce moving parts during initial testing
Trade-off: Less granular control, harder to optimize individual steps
-
5.2 Why Add Components in V1.0?
Rationale:
- Better error handling per step
- Independent optimization of each component
- Easier to add new capabilities
- Better observability and debugging
- Supports federation (multiple FactHarbor instances)
Trade-off: More complexity, more to maintain
-
6. Related Pages
- Requirements - System requirements
- Design - UI/UX design
- Roadmap - Implementation phases
- POC1 - POC1 details
- POC2 - POC2 details
-
Document Status: ✅ Architecture Specified (POC1, POC2, Full System)
Version: V0.9.70