Changes for page POC1 API & Schemas Specification

Last modified by Robert Schaub on 2025/12/24 18:26

From 2.1 to 1.1 From 3.1 to 2.2

From version 2.2

edited by Robert Schaub
on 2025/12/24 16:28

Change comment: There is no comment for this version

To version 2.1

edited by Robert Schaub
on 2025/12/24 13:58

Change comment: Imported from XAR

Raw
Rendered

Summary

Page properties (2 modified, 0 added, 0 removed)

Details

Page properties

Title

@@ -1,1 +1,1 @@
--POC1 API & Schemas Specification
++POC1 API & Schemas Specification v0.4.1

Content

@@ -1,25 +1,44 @@
--= POC1 API & Schemas Specification =
++# FactHarbor POC1 — API & Schemas Specification
------
++**Version:** 0.4.1 (POC1 - 3-Stage Caching Architecture)
++**Namespace:** FactHarbor.*
++**Syntax:** xWiki 2.1
++**Last Updated:** 2025-12-24
++---
++
  == Version History ==
  |=Version|=Date|=Changes
  |0.4.1|2025-12-24|Applied 9 critical fixes: file format notice, verdict taxonomy, canonicalization algorithm, Stage 1 cost policy, BullMQ fix, language in cache key, historical claims TTL, idempotency, copyright policy
  |0.4|2025-12-24|**BREAKING:** 3-stage pipeline with claim-level caching, user tier system, cache-only mode for free users, Redis cache architecture
--|0.3.1|2025-12-24|Fixed single-prompt strategy, SSE clarification, schema canonicalization, cost constraints
--|0.3|2025-12-24|Added complete API endpoints, LLM config, risk tiers, scraping details
++|0.3.1|2025-12-24|Fixed single-prompt strategy, SSE clarification, schema canonicalization, cost constraints, chain-of-thought, evidence citation, Jina safety, gate numbering
++|0.3|2025-12-24|Added complete API endpoints, LLM config, risk tiers, scraping details, quality gate logging, temporal separation note, cross-references
++|0.2|2025-12-24|Initial rebased version with holistic assessment
++|0.1|2025-12-24|Original specification
------
++---
++---
--== 1. Core Objective (POC1) ==
++== File Format Notice ==
--The primary technical goal of POC1 is to validate **Approach 1 (Single-Pass Holistic Analysis)** while implementing **claim-level caching** to achieve cost sustainability.
++**⚠️ Important:** This file is stored as {{code}}.md{{/code}} for transport/versioning, but the content is **xWiki 2.1 syntax** (not Markdown).
--The system must prove that AI can identify an article's **Main Thesis** and determine if supporting claims logically support that thesis without committing fallacies.
++**When importing to xWiki:**
++* Use "Import as XWiki content" (not "Import as Markdown")
++* The xWiki parser will correctly interpret {{code}}==}} headers, {{{{code}}}}}} blocks, etc.
--=== Success Criteria: ===
++**Alternate naming:** If your workflow supports it, rename to {{code}}.xwiki.txt{{/code}} to avoid ambiguity.
++---
++
++== 1. Core Objective (POC1) ==
++
++The primary technical goal of POC1 is to validate **Approach 1 (Single-Pass Holistic Analysis)** while implementing **claim-level caching** to achieve cost sustainability:
++
++The system must prove that AI can identify an article's **Main Thesis** and determine if the supporting claims (even if individually accurate) logically support that thesis without committing fallacies (e.g., correlation vs. causation, cherry-picking, hasty generalization).
++
++**Success Criteria:**
  * Test with 30 diverse articles
  * Target: ≥70% accuracy detecting misleading articles
  * Cost: <$0.25 per NEW analysis (uncached)
@@ -27,13 +27,14 @@
  * Cache hit rate: ≥50% after 1,000 articles
  * Processing time: <2 minutes (standard depth)
--=== Economic Model: ===
++**Economic Model:**
++* Free tier: $10 credit per month (~40-140 articles depending on cache hits)
++* After limit: Cache-only mode (instant, free access to cached claims)
++* Paid tier: Unlimited new analyses
--* **Free tier:** $10 credit per month (~~40-140 articles depending on cache hits)
--* **After limit:** Cache-only mode (instant, free access to cached claims)
--* **Paid tier:** Unlimited new analyses
++**See:** [[Article Verdict Problem>>Test.FactHarbor.Specification.POC.Article-Verdict-Problem]] for complete investigation of 7 approaches.
------
++---
  == 2. Architecture Overview ==
@@ -41,7 +41,8 @@
  FactHarbor POC1 uses a **3-stage architecture** designed for claim-level caching and cost efficiency:
--{{{graph TD
++{{code language="mermaid"}}
++graph TD
      A[Article Input] --> B[Stage 1: Extract Claims]
      B --> C{For Each Claim}
      C --> D[Check Cache]
@@ -51,46 +51,41 @@
      G --> E
      E --> H[Stage 3: Holistic Assessment]
      H --> I[Final Report]
--}}}
++{{/code}}
--==== Stage 1: Claim Extraction (Haiku, no cache) ====
++**Stage 1: Claim Extraction** (Haiku, no cache)
++* Input: Article text
++* Output: 5 canonical claims (normalized, deduplicated)
++* Model: Claude Haiku 4
++* Cost: $0.003 per article
++* Cache strategy: No caching (article-specific)
--* **Input:** Article text
--* **Output:** 5 canonical claims (normalized, deduplicated)
--* **Model:** Claude Haiku 4
--* **Cost:** $0.003 per article
--* **Cache strategy:** No caching (article-specific)
++**Stage 2: Claim Analysis** (Sonnet, CACHED)
++* Input: Single canonical claim
++* Output: Scenarios + Evidence + Verdicts
++* Model: Claude Sonnet 3.5
++* Cost: $0.081 per NEW claim
++* Cache strategy: **Redis, 90-day TTL**
++* Cache key: {{code}}claim:v1norm1:{language}:{sha256(canonical_claim)}{{/code}}
--==== Stage 2: Claim Analysis (Sonnet, CACHED) ====
++**Stage 3: Holistic Assessment** (Sonnet, no cache)
++* Input: Article + Claim verdicts (from cache or Stage 2)
++* Output: Article verdict + Fallacies + Logic quality
++* Model: Claude Sonnet 3.5
++* Cost: $0.030 per article
++* Cache strategy: No caching (article-specific)
--* **Input:** Single canonical claim
--* **Output:** Scenarios + Evidence + Verdicts
--* **Model:** Claude Sonnet 3.5
--* **Cost:** $0.081 per NEW claim
--* **Cache strategy:** Redis, 90-day TTL
--* **Cache key:** claim:v1norm1:{language}:{sha256(canonical_claim)}
++**Total Cost Formula:**
++{{code}}
++Cost = $0.003 (extraction) + (N_new_claims × $0.081) + $0.030 (holistic)
--==== Stage 3: Holistic Assessment (Sonnet, no cache) ====
--
--* **Input:** Article + Claim verdicts (from cache or Stage 2)
--* **Output:** Article verdict + Fallacies + Logic quality
--* **Model:** Claude Sonnet 3.5
--* **Cost:** $0.030 per article
--* **Cache strategy:** No caching (article-specific)
--
--=== Total Cost Formula: ===
--
--{{{Cost = $0.003 (extraction) + (N_new_claims × $0.081) + $0.030 (holistic)
--
  Examples:
  - 0 new claims (100% cache hit): $0.033
  - 1 new claim (80% cache hit): $0.114
  - 3 new claims (40% cache hit): $0.276
  - 5 new claims (0% cache hit): $0.438
--}}}
++{{/code}}
------
--
  === 2.2 User Tier System ===
  |=Tier|=Monthly Credit|=After Limit|=Cache Access|=Analytics
@@ -99,21 +99,17 @@
  |**Enterprise** (future)|Custom|Continues|✅ Full + Priority|Full
  **Free Tier Economics:**
--
  * $10 credit = 40-140 articles analyzed (depending on cache hit rate)
  * Average 70 articles/month at 70% cache hit rate
--* After limit: Cache-only mode
++* After limit: Cache-only mode (see Section 2.3)
------
--
  === 2.3 Cache-Only Mode (Free Tier Feature) ===
  When free users reach their $10 monthly limit, they enter **Cache-Only Mode**:
--==== What Cache-Only Mode Provides: ====
++**What Cache-Only Mode Provides:**
  ✅ **Claim Extraction (Platform-Funded):**
--
  * Stage 1 extraction runs at $0.003 per article
  * **Cost: Absorbed by platform** (not charged to user credit)
  * Rationale: Extraction is necessary to check cache, and cost is negligible
@@ -120,31 +120,27 @@
  * Rate limit: Max 50 extractions/day in cache-only mode (prevents abuse)
  ✅ **Instant Access to Cached Claims:**
--
  * Any claim that exists in cache → Full verdict returned
  * Cost: $0 (no LLM calls)
  * Response time: <100ms
  ✅ **Partial Article Analysis:**
--
  * Check each claim against cache
  * Return verdicts for ALL cached claims
--* For uncached claims: Return "status": "cache_miss"
++* For uncached claims: Return {{code}}"status": "cache_miss"{{/code}}
  ✅ **Cache Coverage Report:**
--
  * "3 of 5 claims available in cache (60% coverage)"
  * Links to cached analyses
  * Estimated cost to complete: $0.162 (2 new claims)
  ❌ **Not Available in Cache-Only Mode:**
--
  * New claim analysis (Stage 2 LLM calls blocked)
  * Full holistic assessment (Stage 3 blocked if any claims missing)
--==== User Experience Example: ====
--
--{{{{
++**User Experience:**
++{{code language="json"}}
++{
    "status": "cache_only_mode",
    "message": "Monthly credit limit reached. Showing cached results only.",
    "cache_coverage": {
@@ -167,26 +167,26 @@
      "pro_tier": "$50/month unlimited"
    }
  }
--}}}
++{{/code}}
  **Design Rationale:**
--
  * Free users still get value (cached claims often answer their question)
  * Demonstrates FactHarbor's value (partial results encourage upgrade)
  * Sustainable for platform (no additional cost)
  * Fair to all users (everyone contributes to cache)
------
++---
  == 3. REST API Contract ==
  === 3.1 User Credit Tracking ===
--**Endpoint:** GET /v1/user/credit
++**Endpoint:** {{code}}GET /v1/user/credit{{/code}}
--**Response:** 200 OK
++**Response:** {{code}}200 OK{{/code}}
--{{{{
++{{code language="json"}}
++{
    "user_id": "user_abc123",
    "tier": "free",
    "credit_limit": 10.00,
@@ -201,25 +201,30 @@
      "cache_hit_rate": 0.626
    }
  }
--}}}
++{{/code}}
------
++---
  === 3.2 Create Analysis Job (3-Stage) ===
--**Endpoint:** POST /v1/analyze
++**Endpoint:** {{code}}POST /v1/analyze{{/code}}
--==== Idempotency Support: ====
++**Request Body:**
++
++**Idempotency Support:**
++
  To prevent duplicate job creation on network retries, clients SHOULD include:
--{{{POST /v1/analyze
++{{code language="http"}}
++POST /v1/analyze
  Idempotency-Key: {client-generated-uuid}
--}}}
++{{/code}}
--OR use the client.request_id field:
++OR use the {{code}}client.request_id{{/code}} field:
--{{{{
++{{code language="json"}}
++{
    "input_url": "...",
    "client": {
      "request_id": "client-uuid-12345",
@@ -226,18 +226,17 @@
      "source_label": "optional"
    }
  }
--}}}
++{{/code}}
  **Server Behavior:**
--
--* If Idempotency-Key or request_id seen before (within 24 hours):
--** Return existing job (200 OK, not 202 Accepted)
--** Do NOT create duplicate job or charge twice
++* If {{code}}Idempotency-Key{{/code}} or {{code}}request_id{{/code}} seen before (within 24 hours):
++  - Return existing job ({{code}}200 OK{{/code}}, not {{code}}202 Accepted{{/code}})
++  - Do NOT create duplicate job or charge twice
  * Idempotency keys expire after 24 hours (matches job retention)
  **Example Response (Idempotent):**
--
--{{{{
++{{code language="json"}}
++{
    "job_id": "01J...ULID",
    "status": "RUNNING",
    "idempotent": true,
@@ -244,11 +244,11 @@
    "original_request_at": "2025-12-24T10:31:00Z",
    "message": "Returning existing job (idempotency key matched)"
  }
--}}}
++{{/code}}
--==== Request Body: ====
--{{{{
++{{code language="json"}}
++{
    "input_type": "url",
    "input_url": "https://example.com/medical-report-01",
    "input_text": null,
@@ -256,9 +256,8 @@
      "browsing": "on",
      "depth": "standard",
      "max_claims": 5,
--    "scenarios_per_claim": 2,
--    "max_evidence_per_scenario": 6,
--    "context_aware_analysis": true
++    "context_aware_analysis": true,
++    "cache_preference": "prefer_cache"
    },
    "client": {
      "request_id": "optional-client-tracking-id",
@@ -265,20 +265,18 @@
      "source_label": "optional"
    }
  }
--}}}
++{{/code}}
  **Options:**
++* {{code}}cache_preference{{/code}}: {{code}}prefer_cache{{/code}} | {{code}}require_fresh{{/code}} | {{code}}allow_partial{{/code}}
++  - {{code}}prefer_cache{{/code}}: Use cache when available, analyze new claims (default)
++  - {{code}}require_fresh{{/code}}: Force re-analysis of all claims (ignores cache, costs more)
++  - {{code}}allow_partial{{/code}}: Return partial results if some claims uncached (for free tier cache-only mode)
--* browsing: on | off (retrieve web sources or just output queries)
--* depth: standard | deep (evidence thoroughness)
--* max_claims: 1-10 (default: **5** for cost control)
--* scenarios_per_claim: 1-5 (default: **2** for cost control)
--* max_evidence_per_scenario: 3-10 (default: **6**)
--* context_aware_analysis: true | false (experimental)
++**Response:** {{code}}202 Accepted{{/code}}
--**Response:** 202 Accepted
--
--{{{{
++{{code language="json"}}
++{
    "job_id": "01J...ULID",
    "status": "QUEUED",
    "created_at": "2025-12-24T10:31:00Z",
@@ -301,13 +301,13 @@
      "events": "/v1/jobs/01J...ULID/events"
    }
  }
--}}}
++{{/code}}
  **Error Responses:**
--402 Payment Required - Free tier limit reached, cache-only mode
--
--{{{{
++{{code}}402 Payment Required{{/code}} - Free tier limit reached, cache-only mode
++{{code language="json"}}
++{
    "error": "credit_limit_reached",
    "message": "Monthly credit limit reached. Entering cache-only mode.",
    "cache_only_mode": true,
@@ -315,15 +315,199 @@
    "reset_date": "2025-02-01T00:00:00Z",
    "action": "Resubmit with cache_preference=allow_partial for cached results"
  }
--}}}
++{{/code}}
------
++---
++=== 3.3 Get Job Status ===
++
++**Endpoint:** {{code}}GET /v1/jobs/{job_id}{{/code}}
++
++**Response:** {{code}}200 OK{{/code}}
++
++{{code language="json"}}
++{
++  "job_id": "01J...ULID",
++  "status": "RUNNING",
++  "created_at": "2025-12-24T10:31:00Z",
++  "updated_at": "2025-12-24T10:31:22Z",
++  "progress": {
++    "stage": "stage2_claim_analysis",
++    "percent": 65,
++    "message": "Analyzing claim 3 of 5 (2 from cache)",
++    "current_claim_id": "C3",
++    "cache_hits": 2,
++    "cache_misses": 1
++  },
++  "actual_cost": 0.084,
++  "cost_breakdown": {
++    "stage1_extraction": 0.003,
++    "stage2_new_claims": 0.081,
++    "stage2_cached_claims": 0.000,
++    "stage3_holistic": null
++  },
++  "input_echo": {
++    "input_type": "url",
++    "input_url": "https://example.com/medical-report-01"
++  },
++  "links": {
++    "self": "/v1/jobs/01J...ULID",
++    "result": "/v1/jobs/01J...ULID/result",
++    "report": "/v1/jobs/01J...ULID/report"
++  },
++  "error": null
++}
++{{/code}}
++
++---
++
++=== 3.4 Get Analysis Result ===
++
++**Endpoint:** {{code}}GET /v1/jobs/{job_id}/result{{/code}}
++
++**Response:** {{code}}200 OK{{/code}}
++
++Returns complete **AnalysisResult** schema (see Section 4).
++
++**Cache-Only Mode Response:** {{code}}206 Partial Content{{/code}}
++
++{{code language="json"}}
++{
++  "cache_only_mode": true,
++  "cache_coverage": {
++    "claims_total": 5,
++    "claims_cached": 3,
++    "claims_missing": 2,
++    "coverage_percent": 60
++  },
++  "partial_result": {
++    "metadata": {
++      "job_id": "01J...ULID",
++      "timestamp_utc": "2025-12-24T10:31:30Z",
++      "engine_version": "POC1-v0.4",
++      "cache_only": true
++    },
++    "claims": [
++      {
++        "claim_id": "C1",
++        "claim_text": "...",
++        "canonical_claim": "...",
++        "source": "cache",
++        "cached_at": "2025-12-20T15:30:00Z",
++        "cache_hit_count": 47,
++        "scenarios": [...]
++      },
++      {
++        "claim_id": "C3",
++        "claim_text": "...",
++        "canonical_claim": "...",
++        "source": "not_analyzed",
++        "status": "cache_miss",
++        "estimated_cost": 0.081
++      }
++    ],
++    "article_holistic_assessment": null,
++    "upgrade_prompt": {
++      "message": "Upgrade to Pro for full analysis of all claims",
++      "missing_claims": 2,
++      "cost_to_complete": 0.192
++    }
++  }
++}
++{{/code}}
++
++**Other Responses:**
++* {{code}}409 Conflict{{/code}} - Job not finished yet
++* {{code}}404 Not Found{{/code}} - Job ID unknown
++
++---
++
++=== 3.5 Stage-Specific Endpoints (Optional, Advanced) ===
++
++For direct stage access (useful for cache debugging, custom workflows):
++
++**Extract Claims Only:**
++{{code}}POST /v1/analyze/extract-claims{{/code}}
++
++**Analyze Single Claim:**
++{{code}}POST /v1/analyze/claim{{/code}}
++
++**Assess Article (with claim verdicts):**
++{{code}}POST /v1/analyze/assess-article{{/code}}
++
++**Check Claim Cache:**
++{{code}}GET /v1/cache/claim/{claim_hash}{{/code}}
++
++**Cache Statistics:**
++{{code}}GET /v1/cache/stats{{/code}}
++
++---
++
++=== 3.6 Download Markdown Report ===
++
++**Endpoint:** {{code}}GET /v1/jobs/{job_id}/report{{/code}}
++
++**Response:** {{code}}200 OK{{/code}} with {{code}}text/markdown; charset=utf-8{{/code}} content
++
++**Headers:**
++* {{code}}Content-Disposition: attachment; filename="factharbor_poc1_{job_id}.md"{{/code}}
++
++**Cache-Only Mode:** Report includes "Partial Analysis" watermark and upgrade prompt.
++
++---
++
++=== 3.7 Stream Job Events (Backend Progress) ===
++
++**Endpoint:** {{code}}GET /v1/jobs/{job_id}/events{{/code}}
++
++**Response:** Server-Sent Events (SSE) stream
++
++**Event Types:**
++* {{code}}progress{{/code}} - Backend progress (e.g., "Stage 1: Extracting claims")
++* {{code}}cache_hit{{/code}} - Claim found in cache
++* {{code}}cache_miss{{/code}} - Claim requires new analysis
++* {{code}}stage_complete{{/code}} - Stage 1/2/3 finished
++* {{code}}complete{{/code}} - Job finished
++* {{code}}error{{/code}} - Error occurred
++* {{code}}credit_warning{{/code}} - User approaching limit
++
++---
++
++=== 3.8 Cancel Job ===
++
++**Endpoint:** {{code}}DELETE /v1/jobs/{job_id}{{/code}}
++
++**Note:** If job is mid-stage (e.g., analyzing claim 3 of 5), user is charged for completed work only.
++
++---
++
++=== 3.9 Health Check ===
++
++**Endpoint:** {{code}}GET /v1/health{{/code}}
++
++{{code language="json"}}
++{
++  "status": "ok",
++  "version": "POC1-v0.4",
++  "model_stage1": "claude-haiku-4",
++  "model_stage2": "claude-3-5-sonnet-20241022",
++  "model_stage3": "claude-3-5-sonnet-20241022",
++  "cache": {
++    "status": "connected",
++    "total_claims": 12847,
++    "avg_hit_rate_24h": 0.73
++  }
++}
++{{/code}}
++
++---
++
  == 4. Data Schemas ==
  === 4.1 Stage 1 Output: ClaimExtraction ===
--{{{{
++{{code language="json"}}
++{
    "job_id": "01J...ULID",
    "stage": "stage1_extraction",
    "article_metadata": {
@@ -348,10 +348,219 @@
    "article_thesis": "Main argument detected",
    "cost": 0.003
  }
--}}}
++{{/code}}
------
++=== 4.2 Stage 2 Output: ClaimAnalysis (CACHED) ===
++This is the CACHEABLE unit. Stored in Redis with 90-day TTL.
++
++{{code language="json"}}
++{
++  "claim_hash": "sha256:abc123...",
++  "canonical_claim": "COVID vaccines are 95% effective",
++  "language": "en",
++  "domain": "public_health",
++  "analysis_version": "v1.0",
++  "scenarios": [
++    {
++      "scenario_id": "S1",
++      "scenario_title": "mRNA vaccines (Pfizer/Moderna) in clinical trials",
++      "definitions": {"95% effective": "95% reduction in symptomatic infection"},
++      "assumptions": ["Based on phase 3 trial data", "Against original strain"],
++      "boundaries": {
++        "time": "2020-2021 trials",
++        "geography": "Multi-country trials",
++        "population": "Adult population (16+)",
++        "conditions": "Before widespread variants"
++      },
++      "verdict": {
++        "label": "Highly Likely",
++        "probability_range": [0.88, 0.97],
++        "confidence": 0.92,
++        "reasoning_chain": [
++          "Pfizer/BioNTech trial: 95% efficacy (n=43,548)",
++          "Moderna trial: 94.1% efficacy (n=30,420)",
++          "Peer-reviewed publications in NEJM",
++          "FDA independent analysis confirmed"
++        ],
++        "key_supporting_evidence_ids": ["E1", "E2"],
++        "key_counter_evidence_ids": ["E3"],
++        "uncertainty_factors": [
++          "Limited data on long-term effectiveness",
++          "Variant-specific performance not yet measured"
++        ]
++      },
++      "evidence": [
++        {
++          "evidence_id": "E1",
++          "stance": "supports",
++          "relevance_to_scenario": 0.98,
++          "evidence_summary": [
++            "Pfizer trial showed 170 cases in placebo vs 8 in vaccine group",
++            "Follow-up period median 2 months post-dose 2",
++            "Efficacy consistent across age, sex, race, ethnicity"
++          ],
++          "citation": {
++            "title": "Safety and Efficacy of the BNT162b2 mRNA Covid-19 Vaccine",
++            "author_or_org": "Polack et al.",
++            "publication_date": "2020-12-31",
++            "url": "https://nejm.org/doi/full/10.1056/NEJMoa2034577",
++            "publisher": "New England Journal of Medicine",
++            "retrieved_at_utc": "2025-12-20T15:30:00Z"
++          },
++          "excerpt": ["The vaccine was 95% effective in preventing Covid-19"],
++          "excerpt_word_count": 9,
++          "source_reliability_score": 0.95,
++          "reliability_justification": "Peer-reviewed, high-impact journal, large RCT",
++          "limitations_and_reservations": [
++            "Short follow-up period (2 months)",
++            "Primarily measures symptomatic infection, not transmission"
++          ],
++          "retraction_or_dispute_signal": "none"
++        }
++      ]
++    }
++  ],
++  "cache_metadata": {
++    "first_analyzed": "2025-12-01T10:00:00Z",
++    "last_updated": "2025-12-20T15:30:00Z",
++    "hit_count": 47,
++    "version": "v1.0",
++    "ttl_expires": "2026-03-20T15:30:00Z"
++  },
++  "cost": 0.081
++}
++{{/code}}
++
++**Cache Key Structure:**
++{{code}}
++Redis Key: claim:v1norm1:{language}:{sha256(canonical_claim)}
++TTL: 90 days (7,776,000 seconds)
++Size: ~15KB JSON (compressed: ~5KB)
++{{/code}}
++
++=== 4.3 Stage 3 Output: HolisticAssessment ===
++
++{{code language="json"}}
++{
++  "job_id": "01J...ULID",
++  "stage": "stage3_holistic",
++  "article_metadata": {
++    "title": "...",
++    "main_thesis": "...",
++    "source_url": "..."
++  },
++  "article_holistic_assessment": {
++    "overall_verdict": "MISLEADING",
++    "logic_quality_score": 0.42,
++    "fallacies_detected": [
++      "correlation-causation",
++      "cherry-picking"
++    ],
++    "verdict_reasoning": [
++      "Central claim C1 is REFUTED by multiple systematic reviews",
++      "Supporting claims C2-C4 are TRUE but do not support the thesis",
++      "Article commits correlation-causation fallacy",
++      "Selective citation of evidence (cherry-picking detected)"
++    ],
++    "experimental_feature": true
++  },
++  "claims_summary": [
++    {
++      "claim_id": "C1",
++      "is_central_to_thesis": true,
++      "verdict": "Refuted",
++      "confidence": 0.89,
++      "source": "cache",
++      "cache_hit": true
++    },
++    {
++      "claim_id": "C2",
++      "is_central_to_thesis": false,
++      "verdict": "Highly Likely",
++      "confidence": 0.91,
++      "source": "new_analysis",
++      "cache_hit": false
++    }
++  ],
++  "quality_gates": {
++    "gate1_claim_validation": "pass",
++    "gate4_verdict_confidence": "pass",
++    "passed_all": true
++  },
++  "cost": 0.030,
++  "total_job_cost": 0.114
++}
++{{/code}}
++
++=== 4.4 Complete AnalysisResult (All 3 Stages Combined) ===
++
++{{code language="json"}}
++{
++  "metadata": {
++    "job_id": "01J...ULID",
++    "timestamp_utc": "2025-12-24T10:31:30Z",
++    "engine_version": "POC1-v0.4",
++    "llm_stage1": "claude-haiku-4",
++    "llm_stage2": "claude-3-5-sonnet-20241022",
++    "llm_stage3": "claude-3-5-sonnet-20241022",
++    "usage_stats": {
++      "stage1_tokens": {"input": 10000, "output": 500},
++      "stage2_tokens": {"input": 2000, "output": 5000},
++      "stage3_tokens": {"input": 5000, "output": 1000},
++      "total_input_tokens": 17000,
++      "total_output_tokens": 6500,
++      "estimated_cost_usd": 0.114,
++      "response_time_sec": 45.2
++    },
++    "cache_stats": {
++      "claims_total": 5,
++      "claims_from_cache": 4,
++      "claims_new_analysis": 1,
++      "cache_hit_rate": 0.80,
++      "cache_savings_usd": 0.324
++    }
++  },
++  "article_holistic_assessment": {
++    "main_thesis": "...",
++    "overall_verdict": "MISLEADING",
++    "logic_quality_score": 0.42,
++    "fallacies_detected": ["correlation-causation", "cherry-picking"],
++    "verdict_reasoning": ["...", "...", "..."],
++    "experimental_feature": true
++  },
++  "claims": [
++    {
++      "claim_id": "C1",
++      "is_central_to_thesis": true,
++      "claim_text": "...",
++      "canonical_claim": "...",
++      "claim_hash": "sha256:abc123...",
++      "claim_type": "causal",
++      "evaluability": "evaluable",
++      "risk_tier": "B",
++      "source": "cache",
++      "cached_at": "2025-12-20T15:30:00Z",
++      "cache_hit_count": 47,
++      "scenarios": [...]
++    },
++    {
++      "claim_id": "C2",
++      "source": "new_analysis",
++      "analyzed_at": "2025-12-24T10:31:15Z",
++      "scenarios": [...]
++    }
++  ],
++  "quality_gates": {
++    "gate1_claim_validation": "pass",
++    "gate4_verdict_confidence": "pass",
++    "passed_all": true
++  }
++}
++{{/code}}
++
++
++
  === 4.5 Verdict Label Taxonomy ===
  FactHarbor uses **three distinct verdict taxonomies** depending on analysis level:
@@ -361,26 +361,23 @@
  Used for individual scenario verdicts within a claim.
  **Enum Values:**
++* {{code}}Highly Likely{{/code}} - Probability 0.85-1.0, high confidence
++* {{code}}Likely{{/code}} - Probability 0.65-0.84, moderate-high confidence
++* {{code}}Unclear{{/code}} - Probability 0.35-0.64, or low confidence
++* {{code}}Unlikely{{/code}} - Probability 0.16-0.34, moderate-high confidence
++* {{code}}Highly Unlikely{{/code}} - Probability 0.0-0.15, high confidence
++* {{code}}Unsubstantiated{{/code}} - Insufficient evidence to determine probability
--* Highly Likely - Probability 0.85-1.0, high confidence
--* Likely - Probability 0.65-0.84, moderate-high confidence
--* Unclear - Probability 0.35-0.64, or low confidence
--* Unlikely - Probability 0.16-0.34, moderate-high confidence
--* Highly Unlikely - Probability 0.0-0.15, high confidence
--* Unsubstantiated - Insufficient evidence to determine probability
--
  ==== 4.5.2 Claim Verdict Labels (Rollup) ====
  Used when summarizing a claim across all scenarios.
  **Enum Values:**
++* {{code}}Supported{{/code}} - Majority of scenarios are Likely or Highly Likely
++* {{code}}Refuted{{/code}} - Majority of scenarios are Unlikely or Highly Unlikely
++* {{code}}Inconclusive{{/code}} - Mixed scenarios or majority Unclear/Unsubstantiated
--* Supported - Majority of scenarios are Likely or Highly Likely
--* Refuted - Majority of scenarios are Unlikely or Highly Unlikely
--* Inconclusive - Mixed scenarios or majority Unclear/Unsubstantiated
--
  **Mapping Logic:**
--
  * If ≥60% scenarios are (Highly Likely | Likely) → Supported
  * If ≥60% scenarios are (Highly Unlikely | Unlikely) → Refuted
  * Otherwise → Inconclusive
@@ -390,23 +390,23 @@
  Used for holistic article-level assessment.
  **Enum Values:**
++* {{code}}WELL-SUPPORTED{{/code}} - Article thesis logically follows from supported claims
++* {{code}}MISLEADING{{/code}} - Claims may be true but article commits logical fallacies
++* {{code}}REFUTED{{/code}} - Central claims are refuted, invalidating thesis
++* {{code}}UNCERTAIN{{/code}} - Insufficient evidence or highly mixed claim verdicts
--* WELL-SUPPORTED - Article thesis logically follows from supported claims
--* MISLEADING - Claims may be true but article commits logical fallacies
--* REFUTED - Central claims are refuted, invalidating thesis
--* UNCERTAIN - Insufficient evidence or highly mixed claim verdicts
--
  **Note:** Article verdict considers **claim centrality** (central claims override supporting claims).
  ==== 4.5.4 API Field Mapping ====
  |=Level|=API Field|=Enum Name
--|Scenario|scenarios[].verdict.label|scenario_verdict_label
--|Claim|claims[].rollup_verdict (optional)|claim_verdict_label
--|Article|article_holistic_assessment.overall_verdict|article_verdict_label
++|Scenario|{{code}}scenarios[].verdict.label{{/code}}|scenario_verdict_label
++|Claim|{{code}}claims[].rollup_verdict{{/code}} (optional)|claim_verdict_label
++|Article|{{code}}article_holistic_assessment.overall_verdict{{/code}}|article_verdict_label
------
++---
++
  == 5. Cache Architecture ==
  === 5.1 Redis Cache Design ===
@@ -414,29 +414,117 @@
  **Technology:** Redis 7.0+ (in-memory key-value store)
  **Cache Key Schema:**
++{{code}}
++claim:v1norm1:{language}:{sha256(canonical_claim)}
++{{/code}}
--{{{claim:v1norm1:{language}:{sha256(canonical_claim)}
--}}}
--
  **Example:**
--
--{{{Claim (English): "COVID vaccines are 95% effective"
++{{code}}
++Claim (English): "COVID vaccines are 95% effective"
  Canonical: "covid vaccines are 95 percent effective"
  Language: "en"
  SHA256: abc123...def456
  Key: claim:v1norm1:en:abc123...def456
--}}}
++{{/code}}
  **Rationale:** Prevents cross-language collisions and enables per-language cache analytics.
  **Data Structure:**
++{{code language="redis"}}
++SET claim:v1:abc123...def456 '{...ClaimAnalysis JSON...}'
++EXPIRE claim:v1:abc123...def456 7776000  # 90 days
++{{/code}}
--{{{SET claim:v1norm1:en:abc123...def456 '{...ClaimAnalysis JSON...}'
--EXPIRE claim:v1norm1:en:abc123...def456 7776000  # 90 days
--}}}
++**Additional Keys:**
++{{code}}
------
++==== 5.1.1 Canonical Claim Normalization (v1) ====
++The cache key depends on deterministic claim normalization. All implementations MUST follow this algorithm exactly.
++
++**Algorithm: Canonical Claim Normalization v1**
++
++{{code language="python"}}
++def normalize_claim_v1(claim_text: str, language: str) -> str:
++    """
++    Normalizes claim to canonical form for cache key generation.
++    Version: v1norm1 (POC1)
++    """
++    import re
++    import unicodedata
++
++    # Step 1: Unicode normalization (NFC)
++    text = unicodedata.normalize('NFC', claim_text)
++
++    # Step 2: Lowercase
++    text = text.lower()
++
++    # Step 3: Remove punctuation (except hyphens in words)
++    text = re.sub(r'[^\w\s-]', '', text)
++
++    # Step 4: Normalize whitespace (collapse multiple spaces)
++    text = re.sub(r'\s+', ' ', text).strip()
++
++    # Step 5: Numeric normalization
++    text = text.replace('%', ' percent')
++    # Spell out single-digit numbers
++    num_to_word = {'0':'zero', '1':'one', '2':'two', '3':'three',
++                   '4':'four', '5':'five', '6':'six', '7':'seven',
++                   '8':'eight', '9':'nine'}
++    for num, word in num_to_word.items():
++        text = re.sub(rf'\b{num}\b', word, text)
++
++    # Step 6: Common abbreviations (English only in v1)
++    if language == 'en':
++        text = text.replace('covid-19', 'covid')
++        text = text.replace('u.s.', 'us')
++        text = text.replace('u.k.', 'uk')
++
++    # Step 7: NO entity normalization in v1
++    # (Trump vs Donald Trump vs President Trump remain distinct)
++
++    return text
++
++# Version identifier (include in cache namespace)
++CANONICALIZER_VERSION = "v1norm1"
++{{/code}}
++
++**Cache Key Formula (Updated):**
++
++{{code}}
++language = "en"
++canonical = normalize_claim_v1(claim_text, language)
++cache_key = f"claim:{CANONICALIZER_VERSION}:{language}:{sha256(canonical)}"
++
++Example:
++  claim: "COVID-19 vaccines are 95% effective"
++  canonical: "covid vaccines are 95 percent effective"
++  sha256: abc123...def456
++  key: "claim:v1norm1:en:abc123...def456"
++{{/code}}
++
++**Cache Metadata MUST Include:**
++
++{{code language="json"}}
++{
++  "canonical_claim": "covid vaccines are 95 percent effective",
++  "canonicalizer_version": "v1norm1",
++  "language": "en",
++  "original_claim_samples": ["COVID-19 vaccines are 95% effective"]
++}
++{{/code}}
++
++**Version Upgrade Path:**
++* v1norm1 → v1norm2: Cache namespace changes, old keys remain valid until TTL
++* v1normN → v2norm1: Major version bump, invalidate all v1 caches
++
++
++claim:stats:hit_count:{claim_hash}  # Counter
++claim:index:domain:{domain}  # Set of claim hashes by domain
++claim:index:language:{lang}  # Set of claim hashes by language
++{{/code}}
++
++
  === 5.1.1 Canonical Claim Normalization (v1) ===
  The cache key depends on deterministic claim normalization. All implementations MUST follow this algorithm exactly.
@@ -443,7 +443,8 @@
  **Algorithm: Canonical Claim Normalization v1**
--{{{def normalize_claim_v1(claim_text: str, language: str) -> str:
++{{code language="python"}}
++def normalize_claim_v1(claim_text: str, language: str) -> str:
      """
      Normalizes claim to canonical form for cache key generation.
      Version: v1norm1 (POC1)
@@ -485,11 +485,12 @@
  # Version identifier (include in cache namespace)
  CANONICALIZER_VERSION = "v1norm1"
--}}}
++{{/code}}
  **Cache Key Formula (Updated):**
--{{{language = "en"
++{{code}}
++language = "en"
  canonical = normalize_claim_v1(claim_text, language)
  cache_key = f"claim:{CANONICALIZER_VERSION}:{language}:{sha256(canonical)}"
@@ -498,25 +498,25 @@
    canonical: "covid vaccines are 95 percent effective"
    sha256: abc123...def456
    key: "claim:v1norm1:en:abc123...def456"
--}}}
++{{/code}}
  **Cache Metadata MUST Include:**
--{{{{
++{{code language="json"}}
++{
    "canonical_claim": "covid vaccines are 95 percent effective",
    "canonicalizer_version": "v1norm1",
    "language": "en",
    "original_claim_samples": ["COVID-19 vaccines are 95% effective"]
  }
--}}}
++{{/code}}
  **Version Upgrade Path:**
--
  * v1norm1 → v1norm2: Cache namespace changes, old keys remain valid until TTL
  * v1normN → v2norm1: Major version bump, invalidate all v1 caches
------
++
  === 5.1.2 Copyright & Data Retention Policy ===
  **Evidence Excerpt Storage:**
@@ -524,7 +524,6 @@
  To comply with copyright law and fair use principles:
  **What We Store:**
--
  * **Metadata only:** Title, author, publisher, URL, publication date
  * **Short excerpts:** Max 25 words per quote, max 3 quotes per evidence item
  * **Summaries:** AI-generated bullet points (not verbatim text)
@@ -531,20 +531,17 @@
  * **No full articles:** Never store complete article text beyond job processing
  **Total per Cached Claim:**
--
  * Scenarios: 2 per claim
  * Evidence items: 6 per scenario (12 total)
  * Quotes: 3 per evidence × 25 words = 75 words per item
--* **Maximum stored verbatim text:** ~~900 words per claim (12 × 75)
++* **Maximum stored verbatim text:** ~900 words per claim (12 × 75)
  **Retention:**
--
  * Cache TTL: 90 days
  * Job outputs: 24 hours (then archived or deleted)
  * No persistent full-text article storage
  **Rationale:**
--
  * Short excerpts for citation = fair use
  * Summaries are transformative (not copyrightable)
  * Limited retention (90 days max)
@@ -551,27 +551,480 @@
  * No commercial republication of excerpts
  **DMCA Compliance:**
--
  * Cache invalidation endpoint available for rights holders
  * Contact: dmca@factharbor.org
------
--== Summary ==
++=== 5.2 Cache Invalidation Strategy ===
--This WYSIWYG preview shows the **structure and key sections** of the 1,515-line API specification.
++**Time-Based (Primary):**
++* TTL: 90 days for most claims
++* Reasoning: Evidence freshness, news cycles
--**Full specification includes:**
++**Event-Based (Manual):**
++* Admin can flag claims for invalidation
++* Example: "Major study retracts findings"
++* Tool: {{code}}DELETE /v1/cache/claim/{claim_hash}?reason=retraction{{/code}}
--* Complete API endpoints (7 total)
--* All data schemas (ClaimExtraction, ClaimAnalysis, HolisticAssessment, Complete)
--* Quality gates & validation rules
--* LLM configuration for all 3 stages
--* Implementation notes with code samples
--* Testing strategy
--* Cross-references to other pages
++**Version-Based (Automatic):**
++* AKEL v2.0 release → Invalidate all v1.0 caches
++* Cache keys include version: {{code}}claim:v1:*{{/code}} vs {{code}}claim:v2:*{{/code}}
--**The complete specification is available in:**
++**Long-Lived Historical Claims:**
++* Historical claims about completed events generally have stable verdicts
++* Example: "2024 US presidential election results"
++* **Policy:** Extended TTL (365-3,650 days) instead of "never invalidate"
++* **Reason:** Even historical data gets revisions (updated counts, corrections)
++* **Mechanism:** Admin can still manually invalidate if major correction issued
++* **Flag:** {{code}}is_historical=true{{/code}} in cache metadata → longer TTL
--* FactHarbor_POC1_API_and_Schemas_Spec_v0_4_1_PATCHED.md (45 KB standalone)
--* Export files (TEST/PRODUCTION) for xWiki import
++=== 5.3 Cache Warming Strategy ===
++
++**Proactive Cache Building (Future):**
++
++**Trending Topics:**
++* Monitor news APIs for trending topics
++* Pre-analyze top 20 common claims
++* Example: New health study published → Pre-cache related claims
++
++**Predictable Events:**
++* Elections, sporting events, earnings reports
++* Pre-cache expected claims before event
++* Reduces load during traffic spikes
++
++**User Patterns:**
++* Analyze query logs
++* Identify frequently requested claims
++* Prioritize cache warming for these
++
++---
++
++== 6. Quality Gates & Validation Rules ==
++
++=== 6.1 Quality Gate Overview ===
++
++|=Gate|=Name|=POC1 Status|=Applies To|=Notes
++|**Gate 1**|Claim Validation|✅ Hard gate|Stage 1: Extraction|Filters opinions, compound claims
++|**Gate 2**|Contradiction Search|✅ Mandatory rule|Stage 2: Analysis|Enforced per cached claim
++|**Gate 3**|Uncertainty Disclosure|⚠️ Soft guidance|Stage 2: Analysis|Best practice
++|**Gate 4**|Verdict Confidence|✅ Hard gate|Stage 2: Analysis|Confidence ≥ 0.5 required
++
++**Hard Gate Failures:**
++* Gate 1 fail → Claim excluded from analysis
++* Gate 4 fail → Claim marked "Unsubstantiated" but included
++
++=== 6.2 Validation Rules ===
++
++|=Rule|=Requirement
++|**Mandatory Contradiction**|Stage 2 MUST search for "undermines" evidence. If none found, reasoning must state: "No counter-evidence found despite targeted search."
++|**Context-Aware Logic**|Stage 3 must prioritize central claims. If {{code}}is_central_to_thesis=true{{/code}} claim is REFUTED, article cannot be WELL-SUPPORTED.
++|**Cache Consistency**|Cached claims must match current AKEL version. Version mismatch → cache miss.
++|**Author Identification**|All outputs MUST include {{code}}author_type: "AI/AKEL"{{/code}}.
++
++---
++
++== 7. Deterministic Markdown Template ==
++
++Report generation uses **fixed template** (not LLM-generated).
++
++**Cache-Only Mode Template:**
++{{code language="markdown"}}
++# FactHarbor Analysis Report: PARTIAL ANALYSIS
++
++**Job ID:** {job_id} | **Generated:** {timestamp_utc}
++**Mode:** Cache-Only (Free Tier)
++
++---
++
++## ⚠️ Partial Analysis Notice
++
++This is a **cache-only analysis** based on previously analyzed claims.
++{cache_coverage_percent}% of claims were available in cache.
++
++**What's Included:**
++* {claims_cached} of {claims_total} claims analyzed
++* Evidence and verdicts from cache (last updated: {oldest_cache_date})
++
++**What's Missing:**
++* {claims_missing} claims require new analysis
++* Full article holistic assessment unavailable
++* Estimated cost to complete: ${cost_to_complete}
++
++**[Upgrade to Pro]** for complete analysis
++
++---
++
++## Cached Claims
++
++### [C1] {claim_text} ✅ From Cache
++* **Cached:** {cached_at} ({cache_age} ago)
++* **Times Used:** {hit_count} articles
++* **Verdict:** {verdict} (Confidence: {confidence})
++* **Evidence:** {evidence_count} sources
++
++[Full claim details...]
++
++### [C3] {claim_text} ⚠️ Not In Cache
++* **Status:** Requires new analysis
++* **Cost:** $0.081
++* **Upgrade to analyze this claim**
++
++---
++
++**Powered by FactHarbor POC1-v0.4** | [Upgrade](https://factharbor.org/upgrade)
++{{/code}}
++
++---
++
++== 8. LLM Configuration (3-Stage) ==
++
++=== 8.1 Stage 1: Claim Extraction (Haiku) ===
++
++|=Parameter|=Value|=Notes
++|**Model**|{{code}}claude-haiku-4-20250108{{/code}}|Fast, cheap, sufficient for extraction
++|**Input Tokens**|~10K|Article text after URL extraction
++|**Output Tokens**|~500|5 claims @ ~100 tokens each
++|**Cost**|$0.003 per article|($0.25/M input + $1.25/M output)
++|**Temperature**|0.0|Deterministic
++|**Max Tokens**|1000|Generous buffer
++
++**Prompt Strategy:**
++* Extract 5 verifiable factual claims
++* Mark central vs. supporting claims
++* Canonicalize (normalize phrasing)
++* Deduplicate similar claims
++* Output structured JSON only
++
++=== 8.2 Stage 2: Claim Analysis (Sonnet, CACHED) ===
++
++|=Parameter|=Value|=Notes
++|**Model**|{{code}}claude-3-5-sonnet-20241022{{/code}}|High quality for verdicts
++|**Input Tokens**|~2K|Single claim + prompt + context
++|**Output Tokens**|~5K|2 scenarios × ~2.5K tokens
++|**Cost**|$0.081 per NEW claim|($3/M input + $15/M output)
++|**Temperature**|0.0|Deterministic (cache consistency)
++|**Max Tokens**|8000|Sufficient for 2 scenarios
++|**Cache Strategy**|Redis, 90-day TTL|Key: {{code}}claim:v1norm1:{language}:{sha256(canonical_claim)}{{/code}}
++
++**Prompt Strategy:**
++* Generate 2 scenario interpretations
++* Search for supporting AND undermining evidence (mandatory)
++* 6 evidence items per scenario maximum
++* Compute verdict with reasoning chain (3-4 bullets)
++* Output structured JSON only
++
++**Output Constraints (Cost Control):**
++* Scenarios: Max 2 per claim
++* Evidence: Max 6 per scenario
++* Evidence summary: Max 3 bullets
++* Reasoning chain: Max 4 bullets
++
++=== 8.3 Stage 3: Holistic Assessment (Sonnet) ===
++
++|=Parameter|=Value|=Notes
++|**Model**|{{code}}claude-3-5-sonnet-20241022{{/code}}|Context-aware analysis
++|**Input Tokens**|~5K|Article + claim verdicts
++|**Output Tokens**|~1K|Article verdict + fallacies
++|**Cost**|$0.030 per article|($3/M input + $15/M output)
++|**Temperature**|0.0|Deterministic
++|**Max Tokens**|2000|Sufficient for assessment
++
++**Prompt Strategy:**
++* Detect main thesis
++* Evaluate logical coherence (claim verdicts → thesis)
++* Identify fallacies (correlation-causation, cherry-picking, etc.)
++* Compute logic_quality_score
++* Explain article verdict reasoning (3-4 bullets)
++* Output structured JSON only
++
++=== 8.4 Cost Projections by Cache Hit Rate ===
++
++|=Cache Hit Rate|=Cost per Article|=10K Articles Cost|=100K Articles Cost
++|0% (cold start)|$0.438|$4,380|$43,800
++|20%|$0.357|$3,570|$35,700
++|40%|$0.276|$2,760|$27,600
++|**60%**|**$0.195**|**$1,950**|**$19,500**
++|**70%** (target)|**$0.155**|**$1,550**|**$15,500**
++|**80%**|**$0.114**|**$1,140**|**$11,400**
++|**90%**|**$0.073**|**$730**|**$7,300**
++|95%|$0.053|$530|$5,300
++
++**Break-Even Analysis:**
++* Monolithic (v0.3.1): $0.15 per article constant
++* 3-stage breaks even at **70% cache hit rate**
++* Expected after ~1,500 articles in same domain
++
++---
++
++== 9. Implementation Notes ==
++
++=== 9.1 Recommended Tech Stack ===
++
++* **Framework:** Next.js 14+ with App Router (TypeScript)
++* **Cache:** Redis 7.0+ (managed: AWS ElastiCache, Redis Cloud, Upstash)
++* **Storage:** Filesystem JSON for jobs + S3/R2 for archival
++* **Queue:** BullMQ with Redis (for 3-stage pipeline orchestration)
++* **LLM Client:** Anthropic Python SDK or TypeScript SDK
++* **Cost Tracking:** PostgreSQL for user credit ledger
++* **Deployment:** Vercel (frontend + API) + Redis Cloud
++
++=== 9.2 3-Stage Pipeline Implementation ===
++
++**Job Queue Flow (Conceptual):**
++
++{{code language="typescript"}}
++// Stage 1: Extract Claims
++const stage1Job = await queue.add('stage1-extract-claims', {
++  jobId: 'job123',
++  articleUrl: 'https://example.com/article'
++});
++
++// On Stage 1 completion → enqueue Stage 2 jobs
++stage1Job.on('completed', async (result) => {
++  const { claims } = result;
++
++  // Stage 2: Analyze each claim (with cache check)
++  const stage2Jobs = await Promise.all(
++    claims.map(claim =>
++      queue.add('stage2-analyze-claim', {
++        jobId: 'job123',
++        claimId: claim.claim_id,
++        canonicalClaim: claim.canonical_claim,
++        checkCache: true
++      })
++    )
++  );
++
++  // On all Stage 2 completions → enqueue Stage 3
++  await Promise.all(stage2Jobs.map(j => j.waitUntilFinished()));
++
++  const claimVerdicts = await gatherStage2Results('job123');
++
++  await queue.add('stage3-holistic', {
++    jobId: 'job123',
++    articleUrl: 'https://example.com/article',
++    claimVerdicts: claimVerdicts
++  });
++});
++{{/code}}
++
++**Note:** This is a conceptual sketch. Actual implementation may use BullMQ Flow API or custom orchestration.
++
++**Cache Check Logic:**
++{{code language="typescript"}}
++async function analyzeClaimWithCache(claim: string): Promise<ClaimAnalysis> {
++  const canonicalClaim = normalizeClaim(claim);
++  const claimHash = sha256(canonicalClaim);
++  const cacheKey = `claim:v1:${claimHash}`;
++
++  // Check cache
++  const cached = await redis.get(cacheKey);
++  if (cached) {
++    await redis.incr(`claim:stats:hit_count:${claimHash}`);
++    return JSON.parse(cached);
++  }
++
++  // Cache miss - analyze with LLM
++  const analysis = await analyzeClaim_Stage2(canonicalClaim);
++
++  // Store in cache
++  await redis.set(cacheKey, JSON.stringify(analysis), 'EX', 7776000); // 90 days
++
++  return analysis;
++}
++{{/code}}
++
++=== 9.3 User Credit Management ===
++
++**PostgreSQL Schema:**
++{{code language="sql"}}
++CREATE TABLE user_credits (
++  user_id UUID PRIMARY KEY,
++  tier VARCHAR(20) DEFAULT 'free',
++  credit_limit DECIMAL(10,2) DEFAULT 10.00,
++  credit_used DECIMAL(10,2) DEFAULT 0.00,
++  reset_date TIMESTAMP,
++  cache_only_mode BOOLEAN DEFAULT false,
++  created_at TIMESTAMP DEFAULT NOW()
++);
++
++CREATE TABLE usage_log (
++  id SERIAL PRIMARY KEY,
++  user_id UUID REFERENCES user_credits(user_id),
++  job_id VARCHAR(50),
++  stage VARCHAR(20),
++  cost DECIMAL(10,4),
++  cache_hit BOOLEAN,
++  created_at TIMESTAMP DEFAULT NOW()
++);
++{{/code}}
++
++**Credit Deduction Logic:**
++{{code language="typescript"}}
++async function deductCredit(userId: string, cost: number): Promise<boolean> {
++  const user = await db.query('SELECT * FROM user_credits WHERE user_id = $1', [userId]);
++
++  const newUsed = user.credit_used + cost;
++
++  if (newUsed > user.credit_limit && user.tier === 'free') {
++    // Trigger cache-only mode
++    await db.query(
++      'UPDATE user_credits SET cache_only_mode = true WHERE user_id = $1',
++      [userId]
++    );
++    throw new Error('CREDIT_LIMIT_REACHED');
++  }
++
++  await db.query(
++    'UPDATE user_credits SET credit_used = $1 WHERE user_id = $2',
++    [newUsed, userId]
++  );
++
++  return true;
++}
++{{/code}}
++
++=== 9.4 Cache-Only Mode Implementation ===
++
++**Middleware:**
++{{code language="typescript"}}
++async function checkCacheOnlyMode(req, res, next) {
++  const user = await getUserCredit(req.userId);
++
++  if (user.cache_only_mode) {
++    // Allow only cache reads
++    if (req.body.options?.cache_preference !== 'allow_partial') {
++      return res.status(402).json({
++        error: 'credit_limit_reached',
++        message: 'Resubmit with cache_preference=allow_partial',
++        cache_only_mode: true
++      });
++    }
++
++    // Modify request to skip Stage 2 for uncached claims
++    req.cacheOnlyMode = true;
++  }
++
++  next();
++}
++{{/code}}
++
++=== 9.5 Estimated Timeline ===
++
++**POC1 with 3-Stage Architecture:**
++* Week 1: Stage 1 (Haiku extraction) + Redis setup
++* Week 2: Stage 2 (Sonnet analysis + caching)
++* Week 3: Stage 3 (Holistic assessment) + pipeline orchestration
++* Week 4: User credit system + cache-only mode
++* Week 5: Testing with 100 articles (measure cache hit rate)
++* Week 6: Optimization + bug fixes
++* **Total: 6-8 weeks**
++
++**Manual coding:** 12-16 weeks
++
++---
++
++== 10. Testing Strategy ==
++
++=== 10.1 Cache Performance Testing ===
++
++**Test Scenarios:**
++
++**Scenario 1: Cold Start (0 cache)**
++* Analyze 100 diverse articles
++* Measure: Cost per article, cache growth rate
++* Expected: $0.35-0.40 avg, ~400 unique claims cached
++
++**Scenario 2: Warm Cache (Overlapping Domain)**
++* Analyze 100 articles on SAME topic (e.g., "2024 election")
++* Measure: Cache hit rate growth
++* Expected: Hit rate 20% → 60% by article 100
++
++**Scenario 3: Mature Cache (1,000 articles)**
++* Analyze next 100 articles (diverse topics)
++* Measure: Steady-state cache hit rate
++* Expected: 60-70% hit rate, $0.15-0.18 avg cost
++
++**Scenario 4: Cache-Only Mode**
++* Free user reaches $10 limit (67 articles at 70% hit rate)
++* Submit 10 more articles with {{code}}cache_preference=allow_partial{{/code}}
++* Measure: Coverage %, user satisfaction
++* Expected: 60-70% coverage, instant results
++
++=== 10.2 Success Metrics ===
++
++**Cache Performance:**
++* Week 1: 5-10% hit rate
++* Week 2: 15-25% hit rate
++* Week 3: 30-40% hit rate
++* Week 4: 45-55% hit rate
++* Target: ≥50% by 1,000 articles
++
++**Cost Targets:**
++* Articles 1-100: $0.35-0.40 avg ⚠️ (expected)
++* Articles 100-500: $0.25-0.30 avg
++* Articles 500-1,000: $0.18-0.22 avg
++* Articles 1,000+: $0.12-0.15 avg ✅
++
++**Quality Metrics (same as v0.3.1):**
++* Hallucination rate: <5%
++* Context-aware accuracy: ≥70%
++* False positive rate: <15%
++* Mandatory contradiction search: 100% compliance
++
++=== 10.3 Free Tier Economics Validation ===
++
++**Test with simulated 1,000 users:**
++* Each user: $10 credit
++* 70% cache hit rate
++* Avg 70 articles/user/month
++
++**Projected Costs:**
++* Total credits: 1,000 × $10 = $10,000
++* Actual LLM costs: ~$9,000 (cache savings)
++* Margin: 10%
++
++**Sustainability Check:**
++* If margin <5% → Reduce free tier limit
++* If margin >20% → Consider increasing free tier
++
++---
++
++== 11. Cross-References ==
++
++This API specification implements requirements from:
++
++* **[[POC Requirements>>Test.FactHarbor.Specification.POC.Requirements]]**
++** FR-POC-1 through FR-POC-6 (3-stage architecture)
++** NFR-POC-1 through NFR-POC-3 (quality gates, caching)
++** NEW: FR-POC-7 (Claim-level caching)
++** NEW: FR-POC-8 (User credit system)
++** NEW: FR-POC-9 (Cache-only mode)
++
++* **[[Article Verdict Problem>>Test.FactHarbor.Specification.POC.Article-Verdict-Problem]]**
++** Approach 1 implemented in Stage 3
++** Context-aware holistic assessment
++
++* **[[Requirements>>Test.FactHarbor.Specification.Requirements.WebHome]]**
++** FR4 (Analysis Summary) - enhanced with caching
++** FR7 (Verdict Calculation) - cached per claim
++** NFR11 (Quality Gates) - enforced across stages
++** NEW: NFR19 (Cost Efficiency via Caching)
++** NEW: NFR20 (Free Tier Sustainability)
++
++* **[[Architecture>>Test.FactHarbor.Specification.Architecture.WebHome]]**
++** POC1 3-stage pipeline architecture
++** Redis cache layer
++** User credit system
++
++* **[[Data Model>>Test.FactHarbor.Specification.Data Model.WebHome]]**
++** Claim structure (cacheable unit)
++** Evidence structure
++** Scenario boundaries
++
++---
++
++**End of Specification - FactHarbor POC1 API v0.4**
++
++**3-stage caching architecture with free tier cache-only mode. Ready for sustainable, scalable implementation!** 🚀
++

Changes for page POC1 API & Schemas Specification

Summary

Details

Applications

Navigation

Need help?