Changes for page POC1 API & Schemas Specification

Last modified by Robert Schaub on 2025/12/24 18:26

From 1.1 to 2.1 From 4.1 to 5.1

From version 2.1

edited by Robert Schaub
on 2025/12/24 13:58

Change comment: Imported from XAR

To version 4.1

edited by Robert Schaub
on 2025/12/24 16:55

Change comment: Imported from XAR

Raw
Rendered

Summary

Page properties (1 modified, 0 added, 0 removed)

Details

Page properties

Content

@@ -1,44 +1,25 @@
--# FactHarbor POC1 — API & Schemas Specification
++= POC1 API & Schemas Specification =
--**Version:** 0.4.1 (POC1 - 3-Stage Caching Architecture)
--**Namespace:** FactHarbor.*
--**Syntax:** xWiki 2.1
--**Last Updated:** 2025-12-24
++----
-----
--
  == Version History ==
  |=Version|=Date|=Changes
  |0.4.1|2025-12-24|Applied 9 critical fixes: file format notice, verdict taxonomy, canonicalization algorithm, Stage 1 cost policy, BullMQ fix, language in cache key, historical claims TTL, idempotency, copyright policy
  |0.4|2025-12-24|**BREAKING:** 3-stage pipeline with claim-level caching, user tier system, cache-only mode for free users, Redis cache architecture
--|0.3.1|2025-12-24|Fixed single-prompt strategy, SSE clarification, schema canonicalization, cost constraints, chain-of-thought, evidence citation, Jina safety, gate numbering
--|0.3|2025-12-24|Added complete API endpoints, LLM config, risk tiers, scraping details, quality gate logging, temporal separation note, cross-references
--|0.2|2025-12-24|Initial rebased version with holistic assessment
--|0.1|2025-12-24|Original specification
++|0.3.1|2025-12-24|Fixed single-prompt strategy, SSE clarification, schema canonicalization, cost constraints
++|0.3|2025-12-24|Added complete API endpoints, LLM config, risk tiers, scraping details
-----
-----
++----
--== File Format Notice ==
--
--**⚠️ Important:** This file is stored as {{code}}.md{{/code}} for transport/versioning, but the content is **xWiki 2.1 syntax** (not Markdown).
--
--**When importing to xWiki:**
--* Use "Import as XWiki content" (not "Import as Markdown")
--* The xWiki parser will correctly interpret {{code}}==}} headers, {{{{code}}}}}} blocks, etc.
--
--**Alternate naming:** If your workflow supports it, rename to {{code}}.xwiki.txt{{/code}} to avoid ambiguity.
--
-----
--
  == 1. Core Objective (POC1) ==
--The primary technical goal of POC1 is to validate **Approach 1 (Single-Pass Holistic Analysis)** while implementing **claim-level caching** to achieve cost sustainability:
++The primary technical goal of POC1 is to validate **Approach 1 (Single-Pass Holistic Analysis)** while implementing **claim-level caching** to achieve cost sustainability.
--The system must prove that AI can identify an article's **Main Thesis** and determine if the supporting claims (even if individually accurate) logically support that thesis without committing fallacies (e.g., correlation vs. causation, cherry-picking, hasty generalization).
++The system must prove that AI can identify an article's **Main Thesis** and determine if supporting claims logically support that thesis without committing fallacies.
--**Success Criteria:**
++=== Success Criteria: ===
++
  * Test with 30 diverse articles
  * Target: ≥70% accuracy detecting misleading articles
  * Cost: <$0.25 per NEW analysis (uncached)
@@ -46,14 +46,13 @@
  * Cache hit rate: ≥50% after 1,000 articles
  * Processing time: <2 minutes (standard depth)
--**Economic Model:**
--* Free tier: $10 credit per month (~40-140 articles depending on cache hits)
--* After limit: Cache-only mode (instant, free access to cached claims)
--* Paid tier: Unlimited new analyses
++=== Economic Model: ===
--**See:** [[Article Verdict Problem>>Test.FactHarbor.Specification.POC.Article-Verdict-Problem]] for complete investigation of 7 approaches.
++* **Free tier:** $10 credit per month (~~40-140 articles depending on cache hits)
++* **After limit:** Cache-only mode (instant, free access to cached claims)
++* **Paid tier:** Unlimited new analyses
-----
++----
  == 2. Architecture Overview ==
@@ -61,7 +61,7 @@
  FactHarbor POC1 uses a **3-stage architecture** designed for claim-level caching and cost efficiency:
--{{code language="mermaid"}}
++{{mermaid}}
  graph TD
      A[Article Input] --> B[Stage 1: Extract Claims]
      B --> C{For Each Claim}
@@ -72,41 +72,46 @@
      G --> E
      E --> H[Stage 3: Holistic Assessment]
      H --> I[Final Report]
--{{/code}}
++{{/mermaid}}
--**Stage 1: Claim Extraction** (Haiku, no cache)
--* Input: Article text
--* Output: 5 canonical claims (normalized, deduplicated)
--* Model: Claude Haiku 4
--* Cost: $0.003 per article
--* Cache strategy: No caching (article-specific)
++==== Stage 1: Claim Extraction (Haiku, no cache) ====
--**Stage 2: Claim Analysis** (Sonnet, CACHED)
--* Input: Single canonical claim
--* Output: Scenarios + Evidence + Verdicts
--* Model: Claude Sonnet 3.5
--* Cost: $0.081 per NEW claim
--* Cache strategy: **Redis, 90-day TTL**
--* Cache key: {{code}}claim:v1norm1:{language}:{sha256(canonical_claim)}{{/code}}
++* **Input:** Article text
++* **Output:** 5 canonical claims (normalized, deduplicated)
++* **Model:** Claude Haiku 4
++* **Cost:** $0.003 per article
++* **Cache strategy:** No caching (article-specific)
--**Stage 3: Holistic Assessment** (Sonnet, no cache)
--* Input: Article + Claim verdicts (from cache or Stage 2)
--* Output: Article verdict + Fallacies + Logic quality
--* Model: Claude Sonnet 3.5
--* Cost: $0.030 per article
--* Cache strategy: No caching (article-specific)
++==== Stage 2: Claim Analysis (Sonnet, CACHED) ====
--**Total Cost Formula:**
--{{code}}
--Cost = $0.003 (extraction) + (N_new_claims × $0.081) + $0.030 (holistic)
++* **Input:** Single canonical claim
++* **Output:** Scenarios + Evidence + Verdicts
++* **Model:** Claude Sonnet 3.5
++* **Cost:** $0.081 per NEW claim
++* **Cache strategy:** Redis, 90-day TTL
++* **Cache key:** claim:v1norm1:{language}:{sha256(canonical_claim)}
++==== Stage 3: Holistic Assessment (Sonnet, no cache) ====
++
++* **Input:** Article + Claim verdicts (from cache or Stage 2)
++* **Output:** Article verdict + Fallacies + Logic quality
++* **Model:** Claude Sonnet 3.5
++* **Cost:** $0.030 per article
++* **Cache strategy:** No caching (article-specific)
++
++=== Total Cost Formula: ===
++
++{{{Cost = $0.003 (extraction) + (N_new_claims × $0.081) + $0.030 (holistic)
++
  Examples:
  - 0 new claims (100% cache hit): $0.033
  - 1 new claim (80% cache hit): $0.114
  - 3 new claims (40% cache hit): $0.276
  - 5 new claims (0% cache hit): $0.438
--{{/code}}
++}}}
++----
++
  === 2.2 User Tier System ===
  |=Tier|=Monthly Credit|=After Limit|=Cache Access|=Analytics
@@ -115,17 +115,21 @@
  |**Enterprise** (future)|Custom|Continues|✅ Full + Priority|Full
  **Free Tier Economics:**
++
  * $10 credit = 40-140 articles analyzed (depending on cache hit rate)
  * Average 70 articles/month at 70% cache hit rate
--* After limit: Cache-only mode (see Section 2.3)
++* After limit: Cache-only mode
++----
++
  === 2.3 Cache-Only Mode (Free Tier Feature) ===
  When free users reach their $10 monthly limit, they enter **Cache-Only Mode**:
--**What Cache-Only Mode Provides:**
++==== What Cache-Only Mode Provides: ====
  ✅ **Claim Extraction (Platform-Funded):**
++
  * Stage 1 extraction runs at $0.003 per article
  * **Cost: Absorbed by platform** (not charged to user credit)
  * Rationale: Extraction is necessary to check cache, and cost is negligible
@@ -132,27 +132,31 @@
  * Rate limit: Max 50 extractions/day in cache-only mode (prevents abuse)
  ✅ **Instant Access to Cached Claims:**
++
  * Any claim that exists in cache → Full verdict returned
  * Cost: $0 (no LLM calls)
  * Response time: <100ms
  ✅ **Partial Article Analysis:**
++
  * Check each claim against cache
  * Return verdicts for ALL cached claims
--* For uncached claims: Return {{code}}"status": "cache_miss"{{/code}}
++* For uncached claims: Return "status": "cache_miss"
  ✅ **Cache Coverage Report:**
++
  * "3 of 5 claims available in cache (60% coverage)"
  * Links to cached analyses
  * Estimated cost to complete: $0.162 (2 new claims)
  ❌ **Not Available in Cache-Only Mode:**
++
  * New claim analysis (Stage 2 LLM calls blocked)
  * Full holistic assessment (Stage 3 blocked if any claims missing)
--**User Experience:**
--{{code language="json"}}
--{
++==== User Experience Example: ====
++
++{{{{
    "status": "cache_only_mode",
    "message": "Monthly credit limit reached. Showing cached results only.",
    "cache_coverage": {
@@ -175,26 +175,26 @@
      "pro_tier": "$50/month unlimited"
    }
  }
--{{/code}}
++}}}
  **Design Rationale:**
++
  * Free users still get value (cached claims often answer their question)
  * Demonstrates FactHarbor's value (partial results encourage upgrade)
  * Sustainable for platform (no additional cost)
  * Fair to all users (everyone contributes to cache)
-----
++----
  == 3. REST API Contract ==
  === 3.1 User Credit Tracking ===
--**Endpoint:** {{code}}GET /v1/user/credit{{/code}}
++**Endpoint:** GET /v1/user/credit
--**Response:** {{code}}200 OK{{/code}}
++**Response:** 200 OK
--{{code language="json"}}
--{
++{{{{
    "user_id": "user_abc123",
    "tier": "free",
    "credit_limit": 10.00,
@@ -209,30 +209,25 @@
      "cache_hit_rate": 0.626
    }
  }
--{{/code}}
++}}}
-----
++----
  === 3.2 Create Analysis Job (3-Stage) ===
--**Endpoint:** {{code}}POST /v1/analyze{{/code}}
++**Endpoint:** POST /v1/analyze
--**Request Body:**
++==== Idempotency Support: ====
--
--**Idempotency Support:**
--
  To prevent duplicate job creation on network retries, clients SHOULD include:
--{{code language="http"}}
--POST /v1/analyze
++{{{POST /v1/analyze
  Idempotency-Key: {client-generated-uuid}
--{{/code}}
++}}}
--OR use the {{code}}client.request_id{{/code}} field:
++OR use the client.request_id field:
--{{code language="json"}}
--{
++{{{{
    "input_url": "...",
    "client": {
      "request_id": "client-uuid-12345",
@@ -239,17 +239,18 @@
      "source_label": "optional"
    }
  }
--{{/code}}
++}}}
  **Server Behavior:**
--* If {{code}}Idempotency-Key{{/code}} or {{code}}request_id{{/code}} seen before (within 24 hours):
--  - Return existing job ({{code}}200 OK{{/code}}, not {{code}}202 Accepted{{/code}})
--  - Do NOT create duplicate job or charge twice
++
++* If Idempotency-Key or request_id seen before (within 24 hours):
++** Return existing job (200 OK, not 202 Accepted)
++** Do NOT create duplicate job or charge twice
  * Idempotency keys expire after 24 hours (matches job retention)
  **Example Response (Idempotent):**
--{{code language="json"}}
--{
++
++{{{{
    "job_id": "01J...ULID",
    "status": "RUNNING",
    "idempotent": true,
@@ -256,11 +256,11 @@
    "original_request_at": "2025-12-24T10:31:00Z",
    "message": "Returning existing job (idempotency key matched)"
  }
--{{/code}}
++}}}
++==== Request Body: ====
--{{code language="json"}}
--{
++{{{{
    "input_type": "url",
    "input_url": "https://example.com/medical-report-01",
    "input_text": null,
@@ -268,8 +268,9 @@
      "browsing": "on",
      "depth": "standard",
      "max_claims": 5,
--    "context_aware_analysis": true,
--    "cache_preference": "prefer_cache"
++    "scenarios_per_claim": 2,
++    "max_evidence_per_scenario": 6,
++    "context_aware_analysis": true
    },
    "client": {
      "request_id": "optional-client-tracking-id",
@@ -276,18 +276,20 @@
      "source_label": "optional"
    }
  }
--{{/code}}
++}}}
  **Options:**
--* {{code}}cache_preference{{/code}}: {{code}}prefer_cache{{/code}} | {{code}}require_fresh{{/code}} | {{code}}allow_partial{{/code}}
--  - {{code}}prefer_cache{{/code}}: Use cache when available, analyze new claims (default)
--  - {{code}}require_fresh{{/code}}: Force re-analysis of all claims (ignores cache, costs more)
--  - {{code}}allow_partial{{/code}}: Return partial results if some claims uncached (for free tier cache-only mode)
--**Response:** {{code}}202 Accepted{{/code}}
++* browsing: on | off (retrieve web sources or just output queries)
++* depth: standard | deep (evidence thoroughness)
++* max_claims: 1-10 (default: **5** for cost control)
++* scenarios_per_claim: 1-5 (default: **2** for cost control)
++* max_evidence_per_scenario: 3-10 (default: **6**)
++* context_aware_analysis: true | false (experimental)
--{{code language="json"}}
--{
++**Response:** 202 Accepted
++
++{{{{
    "job_id": "01J...ULID",
    "status": "QUEUED",
    "created_at": "2025-12-24T10:31:00Z",
@@ -310,13 +310,13 @@
      "events": "/v1/jobs/01J...ULID/events"
    }
  }
--{{/code}}
++}}}
  **Error Responses:**
--{{code}}402 Payment Required{{/code}} - Free tier limit reached, cache-only mode
--{{code language="json"}}
--{
++402 Payment Required - Free tier limit reached, cache-only mode
++
++{{{{
    "error": "credit_limit_reached",
    "message": "Monthly credit limit reached. Entering cache-only mode.",
    "cache_only_mode": true,
@@ -324,199 +324,15 @@
    "reset_date": "2025-02-01T00:00:00Z",
    "action": "Resubmit with cache_preference=allow_partial for cached results"
  }
--{{/code}}
++}}}
-----
++----
--=== 3.3 Get Job Status ===
--
--**Endpoint:** {{code}}GET /v1/jobs/{job_id}{{/code}}
--
--**Response:** {{code}}200 OK{{/code}}
--
--{{code language="json"}}
--{
--  "job_id": "01J...ULID",
--  "status": "RUNNING",
--  "created_at": "2025-12-24T10:31:00Z",
--  "updated_at": "2025-12-24T10:31:22Z",
--  "progress": {
--    "stage": "stage2_claim_analysis",
--    "percent": 65,
--    "message": "Analyzing claim 3 of 5 (2 from cache)",
--    "current_claim_id": "C3",
--    "cache_hits": 2,
--    "cache_misses": 1
--  },
--  "actual_cost": 0.084,
--  "cost_breakdown": {
--    "stage1_extraction": 0.003,
--    "stage2_new_claims": 0.081,
--    "stage2_cached_claims": 0.000,
--    "stage3_holistic": null
--  },
--  "input_echo": {
--    "input_type": "url",
--    "input_url": "https://example.com/medical-report-01"
--  },
--  "links": {
--    "self": "/v1/jobs/01J...ULID",
--    "result": "/v1/jobs/01J...ULID/result",
--    "report": "/v1/jobs/01J...ULID/report"
--  },
--  "error": null
--}
--{{/code}}
--
-----
--
--=== 3.4 Get Analysis Result ===
--
--**Endpoint:** {{code}}GET /v1/jobs/{job_id}/result{{/code}}
--
--**Response:** {{code}}200 OK{{/code}}
--
--Returns complete **AnalysisResult** schema (see Section 4).
--
--**Cache-Only Mode Response:** {{code}}206 Partial Content{{/code}}
--
--{{code language="json"}}
--{
--  "cache_only_mode": true,
--  "cache_coverage": {
--    "claims_total": 5,
--    "claims_cached": 3,
--    "claims_missing": 2,
--    "coverage_percent": 60
--  },
--  "partial_result": {
--    "metadata": {
--      "job_id": "01J...ULID",
--      "timestamp_utc": "2025-12-24T10:31:30Z",
--      "engine_version": "POC1-v0.4",
--      "cache_only": true
--    },
--    "claims": [
--      {
--        "claim_id": "C1",
--        "claim_text": "...",
--        "canonical_claim": "...",
--        "source": "cache",
--        "cached_at": "2025-12-20T15:30:00Z",
--        "cache_hit_count": 47,
--        "scenarios": [...]
--      },
--      {
--        "claim_id": "C3",
--        "claim_text": "...",
--        "canonical_claim": "...",
--        "source": "not_analyzed",
--        "status": "cache_miss",
--        "estimated_cost": 0.081
--      }
--    ],
--    "article_holistic_assessment": null,
--    "upgrade_prompt": {
--      "message": "Upgrade to Pro for full analysis of all claims",
--      "missing_claims": 2,
--      "cost_to_complete": 0.192
--    }
--  }
--}
--{{/code}}
--
--**Other Responses:**
--* {{code}}409 Conflict{{/code}} - Job not finished yet
--* {{code}}404 Not Found{{/code}} - Job ID unknown
--
-----
--
--=== 3.5 Stage-Specific Endpoints (Optional, Advanced) ===
--
--For direct stage access (useful for cache debugging, custom workflows):
--
--**Extract Claims Only:**
--{{code}}POST /v1/analyze/extract-claims{{/code}}
--
--**Analyze Single Claim:**
--{{code}}POST /v1/analyze/claim{{/code}}
--
--**Assess Article (with claim verdicts):**
--{{code}}POST /v1/analyze/assess-article{{/code}}
--
--**Check Claim Cache:**
--{{code}}GET /v1/cache/claim/{claim_hash}{{/code}}
--
--**Cache Statistics:**
--{{code}}GET /v1/cache/stats{{/code}}
--
-----
--
--=== 3.6 Download Markdown Report ===
--
--**Endpoint:** {{code}}GET /v1/jobs/{job_id}/report{{/code}}
--
--**Response:** {{code}}200 OK{{/code}} with {{code}}text/markdown; charset=utf-8{{/code}} content
--
--**Headers:**
--* {{code}}Content-Disposition: attachment; filename="factharbor_poc1_{job_id}.md"{{/code}}
--
--**Cache-Only Mode:** Report includes "Partial Analysis" watermark and upgrade prompt.
--
-----
--
--=== 3.7 Stream Job Events (Backend Progress) ===
--
--**Endpoint:** {{code}}GET /v1/jobs/{job_id}/events{{/code}}
--
--**Response:** Server-Sent Events (SSE) stream
--
--**Event Types:**
--* {{code}}progress{{/code}} - Backend progress (e.g., "Stage 1: Extracting claims")
--* {{code}}cache_hit{{/code}} - Claim found in cache
--* {{code}}cache_miss{{/code}} - Claim requires new analysis
--* {{code}}stage_complete{{/code}} - Stage 1/2/3 finished
--* {{code}}complete{{/code}} - Job finished
--* {{code}}error{{/code}} - Error occurred
--* {{code}}credit_warning{{/code}} - User approaching limit
--
-----
--
--=== 3.8 Cancel Job ===
--
--**Endpoint:** {{code}}DELETE /v1/jobs/{job_id}{{/code}}
--
--**Note:** If job is mid-stage (e.g., analyzing claim 3 of 5), user is charged for completed work only.
--
-----
--
--=== 3.9 Health Check ===
--
--**Endpoint:** {{code}}GET /v1/health{{/code}}
--
--{{code language="json"}}
--{
--  "status": "ok",
--  "version": "POC1-v0.4",
--  "model_stage1": "claude-haiku-4",
--  "model_stage2": "claude-3-5-sonnet-20241022",
--  "model_stage3": "claude-3-5-sonnet-20241022",
--  "cache": {
--    "status": "connected",
--    "total_claims": 12847,
--    "avg_hit_rate_24h": 0.73
--  }
--}
--{{/code}}
--
-----
--
  == 4. Data Schemas ==
  === 4.1 Stage 1 Output: ClaimExtraction ===
--{{code language="json"}}
--{
++{{{{
    "job_id": "01J...ULID",
    "stage": "stage1_extraction",
    "article_metadata": {
@@ -541,219 +541,10 @@
    "article_thesis": "Main argument detected",
    "cost": 0.003
  }
--{{/code}}
++}}}
--=== 4.2 Stage 2 Output: ClaimAnalysis (CACHED) ===
++----
--This is the CACHEABLE unit. Stored in Redis with 90-day TTL.
--
--{{code language="json"}}
--{
--  "claim_hash": "sha256:abc123...",
--  "canonical_claim": "COVID vaccines are 95% effective",
--  "language": "en",
--  "domain": "public_health",
--  "analysis_version": "v1.0",
--  "scenarios": [
--    {
--      "scenario_id": "S1",
--      "scenario_title": "mRNA vaccines (Pfizer/Moderna) in clinical trials",
--      "definitions": {"95% effective": "95% reduction in symptomatic infection"},
--      "assumptions": ["Based on phase 3 trial data", "Against original strain"],
--      "boundaries": {
--        "time": "2020-2021 trials",
--        "geography": "Multi-country trials",
--        "population": "Adult population (16+)",
--        "conditions": "Before widespread variants"
--      },
--      "verdict": {
--        "label": "Highly Likely",
--        "probability_range": [0.88, 0.97],
--        "confidence": 0.92,
--        "reasoning_chain": [
--          "Pfizer/BioNTech trial: 95% efficacy (n=43,548)",
--          "Moderna trial: 94.1% efficacy (n=30,420)",
--          "Peer-reviewed publications in NEJM",
--          "FDA independent analysis confirmed"
--        ],
--        "key_supporting_evidence_ids": ["E1", "E2"],
--        "key_counter_evidence_ids": ["E3"],
--        "uncertainty_factors": [
--          "Limited data on long-term effectiveness",
--          "Variant-specific performance not yet measured"
--        ]
--      },
--      "evidence": [
--        {
--          "evidence_id": "E1",
--          "stance": "supports",
--          "relevance_to_scenario": 0.98,
--          "evidence_summary": [
--            "Pfizer trial showed 170 cases in placebo vs 8 in vaccine group",
--            "Follow-up period median 2 months post-dose 2",
--            "Efficacy consistent across age, sex, race, ethnicity"
--          ],
--          "citation": {
--            "title": "Safety and Efficacy of the BNT162b2 mRNA Covid-19 Vaccine",
--            "author_or_org": "Polack et al.",
--            "publication_date": "2020-12-31",
--            "url": "https://nejm.org/doi/full/10.1056/NEJMoa2034577",
--            "publisher": "New England Journal of Medicine",
--            "retrieved_at_utc": "2025-12-20T15:30:00Z"
--          },
--          "excerpt": ["The vaccine was 95% effective in preventing Covid-19"],
--          "excerpt_word_count": 9,
--          "source_reliability_score": 0.95,
--          "reliability_justification": "Peer-reviewed, high-impact journal, large RCT",
--          "limitations_and_reservations": [
--            "Short follow-up period (2 months)",
--            "Primarily measures symptomatic infection, not transmission"
--          ],
--          "retraction_or_dispute_signal": "none"
--        }
--      ]
--    }
--  ],
--  "cache_metadata": {
--    "first_analyzed": "2025-12-01T10:00:00Z",
--    "last_updated": "2025-12-20T15:30:00Z",
--    "hit_count": 47,
--    "version": "v1.0",
--    "ttl_expires": "2026-03-20T15:30:00Z"
--  },
--  "cost": 0.081
--}
--{{/code}}
--
--**Cache Key Structure:**
--{{code}}
--Redis Key: claim:v1norm1:{language}:{sha256(canonical_claim)}
--TTL: 90 days (7,776,000 seconds)
--Size: ~15KB JSON (compressed: ~5KB)
--{{/code}}
--
--=== 4.3 Stage 3 Output: HolisticAssessment ===
--
--{{code language="json"}}
--{
--  "job_id": "01J...ULID",
--  "stage": "stage3_holistic",
--  "article_metadata": {
--    "title": "...",
--    "main_thesis": "...",
--    "source_url": "..."
--  },
--  "article_holistic_assessment": {
--    "overall_verdict": "MISLEADING",
--    "logic_quality_score": 0.42,
--    "fallacies_detected": [
--      "correlation-causation",
--      "cherry-picking"
--    ],
--    "verdict_reasoning": [
--      "Central claim C1 is REFUTED by multiple systematic reviews",
--      "Supporting claims C2-C4 are TRUE but do not support the thesis",
--      "Article commits correlation-causation fallacy",
--      "Selective citation of evidence (cherry-picking detected)"
--    ],
--    "experimental_feature": true
--  },
--  "claims_summary": [
--    {
--      "claim_id": "C1",
--      "is_central_to_thesis": true,
--      "verdict": "Refuted",
--      "confidence": 0.89,
--      "source": "cache",
--      "cache_hit": true
--    },
--    {
--      "claim_id": "C2",
--      "is_central_to_thesis": false,
--      "verdict": "Highly Likely",
--      "confidence": 0.91,
--      "source": "new_analysis",
--      "cache_hit": false
--    }
--  ],
--  "quality_gates": {
--    "gate1_claim_validation": "pass",
--    "gate4_verdict_confidence": "pass",
--    "passed_all": true
--  },
--  "cost": 0.030,
--  "total_job_cost": 0.114
--}
--{{/code}}
--
--=== 4.4 Complete AnalysisResult (All 3 Stages Combined) ===
--
--{{code language="json"}}
--{
--  "metadata": {
--    "job_id": "01J...ULID",
--    "timestamp_utc": "2025-12-24T10:31:30Z",
--    "engine_version": "POC1-v0.4",
--    "llm_stage1": "claude-haiku-4",
--    "llm_stage2": "claude-3-5-sonnet-20241022",
--    "llm_stage3": "claude-3-5-sonnet-20241022",
--    "usage_stats": {
--      "stage1_tokens": {"input": 10000, "output": 500},
--      "stage2_tokens": {"input": 2000, "output": 5000},
--      "stage3_tokens": {"input": 5000, "output": 1000},
--      "total_input_tokens": 17000,
--      "total_output_tokens": 6500,
--      "estimated_cost_usd": 0.114,
--      "response_time_sec": 45.2
--    },
--    "cache_stats": {
--      "claims_total": 5,
--      "claims_from_cache": 4,
--      "claims_new_analysis": 1,
--      "cache_hit_rate": 0.80,
--      "cache_savings_usd": 0.324
--    }
--  },
--  "article_holistic_assessment": {
--    "main_thesis": "...",
--    "overall_verdict": "MISLEADING",
--    "logic_quality_score": 0.42,
--    "fallacies_detected": ["correlation-causation", "cherry-picking"],
--    "verdict_reasoning": ["...", "...", "..."],
--    "experimental_feature": true
--  },
--  "claims": [
--    {
--      "claim_id": "C1",
--      "is_central_to_thesis": true,
--      "claim_text": "...",
--      "canonical_claim": "...",
--      "claim_hash": "sha256:abc123...",
--      "claim_type": "causal",
--      "evaluability": "evaluable",
--      "risk_tier": "B",
--      "source": "cache",
--      "cached_at": "2025-12-20T15:30:00Z",
--      "cache_hit_count": 47,
--      "scenarios": [...]
--    },
--    {
--      "claim_id": "C2",
--      "source": "new_analysis",
--      "analyzed_at": "2025-12-24T10:31:15Z",
--      "scenarios": [...]
--    }
--  ],
--  "quality_gates": {
--    "gate1_claim_validation": "pass",
--    "gate4_verdict_confidence": "pass",
--    "passed_all": true
--  }
--}
--{{/code}}
--
--
--
  === 4.5 Verdict Label Taxonomy ===
  FactHarbor uses **three distinct verdict taxonomies** depending on analysis level:
@@ -763,23 +763,26 @@
  Used for individual scenario verdicts within a claim.
  **Enum Values:**
--* {{code}}Highly Likely{{/code}} - Probability 0.85-1.0, high confidence
--* {{code}}Likely{{/code}} - Probability 0.65-0.84, moderate-high confidence
--* {{code}}Unclear{{/code}} - Probability 0.35-0.64, or low confidence
--* {{code}}Unlikely{{/code}} - Probability 0.16-0.34, moderate-high confidence
--* {{code}}Highly Unlikely{{/code}} - Probability 0.0-0.15, high confidence
--* {{code}}Unsubstantiated{{/code}} - Insufficient evidence to determine probability
++* Highly Likely - Probability 0.85-1.0, high confidence
++* Likely - Probability 0.65-0.84, moderate-high confidence
++* Unclear - Probability 0.35-0.64, or low confidence
++* Unlikely - Probability 0.16-0.34, moderate-high confidence
++* Highly Unlikely - Probability 0.0-0.15, high confidence
++* Unsubstantiated - Insufficient evidence to determine probability
++
  ==== 4.5.2 Claim Verdict Labels (Rollup) ====
  Used when summarizing a claim across all scenarios.
  **Enum Values:**
--* {{code}}Supported{{/code}} - Majority of scenarios are Likely or Highly Likely
--* {{code}}Refuted{{/code}} - Majority of scenarios are Unlikely or Highly Unlikely
--* {{code}}Inconclusive{{/code}} - Mixed scenarios or majority Unclear/Unsubstantiated
++* Supported - Majority of scenarios are Likely or Highly Likely
++* Refuted - Majority of scenarios are Unlikely or Highly Unlikely
++* Inconclusive - Mixed scenarios or majority Unclear/Unsubstantiated
++
  **Mapping Logic:**
++
  * If ≥60% scenarios are (Highly Likely | Likely) → Supported
  * If ≥60% scenarios are (Highly Unlikely | Unlikely) → Refuted
  * Otherwise → Inconclusive
@@ -789,23 +789,23 @@
  Used for holistic article-level assessment.
  **Enum Values:**
--* {{code}}WELL-SUPPORTED{{/code}} - Article thesis logically follows from supported claims
--* {{code}}MISLEADING{{/code}} - Claims may be true but article commits logical fallacies
--* {{code}}REFUTED{{/code}} - Central claims are refuted, invalidating thesis
--* {{code}}UNCERTAIN{{/code}} - Insufficient evidence or highly mixed claim verdicts
++* WELL-SUPPORTED - Article thesis logically follows from supported claims
++* MISLEADING - Claims may be true but article commits logical fallacies
++* REFUTED - Central claims are refuted, invalidating thesis
++* UNCERTAIN - Insufficient evidence or highly mixed claim verdicts
++
  **Note:** Article verdict considers **claim centrality** (central claims override supporting claims).
  ==== 4.5.4 API Field Mapping ====
  |=Level|=API Field|=Enum Name
--|Scenario|{{code}}scenarios[].verdict.label{{/code}}|scenario_verdict_label
--|Claim|{{code}}claims[].rollup_verdict{{/code}} (optional)|claim_verdict_label
--|Article|{{code}}article_holistic_assessment.overall_verdict{{/code}}|article_verdict_label
++|Scenario|scenarios[].verdict.label|scenario_verdict_label
++|Claim|claims[].rollup_verdict (optional)|claim_verdict_label
++|Article|article_holistic_assessment.overall_verdict|article_verdict_label
++----
-----
--
  == 5. Cache Architecture ==
  === 5.1 Redis Cache Design ===
@@ -813,117 +813,29 @@
  **Technology:** Redis 7.0+ (in-memory key-value store)
  **Cache Key Schema:**
--{{code}}
--claim:v1norm1:{language}:{sha256(canonical_claim)}
--{{/code}}
++{{{claim:v1norm1:{language}:{sha256(canonical_claim)}
++}}}
++
  **Example:**
--{{code}}
--Claim (English): "COVID vaccines are 95% effective"
++
++{{{Claim (English): "COVID vaccines are 95% effective"
  Canonical: "covid vaccines are 95 percent effective"
  Language: "en"
  SHA256: abc123...def456
  Key: claim:v1norm1:en:abc123...def456
--{{/code}}
++}}}
  **Rationale:** Prevents cross-language collisions and enables per-language cache analytics.
  **Data Structure:**
--{{code language="redis"}}
--SET claim:v1:abc123...def456 '{...ClaimAnalysis JSON...}'
--EXPIRE claim:v1:abc123...def456 7776000  # 90 days
--{{/code}}
--**Additional Keys:**
--{{code}}
++{{{SET claim:v1norm1:en:abc123...def456 '{...ClaimAnalysis JSON...}'
++EXPIRE claim:v1norm1:en:abc123...def456 7776000  # 90 days
++}}}
--==== 5.1.1 Canonical Claim Normalization (v1) ====
++----
--The cache key depends on deterministic claim normalization. All implementations MUST follow this algorithm exactly.
--
--**Algorithm: Canonical Claim Normalization v1**
--
--{{code language="python"}}
--def normalize_claim_v1(claim_text: str, language: str) -> str:
--    """
--    Normalizes claim to canonical form for cache key generation.
--    Version: v1norm1 (POC1)
--    """
--    import re
--    import unicodedata
--
--    # Step 1: Unicode normalization (NFC)
--    text = unicodedata.normalize('NFC', claim_text)
--
--    # Step 2: Lowercase
--    text = text.lower()
--
--    # Step 3: Remove punctuation (except hyphens in words)
--    text = re.sub(r'[^\w\s-]', '', text)
--
--    # Step 4: Normalize whitespace (collapse multiple spaces)
--    text = re.sub(r'\s+', ' ', text).strip()
--
--    # Step 5: Numeric normalization
--    text = text.replace('%', ' percent')
--    # Spell out single-digit numbers
--    num_to_word = {'0':'zero', '1':'one', '2':'two', '3':'three',
--                   '4':'four', '5':'five', '6':'six', '7':'seven',
--                   '8':'eight', '9':'nine'}
--    for num, word in num_to_word.items():
--        text = re.sub(rf'\b{num}\b', word, text)
--
--    # Step 6: Common abbreviations (English only in v1)
--    if language == 'en':
--        text = text.replace('covid-19', 'covid')
--        text = text.replace('u.s.', 'us')
--        text = text.replace('u.k.', 'uk')
--
--    # Step 7: NO entity normalization in v1
--    # (Trump vs Donald Trump vs President Trump remain distinct)
--
--    return text
--
--# Version identifier (include in cache namespace)
--CANONICALIZER_VERSION = "v1norm1"
--{{/code}}
--
--**Cache Key Formula (Updated):**
--
--{{code}}
--language = "en"
--canonical = normalize_claim_v1(claim_text, language)
--cache_key = f"claim:{CANONICALIZER_VERSION}:{language}:{sha256(canonical)}"
--
--Example:
--  claim: "COVID-19 vaccines are 95% effective"
--  canonical: "covid vaccines are 95 percent effective"
--  sha256: abc123...def456
--  key: "claim:v1norm1:en:abc123...def456"
--{{/code}}
--
--**Cache Metadata MUST Include:**
--
--{{code language="json"}}
--{
--  "canonical_claim": "covid vaccines are 95 percent effective",
--  "canonicalizer_version": "v1norm1",
--  "language": "en",
--  "original_claim_samples": ["COVID-19 vaccines are 95% effective"]
--}
--{{/code}}
--
--**Version Upgrade Path:**
--* v1norm1 → v1norm2: Cache namespace changes, old keys remain valid until TTL
--* v1normN → v2norm1: Major version bump, invalidate all v1 caches
--
--
--claim:stats:hit_count:{claim_hash}  # Counter
--claim:index:domain:{domain}  # Set of claim hashes by domain
--claim:index:language:{lang}  # Set of claim hashes by language
--{{/code}}
--
--
  === 5.1.1 Canonical Claim Normalization (v1) ===
  The cache key depends on deterministic claim normalization. All implementations MUST follow this algorithm exactly.
@@ -930,8 +930,7 @@
  **Algorithm: Canonical Claim Normalization v1**
--{{code language="python"}}
--def normalize_claim_v1(claim_text: str, language: str) -> str:
++{{{def normalize_claim_v1(claim_text: str, language: str) -> str:
      """
      Normalizes claim to canonical form for cache key generation.
      Version: v1norm1 (POC1)
@@ -973,12 +973,11 @@
  # Version identifier (include in cache namespace)
  CANONICALIZER_VERSION = "v1norm1"
--{{/code}}
++}}}
  **Cache Key Formula (Updated):**
--{{code}}
--language = "en"
++{{{language = "en"
  canonical = normalize_claim_v1(claim_text, language)
  cache_key = f"claim:{CANONICALIZER_VERSION}:{language}:{sha256(canonical)}"
@@ -987,25 +987,25 @@
    canonical: "covid vaccines are 95 percent effective"
    sha256: abc123...def456
    key: "claim:v1norm1:en:abc123...def456"
--{{/code}}
++}}}
  **Cache Metadata MUST Include:**
--{{code language="json"}}
--{
++{{{{
    "canonical_claim": "covid vaccines are 95 percent effective",
    "canonicalizer_version": "v1norm1",
    "language": "en",
    "original_claim_samples": ["COVID-19 vaccines are 95% effective"]
  }
--{{/code}}
++}}}
  **Version Upgrade Path:**
++
  * v1norm1 → v1norm2: Cache namespace changes, old keys remain valid until TTL
  * v1normN → v2norm1: Major version bump, invalidate all v1 caches
++----
--
  === 5.1.2 Copyright & Data Retention Policy ===
  **Evidence Excerpt Storage:**
@@ -1013,6 +1013,7 @@
  To comply with copyright law and fair use principles:
  **What We Store:**
++
  * **Metadata only:** Title, author, publisher, URL, publication date
  * **Short excerpts:** Max 25 words per quote, max 3 quotes per evidence item
  * **Summaries:** AI-generated bullet points (not verbatim text)
@@ -1019,17 +1019,20 @@
  * **No full articles:** Never store complete article text beyond job processing
  **Total per Cached Claim:**
++
  * Scenarios: 2 per claim
  * Evidence items: 6 per scenario (12 total)
  * Quotes: 3 per evidence × 25 words = 75 words per item
--* **Maximum stored verbatim text:** ~900 words per claim (12 × 75)
++* **Maximum stored verbatim text:** ~~900 words per claim (12 × 75)
  **Retention:**
++
  * Cache TTL: 90 days
  * Job outputs: 24 hours (then archived or deleted)
  * No persistent full-text article storage
  **Rationale:**
++
  * Short excerpts for citation = fair use
  * Summaries are transformative (not copyrightable)
  * Limited retention (90 days max)
@@ -1036,480 +1036,27 @@
  * No commercial republication of excerpts
  **DMCA Compliance:**
++
  * Cache invalidation endpoint available for rights holders
  * Contact: dmca@factharbor.org
++----
--=== 5.2 Cache Invalidation Strategy ===
++== Summary ==
--**Time-Based (Primary):**
--* TTL: 90 days for most claims
--* Reasoning: Evidence freshness, news cycles
++This WYSIWYG preview shows the **structure and key sections** of the 1,515-line API specification.
--**Event-Based (Manual):**
--* Admin can flag claims for invalidation
--* Example: "Major study retracts findings"
--* Tool: {{code}}DELETE /v1/cache/claim/{claim_hash}?reason=retraction{{/code}}
++**Full specification includes:**
--**Version-Based (Automatic):**
--* AKEL v2.0 release → Invalidate all v1.0 caches
--* Cache keys include version: {{code}}claim:v1:*{{/code}} vs {{code}}claim:v2:*{{/code}}
++* Complete API endpoints (7 total)
++* All data schemas (ClaimExtraction, ClaimAnalysis, HolisticAssessment, Complete)
++* Quality gates & validation rules
++* LLM configuration for all 3 stages
++* Implementation notes with code samples
++* Testing strategy
++* Cross-references to other pages
--**Long-Lived Historical Claims:**
--* Historical claims about completed events generally have stable verdicts
--* Example: "2024 US presidential election results"
--* **Policy:** Extended TTL (365-3,650 days) instead of "never invalidate"
--* **Reason:** Even historical data gets revisions (updated counts, corrections)
--* **Mechanism:** Admin can still manually invalidate if major correction issued
--* **Flag:** {{code}}is_historical=true{{/code}} in cache metadata → longer TTL
++**The complete specification is available in:**
--=== 5.3 Cache Warming Strategy ===
--
--**Proactive Cache Building (Future):**
--
--**Trending Topics:**
--* Monitor news APIs for trending topics
--* Pre-analyze top 20 common claims
--* Example: New health study published → Pre-cache related claims
--
--**Predictable Events:**
--* Elections, sporting events, earnings reports
--* Pre-cache expected claims before event
--* Reduces load during traffic spikes
--
--**User Patterns:**
--* Analyze query logs
--* Identify frequently requested claims
--* Prioritize cache warming for these
--
-----
--
--== 6. Quality Gates & Validation Rules ==
--
--=== 6.1 Quality Gate Overview ===
--
--|=Gate|=Name|=POC1 Status|=Applies To|=Notes
--|**Gate 1**|Claim Validation|✅ Hard gate|Stage 1: Extraction|Filters opinions, compound claims
--|**Gate 2**|Contradiction Search|✅ Mandatory rule|Stage 2: Analysis|Enforced per cached claim
--|**Gate 3**|Uncertainty Disclosure|⚠️ Soft guidance|Stage 2: Analysis|Best practice
--|**Gate 4**|Verdict Confidence|✅ Hard gate|Stage 2: Analysis|Confidence ≥ 0.5 required
--
--**Hard Gate Failures:**
--* Gate 1 fail → Claim excluded from analysis
--* Gate 4 fail → Claim marked "Unsubstantiated" but included
--
--=== 6.2 Validation Rules ===
--
--|=Rule|=Requirement
--|**Mandatory Contradiction**|Stage 2 MUST search for "undermines" evidence. If none found, reasoning must state: "No counter-evidence found despite targeted search."
--|**Context-Aware Logic**|Stage 3 must prioritize central claims. If {{code}}is_central_to_thesis=true{{/code}} claim is REFUTED, article cannot be WELL-SUPPORTED.
--|**Cache Consistency**|Cached claims must match current AKEL version. Version mismatch → cache miss.
--|**Author Identification**|All outputs MUST include {{code}}author_type: "AI/AKEL"{{/code}}.
--
-----
--
--== 7. Deterministic Markdown Template ==
--
--Report generation uses **fixed template** (not LLM-generated).
--
--**Cache-Only Mode Template:**
--{{code language="markdown"}}
--# FactHarbor Analysis Report: PARTIAL ANALYSIS
--
--**Job ID:** {job_id} | **Generated:** {timestamp_utc}
--**Mode:** Cache-Only (Free Tier)
--
-----
--
--## ⚠️ Partial Analysis Notice
--
--This is a **cache-only analysis** based on previously analyzed claims.
--{cache_coverage_percent}% of claims were available in cache.
--
--**What's Included:**
--* {claims_cached} of {claims_total} claims analyzed
--* Evidence and verdicts from cache (last updated: {oldest_cache_date})
--
--**What's Missing:**
--* {claims_missing} claims require new analysis
--* Full article holistic assessment unavailable
--* Estimated cost to complete: ${cost_to_complete}
--
--**[Upgrade to Pro]** for complete analysis
--
-----
--
--## Cached Claims
--
--### [C1] {claim_text} ✅ From Cache
--* **Cached:** {cached_at} ({cache_age} ago)
--* **Times Used:** {hit_count} articles
--* **Verdict:** {verdict} (Confidence: {confidence})
--* **Evidence:** {evidence_count} sources
--
--[Full claim details...]
--
--### [C3] {claim_text} ⚠️ Not In Cache
--* **Status:** Requires new analysis
--* **Cost:** $0.081
--* **Upgrade to analyze this claim**
--
-----
--
--**Powered by FactHarbor POC1-v0.4** | [Upgrade](https://factharbor.org/upgrade)
--{{/code}}
--
-----
--
--== 8. LLM Configuration (3-Stage) ==
--
--=== 8.1 Stage 1: Claim Extraction (Haiku) ===
--
--|=Parameter|=Value|=Notes
--|**Model**|{{code}}claude-haiku-4-20250108{{/code}}|Fast, cheap, sufficient for extraction
--|**Input Tokens**|~10K|Article text after URL extraction
--|**Output Tokens**|~500|5 claims @ ~100 tokens each
--|**Cost**|$0.003 per article|($0.25/M input + $1.25/M output)
--|**Temperature**|0.0|Deterministic
--|**Max Tokens**|1000|Generous buffer
--
--**Prompt Strategy:**
--* Extract 5 verifiable factual claims
--* Mark central vs. supporting claims
--* Canonicalize (normalize phrasing)
--* Deduplicate similar claims
--* Output structured JSON only
--
--=== 8.2 Stage 2: Claim Analysis (Sonnet, CACHED) ===
--
--|=Parameter|=Value|=Notes
--|**Model**|{{code}}claude-3-5-sonnet-20241022{{/code}}|High quality for verdicts
--|**Input Tokens**|~2K|Single claim + prompt + context
--|**Output Tokens**|~5K|2 scenarios × ~2.5K tokens
--|**Cost**|$0.081 per NEW claim|($3/M input + $15/M output)
--|**Temperature**|0.0|Deterministic (cache consistency)
--|**Max Tokens**|8000|Sufficient for 2 scenarios
--|**Cache Strategy**|Redis, 90-day TTL|Key: {{code}}claim:v1norm1:{language}:{sha256(canonical_claim)}{{/code}}
--
--**Prompt Strategy:**
--* Generate 2 scenario interpretations
--* Search for supporting AND undermining evidence (mandatory)
--* 6 evidence items per scenario maximum
--* Compute verdict with reasoning chain (3-4 bullets)
--* Output structured JSON only
--
--**Output Constraints (Cost Control):**
--* Scenarios: Max 2 per claim
--* Evidence: Max 6 per scenario
--* Evidence summary: Max 3 bullets
--* Reasoning chain: Max 4 bullets
--
--=== 8.3 Stage 3: Holistic Assessment (Sonnet) ===
--
--|=Parameter|=Value|=Notes
--|**Model**|{{code}}claude-3-5-sonnet-20241022{{/code}}|Context-aware analysis
--|**Input Tokens**|~5K|Article + claim verdicts
--|**Output Tokens**|~1K|Article verdict + fallacies
--|**Cost**|$0.030 per article|($3/M input + $15/M output)
--|**Temperature**|0.0|Deterministic
--|**Max Tokens**|2000|Sufficient for assessment
--
--**Prompt Strategy:**
--* Detect main thesis
--* Evaluate logical coherence (claim verdicts → thesis)
--* Identify fallacies (correlation-causation, cherry-picking, etc.)
--* Compute logic_quality_score
--* Explain article verdict reasoning (3-4 bullets)
--* Output structured JSON only
--
--=== 8.4 Cost Projections by Cache Hit Rate ===
--
--|=Cache Hit Rate|=Cost per Article|=10K Articles Cost|=100K Articles Cost
--|0% (cold start)|$0.438|$4,380|$43,800
--|20%|$0.357|$3,570|$35,700
--|40%|$0.276|$2,760|$27,600
--|**60%**|**$0.195**|**$1,950**|**$19,500**
--|**70%** (target)|**$0.155**|**$1,550**|**$15,500**
--|**80%**|**$0.114**|**$1,140**|**$11,400**
--|**90%**|**$0.073**|**$730**|**$7,300**
--|95%|$0.053|$530|$5,300
--
--**Break-Even Analysis:**
--* Monolithic (v0.3.1): $0.15 per article constant
--* 3-stage breaks even at **70% cache hit rate**
--* Expected after ~1,500 articles in same domain
--
-----
--
--== 9. Implementation Notes ==
--
--=== 9.1 Recommended Tech Stack ===
--
--* **Framework:** Next.js 14+ with App Router (TypeScript)
--* **Cache:** Redis 7.0+ (managed: AWS ElastiCache, Redis Cloud, Upstash)
--* **Storage:** Filesystem JSON for jobs + S3/R2 for archival
--* **Queue:** BullMQ with Redis (for 3-stage pipeline orchestration)
--* **LLM Client:** Anthropic Python SDK or TypeScript SDK
--* **Cost Tracking:** PostgreSQL for user credit ledger
--* **Deployment:** Vercel (frontend + API) + Redis Cloud
--
--=== 9.2 3-Stage Pipeline Implementation ===
--
--**Job Queue Flow (Conceptual):**
--
--{{code language="typescript"}}
--// Stage 1: Extract Claims
--const stage1Job = await queue.add('stage1-extract-claims', {
--  jobId: 'job123',
--  articleUrl: 'https://example.com/article'
--});
--
--// On Stage 1 completion → enqueue Stage 2 jobs
--stage1Job.on('completed', async (result) => {
--  const { claims } = result;
--
--  // Stage 2: Analyze each claim (with cache check)
--  const stage2Jobs = await Promise.all(
--    claims.map(claim =>
--      queue.add('stage2-analyze-claim', {
--        jobId: 'job123',
--        claimId: claim.claim_id,
--        canonicalClaim: claim.canonical_claim,
--        checkCache: true
--      })
--    )
--  );
--
--  // On all Stage 2 completions → enqueue Stage 3
--  await Promise.all(stage2Jobs.map(j => j.waitUntilFinished()));
--
--  const claimVerdicts = await gatherStage2Results('job123');
--
--  await queue.add('stage3-holistic', {
--    jobId: 'job123',
--    articleUrl: 'https://example.com/article',
--    claimVerdicts: claimVerdicts
--  });
--});
--{{/code}}
--
--**Note:** This is a conceptual sketch. Actual implementation may use BullMQ Flow API or custom orchestration.
--
--**Cache Check Logic:**
--{{code language="typescript"}}
--async function analyzeClaimWithCache(claim: string): Promise<ClaimAnalysis> {
--  const canonicalClaim = normalizeClaim(claim);
--  const claimHash = sha256(canonicalClaim);
--  const cacheKey = `claim:v1:${claimHash}`;
--
--  // Check cache
--  const cached = await redis.get(cacheKey);
--  if (cached) {
--    await redis.incr(`claim:stats:hit_count:${claimHash}`);
--    return JSON.parse(cached);
--  }
--
--  // Cache miss - analyze with LLM
--  const analysis = await analyzeClaim_Stage2(canonicalClaim);
--
--  // Store in cache
--  await redis.set(cacheKey, JSON.stringify(analysis), 'EX', 7776000); // 90 days
--
--  return analysis;
--}
--{{/code}}
--
--=== 9.3 User Credit Management ===
--
--**PostgreSQL Schema:**
--{{code language="sql"}}
--CREATE TABLE user_credits (
--  user_id UUID PRIMARY KEY,
--  tier VARCHAR(20) DEFAULT 'free',
--  credit_limit DECIMAL(10,2) DEFAULT 10.00,
--  credit_used DECIMAL(10,2) DEFAULT 0.00,
--  reset_date TIMESTAMP,
--  cache_only_mode BOOLEAN DEFAULT false,
--  created_at TIMESTAMP DEFAULT NOW()
--);
--
--CREATE TABLE usage_log (
--  id SERIAL PRIMARY KEY,
--  user_id UUID REFERENCES user_credits(user_id),
--  job_id VARCHAR(50),
--  stage VARCHAR(20),
--  cost DECIMAL(10,4),
--  cache_hit BOOLEAN,
--  created_at TIMESTAMP DEFAULT NOW()
--);
--{{/code}}
--
--**Credit Deduction Logic:**
--{{code language="typescript"}}
--async function deductCredit(userId: string, cost: number): Promise<boolean> {
--  const user = await db.query('SELECT * FROM user_credits WHERE user_id = $1', [userId]);
--
--  const newUsed = user.credit_used + cost;
--
--  if (newUsed > user.credit_limit && user.tier === 'free') {
--    // Trigger cache-only mode
--    await db.query(
--      'UPDATE user_credits SET cache_only_mode = true WHERE user_id = $1',
--      [userId]
--    );
--    throw new Error('CREDIT_LIMIT_REACHED');
--  }
--
--  await db.query(
--    'UPDATE user_credits SET credit_used = $1 WHERE user_id = $2',
--    [newUsed, userId]
--  );
--
--  return true;
--}
--{{/code}}
--
--=== 9.4 Cache-Only Mode Implementation ===
--
--**Middleware:**
--{{code language="typescript"}}
--async function checkCacheOnlyMode(req, res, next) {
--  const user = await getUserCredit(req.userId);
--
--  if (user.cache_only_mode) {
--    // Allow only cache reads
--    if (req.body.options?.cache_preference !== 'allow_partial') {
--      return res.status(402).json({
--        error: 'credit_limit_reached',
--        message: 'Resubmit with cache_preference=allow_partial',
--        cache_only_mode: true
--      });
--    }
--
--    // Modify request to skip Stage 2 for uncached claims
--    req.cacheOnlyMode = true;
--  }
--
--  next();
--}
--{{/code}}
--
--=== 9.5 Estimated Timeline ===
--
--**POC1 with 3-Stage Architecture:**
--* Week 1: Stage 1 (Haiku extraction) + Redis setup
--* Week 2: Stage 2 (Sonnet analysis + caching)
--* Week 3: Stage 3 (Holistic assessment) + pipeline orchestration
--* Week 4: User credit system + cache-only mode
--* Week 5: Testing with 100 articles (measure cache hit rate)
--* Week 6: Optimization + bug fixes
--* **Total: 6-8 weeks**
--
--**Manual coding:** 12-16 weeks
--
-----
--
--== 10. Testing Strategy ==
--
--=== 10.1 Cache Performance Testing ===
--
--**Test Scenarios:**
--
--**Scenario 1: Cold Start (0 cache)**
--* Analyze 100 diverse articles
--* Measure: Cost per article, cache growth rate
--* Expected: $0.35-0.40 avg, ~400 unique claims cached
--
--**Scenario 2: Warm Cache (Overlapping Domain)**
--* Analyze 100 articles on SAME topic (e.g., "2024 election")
--* Measure: Cache hit rate growth
--* Expected: Hit rate 20% → 60% by article 100
--
--**Scenario 3: Mature Cache (1,000 articles)**
--* Analyze next 100 articles (diverse topics)
--* Measure: Steady-state cache hit rate
--* Expected: 60-70% hit rate, $0.15-0.18 avg cost
--
--**Scenario 4: Cache-Only Mode**
--* Free user reaches $10 limit (67 articles at 70% hit rate)
--* Submit 10 more articles with {{code}}cache_preference=allow_partial{{/code}}
--* Measure: Coverage %, user satisfaction
--* Expected: 60-70% coverage, instant results
--
--=== 10.2 Success Metrics ===
--
--**Cache Performance:**
--* Week 1: 5-10% hit rate
--* Week 2: 15-25% hit rate
--* Week 3: 30-40% hit rate
--* Week 4: 45-55% hit rate
--* Target: ≥50% by 1,000 articles
--
--**Cost Targets:**
--* Articles 1-100: $0.35-0.40 avg ⚠️ (expected)
--* Articles 100-500: $0.25-0.30 avg
--* Articles 500-1,000: $0.18-0.22 avg
--* Articles 1,000+: $0.12-0.15 avg ✅
--
--**Quality Metrics (same as v0.3.1):**
--* Hallucination rate: <5%
--* Context-aware accuracy: ≥70%
--* False positive rate: <15%
--* Mandatory contradiction search: 100% compliance
--
--=== 10.3 Free Tier Economics Validation ===
--
--**Test with simulated 1,000 users:**
--* Each user: $10 credit
--* 70% cache hit rate
--* Avg 70 articles/user/month
--
--**Projected Costs:**
--* Total credits: 1,000 × $10 = $10,000
--* Actual LLM costs: ~$9,000 (cache savings)
--* Margin: 10%
--
--**Sustainability Check:**
--* If margin <5% → Reduce free tier limit
--* If margin >20% → Consider increasing free tier
--
-----
--
--== 11. Cross-References ==
--
--This API specification implements requirements from:
--
--* **[[POC Requirements>>Test.FactHarbor.Specification.POC.Requirements]]**
--** FR-POC-1 through FR-POC-6 (3-stage architecture)
--** NFR-POC-1 through NFR-POC-3 (quality gates, caching)
--** NEW: FR-POC-7 (Claim-level caching)
--** NEW: FR-POC-8 (User credit system)
--** NEW: FR-POC-9 (Cache-only mode)
--
--* **[[Article Verdict Problem>>Test.FactHarbor.Specification.POC.Article-Verdict-Problem]]**
--** Approach 1 implemented in Stage 3
--** Context-aware holistic assessment
--
--* **[[Requirements>>Test.FactHarbor.Specification.Requirements.WebHome]]**
--** FR4 (Analysis Summary) - enhanced with caching
--** FR7 (Verdict Calculation) - cached per claim
--** NFR11 (Quality Gates) - enforced across stages
--** NEW: NFR19 (Cost Efficiency via Caching)
--** NEW: NFR20 (Free Tier Sustainability)
--
--* **[[Architecture>>Test.FactHarbor.Specification.Architecture.WebHome]]**
--** POC1 3-stage pipeline architecture
--** Redis cache layer
--** User credit system
--
--* **[[Data Model>>Test.FactHarbor.Specification.Data Model.WebHome]]**
--** Claim structure (cacheable unit)
--** Evidence structure
--** Scenario boundaries
--
-----
--
--**End of Specification - FactHarbor POC1 API v0.4**
--
--**3-stage caching architecture with free tier cache-only mode. Ready for sustainable, scalable implementation!** 🚀
--
++* FactHarbor_POC1_API_and_Schemas_Spec_v0_4_1_PATCHED.md (45 KB standalone)
++* Export files (TEST/PRODUCTION) for xWiki import

Changes for page POC1 API & Schemas Specification

Summary

Details

Applications

Navigation

Need help?