Changes for page POC2: Robust Quality & Reliability

Last modified by Robert Schaub on 2025/12/24 09:59

From 2.1 to 2.2

From version 1.1

edited by Robert Schaub
on 2025/12/23 18:19

Change comment: Imported from XAR

To version 2.1

edited by Robert Schaub
on 2025/12/24 09:44

Change comment: Imported from XAR

Raw
Rendered

Summary

Page properties (1 modified, 0 added, 0 removed)

Details

Page properties

Content

@@ -160,6 +160,95 @@
  **See:** [[Architecture>>Test.FactHarbor pre10 V0\.9\.70.Specification.Architecture.WebHome]] for details
++== 5. Context-Aware Analysis (Conditional Feature) ==
++
++**Status:** Depends on POC1 experimental test results
++
++**Background:**
++
++POC1 tested context-aware analysis as an experimental feature using Approach 1 (Single-Pass Holistic Analysis). The goal is to detect when articles use accurate individual claims but reach misleading conclusions through faulty logic or selective presentation.
++
++**See:** [[Article Verdict Problem>>Test.FactHarbor.Specification.POC.Article-Verdict-Problem]] for complete investigation
++
++=== 5.1 POC2 Implementation Path ===
++
++**Decision based on POC1 test results (30-article test set):**
++
++==== If POC1 Accuracy ≥70% (Success) ====
++
++**Action:** Implement as standard feature (no longer experimental)
++
++**Enhancement to FR4:**
++* Context-aware analysis becomes part of standard Analysis Summary
++* Article verdict may differ from simple claim average
++* AI evaluates logical structure and reasoning quality
++
++**Potential Upgrade to Approach 6 (Hybrid):**
++* Add weighted claim importance (some claims more central than others)
++* Add rule-based fallacy detection alongside AI reasoning
++* Combine AI judgment with heuristic checks for robustness
++
++**Target:** Maintain ≥70% accuracy at detecting misleading articles
++
++==== If POC1 Accuracy 50-70% (Promising) ====
++
++**Action:** Implement alternative Approach 4 (Weighted Aggregation)
++
++**Instead of holistic analysis:**
++* AI assigns importance weights (0-1) to each claim
++* Weight based on: claim centrality, evidence strength, logical role
++* Article verdict = weighted average of claim verdicts
++* More structured than pure AI reasoning
++
++**Rationale:** If holistic reasoning is inconsistent, structured weighting may work better
++
++==== If POC1 Accuracy <50% (Insufficient) ====
++
++**Action:** Defer context-aware analysis to post-POC2
++
++**Fallback:**
++* Focus on individual claim accuracy only
++* Article verdict = simple average of claim verdicts
++* Note limitation: May miss misleading articles built from accurate claims
++
++**Future consideration:** Try Approach 7 (LLM-as-Judge) with better models in future releases
++
++=== 5.2 Testing in POC2 ===
++
++**If context-aware feature is implemented:**
++
++* Expand test set from 30 to 100 articles
++* Include more diverse article types (op-eds, news, analysis, advocacy)
++* Track false positive rate (flagging good articles as misleading)
++* Validate with subject matter experts when possible
++
++**Success Metrics:**
++* ≥70% accuracy on misleading article detection
++* <15% false positive rate
++* Reasoning is comprehensible to users
++
++=== 5.3 Architecture Notes ===
++
++**Context-aware analysis adds NO additional API calls**
++
++The enhanced analysis happens within the existing AKEL workflow:
++
++{{code}}
++Standard Flow:           Context-Aware Enhancement:
++1. Extract claims        1. Extract claims + mark central claims
++2. Find evidence         2. Find evidence
++3. Generate verdicts     3. Generate verdicts
++4. Write summary         4. Write context-aware summary
++                            (evaluates article structure)
++{{/code}}
++
++**Cost:** $0 increase (same API calls, enhanced prompt only)
++
++**See:** [[POC Requirements>>Test.FactHarbor.Specification.POC.Requirements]] Component 1 for implementation details
++
++
++
++
  == Related Pages ==
  * [[POC1>>Test.FactHarbor pre10 V0\.9\.70.Roadmap.POC1.WebHome]] - Previous phase

Changes for page POC2: Robust Quality & Reliability

Summary

Details

Applications

Navigation

Need help?