Changes for page Automation

Last modified by Robert Schaub on 2025/12/24 21:46

From version 3.1
edited by Robert Schaub
on 2025/12/18 22:28
Change comment: Imported from XAR
To version 1.1
edited by Robert Schaub
on 2025/12/18 12:03
Change comment: Imported from XAR

Summary

Details

Page properties
Content
... ... @@ -37,78 +37,6 @@
37 37  * Detected manipulation attempt
38 38  * Unusual pattern
39 39  * Moderator reviews and may take action
40 -
41 -== 2.5 LLM-Based Processing Architecture ==
42 -
43 -FactHarbor delegates complex reasoning and analysis tasks to Large Language Models (LLMs). The architecture evolves from POC to production:
44 -
45 -=== POC: Two-Phase Approach ===
46 -
47 -**Phase 1: Claim Extraction**
48 -* Single LLM call to extract all claims from submitted content
49 -* Light structure, focused on identifying distinct verifiable claims
50 -* Output: List of claims with context
51 -
52 -**Phase 2: Claim Analysis (Parallel)**
53 -* Single LLM call per claim (parallelizable)
54 -* Full structured output: Evidence, Scenarios, Sources, Verdict, Risk
55 -* Each claim analyzed independently
56 -
57 -**Advantages:**
58 -* Fast to implement (2-4 weeks to working POC)
59 -* Only 2-3 API calls total (1 + N claims)
60 -* Simple to debug (claim-level isolation)
61 -* Proves concept viability
62 -
63 -=== Production: Three-Phase Approach ===
64 -
65 -**Phase 1: Claim Extraction + Validation**
66 -* Extract distinct verifiable claims
67 -* Validate claim clarity and uniqueness
68 -* Remove duplicates and vague claims
69 -
70 -**Phase 2: Evidence Gathering (Parallel)**
71 -* For each claim independently:
72 - * Find supporting and contradicting evidence
73 - * Identify authoritative sources
74 - * Generate test scenarios
75 -* Validation: Check evidence quality and source validity
76 -* Error containment: Issues in one claim don't affect others
77 -
78 -**Phase 3: Verdict Generation (Parallel)**
79 -* For each claim:
80 - * Generate verdict based on validated evidence
81 - * Assess confidence and risk level
82 - * Flag low-confidence results for human review
83 -* Validation: Check verdict consistency with evidence
84 -
85 -**Advantages:**
86 -* Error containment between phases
87 -* Clear quality gates and validation
88 -* Observable metrics per phase
89 -* Scalable (parallel processing across claims)
90 -* Adaptable (can optimize each phase independently)
91 -
92 -=== LLM Task Delegation ===
93 -
94 -All complex cognitive tasks are delegated to LLMs:
95 -* **Claim Extraction**: Understanding context, identifying distinct claims
96 -* **Evidence Finding**: Analyzing sources, assessing relevance
97 -* **Scenario Generation**: Creating testable hypotheses
98 -* **Source Evaluation**: Assessing reliability and authority
99 -* **Verdict Generation**: Synthesizing evidence into conclusions
100 -* **Risk Assessment**: Evaluating potential impact
101 -
102 -=== Error Mitigation ===
103 -
104 -Research shows sequential LLM calls face compound error risks. FactHarbor mitigates this through:
105 -* **Validation gates** between phases
106 -* **Confidence thresholds** for quality control
107 -* **Parallel processing** to avoid error propagation across claims
108 -* **Human review queue** for low-confidence verdicts
109 -* **Independent claim processing** - errors in one claim don't cascade to others
110 -
111 -
112 112  == 3. Risk Tiers ==
113 113  Risk tiers classify claims by potential impact and guide audit sampling rates.
114 114  === 3.1 Tier A (High Risk) ===
... ... @@ -151,11 +151,6 @@
151 151  **Release 1.0** (Initial): Tier B/C auto-published, Tier A flagged for review
152 152  **Release 2.0** (Mature): All tiers auto-published with risk labels, sampling audits
153 153  See [[Automation Roadmap>>FactHarbor.Specification.Diagrams.Automation Roadmap.WebHome]] for detailed progression.
154 -
155 -== 5.5 Automation Roadmap ==
156 -
157 -{{include reference="FactHarbor.Specification.Diagrams.Automation Roadmap.WebHome"/}}
158 -
159 159  == 6. Human Role ==
160 160  Humans do NOT review content for approval. Instead:
161 161  **Monitoring**: Watch aggregate performance metrics
... ... @@ -163,11 +163,6 @@
163 163  **Exception handling**: Review AKEL-flagged items
164 164  **Governance**: Set policies AKEL applies
165 165  See [[Contributor Processes>>FactHarbor.Organisation.Contributor-Processes]] for how to improve the system.
166 -
167 -== 6.5 Manual vs Automated Matrix ==
168 -
169 -{{include reference="FactHarbor.Specification.Diagrams.Manual vs Automated matrix.WebHome"/}}
170 -
171 171  == 7. Moderation ==
172 172  Moderators handle items AKEL flags:
173 173  **Abuse detection**: Spam, manipulation, harassment