Changes for page Automation

Last modified by Robert Schaub on 2025/12/24 21:53

From version 4.1
edited by Robert Schaub
on 2025/12/24 21:53
Change comment: Imported from XAR
To version 1.1
edited by Robert Schaub
on 2025/12/18 12:03
Change comment: Imported from XAR

Summary

Details

Page properties
Content
... ... @@ -37,77 +37,6 @@
37 37  * Detected manipulation attempt
38 38  * Unusual pattern
39 39  * Moderator reviews and may take action
40 -
41 -== 2.5 LLM-Based Processing Architecture ==
42 -
43 -FactHarbor delegates complex reasoning and analysis tasks to Large Language Models (LLMs). The architecture evolves from POC to production:
44 -
45 -=== POC: Two-Phase Approach ===
46 -
47 -**Phase 1: Claim Extraction**
48 -* Single LLM call to extract all claims from submitted content
49 -* Light structure, focused on identifying distinct verifiable claims
50 -* Output: List of claims with context
51 -
52 -**Phase 2: Claim Analysis (Parallel)**
53 -* Single LLM call per claim (parallelizable)
54 -* Full structured output: Evidence, Scenarios, Sources, Verdict, Risk
55 -* Each claim analyzed independently
56 -
57 -**Advantages:**
58 -* Fast to implement ( to working POC)
59 -* Only 2-3 API calls total (1 + N claims)
60 -* Simple to debug (claim-level isolation)
61 -* Proves concept viability
62 -
63 -=== Production: Three-Phase Approach ===
64 -
65 -**Phase 1: Claim Extraction + Validation**
66 -* Extract distinct verifiable claims
67 -* Validate claim clarity and uniqueness
68 -* Remove duplicates and vague claims
69 -
70 -**Phase 2: Evidence Gathering (Parallel)**
71 -* For each claim independently:
72 - * Find supporting and contradicting evidence
73 - * Identify authoritative sources
74 - * Generate test scenarios
75 -* Validation: Check evidence quality and source validity
76 -* Error containment: Issues in one claim don't affect others
77 -
78 -**Phase 3: Verdict Generation (Parallel)**
79 -* For each claim:
80 - * Generate verdict based on validated evidence
81 - * Assess confidence and risk level
82 - * Flag low-confidence results for human review
83 -* Validation: Check verdict consistency with evidence
84 -
85 -**Advantages:**
86 -* Error containment between phases
87 -* Clear quality gates and validation
88 -* Observable metrics per phase
89 -* Scalable (parallel processing across claims)
90 -* Adaptable (can optimize each phase independently)
91 -
92 -=== LLM Task Delegation ===
93 -
94 -All complex cognitive tasks are delegated to LLMs:
95 -* **Claim Extraction**: Understanding context, identifying distinct claims
96 -* **Evidence Finding**: Analyzing sources, assessing relevance
97 -* **Scenario Generation**: Creating testable hypotheses
98 -* **Source Evaluation**: Assessing reliability and authority
99 -* **Verdict Generation**: Synthesizing evidence into conclusions
100 -* **Risk Assessment**: Evaluating potential impact
101 -
102 -=== Error Mitigation ===
103 -
104 -Research shows sequential LLM calls face compound error risks. FactHarbor mitigates this through:
105 -* **Validation gates** between phases
106 -* **Confidence thresholds** for quality control
107 -* **Parallel processing** to avoid error propagation across claims
108 -* **Human review queue** for low-confidence verdicts
109 -* **Independent claim processing** - errors in one claim don't cascade to others
110 -
111 111  == 3. Risk Tiers ==
112 112  Risk tiers classify claims by potential impact and guide audit sampling rates.
113 113  === 3.1 Tier A (High Risk) ===
... ... @@ -150,11 +150,6 @@
150 150  **Release 1.0** (Initial): Tier B/C auto-published, Tier A flagged for review
151 151  **Release 2.0** (Mature): All tiers auto-published with risk labels, sampling audits
152 152  See [[Automation Roadmap>>FactHarbor.Specification.Diagrams.Automation Roadmap.WebHome]] for detailed progression.
153 -
154 -== 5.5 Automation Roadmap ==
155 -
156 -{{include reference="FactHarbor.Specification.Diagrams.Automation Roadmap.WebHome"/}}
157 -
158 158  == 6. Human Role ==
159 159  Humans do NOT review content for approval. Instead:
160 160  **Monitoring**: Watch aggregate performance metrics
... ... @@ -162,11 +162,6 @@
162 162  **Exception handling**: Review AKEL-flagged items
163 163  **Governance**: Set policies AKEL applies
164 164  See [[Contributor Processes>>FactHarbor.Organisation.Contributor-Processes]] for how to improve the system.
165 -
166 -== 6.5 Manual vs Automated Matrix ==
167 -
168 -{{include reference="FactHarbor.Specification.Diagrams.Manual vs Automated matrix.WebHome"/}}
169 -
170 170  == 7. Moderation ==
171 171  Moderators handle items AKEL flags:
172 172  **Abuse detection**: Spam, manipulation, harassment