Changes for page Automation
Last modified by Robert Schaub on 2025/12/24 20:34
From version 7.3
edited by Robert Schaub
on 2025/12/16 20:26
on 2025/12/16 20:26
Change comment:
Update document after refactoring.
Summary
-
Page properties (2 modified, 0 added, 0 removed)
Details
- Page properties
-
- Parent
-
... ... @@ -1,1 +1,1 @@ 1 -FactHarbor. Archive.FactHarbor V0\.9\.18.Specification.WebHome1 +FactHarbor.Specification.WebHome - Content
-
... ... @@ -1,271 +1,167 @@ 1 1 = Automation = 2 2 3 -Automation in FactHarbor amplifies human capability while implementing risk-based oversight. 3 +Automation in FactHarbor amplifies human capability but never replaces human oversight. 4 +All automated outputs require human review before publication. 4 4 5 5 This chapter defines: 6 -* Risk-based publication model 7 -* Quality gates for AI-generated content 7 + 8 8 * What must remain human-only 9 -* What AI (AKEL) can draft and publish9 +* What AI (AKEL) can draft 10 10 * What can be fully automated 11 11 * How automation evolves through POC → Beta 0 → Release 1.0 12 12 13 -== POC v1 (AI-Generated Publication Demonstration) == 14 - 15 -The goal of POC v1 is to validate the automated reasoning capabilities and demonstrate AI-generated content publication. 16 - 17 -=== Workflow === 18 - 19 -1. **Input**: User pastes a block of raw text. 20 -1. **Deep Analysis (Background)**: The system autonomously performs the full pipeline **before** displaying the text: 21 -* Extraction & Normalisation 22 -* Scenario & Sub-query generation 23 -* Evidence retrieval with **contradiction search** 24 -* Quality gate validation 25 -* Verdict computation 26 -1. **Visualisation (Extraction & Marking)**: The system displays the text with claims extracted and marked. 27 -* **Verdict-Based Coloring**: The extraction highlights (e.g. Orange/Green) are chosen **according to the computed verdict** for each claim. 28 -* **AI-Generated Label**: Clear indication that content is AI-produced 29 -1. **Inspection**: User clicks a highlighted claim to see the **Reasoning Trail**, showing exactly which evidence and sub-queries led to that verdict. 30 - 31 -=== Technical Scope === 32 - 33 -* **AI-Generated Publication**: Content published as Mode 2 (AI-Generated, no prior human review) 34 -* **Quality Gates Active**: All automated quality checks enforced 35 -* **Contradiction Search Demonstrated**: Shows counter-evidence and reservation detection 36 -* **Risk Tier Classification**: POC shows tier assignment (demo purposes) 37 -* **No Human Approval Gate**: Demonstrates scalable AI publication 38 -* **Structured Sub-Queries**: Logic generated by decomposing claims into the FactHarbor data model 39 - 40 40 ---- 41 41 42 -= =Publication Model ==15 += Manual vs Automated Responsibilities = 43 43 44 -FactHarbor implements a risk-based publication model with three modes: 45 - 46 -=== Mode 1: Draft-Only === 47 -* Failed quality gates 48 -* High-risk content pending expert review 49 -* Internal review queue only 50 - 51 -=== Mode 2: AI-Generated (Public) === 52 -* Passed all quality gates 53 -* Risk tier B or C 54 -* Clear AI-generated labeling 55 -* Users can request human review 56 - 57 -=== Mode 3: Human-Reviewed === 58 -* Validated by human reviewers/experts 59 -* "Human-Reviewed" status badge 60 -* Required for Tier A content publication 61 - 62 -See [[AKEL page>>FactHarbor.Specification.AI Knowledge Extraction Layer (AKEL).WebHome]] for detailed publication mode descriptions. 63 - 64 ----- 65 - 66 -== Risk Tiers and Automation Levels == 67 - 68 -=== Tier A (High Risk) === 69 -* **Domains**: Medical, legal, elections, safety, security 70 -* **Automation**: AI can draft, human review required for "Human-Reviewed" status 71 -* **AI publication**: Allowed with prominent disclaimers and warnings 72 -* **Audit rate**: Recommendation: 30-50% 73 - 74 -=== Tier B (Medium Risk) === 75 -* **Domains**: Complex policy, science, causality claims 76 -* **Automation**: AI can draft and publish (Mode 2) 77 -* **Human review**: Optional, audit-based 78 -* **Audit rate**: Recommendation: 10-20% 79 - 80 -=== Tier C (Low Risk) === 81 -* **Domains**: Definitions, established facts, historical data 82 -* **Automation**: AI publication default 83 -* **Human review**: On request or via sampling 84 -* **Audit rate**: Recommendation: 5-10% 85 - 86 ----- 87 - 88 88 == Human-Only Tasks == 89 89 90 -These require human judgment andcannotbeautomated:19 +These require human judgment, ethics, or contextual interpretation: 91 91 92 -* **Ethical boundary decisions** (especially medical, political, psychological harm assessment) 93 -* **Dispute resolution** between conflicting expert opinions 94 -* **Governance policy** setting and enforcement 95 -* **Final authority** on Tier A "Human-Reviewed" status 96 -* **Audit system oversight** and quality standard definition 97 -* **Risk tier policy** adjustments based on societal context 21 +* Definition of key terms in claims 22 +* Approval or rejection of scenarios 23 +* Interpretation of evidence in context 24 +* Final verdict approval 25 +* Governance decisions and dispute resolution 26 +* High-risk domain oversight 27 +* Ethical boundary decisions (especially medical, political, psychological) 98 98 99 -- ---29 +== Semi-Automated (AI Draft → Human Review) == 100 100 101 - ==AI-DraftwithAudit(Semi-Automated)==31 +AKEL can draft these, but humans must refine/approve: 102 102 103 -AKEL drafts these; humans validate via sampling audits: 33 +* Scenario structures (definitions, assumptions, context) 34 +* Evaluation methods 35 +* Evidence relevance suggestions 36 +* Reliability hints 37 +* Verdict reasoning chains 38 +* Uncertainty and limitations 39 +* Scenario comparison explanations 40 +* Suggestions for merging or splitting scenarios 41 +* Draft public summaries 104 104 105 -* **Scenario structures** (definitions, assumptions, context) 106 -* **Evaluation methods** and reasoning chains 107 -* **Evidence relevance** assessment and ranking 108 -* **Reliability scoring** and source evaluation 109 -* **Verdict reasoning** with uncertainty quantification 110 -* **Contradiction and reservation** identification 111 -* **Scenario comparison** explanations 112 -* **Public summaries** and accessibility text 113 - 114 -Most Tier B and C content remains in AI-draft status unless: 115 -* Users request human review 116 -* Audits identify errors 117 -* High engagement triggers review 118 -* Community flags issues 119 - 120 ----- 121 - 122 122 == Fully Automated Structural Tasks == 123 123 124 124 These require no human interpretation: 125 125 126 -* **Claim normalization** (canonical form generation)127 -* **Duplicate detection**(vector embeddings, clustering)128 -* **Evidence metadata extraction** (dates, authors, publication info)129 -* **Basic reliability heuristics** (source reputation scoring)130 -* **Contradiction detection** (conflicting statements across sources)131 -* **Re-evaluation triggers** (new evidence, source updates)132 -* **Layout generation**(diagrams, summaries, UI presentation)133 -* **Federation integrity checks** (cross-node data validation)47 +* Claim normalization 48 +* Duplicate & cluster detection (vector embeddings) 49 +* Evidence metadata extraction 50 +* Basic reliability heuristics 51 +* Contradiction detection 52 +* Re-evaluation triggers 53 +* Batch layout generation (diagrams, summaries) 54 +* Federation integrity checks 134 134 135 135 ---- 136 136 137 -= =Quality Gates (Automated)==58 += Automation Roadmap = 138 138 139 - BeforeAI-draft publication(Mode2), contentmustpass:60 +Automation increases with maturity. 140 140 141 -1. **Source Quality Gate** 142 - * Primary sources verified 143 - * Citations complete and accessible 144 - * Source reliability scored 62 +== POC (Low Automation) == 145 145 146 -2. **Contradiction Search Gate** (MANDATORY) 147 - * Counter-evidence actively sought 148 - * Reservations and limitations identified 149 - * Bubble detection (echo chambers, conspiracy theories) 150 - * Diverse perspective verification 64 +=== Automated === 151 151 152 - 3.**UncertaintyQuantificationGate**153 - *Confidencescores calculated154 - *Limitationsstated155 - *Datagapsdisclosed66 +* Claim normalization 67 +* Light scenario templates 68 +* Evidence metadata extraction 69 +* Simple verdict drafts (internal only) 156 156 157 -4. **Structural Integrity Gate** 158 - * No hallucinations detected 159 - * Logic chain valid 160 - * References verifiable 71 +=== Human === 161 161 162 -See [[AKEL page>>FactHarbor.Specification.AI Knowledge Extraction Layer (AKEL).WebHome]] for detailed quality gate specifications. 73 +* All scenario definitions 74 +* Evidence interpretation 75 +* Verdict creation 76 +* Governance 163 163 164 - ----78 +== Beta 0 (Medium Automation) == 165 165 166 -== Au ditSystem==80 +=== Automated === 167 167 168 -Instead of reviewing all AI output, systematic sampling audits ensure quality: 82 +* Detailed scenario drafts 83 +* Evidence reliability scoring 84 +* Cross-scenario comparisons 85 +* Contradiction detection (local + remote nodes) 86 +* Internal Truth Landscape drafts 169 169 170 -=== Stratified Sampling === 171 -* Risk tier (A > B > C sampling rates) 172 -* Confidence scores (low confidence → more audits) 173 -* Traffic/engagement (popular content audited more) 174 -* Novelty (new topics/claim types prioritized) 175 -* User flags and disagreement signals 88 +=== Human === 176 176 177 -=== Continuous Improvement Loop === 178 -Audit findings improve: 179 -* Query templates 180 -* Source reliability weights 181 -* Contradiction detection algorithms 182 -* Risk tier assignment rules 183 -* Bubble detection heuristics 90 +* Scenario approval 91 +* Final verdict validation 184 184 185 -=== Transparency === 186 -* Audit statistics published 187 -* Accuracy rates by tier reported 188 -* System improvements documented 93 +== Release 1.0 (High Automation) == 189 189 190 - ----95 +=== Automated === 191 191 192 -== Automation Roadmap == 97 +* Full scenario generation (definitions, assumptions, boundaries) 98 +* Evidence relevance scoring and ranking 99 +* Bayesian verdict scoring across scenario sets 100 +* Multi-scenario summary generation 101 +* Anomaly detection across nodes 102 +* AKEL-assisted federated synchronization 193 193 194 - Automationcapabilities increase with system maturity whilemaintainingquality oversight.104 +=== Human === 195 195 196 -=== POC (Current Focus) === 106 +* Final approval of all scenarios and verdicts 107 +* Ethical decisions 108 +* Oversight and conflict resolution 197 197 198 -**Automated:** 199 -* Claim normalization 200 -* Scenario template generation 201 -* Evidence metadata extraction 202 -* Simple verdict drafts 203 -* **AI-generated publication** (Mode 2, with quality gates) 204 -* **Contradiction search** 205 -* **Risk tier assignment** 110 +---- 206 206 207 -**Human:** 208 -* High-risk content validation (Tier A) 209 -* Sampling audits across all tiers 210 -* Quality standard refinement 211 -* Governance decisions 112 += Automation Levels = 212 212 213 -== =Beta0(EnhancedAutomation) ===114 +== Level 0 — Human-Centric (POC) == 214 214 215 -**Automated:** 216 -* Detailed scenario generation 217 -* Advanced evidence reliability scoring 218 -* Cross-scenario comparisons 219 -* Multi-source contradiction detection 220 -* Internal Truth Landscape generation 221 -* **Increased AI-draft coverage** (more Tier B content) 116 +AI is purely advisory, nothing auto-published. 222 222 223 -**Human:** 224 -* Tier A final approval 225 -* Audit sampling (continued) 226 -* Expert validation of complex domains 227 -* Quality improvement oversight 118 +== Level 1 — Assisted (Beta 0) == 228 228 229 - ===Release1.0 (HighAutomation)===120 +AI drafts structures; humans approve each part. 230 230 231 -**Automated:** 232 -* Full scenario generation (comprehensive) 233 -* Bayesian verdict scoring across scenarios 234 -* Multi-scenario summary generation 235 -* Anomaly detection across federated nodes 236 -* AKEL-assisted cross-node synchronization 237 -* **Most Tier B and all Tier C** auto-published 122 +== Level 2 — Structured (Release 1.0) == 238 238 239 -**Human:** 240 -* Tier A oversight (still required) 241 -* Strategic audits (lower sampling rates, higher value) 242 -* Ethical decisions and policy 243 -* Conflict resolution 124 +AI produces near-complete drafts; humans refine. 244 244 245 - ----126 +== Level 3 — Distributed Intelligence (Future) == 246 246 247 -== Automation Levels Diagram == 128 +Nodes exchange embeddings, contradiction alerts, and scenario templates. 129 +Humans still approve everything. 248 248 249 -{{include reference="Test.FactHarborV09.Specification.Diagrams.Automation Level.WebHome"/}} 250 - 251 251 ---- 252 252 253 -= =AutomationRoadmap Diagram==133 += Automation Matrix = 254 254 255 - {{include reference="Test.FactHarborV09.Specification.Diagrams.AutomationRoadmap.WebHome"/}}135 +== Always Human == 256 256 257 ----- 137 +* Final verdict approval 138 +* Scenario validity 139 +* Ethical decisions 140 +* Dispute resolution 258 258 259 -== M anualvsAutomatedMatrix==142 +== Mostly AI (Human Validation Needed) == 260 260 261 -{{include reference="Test.FactHarborV09.Specification.Diagrams.Manual vs Automated matrix.WebHome"/}} 144 +* Claim normalization 145 +* Clustering 146 +* Evidence metadata 147 +* Reliability heuristics 148 +* Scenario drafts 149 +* Contradiction detection 262 262 151 +== Mixed == 152 + 153 +* Definitions of ambiguous terms 154 +* Boundary choices 155 +* Assumption evaluation 156 +* Evidence selection 157 +* Verdict reasoning 158 + 263 263 ---- 264 264 265 -= =Related Pages ==161 += Diagram References = 266 266 267 -* [[AKEL (AI Knowledge Extraction Layer)>>FactHarbor.Specification.AI Knowledge Extraction Layer (AKEL).WebHome]] 268 -* [[Requirements (Roles)>>FactHarbor.Specification.Requirements.WebHome]] 269 -* [[Workflows>>FactHarbor.Specification.Workflows.WebHome]] 270 -* [[Governance>>FactHarbor.Organisation.Governance]] 163 +{{include reference="FactHarbor.Archive.Diagrams v0\.8q.Automation Roadmap.WebHome"/}} 271 271 165 +{{include reference="FactHarbor.Archive.Diagrams v0\.8q.Automation Level.WebHome"/}} 166 + 167 +{{include reference="FactHarbor.Specification.Diagrams.Manual vs Automated matrix.WebHome"/}}