Changes for page Automation
Last modified by Robert Schaub on 2025/12/24 20:34
Summary
-
Page properties (1 modified, 0 added, 0 removed)
Details
- Page properties
-
- Content
-
... ... @@ -1,111 +1,296 @@ 1 1 = Automation = 2 2 3 -Automation in FactHarbor amplifies human capability but never replaces human oversight. 4 -All automated outputs require human review before publication. 3 +Automation in FactHarbor amplifies human capability while implementing risk-based oversight. 5 5 6 6 This chapter defines: 6 + 7 +* Risk-based publication model 8 +* Quality gates for AI-generated content 7 7 * What must remain human-only 8 -* What AI (AKEL) can draft 10 +* What AI (AKEL) can draft and publish 9 9 * What can be fully automated 10 10 * How automation evolves through POC → Beta 0 → Release 1.0 11 11 12 -== POC v1 ( FullyAutomated"TexttoTruth Landscape") ==14 +== POC v1 (AI-Generated Publication Demonstration) == 13 13 14 -The goal of POC v1 is to validate the automated reasoning capabilities of the datamodel withouthumanintervention.16 +The goal of POC v1 is to validate the automated reasoning capabilities and demonstrate AI-generated content publication. 15 15 16 16 === Workflow === 17 17 18 18 1. **Input**: User pastes a block of raw text. 19 19 1. **Deep Analysis (Background)**: The system autonomously performs the full pipeline **before** displaying the text: 22 + 20 20 * Extraction & Normalisation 21 21 * Scenario & Sub-query generation 22 -* Evidence retrieval & Verdict computation 25 +* Evidence retrieval with **contradiction search** 26 +* Quality gate validation 27 +* Verdict computation 28 + 23 23 1. **Visualisation (Extraction & Marking)**: The system displays the text with claims extracted and marked. 30 + 24 24 * **Verdict-Based Coloring**: The extraction highlights (e.g. Orange/Green) are chosen **according to the computed verdict** for each claim. 32 +* **AI-Generated Label**: Clear indication that content is AI-produced 33 + 25 25 1. **Inspection**: User clicks a highlighted claim to see the **Reasoning Trail**, showing exactly which evidence and sub-queries led to that verdict. 26 26 27 27 === Technical Scope === 28 28 29 -* **Fully Automated**: No human-in-the-loop for this phase. 30 -* **Structured Sub-Queries**: Logic is generated by decomposing claims into the FactHarbor data model. 31 -* **Latency**: Focus on accuracy of reasoning over real-time speed for v1. 38 +* **AI-Generated Publication**: Content published as Mode 2 (AI-Generated, no prior human review) 39 +* **Quality Gates Active**: All automated quality checks enforced 40 +* **Contradiction Search Demonstrated**: Shows counter-evidence and reservation detection 41 +* **Risk Tier Classification**: POC shows tier assignment (demo purposes) 42 +* **No Human Approval Gate**: Demonstrates scalable AI publication 43 +* **Structured Sub-Queries**: Logic generated by decomposing claims into the FactHarbor data model 32 32 33 33 ---- 34 34 35 -== Manual vs Automated Responsibilities==47 +== Publication Model == 36 36 37 - ===Human-OnlyTasks===49 +FactHarbor implements a risk-based publication model with three modes: 38 38 39 - Theserequire human judgment,ethics,orcontextual interpretation:51 +=== Mode 1: Draft-Only === 40 40 41 -* Definition of key terms in claims 42 -* Approval or rejection of scenarios 43 -* Interpretation of evidence in context 44 -* Final verdict approval 45 -* Governance decisions and dispute resolution 46 -* High-risk domain oversight 47 -* Ethical boundary decisions (especially medical, political, psychological) 53 +* Failed quality gates 54 +* High-risk content pending expert review 55 +* Internal review queue only 48 48 49 -=== Semi-Automated(AIDraft→ Human Review) ===57 +=== Mode 2: AI-Generated (Public) === 50 50 51 -AKEL can draft these, but humans must refine/approve: 59 +* Passed all quality gates 60 +* Risk tier B or C 61 +* Clear AI-generated labeling 62 +* Users can request human review 52 52 53 -* Scenario structures (definitions, assumptions, context) 54 -* Evaluation methods 55 -* Evidence relevance suggestions 56 -* Reliability hints 57 -* Verdict reasoning chains 58 -* Uncertainty and limitations 59 -* Scenario comparison explanations 60 -* Suggestions for merging or splitting scenarios 61 -* Draft public summaries 64 +=== Mode 3: Human-Reviewed === 62 62 63 -=== Fully Automated Structural Tasks === 66 +* Validated by human reviewers/experts 67 +* "Human-Reviewed" status badge 68 +* Required for Tier A content publication 64 64 70 +See [[AKEL page>>FactHarbor.Specification V0\.9\.18.AI Knowledge Extraction Layer (AKEL).WebHome]] for detailed publication mode descriptions. 71 + 72 +---- 73 + 74 +== Risk Tiers and Automation Levels == 75 + 76 +=== Tier A (High Risk) === 77 + 78 +* **Domains**: Medical, legal, elections, safety, security 79 +* **Automation**: AI can draft, human review required for "Human-Reviewed" status 80 +* **AI publication**: Allowed with prominent disclaimers and warnings 81 +* **Audit rate**: Recommendation: 30-50% 82 + 83 +=== Tier B (Medium Risk) === 84 + 85 +* **Domains**: Complex policy, science, causality claims 86 +* **Automation**: AI can draft and publish (Mode 2) 87 +* **Human review**: Optional, audit-based 88 +* **Audit rate**: Recommendation: 10-20% 89 + 90 +=== Tier C (Low Risk) === 91 + 92 +* **Domains**: Definitions, established facts, historical data 93 +* **Automation**: AI publication default 94 +* **Human review**: On request or via sampling 95 +* **Audit rate**: Recommendation: 5-10% 96 + 97 +---- 98 + 99 +== Human-Only Tasks == 100 + 101 +These require human judgment and cannot be automated: 102 + 103 +* **Ethical boundary decisions** (especially medical, political, psychological harm assessment) 104 +* **Dispute resolution** between conflicting expert opinions 105 +* **Governance policy** setting and enforcement 106 +* **Final authority** on Tier A "Human-Reviewed" status 107 +* **Audit system oversight** and quality standard definition 108 +* **Risk tier policy** adjustments based on societal context 109 + 110 +---- 111 + 112 +== AI-Draft with Audit (Semi-Automated) == 113 + 114 +AKEL drafts these; humans validate via sampling audits: 115 + 116 +* **Scenario structures** (definitions, assumptions, context) 117 +* **Evaluation methods** and reasoning chains 118 +* **Evidence relevance** assessment and ranking 119 +* **Reliability scoring** and source evaluation 120 +* **Verdict reasoning** with uncertainty quantification 121 +* **Contradiction and reservation** identification 122 +* **Scenario comparison** explanations 123 +* **Public summaries** and accessibility text 124 + 125 +Most Tier B and C content remains in AI-draft status unless: 126 + 127 +* Users request human review 128 +* Audits identify errors 129 +* High engagement triggers review 130 +* Community flags issues 131 + 132 +---- 133 + 134 +== Fully Automated Structural Tasks == 135 + 65 65 These require no human interpretation: 66 66 138 +* **Claim normalization** (canonical form generation) 139 +* **Duplicate detection** (vector embeddings, clustering) 140 +* **Evidence metadata extraction** (dates, authors, publication info) 141 +* **Basic reliability heuristics** (source reputation scoring) 142 +* **Contradiction detection** (conflicting statements across sources) 143 +* **Re-evaluation triggers** (new evidence, source updates) 144 +* **Layout generation** (diagrams, summaries, UI presentation) 145 +* **Federation integrity checks** (cross-node data validation) 146 + 147 +---- 148 + 149 +== Quality Gates (Automated) == 150 + 151 +Before AI-draft publication (Mode 2), content must pass: 152 + 153 +1. **Source Quality Gate** 154 + 155 +* Primary sources verified 156 +* Citations complete and accessible 157 +* Source reliability scored 158 + 159 +2. **Contradiction Search Gate** (MANDATORY) 160 + 161 +* Counter-evidence actively sought 162 +* Reservations and limitations identified 163 +* Bubble detection (echo chambers, conspiracy theories) 164 +* Diverse perspective verification 165 + 166 +3. **Uncertainty Quantification Gate** 167 + 168 +* Confidence scores calculated 169 +* Limitations stated 170 +* Data gaps disclosed 171 + 172 +4. **Structural Integrity Gate** 173 + 174 +* No hallucinations detected 175 +* Logic chain valid 176 +* References verifiable 177 + 178 +See [[AKEL page>>FactHarbor.Specification V0\.9\.18.AI Knowledge Extraction Layer (AKEL).WebHome]] for detailed quality gate specifications. 179 + 180 +---- 181 + 182 +== Audit System == 183 + 184 +Instead of reviewing all AI output, systematic sampling audits ensure quality: 185 + 186 +=== Stratified Sampling === 187 + 188 +* Risk tier (A > B > C sampling rates) 189 +* Confidence scores (low confidence → more audits) 190 +* Traffic/engagement (popular content audited more) 191 +* Novelty (new topics/claim types prioritized) 192 +* User flags and disagreement signals 193 + 194 +=== Continuous Improvement Loop === 195 + 196 +Audit findings improve: 197 + 198 +* Query templates 199 +* Source reliability weights 200 +* Contradiction detection algorithms 201 +* Risk tier assignment rules 202 +* Bubble detection heuristics 203 + 204 +=== Transparency === 205 + 206 +* Audit statistics published 207 +* Accuracy rates by tier reported 208 +* System improvements documented 209 + 210 +---- 211 + 212 +== Automation Roadmap == 213 + 214 +Automation capabilities increase with system maturity while maintaining quality oversight. 215 + 216 +=== POC (Current Focus) === 217 + 218 +**Automated:** 219 + 67 67 * Claim normalization 68 -* Duplicate& clusterdetection(vector embeddings)221 +* Scenario template generation 69 69 * Evidence metadata extraction 70 -* Basic reliability heuristics 71 -* Contradiction detection 72 -* Re-evaluation triggers 73 -* Batch layout generation (diagrams, summaries) 74 -* Federation integrity checks 223 +* Simple verdict drafts 224 +* **AI-generated publication** (Mode 2, with quality gates) 225 +* **Contradiction search** 226 +* **Risk tier assignment** 75 75 76 - == AutomationRoadmap ==228 +**Human:** 77 77 78 -Automation increases with maturity. 230 +* High-risk content validation (Tier A) 231 +* Sampling audits across all tiers 232 +* Quality standard refinement 233 +* Governance decisions 79 79 80 -=== POC (Low Automation) === 81 -* **Automated**: Claim normalization, Light scenario templates, Metadata extraction, Internal drafts. 82 -* **Human**: All scenario definitions, Evidence interpretation, Verdict creation, Governance. 235 +=== Beta 0 (Enhanced Automation) === 83 83 84 -=== Beta 0 (Medium Automation) === 85 -* **Automated**: Detailed scenario drafts, Evidence reliability scoring, Cross-scenario comparisons, Contradiction detection. 86 -* **Human**: Scenario approval, Final verdict validation. 237 +**Automated:** 87 87 239 +* Detailed scenario generation 240 +* Advanced evidence reliability scoring 241 +* Cross-scenario comparisons 242 +* Multi-source contradiction detection 243 +* Internal Truth Landscape generation 244 +* **Increased AI-draft coverage** (more Tier B content) 245 + 246 +**Human:** 247 + 248 +* Tier A final approval 249 +* Audit sampling (continued) 250 +* Expert validation of complex domains 251 +* Quality improvement oversight 252 + 88 88 === Release 1.0 (High Automation) === 89 -* **Automated**: Full scenario generation, Evidence relevance ranking, Bayesian verdict scoring, Anomaly detection, Federation sync. 90 -* **Human**: Final approval, Ethical decisions, Oversight. 91 91 92 - ==Automation Levels ==255 +**Automated:** 93 93 94 -* **Level 0 — Human-Centric (POC)**: AI is purely advisory, nothing auto-published. 95 -* **Level 1 — Assisted (Beta 0)**: AI drafts structures; humans approve each part. 96 -* **Level 2 — Structured (Release 1.0)**: AI produces near-complete drafts; humans refine. 97 -* **Level 3 — Distributed Intelligence (Future)**: Nodes exchange embeddings and alerts; humans still approve. 257 +* Full scenario generation (comprehensive) 258 +* Bayesian verdict scoring across scenarios 259 +* Multi-scenario summary generation 260 +* Anomaly detection across federated nodes 261 +* AKEL-assisted cross-node synchronization 262 +* **Most Tier B and all Tier C** auto-published 98 98 99 - == AutomationMatrix ==264 +**Human:** 100 100 101 -* **Always Human**: Final verdict, Scenario validity, Ethics, Disputes. 102 -* **Mostly AI**: Normalization, Clustering, Metadata, Heuristics, Alerts. 103 -* **Mixed**: Definitions, Boundaries, Assumptions, Reasoning. 266 +* Tier A oversight (still required) 267 +* Strategic audits (lower sampling rates, higher value) 268 +* Ethical decisions and policy 269 +* Conflict resolution 104 104 105 - == Diagram References ==271 +---- 106 106 107 - {{include reference="FactHarbor.Specification.Diagrams.AutomationRoadmap.WebHome"/}}273 +== Automation Levels Diagram == 108 108 109 -{{include reference="FactHarbor.Specification.Diagrams.Automation Level.WebHome"/}} 275 +{{include reference="FactHarbor.Archive.FactHarbor V0\.9\.23 Lost Data.Specification.Diagrams.Automation Level.WebHome"/}} 110 110 111 -{{include reference="FactHarbor.Specification.Diagrams.Manual vs Automated matrix.WebHome"/}} 277 +---- 278 + 279 +== Automation Roadmap Diagram == 280 + 281 +{{include reference="FactHarbor.Archive.FactHarbor V0\.9\.23 Lost Data.Specification.Diagrams.Automation Roadmap.WebHome"/}} 282 + 283 +---- 284 + 285 +== Manual vs Automated Matrix == 286 + 287 +{{include reference="FactHarbor.Archive.FactHarbor V0\.9\.23 Lost Data.Specification.Diagrams.Manual vs Automated matrix.WebHome"/}} 288 + 289 +---- 290 + 291 +== Related Pages == 292 + 293 +* [[AKEL (AI Knowledge Extraction Layer)>>FactHarbor.Specification V0\.9\.18.AI Knowledge Extraction Layer (AKEL).WebHome]] 294 +* [[Requirements (Roles)>>FactHarbor.Specification.Requirements.WebHome]] 295 +* [[Workflows>>FactHarbor.Specification.Workflows.WebHome]] 296 +* [[Governance>>FactHarbor.Organisation.Governance]]