Changes for page AI Knowledge Extraction Layer (AKEL)
Last modified by Robert Schaub on 2025/12/24 20:33
Summary
-
Page properties (1 modified, 0 added, 0 removed)
Details
- Page properties
-
- Content
-
... ... @@ -1,9 +1,9 @@ 1 1 = AKEL — AI Knowledge Extraction Layer = 2 2 3 -AKEL is FactHarbor ’s automated intelligence subsystem.3 +AKEL is FactHarbor's automated intelligence subsystem. 4 4 Its purpose is to reduce human workload, enhance consistency, and enable scalable knowledge processing — **without ever replacing human judgment**. 5 5 6 -A ll AKEL outputs are marked with **AuthorType = AI** andrequirehumanapprovalbefore publication.6 +AKEL outputs are marked with **AuthorType = AI** and published according to risk-based review policies (see Publication Modes below). 7 7 8 8 AKEL operates in two modes: 9 9 ... ... @@ -10,78 +10,325 @@ 10 10 * **Single-node mode** (POC & Beta 0) 11 11 * **Federated multi-node mode** (Release 1.0+) 12 12 13 -Human reviewers, experts, and moderators always retain final authority. 13 +Human reviewers, experts, and moderators always retain final authority over content marked as "Human-Reviewed." 14 14 15 +---- 16 + 15 15 == Purpose and Role == 16 16 17 -AKEL transforms unstructured inputs into structured, review-readydrafts.19 +AKEL transforms unstructured inputs into structured, publication-ready content. 18 18 19 19 Core responsibilities: 20 20 21 21 * Claim extraction from arbitrary text 22 -* Claim classification (domain, type, evaluability, safety) 24 +* Claim classification (domain, type, evaluability, safety, **risk tier**) 23 23 * Scenario generation (definitions, boundaries, assumptions, methodology) 24 24 * Evidence summarization and metadata extraction 25 -* Contradiction detection 27 +* **Contradiction detection and counter-evidence search** 28 +* **Reservation and limitation identification** 29 +* **Bubble detection** (echo chambers, conspiracy theories, isolated sources) 26 26 * Re-evaluation proposal generation 27 27 * Cross-node embedding exchange (Release 1.0+) 28 28 33 +---- 34 + 29 29 == Components == 30 30 31 31 * **AKEL Orchestrator** – central coordinator 32 32 * **Claim Extractor** 33 -* **Claim Classifier** 39 +* **Claim Classifier** (with risk tier assignment) 34 34 * **Scenario Generator** 35 35 * **Evidence Summarizer** 36 -* **Contradiction Detector** 42 +* **Contradiction Detector** (enhanced with counter-evidence search) 43 +* **Quality Gate Validator** 44 +* **Audit Sampling Scheduler** 37 37 * **Embedding Handler** (Release 1.0+) 38 38 * **Federation Sync Adapter** (Release 1.0+) 39 39 48 +---- 49 + 40 40 == Inputs and Outputs == 41 41 42 42 === Inputs === 43 -* User-submitted claims or evidence 44 -* Uploaded documents 45 -* URLs or citations 46 -* External LLM API (optional) 53 +* User-submitted claims or evidence 54 +* Uploaded documents 55 +* URLs or citations 56 +* External LLM API (optional) 47 47 * Embeddings (from local or federated peers) 48 48 49 -=== Outputs (all require human approval) === 50 -* ClaimVersion (draft) 51 -* ScenarioVersion (draft) 52 -* EvidenceVersion (summary + metadata draft) 53 -* VerdictVersion (draft; internal only) 54 -* Contradiction alerts 55 -* Re-evaluation proposals 59 +=== Outputs (publication mode varies by risk tier) === 60 +* ClaimVersion (draft or AI-generated) 61 +* ScenarioVersion (draft or AI-generated) 62 +* EvidenceVersion (summary + metadata, draft or AI-generated) 63 +* VerdictVersion (draft, AI-generated, or human-reviewed) 64 +* Contradiction alerts 65 +* Reservation and limitation notices 66 +* Re-evaluation proposals 56 56 * Updated embeddings 57 57 69 +---- 70 + 71 +== Publication Modes == 72 + 73 +AKEL content is published according to three modes: 74 + 75 +=== Mode 1: Draft-Only (Never Public) === 76 + 77 +**Used for:** 78 +* Failed quality gate checks 79 +* Sensitive topics flagged for expert review 80 +* Unclear scope or missing critical sources 81 +* High reputational risk content 82 + 83 +**Visibility:** Internal review queue only 84 + 85 +=== Mode 2: Published as AI-Generated (No Prior Human Review) === 86 + 87 +**Requirements:** 88 +* All automated quality gates passed (see below) 89 +* Risk tier permits AI-draft publication (Tier B or C) 90 +* Contradiction search completed successfully 91 +* Clear labeling as "AI-Generated, Awaiting Human Review" 92 + 93 +**Label shown to users:** 94 +``` 95 +[AI-Generated] This content was produced by AI and has not yet been human-reviewed. 96 +Source: AI | Review Status: Pending | Risk Tier: [B/C] 97 +Contradiction Search: Completed | Last Updated: [timestamp] 98 +``` 99 + 100 +**User actions:** 101 +* Browse and read content 102 +* Request human review (escalates to review queue) 103 +* Flag for expert attention 104 + 105 +=== Mode 3: Published as Human-Reviewed === 106 + 107 +**Requirements:** 108 +* Human reviewer or domain expert validated 109 +* All quality gates passed 110 +* Visible "Human-Reviewed" mark with reviewer role and timestamp 111 + 112 +**Label shown to users:** 113 +``` 114 +[Human-Reviewed] This content has been validated by human reviewers. 115 +Source: AI+Human | Review Status: Approved | Reviewed by: [Role] on [timestamp] 116 +Risk Tier: [A/B/C] | Contradiction Search: Completed 117 +``` 118 + 119 +---- 120 + 121 +== Risk Tiers == 122 + 123 +AKEL assigns risk tiers to all content to determine appropriate review requirements: 124 + 125 +=== Tier A — High Risk / High Impact === 126 + 127 +**Domains:** Medical, legal, elections, safety/security, major reputational harm 128 + 129 +**Publication policy:** 130 +* Human review REQUIRED before "Human-Reviewed" status 131 +* AI-generated content MAY be published but: 132 + ** Clearly flagged as AI-draft with prominent disclaimer 133 + ** May have limited visibility 134 + ** Auto-escalated to expert review queue 135 + ** User warnings displayed 136 + 137 +**Audit rate:** Recommendation: 30-50% of published AI-drafts sampled in first 6 months 138 + 139 +=== Tier B — Medium Risk === 140 + 141 +**Domains:** Contested public policy, complex science, causality claims, significant financial impact 142 + 143 +**Publication policy:** 144 +* AI-draft CAN publish immediately with clear labeling 145 +* Sampling audits conducted (see Audit System below) 146 +* High-engagement items auto-escalated to expert review 147 +* Users can request human review 148 + 149 +**Audit rate:** Recommendation: 10-20% of published AI-drafts sampled 150 + 151 +=== Tier C — Low Risk === 152 + 153 +**Domains:** Definitions, simple factual lookups with strong primary sources, historical facts, established scientific consensus 154 + 155 +**Publication policy:** 156 +* AI-draft default publication mode 157 +* Sampling audits sufficient 158 +* Community flagging available 159 +* Human review on request 160 + 161 +**Audit rate:** Recommendation: 5-10% of published AI-drafts sampled 162 + 163 +---- 164 + 165 +== Quality Gates (Mandatory Before AI-Draft Publication) == 166 + 167 +All AI-generated content must pass these automated checks before Mode 2 publication: 168 + 169 +=== Gate 1: Source Quality === 170 +* Primary sources identified and accessible 171 +* Source reliability scored against whitelist 172 +* Citation completeness verified 173 +* Publication dates checked 174 +* Author credentials validated (where applicable) 175 + 176 +=== Gate 2: Contradiction Search (MANDATORY) === 177 + 178 +**The system MUST actively search for:** 179 + 180 +* **Counter-evidence** – Rebuttals, conflicting results, contradictory studies 181 +* **Reservations** – Caveats, limitations, boundary conditions, applicability constraints 182 +* **Alternative interpretations** – Different framings, definitions, contextual variations 183 +* **Bubble detection** – Conspiracy theories, echo chambers, ideologically isolated sources 184 + 185 +**Search coverage requirements:** 186 +* Academic literature (BOTH supporting AND opposing views) 187 +* Reputable media across diverse political/ideological perspectives 188 +* Official contradictions (retractions, corrections, updates, amendments) 189 +* Domain-specific skeptics, critics, and alternative expert opinions 190 +* Cross-cultural and international perspectives 191 + 192 +**Search must actively avoid algorithmic bubbles:** 193 +* Deliberately seek opposing viewpoints 194 +* Check for echo chamber patterns in source clusters 195 +* Identify tribal or ideological source clustering 196 +* Flag when search space appears artificially constrained 197 +* Verify diversity of perspectives represented 198 + 199 +**Outcomes:** 200 +* **Strong counter-evidence found** → Auto-escalate to Tier B or draft-only mode 201 +* **Significant uncertainty detected** → Require uncertainty disclosure in verdict 202 +* **Bubble indicators present** → Flag for expert review and human validation 203 +* **Limited perspective diversity** → Expand search or flag for human review 204 + 205 +=== Gate 3: Uncertainty Quantification === 206 +* Confidence scores calculated for all claims and verdicts 207 +* Limitations explicitly stated 208 +* Data gaps identified and disclosed 209 +* Strength of evidence assessed 210 +* Alternative scenarios considered 211 + 212 +=== Gate 4: Structural Integrity === 213 +* No hallucinations detected (fact-checking against sources) 214 +* Logic chain valid and traceable 215 +* References accessible and verifiable 216 +* No circular reasoning 217 +* Premises clearly stated 218 + 219 +**If any gate fails:** 220 +* Content remains in draft-only mode 221 +* Failure reason logged 222 +* Human review required before publication 223 +* Failure patterns analyzed for system improvement 224 + 225 +---- 226 + 227 +== Audit System (Sampling-Based Quality Assurance) == 228 + 229 +Instead of reviewing ALL AI output, FactHarbor implements stratified sampling audits: 230 + 231 +=== Sampling Strategy === 232 + 233 +Audits prioritize: 234 +* **Risk tier** (higher tiers get more frequent audits) 235 +* **AI confidence score** (low confidence → higher sampling rate) 236 +* **Traffic and engagement** (high-visibility content audited more) 237 +* **Novelty** (new claim types, new domains, emerging topics) 238 +* **Disagreement signals** (user flags, contradiction alerts, community reports) 239 + 240 +=== Audit Process === 241 + 242 +1. System selects content for audit based on sampling strategy 243 +2. Human auditor reviews AI-generated content against quality standards 244 +3. Auditor validates or corrects: 245 + * Claim extraction accuracy 246 + * Scenario appropriateness 247 + * Evidence relevance and interpretation 248 + * Verdict reasoning 249 + * Contradiction search completeness 250 +4. Audit outcome recorded (pass/fail + detailed feedback) 251 +5. Failed audits trigger immediate content review 252 +6. Audit results feed back into system improvement 253 + 254 +=== Feedback Loop (Continuous Improvement) === 255 + 256 +Audit outcomes systematically improve: 257 +* **Query templates** – Refined based on missed evidence patterns 258 +* **Retrieval source weights** – Adjusted for accuracy and reliability 259 +* **Contradiction detection heuristics** – Enhanced to catch missed counter-evidence 260 +* **Model prompts and extraction rules** – Tuned for better claim extraction 261 +* **Risk tier assignments** – Recalibrated based on error patterns 262 +* **Bubble detection algorithms** – Improved to identify echo chambers 263 + 264 +=== Audit Transparency === 265 + 266 +* Audit statistics published regularly 267 +* Accuracy rates by risk tier tracked and reported 268 +* System improvements documented 269 +* Community can view aggregate audit performance 270 + 271 +---- 272 + 58 58 == Architecture Overview == 59 59 60 60 {{include reference="FactHarbor.Specification.Diagrams.AKEL Architecture.WebHome"/}} 61 61 277 +---- 278 + 62 62 == AKEL and Federation == 63 63 64 64 In Release 1.0+, AKEL participates in cross-node knowledge alignment: 65 65 66 -* Shares embeddings 67 -* Exchanges canonicalized claim forms 68 -* Exchanges scenario templates 69 -* Sends + receives contradiction alerts 70 -* Never shares model weights 283 +* Shares embeddings 284 +* Exchanges canonicalized claim forms 285 +* Exchanges scenario templates 286 +* Sends + receives contradiction alerts 287 +* Shares audit findings (with privacy controls) 288 +* Never shares model weights 71 71 * Never overrides local governance 72 72 73 73 Nodes may choose trust levels for AKEL-related data: 74 74 75 -* Trusted nodes: auto-merge embeddings + templates 76 -* Neutral nodes: require reviewer approval 293 +* Trusted nodes: auto-merge embeddings + templates 294 +* Neutral nodes: require reviewer approval 77 77 * Untrusted nodes: fully manual import 78 78 79 - == Human Approval Workflow ==297 +---- 80 80 81 -1. AKEL generates draft outputs (AuthorType = AI) 82 -2. Reviewers inspect and approve/moderate the drafts 83 -3. Experts validate high-risk or domain-specific outputs 84 -4. Moderators finalize publication 85 -5. Version numbers increment, history preserved 299 +== Human Review Workflow (Mode 3 Publication) == 86 86 87 -No AKEL output is ever published automatically. 301 +For content requiring human validation before "Human-Reviewed" status: 302 + 303 +1. AKEL generates content and publishes as AI-draft (Mode 2) or keeps as draft (Mode 1) 304 +2. Reviewers inspect content in review queue 305 +3. Reviewers validate quality gates were correctly applied 306 +4. Experts validate high-risk (Tier A) or domain-specific outputs 307 +5. Moderators finalize "Human-Reviewed" publication 308 +6. Version numbers increment, full history preserved 309 + 310 +**Note:** Most AI-generated content (Tier B and C) can remain in Mode 2 (AI-Generated) indefinitely. Human review is optional for these tiers unless users or audits flag issues. 311 + 312 +---- 313 + 314 +== POC v1 Behavior == 315 + 316 +The POC explicitly demonstrates AI-generated content publication: 317 + 318 +* Produces public AI-generated output (Mode 2) 319 +* No human data sources required 320 +* No human approval gate 321 +* Clear "AI-Generated - POC/Demo" labeling 322 +* All quality gates active (including contradiction search) 323 +* Users understand this demonstrates AI reasoning capabilities 324 +* Risk tier classification shown (demo purposes) 325 + 326 +---- 327 + 328 +== Related Pages == 329 + 330 +* [[Automation>>FactHarbor.Specification.Automation.WebHome]] 331 +* [[Requirements (Roles)>>FactHarbor.Specification.Requirements.WebHome]] 332 +* [[Workflows>>FactHarbor.Specification.Workflows.WebHome]] 333 +* [[Governance>>FactHarbor.Organisation.Governance]] 334 +