Last modified by Robert Schaub on 2025/12/24 20:33

From version 4.1
edited by Robert Schaub
on 2025/12/12 15:41
Change comment: Imported from XAR
To version 6.4
edited by Robert Schaub
on 2025/12/16 20:28
Change comment: Renamed back-links.

Summary

Details

Page properties
Parent
... ... @@ -1,1 +1,1 @@
1 -FactHarbor.Specification.WebHome
1 +FactHarbor.Archive.FactHarbor V0\.9\.18.Specification.WebHome
Content
... ... @@ -1,9 +1,9 @@
1 1  = AKEL — AI Knowledge Extraction Layer =
2 2  
3 -AKEL is FactHarbors automated intelligence subsystem.
3 +AKEL is FactHarbor's automated intelligence subsystem.
4 4  Its purpose is to reduce human workload, enhance consistency, and enable scalable knowledge processing — **without ever replacing human judgment**.
5 5  
6 -All AKEL outputs are marked with **AuthorType = AI** and require human approval before publication.
6 +AKEL outputs are marked with **AuthorType = AI** and published according to risk-based review policies (see Publication Modes below).
7 7  
8 8  AKEL operates in two modes:
9 9  
... ... @@ -10,21 +10,23 @@
10 10  * **Single-node mode** (POC & Beta 0)
11 11  * **Federated multi-node mode** (Release 1.0+)
12 12  
13 -Human reviewers, experts, and moderators always retain final authority.
13 +Human reviewers, experts, and moderators always retain final authority over content marked as "Human-Reviewed."
14 14  
15 15  ----
16 16  
17 17  == Purpose and Role ==
18 18  
19 -AKEL transforms unstructured inputs into structured, review-ready drafts.
19 +AKEL transforms unstructured inputs into structured, publication-ready content.
20 20  
21 21  Core responsibilities:
22 22  
23 23  * Claim extraction from arbitrary text
24 -* Claim classification (domain, type, evaluability, safety)
24 +* Claim classification (domain, type, evaluability, safety, **risk tier**)
25 25  * Scenario generation (definitions, boundaries, assumptions, methodology)
26 26  * Evidence summarization and metadata extraction
27 -* Contradiction detection
27 +* **Contradiction detection and counter-evidence search**
28 +* **Reservation and limitation identification**
29 +* **Bubble detection** (echo chambers, conspiracy theories, isolated sources)
28 28  * Re-evaluation proposal generation
29 29  * Cross-node embedding exchange (Release 1.0+)
30 30  
... ... @@ -34,10 +34,12 @@
34 34  
35 35  * **AKEL Orchestrator** – central coordinator
36 36  * **Claim Extractor**
37 -* **Claim Classifier**
39 +* **Claim Classifier** (with risk tier assignment)
38 38  * **Scenario Generator**
39 39  * **Evidence Summarizer**
40 -* **Contradiction Detector**
42 +* **Contradiction Detector** (enhanced with counter-evidence search)
43 +* **Quality Gate Validator**
44 +* **Audit Sampling Scheduler**
41 41  * **Embedding Handler** (Release 1.0+)
42 42  * **Federation Sync Adapter** (Release 1.0+)
43 43  
... ... @@ -46,6 +46,7 @@
46 46  == Inputs and Outputs ==
47 47  
48 48  === Inputs ===
53 +
49 49  * User-submitted claims or evidence
50 50  * Uploaded documents
51 51  * URLs or citations
... ... @@ -52,20 +52,241 @@
52 52  * External LLM API (optional)
53 53  * Embeddings (from local or federated peers)
54 54  
55 -=== Outputs (all require human approval) ===
56 -* ClaimVersion (draft)
57 -* ScenarioVersion (draft)
58 -* EvidenceVersion (summary + metadata draft)
59 -* VerdictVersion (draft; internal only)
60 +=== Outputs (publication mode varies by risk tier) ===
61 +
62 +* ClaimVersion (draft or AI-generated)
63 +* ScenarioVersion (draft or AI-generated)
64 +* EvidenceVersion (summary + metadata, draft or AI-generated)
65 +* VerdictVersion (draft, AI-generated, or human-reviewed)
60 60  * Contradiction alerts
67 +* Reservation and limitation notices
61 61  * Re-evaluation proposals
62 62  * Updated embeddings
63 63  
64 64  ----
65 65  
73 +== Publication Modes ==
74 +
75 +AKEL content is published according to three modes:
76 +
77 +=== Mode 1: Draft-Only (Never Public) ===
78 +
79 +**Used for:**
80 +
81 +* Failed quality gate checks
82 +* Sensitive topics flagged for expert review
83 +* Unclear scope or missing critical sources
84 +* High reputational risk content
85 +
86 +**Visibility:** Internal review queue only
87 +
88 +=== Mode 2: Published as AI-Generated (No Prior Human Review) ===
89 +
90 +**Requirements:**
91 +
92 +* All automated quality gates passed (see below)
93 +* Risk tier permits AI-draft publication (Tier B or C)
94 +* Contradiction search completed successfully
95 +* Clear labeling as "AI-Generated, Awaiting Human Review"
96 +
97 +**Label shown to users:**
98 +```
99 +[AI-Generated] This content was produced by AI and has not yet been human-reviewed.
100 +Source: AI | Review Status: Pending | Risk Tier: [B/C]
101 +Contradiction Search: Completed | Last Updated: [timestamp]
102 +```
103 +
104 +**User actions:**
105 +
106 +* Browse and read content
107 +* Request human review (escalates to review queue)
108 +* Flag for expert attention
109 +
110 +=== Mode 3: Published as Human-Reviewed ===
111 +
112 +**Requirements:**
113 +
114 +* Human reviewer or domain expert validated
115 +* All quality gates passed
116 +* Visible "Human-Reviewed" mark with reviewer role and timestamp
117 +
118 +**Label shown to users:**
119 +```
120 +[Human-Reviewed] This content has been validated by human reviewers.
121 +Source: AI+Human | Review Status: Approved | Reviewed by: [Role] on [timestamp]
122 +Risk Tier: [A/B/C] | Contradiction Search: Completed
123 +```
124 +
125 +----
126 +
127 +== Risk Tiers ==
128 +
129 +AKEL assigns risk tiers to all content to determine appropriate review requirements:
130 +
131 +=== Tier A — High Risk / High Impact ===
132 +
133 +**Domains:** Medical, legal, elections, safety/security, major reputational harm
134 +
135 +**Publication policy:**
136 +
137 +* Human review REQUIRED before "Human-Reviewed" status
138 +* AI-generated content MAY be published but:
139 +** Clearly flagged as AI-draft with prominent disclaimer
140 +** May have limited visibility
141 +** Auto-escalated to expert review queue
142 +** User warnings displayed
143 +
144 +**Audit rate:** Recommendation: 30-50% of published AI-drafts sampled in first 6 months
145 +
146 +=== Tier B — Medium Risk ===
147 +
148 +**Domains:** Contested public policy, complex science, causality claims, significant financial impact
149 +
150 +**Publication policy:**
151 +
152 +* AI-draft CAN publish immediately with clear labeling
153 +* Sampling audits conducted (see Audit System below)
154 +* High-engagement items auto-escalated to expert review
155 +* Users can request human review
156 +
157 +**Audit rate:** Recommendation: 10-20% of published AI-drafts sampled
158 +
159 +=== Tier C — Low Risk ===
160 +
161 +**Domains:** Definitions, simple factual lookups with strong primary sources, historical facts, established scientific consensus
162 +
163 +**Publication policy:**
164 +
165 +* AI-draft default publication mode
166 +* Sampling audits sufficient
167 +* Community flagging available
168 +* Human review on request
169 +
170 +**Audit rate:** Recommendation: 5-10% of published AI-drafts sampled
171 +
172 +----
173 +
174 +== Quality Gates (Mandatory Before AI-Draft Publication) ==
175 +
176 +All AI-generated content must pass these automated checks before Mode 2 publication:
177 +
178 +=== Gate 1: Source Quality ===
179 +
180 +* Primary sources identified and accessible
181 +* Source reliability scored against whitelist
182 +* Citation completeness verified
183 +* Publication dates checked
184 +* Author credentials validated (where applicable)
185 +
186 +=== Gate 2: Contradiction Search (MANDATORY) ===
187 +
188 +**The system MUST actively search for:**
189 +
190 +* **Counter-evidence** – Rebuttals, conflicting results, contradictory studies
191 +* **Reservations** – Caveats, limitations, boundary conditions, applicability constraints
192 +* **Alternative interpretations** – Different framings, definitions, contextual variations
193 +* **Bubble detection** – Conspiracy theories, echo chambers, ideologically isolated sources
194 +
195 +**Search coverage requirements:**
196 +
197 +* Academic literature (BOTH supporting AND opposing views)
198 +* Reputable media across diverse political/ideological perspectives
199 +* Official contradictions (retractions, corrections, updates, amendments)
200 +* Domain-specific skeptics, critics, and alternative expert opinions
201 +* Cross-cultural and international perspectives
202 +
203 +**Search must actively avoid algorithmic bubbles:**
204 +
205 +* Deliberately seek opposing viewpoints
206 +* Check for echo chamber patterns in source clusters
207 +* Identify tribal or ideological source clustering
208 +* Flag when search space appears artificially constrained
209 +* Verify diversity of perspectives represented
210 +
211 +**Outcomes:**
212 +
213 +* **Strong counter-evidence found** → Auto-escalate to Tier B or draft-only mode
214 +* **Significant uncertainty detected** → Require uncertainty disclosure in verdict
215 +* **Bubble indicators present** → Flag for expert review and human validation
216 +* **Limited perspective diversity** → Expand search or flag for human review
217 +
218 +=== Gate 3: Uncertainty Quantification ===
219 +
220 +* Confidence scores calculated for all claims and verdicts
221 +* Limitations explicitly stated
222 +* Data gaps identified and disclosed
223 +* Strength of evidence assessed
224 +* Alternative scenarios considered
225 +
226 +=== Gate 4: Structural Integrity ===
227 +
228 +* No hallucinations detected (fact-checking against sources)
229 +* Logic chain valid and traceable
230 +* References accessible and verifiable
231 +* No circular reasoning
232 +* Premises clearly stated
233 +
234 +**If any gate fails:**
235 +
236 +* Content remains in draft-only mode
237 +* Failure reason logged
238 +* Human review required before publication
239 +* Failure patterns analyzed for system improvement
240 +
241 +----
242 +
243 +== Audit System (Sampling-Based Quality Assurance) ==
244 +
245 +Instead of reviewing ALL AI output, FactHarbor implements stratified sampling audits:
246 +
247 +=== Sampling Strategy ===
248 +
249 +Audits prioritize:
250 +
251 +* **Risk tier** (higher tiers get more frequent audits)
252 +* **AI confidence score** (low confidence → higher sampling rate)
253 +* **Traffic and engagement** (high-visibility content audited more)
254 +* **Novelty** (new claim types, new domains, emerging topics)
255 +* **Disagreement signals** (user flags, contradiction alerts, community reports)
256 +
257 +=== Audit Process ===
258 +
259 +1. System selects content for audit based on sampling strategy
260 +2. Human auditor reviews AI-generated content against quality standards
261 +3. Auditor validates or corrects:
262 +
263 +* Claim extraction accuracy
264 +* Scenario appropriateness
265 +* Evidence relevance and interpretation
266 +* Verdict reasoning
267 +* Contradiction search completeness
268 +4. Audit outcome recorded (pass/fail + detailed feedback)
269 +5. Failed audits trigger immediate content review
270 +6. Audit results feed back into system improvement
271 +
272 +=== Feedback Loop (Continuous Improvement) ===
273 +
274 +Audit outcomes systematically improve:
275 +
276 +* **Query templates** – Refined based on missed evidence patterns
277 +* **Retrieval source weights** – Adjusted for accuracy and reliability
278 +* **Contradiction detection heuristics** – Enhanced to catch missed counter-evidence
279 +* **Model prompts and extraction rules** – Tuned for better claim extraction
280 +* **Risk tier assignments** – Recalibrated based on error patterns
281 +* **Bubble detection algorithms** – Improved to identify echo chambers
282 +
283 +=== Audit Transparency ===
284 +
285 +* Audit statistics published regularly
286 +* Accuracy rates by risk tier tracked and reported
287 +* System improvements documented
288 +* Community can view aggregate audit performance
289 +
290 +----
291 +
66 66  == Architecture Overview ==
67 67  
68 -{{include reference="FactHarbor.Specification.Diagrams.AKEL Architecture.WebHome"/}}
294 +{{include reference="FactHarbor.Archive.FactHarbor V0\.9\.23 Lost Data.Specification.Diagrams.AKEL Architecture.WebHome"/}}
69 69  
70 70  ----
71 71  
... ... @@ -77,6 +77,7 @@
77 77  * Exchanges canonicalized claim forms
78 78  * Exchanges scenario templates
79 79  * Sends + receives contradiction alerts
306 +* Shares audit findings (with privacy controls)
80 80  * Never shares model weights
81 81  * Never overrides local governance
82 82  
... ... @@ -88,14 +88,38 @@
88 88  
89 89  ----
90 90  
91 -== Human Approval Workflow ==
318 +== Human Review Workflow (Mode 3 Publication) ==
92 92  
93 -1. AKEL generates draft outputs (AuthorType = AI)
94 -2. Reviewers inspect and approve/moderate the drafts
95 -3. Experts validate high-risk or domain-specific outputs
96 -4. Moderators finalize publication
97 -5. Version numbers increment, history preserved
320 +For content requiring human validation before "Human-Reviewed" status:
98 98  
99 -No AKEL output is ever published automatically.
322 +1. AKEL generates content and publishes as AI-draft (Mode 2) or keeps as draft (Mode 1)
323 +2. Reviewers inspect content in review queue
324 +3. Reviewers validate quality gates were correctly applied
325 +4. Experts validate high-risk (Tier A) or domain-specific outputs
326 +5. Moderators finalize "Human-Reviewed" publication
327 +6. Version numbers increment, full history preserved
100 100  
329 +**Note:** Most AI-generated content (Tier B and C) can remain in Mode 2 (AI-Generated) indefinitely. Human review is optional for these tiers unless users or audits flag issues.
330 +
101 101  ----
332 +
333 +== POC v1 Behavior ==
334 +
335 +The POC explicitly demonstrates AI-generated content publication:
336 +
337 +* Produces public AI-generated output (Mode 2)
338 +* No human data sources required
339 +* No human approval gate
340 +* Clear "AI-Generated - POC/Demo" labeling
341 +* All quality gates active (including contradiction search)
342 +* Users understand this demonstrates AI reasoning capabilities
343 +* Risk tier classification shown (demo purposes)
344 +
345 +----
346 +
347 +== Related Pages ==
348 +
349 +* [[Automation>>FactHarbor.Specification.Automation.WebHome]]
350 +* [[Requirements (Roles)>>FactHarbor.Specification.Requirements.WebHome]]
351 +* [[Workflows>>FactHarbor.Specification.Workflows.WebHome]]
352 +* [[Governance>>FactHarbor.Organisation.Governance]]