Changes for page Automation

Last modified by Robert Schaub on 2026/02/08 08:22

From 4.5 to 4.4

From version 4.4

edited by Robert Schaub
on 2026/01/20 20:27

Change comment: Renamed back-links.

To version 1.1

edited by Robert Schaub
on 2025/12/18 12:03

Change comment: Imported from XAR

Raw
Rendered

Summary

Page properties (2 modified, 0 added, 0 removed)

Details

Page properties

Parent

@@ -1,1 +1,1 @@
--Archive.FactHarbor.Specification.WebHome
++FactHarbor.Specification.WebHome

Content

@@ -1,31 +1,21 @@
  = Automation =
--
  **How FactHarbor scales through automated claim evaluation.**
--
  == 1. Automation Philosophy ==
--
  FactHarbor is **automation-first**: AKEL (AI Knowledge Extraction Layer) makes all content decisions. Humans monitor system performance and improve algorithms.
  **Why automation:**
--
  * **Scale**: Can process millions of claims
  * **Consistency**: Same evaluation criteria applied uniformly
  * **Transparency**: Algorithms are auditable
  * **Speed**: Results in <20 seconds typically
  See [[Automation Philosophy>>FactHarbor.Organisation.Automation-Philosophy]] for detailed principles.
--
  == 2. Claim Processing Flow ==
--
  === 2.1 User Submits Claim ===
--
  * User provides claim text + source URLs
  * System validates format
  * Assigns processing ID
  * Queues for AKEL processing
--
  === 2.2 AKEL Processing ===
--
  **AKEL automatically:**
--
 . Parses claim into testable components
 . Extracts evidence from sources
 . Scores source credibility
@@ -35,12 +35,9 @@
 . Publishes result
  **Processing time**: Typically <20 seconds
  **No human approval required** - publication is automatic
--
  === 2.3 Publication States ===
--
  **Processing**: AKEL working on claim (not visible to public)
  **Published**: AKEL completed evaluation (public)
--
  * Verdict displayed with confidence score
  * Evidence and sources shown
  * Risk tier indicated
@@ -50,128 +50,35 @@
  * Detected manipulation attempt
  * Unusual pattern
  * Moderator reviews and may take action
--
--== 2.5 LLM-Based Processing Architecture ==
--
--FactHarbor delegates complex reasoning and analysis tasks to Large Language Models (LLMs). The architecture evolves from POC to production:
--
--=== POC: Two-Phase Approach ===
--
--**Phase 1: Claim Extraction**
--
--* Single LLM call to extract all claims from submitted content
--* Light structure, focused on identifying distinct verifiable claims
--* Output: List of claims with context
--
--**Phase 2: Claim Analysis (Parallel)**
--
--* Single LLM call per claim (parallelizable)
--* Full structured output: Evidence, Scenarios, Sources, Verdict, Risk
--* Each claim analyzed independently
--
--**Advantages:**
--
--* Fast to implement ( to working POC)
--* Only 2-3 API calls total (1 + N claims)
--* Simple to debug (claim-level isolation)
--* Proves concept viability
--
--=== Production: Three-Phase Approach ===
--
--**Phase 1: Claim Extraction + Validation**
--
--* Extract distinct verifiable claims
--* Validate claim clarity and uniqueness
--* Remove duplicates and vague claims
--
--**Phase 2: Evidence Gathering (Parallel)**
--
--* For each claim independently:
--* Find supporting and contradicting evidence
--* Identify authoritative sources
--* Generate test scenarios
--* Validation: Check evidence quality and source validity
--* Error containment: Issues in one claim don't affect others
--
--**Phase 3: Verdict Generation (Parallel)**
--
--* For each claim:
--* Generate verdict based on validated evidence
--* Assess confidence and risk level
--* Flag low-confidence results for human review
--* Validation: Check verdict consistency with evidence
--
--**Advantages:**
--
--* Error containment between phases
--* Clear quality gates and validation
--* Observable metrics per phase
--* Scalable (parallel processing across claims)
--* Adaptable (can optimize each phase independently)
--
--=== LLM Task Delegation ===
--
--All complex cognitive tasks are delegated to LLMs:
--
--* **Claim Extraction**: Understanding context, identifying distinct claims
--* **Evidence Finding**: Analyzing sources, assessing relevance
--* **Scenario Generation**: Creating testable hypotheses
--* **Source Evaluation**: Assessing reliability and authority
--* **Verdict Generation**: Synthesizing evidence into conclusions
--* **Risk Assessment**: Evaluating potential impact
--
--=== Error Mitigation ===
--
--Research shows sequential LLM calls face compound error risks. FactHarbor mitigates this through:
--
--* **Validation gates** between phases
--* **Confidence thresholds** for quality control
--* **Parallel processing** to avoid error propagation across claims
--* **Human review queue** for low-confidence verdicts
--* **Independent claim processing** - errors in one claim don't cascade to others
--
  == 3. Risk Tiers ==
--
  Risk tiers classify claims by potential impact and guide audit sampling rates.
--
  === 3.1 Tier A (High Risk) ===
--
  **Domains**: Medical, legal, elections, safety, security
  **Characteristics**:
--
  * High potential for harm if incorrect
  * Complex specialized knowledge required
  * Often subject to regulation
  **Publication**: AKEL publishes automatically with prominent risk warning
  **Audit rate**: Higher sampling recommended
--
  === 3.2 Tier B (Medium Risk) ===
--
  **Domains**: Complex policy, science, causality claims
  **Characteristics**:
--
  * Moderate potential impact
  * Requires careful evidence evaluation
  * Multiple valid interpretations possible
  **Publication**: AKEL publishes automatically with standard risk label
  **Audit rate**: Moderate sampling recommended
--
  === 3.3 Tier C (Low Risk) ===
--
  **Domains**: Definitions, established facts, historical data
  **Characteristics**:
--
  * Low potential for harm
  * Well-documented information
  * Clear right/wrong answers typically
  **Publication**: AKEL publishes by default
  **Audit rate**: Lower sampling recommended
--
  == 4. Quality Gates ==
--
  AKEL applies quality gates before publication. If any fail, claim is **flagged** (not blocked - still published).
  **Quality gates**:
--
  * Sufficient evidence extracted (≥2 sources)
  * Sources meet minimum credibility threshold
  * Confidence score calculable
@@ -178,22 +178,14 @@
  * No detected manipulation patterns
  * Claim parseable into testable form
  **Failed gates**: Claim published with flag for moderator review
--
  == 5. Automation Levels ==
--
--{{include reference="Archive.FactHarbor.Specification.Diagrams.Automation Level.WebHome"/}}
++{{include reference="FactHarbor.Specification.Diagrams.Automation Level.WebHome"/}}
  FactHarbor progresses through automation maturity levels:
  **Release 0.5** (Proof-of-Concept): Tier C only, human review required
  **Release 1.0** (Initial): Tier B/C auto-published, Tier A flagged for review
  **Release 2.0** (Mature): All tiers auto-published with risk labels, sampling audits
--See [[Automation Roadmap>>Archive.FactHarbor.Specification.Diagrams.Automation Roadmap.WebHome]] for detailed progression.
--
--== 5.5 Automation Roadmap ==
--
--{{include reference="Archive.FactHarbor.Specification.Diagrams.Automation Roadmap.WebHome"/}}
--
++See [[Automation Roadmap>>FactHarbor.Specification.Diagrams.Automation Roadmap.WebHome]] for detailed progression.
  == 6. Human Role ==
--
  Humans do NOT review content for approval. Instead:
  **Monitoring**: Watch aggregate performance metrics
  **Improvement**: Fix algorithms when patterns show issues
@@ -200,13 +200,7 @@
  **Exception handling**: Review AKEL-flagged items
  **Governance**: Set policies AKEL applies
  See [[Contributor Processes>>FactHarbor.Organisation.Contributor-Processes]] for how to improve the system.
--
--== 6.5 Manual vs Automated Matrix ==
--
--{{include reference="FactHarbor.Specification.Diagrams.Manual vs Automated matrix.WebHome"/}}
--
  == 7. Moderation ==
--
  Moderators handle items AKEL flags:
  **Abuse detection**: Spam, manipulation, harassment
  **Safety issues**: Content that could cause immediate harm
@@ -214,9 +214,7 @@
  **Action**: May temporarily hide content, ban users, or propose algorithm improvements
  **Does NOT**: Routinely review claims or override verdicts
  See [[Organisational Model>>FactHarbor.Organisation.Organisational-Model]] for moderator role details.
--
  == 8. Continuous Improvement ==
--
  **Performance monitoring**: Track AKEL accuracy, speed, coverage
  **Issue identification**: Find systematic errors from metrics
  **Algorithm updates**: Deploy improvements to fix patterns
@@ -223,21 +223,15 @@
  **A/B testing**: Validate changes before full rollout
  **Retrospectives**: Learn from failures systematically
  See [[Continuous Improvement>>FactHarbor.Organisation.How-We-Work-Together.Continuous-Improvement]] for improvement cycle.
--
  == 9. Scalability ==
--
  Automation enables FactHarbor to scale:
--
  * **Millions of claims** processable
  * **Consistent quality** at any volume
  * **Cost efficiency** through automation
  * **Rapid iteration** on algorithms
  Without automation: Human review doesn't scale, creates bottlenecks, introduces inconsistency.
--
  == 10. Transparency ==
--
  All automation is transparent:
--
  * **Algorithm parameters** documented
  * **Evaluation criteria** public
  * **Source scoring rules** explicit

Changes for page Automation

Summary

Details

Applications

Navigation

Need help?