Changes for page Requirements

Last modified by Robert Schaub on 2026/02/08 21:32

From version 7.4
edited by Robert Schaub
on 2026/01/20 20:25
Change comment: Renamed back-links.
To version 7.2
edited by Robert Schaub
on 2025/12/23 11:03
Change comment: Update document after refactoring.

Summary

Details

Page properties
Content
... ... @@ -32,7 +32,6 @@
32 32  **Who**: Anyone (no login required)
33 33  
34 34  **Can**:
35 -
36 36  * Browse and search claims
37 37  * View scenarios, evidence, verdicts, and confidence scores
38 38  * Flag issues or errors
... ... @@ -40,7 +40,6 @@
40 40  * Submit claims automatically (new claims added if not duplicates)
41 41  
42 42  **Cannot**:
43 -
44 44  * Modify content
45 45  * Access edit history details
46 46  
... ... @@ -51,7 +51,6 @@
51 51  **Who**: Registered users (earns reputation through contributions)
52 52  
53 53  **Can**:
54 -
55 55  * Everything a Reader can do
56 56  * Edit claims, evidence, and scenarios
57 57  * Add sources and citations
... ... @@ -60,7 +60,6 @@
60 60  * Earn reputation points for quality contributions
61 61  
62 62  **Reputation System**:
63 -
64 64  * New contributors: Limited edit privileges
65 65  * Established contributors (established reputation): Full edit access
66 66  * Trusted contributors (substantial reputation): Can approve certain changes
... ... @@ -68,7 +68,6 @@
68 68  * Reputation lost through: Reverted edits, invalid flags, abuse
69 69  
70 70  **Cannot**:
71 -
72 72  * Delete or hide content (only moderators)
73 73  * Override moderation decisions
74 74  
... ... @@ -79,7 +79,6 @@
79 79  **Who**: Trusted community members with proven track record, appointed by governance board
80 80  
81 81  **Can**:
82 -
83 83  * Review flagged content
84 84  * Hide harmful or abusive content
85 85  * Resolve disputes between contributors
... ... @@ -88,7 +88,6 @@
88 88  * Access full audit logs
89 89  
90 90  **Cannot**:
91 -
92 92  * Change governance rules
93 93  * Permanently ban users without board approval
94 94  * Override technical quality gates
... ... @@ -102,7 +102,6 @@
102 102  **Not a permanent role**: Contacted externally when needed for contested claims in their domain
103 103  
104 104  **When used**:
105 -
106 106  * Medical claims with life/safety implications
107 107  * Legal interpretations with significant impact
108 108  * Scientific claims with high controversy
... ... @@ -109,7 +109,6 @@
109 109  * Technical claims requiring specialized knowledge
110 110  
111 111  **Process**:
112 -
113 113  * Moderator identifies need for expert input
114 114  * Contact expert externally (don't require them to be users)
115 115  * Trusted Contributor provides written opinion with sources
... ... @@ -129,13 +129,11 @@
129 129  **Status**: Visible to all users
130 130  
131 131  **Includes**:
132 -
133 133  * AI-generated analyses (default state)
134 134  * User-contributed content
135 135  * Edited/improved content
136 136  
137 137  **Quality Indicators** (displayed with content):
138 -
139 139  * **Confidence Score**: 0-100% (AI's confidence in analysis)
140 140  * **Source Quality Score**: 0-100% (based on source track record)
141 141  * **Controversy Flag**: If high dispute/edit activity
... ... @@ -145,7 +145,6 @@
145 145  * **Review Status**: AI-generated / Human-reviewed / Expert-validated
146 146  
147 147  **Automatic Warnings**:
148 -
149 149  * Confidence < 60%: "Low confidence - use caution"
150 150  * Source quality < 40%: "Sources may be unreliable"
151 151  * High controversy: "Disputed - multiple interpretations exist"
... ... @@ -158,7 +158,6 @@
158 158  **Status**: Not visible to regular users (only to moderators)
159 159  
160 160  **Reasons**:
161 -
162 162  * Spam or advertising
163 163  * Personal attacks or harassment
164 164  * Illegal content
... ... @@ -167,7 +167,6 @@
167 167  * Abuse or harmful content
168 168  
169 169  **Process**:
170 -
171 171  * Automated detection flags for moderator review
172 172  * Moderator confirms and hides
173 173  * Original author notified with reason
... ... @@ -190,7 +190,6 @@
190 190  **AKEL is the primary system**. Human contributions supplement and train AKEL.
191 191  
192 192  **AKEL Must**:
193 -
194 194  * Mark all outputs as AI-generated
195 195  * Display confidence scores prominently
196 196  * Provide source citations
... ... @@ -199,7 +199,6 @@
199 199  * Learn from human corrections
200 200  
201 201  **When AKEL Makes Errors**:
202 -
203 203  1. Capture the error pattern (what, why, how common)
204 204  2. Improve the system (better prompt, model, validation)
205 205  3. Re-process affected claims automatically
... ... @@ -230,7 +230,6 @@
230 230  === 4.1 Source Requirements ===
231 231  
232 232  **Track Record Over Credentials**:
233 -
234 234  * Sources evaluated by historical accuracy
235 235  * Correction policy matters
236 236  * Independence from conflicts of interest
... ... @@ -237,7 +237,6 @@
237 237  * Methodology transparency
238 238  
239 239  **Source Quality Database**:
240 -
241 241  * Automated tracking of source accuracy
242 242  * Correction frequency
243 243  * Reliability score (updated continuously)
... ... @@ -269,7 +269,6 @@
269 269  === 4.4 Confidence Scoring ===
270 270  
271 271  **Automated confidence calculation based on**:
272 -
273 273  * Source quality scores
274 274  * Evidence consistency
275 275  * Contradiction detection
... ... @@ -277,7 +277,6 @@
277 277  * Historical accuracy of similar claims
278 278  
279 279  **Thresholds**:
280 -
281 281  * < 40%: Too low to publish (needs improvement)
282 282  * 40-60%: Published with "Low confidence" warning
283 283  * 60-80%: Published as standard
... ... @@ -294,7 +294,6 @@
294 294  === 5.1 Risk Score Calculation ===
295 295  
296 296  **Factors** (weighted algorithm):
297 -
298 298  * **Domain sensitivity**: Medical, legal, safety auto-flagged higher
299 299  * **Potential impact**: Views, citations, spread
300 300  * **Controversy level**: Flags, disputes, edit wars
... ... @@ -321,7 +321,6 @@
321 321  === 6.1 Error Capture ===
322 322  
323 323  **When users flag errors or make corrections**:
324 -
325 325  1. What was wrong? (categorize)
326 326  2. What should it have been?
327 327  3. Why did the system fail? (root cause)
... ... @@ -340,7 +340,6 @@
340 340  === 6.3 Quality Metrics Dashboard ===
341 341  
342 342  **Track continuously**:
343 -
344 344  * Error rate by category
345 345  * Source quality distribution
346 346  * Confidence score trends
... ... @@ -366,7 +366,6 @@
366 366  === 7.2 Anomaly Detection ===
367 367  
368 368  **Automated alerts for**:
369 -
370 370  * Sudden quality drops
371 371  * Unusual patterns
372 372  * Contradiction clusters
... ... @@ -419,7 +419,6 @@
419 419  **Fulfills**: UN-2 (Context-dependent verification), UN-3 (Article summary with FactHarbor analysis summary), UN-8 (Understanding disagreement)
420 420  
421 421  **Automated scenario creation**:
422 -
423 423  * AKEL analyzes claim and generates likely scenarios (use-cases and contexts)
424 424  * Each scenario includes: assumptions, definitions, boundaries, evidence context
425 425  * Users can flag incorrect scenarios
... ... @@ -480,7 +480,6 @@
480 480  **Purpose**: Provide side-by-side comparison of what a document claims vs. FactHarbor's complete analysis of its credibility
481 481  
482 482  **Left Panel: Article Summary**:
483 -
484 484  * Document title, source, and claimed credibility
485 485  * "The Big Picture" - main thesis or position change
486 486  * "Key Findings" - structured summary of document's main claims
... ... @@ -488,7 +488,6 @@
488 488  * "Conclusion" - document's bottom line
489 489  
490 490  **Right Panel: FactHarbor Analysis Summary**:
491 -
492 492  * FactHarbor's independent source credibility assessment
493 493  * Claim-by-claim verdicts with confidence scores
494 494  * Methodology assessment (strengths, limitations)
... ... @@ -496,7 +496,6 @@
496 496  * Analysis ID for reference
497 497  
498 498  **Design Principles**:
499 -
500 500  * No scrolling required - both panels visible simultaneously
501 501  * Visual distinction between "what they say" and "FactHarbor's analysis"
502 502  * Color coding for verdicts (supported, uncertain, refuted)
... ... @@ -504,7 +504,6 @@
504 504  * Mobile responsive (panels stack vertically on small screens)
505 505  
506 506  **Implementation Notes**:
507 -
508 508  * Generated automatically by AKEL for every analyzed document
509 509  * Updates when verdict evolves (maintains version history)
510 510  * Exportable as standalone summary report
... ... @@ -531,8 +531,7 @@
531 531  (% style="font-size:0.9em; color:#666;" %)
532 532  ↑ WELL SUPPORTED • 87% confidence
533 533  [[Click for evidence details →]]
534 -
535 -
505 +(%%)
536 536  )))
537 537  
538 538  The study, which followed 10,000 participants over five years, showed significant improvements in cardiovascular health markers.
... ... @@ -545,8 +545,7 @@
545 545  ↑ UNCERTAIN • 45% confidence
546 546  Overstated - evidence shows risk reduction, not prevention
547 547  [[Click for details →]]
548 -
549 -
518 +(%%)
550 550  )))
551 551  
552 552  Dr. Maria Rodriguez, lead researcher, recommends incorporating more olive oil, fish, and vegetables into daily meals.
... ... @@ -559,8 +559,7 @@
559 559  ↑ REFUTED • 15% confidence
560 560  Claim not supported by study design; correlation ≠ causation
561 561  [[Click for counter-evidence →]]
562 -
563 -
531 +(%%)
564 564  )))
565 565  
566 566  Participants also reported feeling more energetic and experiencing better sleep quality, though these were secondary measures.
... ... @@ -567,7 +567,6 @@
567 567  )))
568 568  
569 569  **Legend:**
570 -
571 571  * 🟢 = Well-supported claim (confidence ≥75%)
572 572  * 🟡 = Uncertain claim (confidence 40-74%)
573 573  * 🔴 = Refuted/unsupported claim (confidence <40%)
... ... @@ -586,13 +586,11 @@
586 586  **Confidence:** 87%
587 587  
588 588  **Evidence Summary:**
589 -
590 590  * Meta-analysis of 12 RCTs confirms 23-28% risk reduction
591 591  * Consistent findings across multiple populations
592 592  * Published in peer-reviewed journal (high credibility)
593 593  
594 594  **Uncertainty Factors:**
595 -
596 596  * Exact percentage varies by study (20-30% range)
597 597  
598 598  [[View Full Analysis →]]
... ... @@ -599,7 +599,6 @@
599 599  )))
600 600  
601 601  **Color-Coding System**:
602 -
603 603  * **Green**: Well-supported claims (confidence ≥75%, strong evidence)
604 604  * **Yellow/Orange**: Uncertain claims (confidence 40-74%, conflicting or limited evidence)
605 605  * **Red**: Refuted or unsupported claims (confidence <40%, contradicted by evidence)
... ... @@ -609,12 +609,8 @@
609 609  
610 610  (% style="width:100%; border-collapse:collapse;" %)
611 611  |=**Article Text**|=**Status**|=**Analysis**
612 -|(((
613 -A recent study published in the Journal of Nutrition has revealed new findings about the Mediterranean diet.
614 -)))|(% style="text-align:center;" %)Plain text|(% style="font-style:italic; color:#888;" %)Context - no highlighting
615 -|(((
616 -//Researchers found that Mediterranean diet followers had a 25% lower risk of heart disease compared to control groups//
617 -)))|(% style="background-color:#D4EDDA; text-align:center; padding:8px;" %)🟢 **WELL SUPPORTED**|(((
576 +|(((A recent study published in the Journal of Nutrition has revealed new findings about the Mediterranean diet.)))|(% style="text-align:center;" %)Plain text|(% style="font-style:italic; color:#888;" %)Context - no highlighting
577 +|(((//Researchers found that Mediterranean diet followers had a 25% lower risk of heart disease compared to control groups//)))|(% style="background-color:#D4EDDA; text-align:center; padding:8px;" %)🟢 **WELL SUPPORTED**|(((
618 618  **87% confidence**
619 619  
620 620  Meta-analysis of 12 RCTs confirms 23-28% risk reduction
... ... @@ -621,12 +621,8 @@
621 621  
622 622  [[View Full Analysis]]
623 623  )))
624 -|(((
625 -The study, which followed 10,000 participants over five years, showed significant improvements in cardiovascular health markers.
626 -)))|(% style="text-align:center;" %)Plain text|(% style="font-style:italic; color:#888;" %)Methodology - no highlighting
627 -|(((
628 -//Some experts believe this diet can completely prevent heart attacks//
629 -)))|(% style="background-color:#FFF3CD; text-align:center; padding:8px;" %)🟡 **UNCERTAIN**|(((
584 +|(((The study, which followed 10,000 participants over five years, showed significant improvements in cardiovascular health markers.)))|(% style="text-align:center;" %)Plain text|(% style="font-style:italic; color:#888;" %)Methodology - no highlighting
585 +|(((//Some experts believe this diet can completely prevent heart attacks//)))|(% style="background-color:#FFF3CD; text-align:center; padding:8px;" %)🟡 **UNCERTAIN**|(((
630 630  **45% confidence**
631 631  
632 632  Overstated - evidence shows risk reduction, not prevention
... ... @@ -633,12 +633,8 @@
633 633  
634 634  [[View Details]]
635 635  )))
636 -|(((
637 -Dr. Rodriguez recommends incorporating more olive oil, fish, and vegetables into daily meals.
638 -)))|(% style="text-align:center;" %)Plain text|(% style="font-style:italic; color:#888;" %)Recommendation - no highlighting
639 -|(((
640 -//The study proves that saturated fats cause heart disease//
641 -)))|(% style="background-color:#F8D7DA; text-align:center; padding:8px;" %)🔴 **REFUTED**|(((
592 +|(((Dr. Rodriguez recommends incorporating more olive oil, fish, and vegetables into daily meals.)))|(% style="text-align:center;" %)Plain text|(% style="font-style:italic; color:#888;" %)Recommendation - no highlighting
593 +|(((//The study proves that saturated fats cause heart disease//)))|(% style="background-color:#F8D7DA; text-align:center; padding:8px;" %)🔴 **REFUTED**|(((
642 642  **15% confidence**
643 643  
644 644  Claim not supported by study; correlation ≠ causation
... ... @@ -647,7 +647,6 @@
647 647  )))
648 648  
649 649  **Design Notes:**
650 -
651 651  * Highlighted claims use italics to distinguish from plain text
652 652  * Color backgrounds match XWiki message box colors (success/warning/error)
653 653  * Status column shows verdict prominently
... ... @@ -654,7 +654,6 @@
654 654  * Analysis column provides quick summary with link to details
655 655  
656 656  **User Actions**:
657 -
658 658  * **Hover** over highlighted claim → Tooltip appears
659 659  * **Click** highlighted claim → Detailed analysis modal/panel
660 660  * **Toggle** button to turn highlighting on/off
... ... @@ -661,18 +661,16 @@
661 661  * **Keyboard**: Tab through highlighted claims
662 662  
663 663  **Interaction Design**:
664 -
665 665  * Hover/click on highlighted claim → Show tooltip with:
666 -* Claim text
667 -* Verdict (e.g., "WELL SUPPORTED")
668 -* Confidence score (e.g., "85%")
669 -* Brief evidence summary
670 -* Link to detailed analysis
615 + * Claim text
616 + * Verdict (e.g., "WELL SUPPORTED")
617 + * Confidence score (e.g., "85%")
618 + * Brief evidence summary
619 + * Link to detailed analysis
671 671  * Toggle highlighting on/off (user preference)
672 672  * Adjustable color intensity for accessibility
673 673  
674 674  **Technical Requirements**:
675 -
676 676  * Real-time highlighting as page loads (non-blocking)
677 677  * Claim boundary detection (start/end of assertion)
678 678  * Handle nested or overlapping claims
... ... @@ -680,19 +680,16 @@
680 680  * Work with various content formats (HTML, plain text, PDFs)
681 681  
682 682  **Performance Requirements**:
683 -
684 684  * Highlighting renders within 500ms of page load
685 685  * No perceptible delay in reading experience
686 686  * Efficient DOM manipulation (avoid reflows)
687 687  
688 688  **Accessibility**:
689 -
690 690  * Color-blind friendly palette (use patterns/icons in addition to color)
691 691  * Screen reader compatible (ARIA labels for claim credibility)
692 692  * Keyboard navigation to highlighted claims
693 693  
694 694  **Implementation Notes**:
695 -
696 696  * Claims extracted and analyzed by AKEL during initial processing
697 697  * Highlighting data stored as annotations with byte offsets
698 698  * Client-side rendering of highlights based on verdict data
... ... @@ -705,7 +705,6 @@
705 705  **Fulfills**: UN-1 (Fast access to verified content), UN-16 (Clear review status)
706 706  
707 707  **Simple flow**:
708 -
709 709  1. Claim submitted
710 710  2. AKEL processes (automated)
711 711  3. If confidence > threshold: Publish (labeled as AI-generated)
... ... @@ -717,7 +717,6 @@
717 717  ==== FR10 — Moderation ====
718 718  
719 719  **Focus on abuse, not routine quality**:
720 -
721 721  * Automated abuse detection
722 722  * Moderators handle flags
723 723  * Quick response to harmful content
... ... @@ -788,7 +788,6 @@
788 788  **Purpose:** Ensure extracted claims are factual assertions (not opinions/predictions)
789 789  
790 790  **Checks:**
791 -
792 792  1. **Factual Statement Test:** Is this verifiable? (Yes/No)
793 793  2. **Opinion Detection:** Contains hedging language? ("I think", "probably", "best")
794 794  3. **Future Prediction Test:** Makes claims about future events?
... ... @@ -795,7 +795,6 @@
795 795  4. **Specificity Score:** Contains specific entities, numbers, dates?
796 796  
797 797  **Thresholds:**
798 -
799 799  * Factual: Must be "Yes"
800 800  * Opinion markers: <2 hedging phrases
801 801  * Specificity: ≥3 specific elements
... ... @@ -807,13 +807,11 @@
807 807  **Purpose:** Ensure AI-linked evidence actually relates to claim
808 808  
809 809  **Checks:**
810 -
811 811  1. **Semantic Similarity Score:** Evidence vs. claim (embeddings)
812 812  2. **Entity Overlap:** Shared people/places/things?
813 813  3. **Topic Relevance:** Discusses claim subject?
814 814  
815 815  **Thresholds:**
816 -
817 817  * Similarity: ≥0.6 (cosine similarity)
818 818  * Entity overlap: ≥1 shared entity
819 819  * Topic relevance: ≥0.5
... ... @@ -825,13 +825,11 @@
825 825  **Purpose:** Validate scenario assumptions are logical and complete
826 826  
827 827  **Checks:**
828 -
829 829  1. **Completeness:** All required fields populated
830 830  2. **Internal Consistency:** Assumptions don't contradict
831 831  3. **Distinguishability:** Scenarios meaningfully different
832 832  
833 833  **Thresholds:**
834 -
835 835  * Required fields: 100%
836 836  * Contradiction score: <0.3
837 837  * Scenario similarity: <0.8
... ... @@ -843,7 +843,6 @@
843 843  **Purpose:** Only publish high-confidence verdicts
844 844  
845 845  **Checks:**
846 -
847 847  1. **Evidence Count:** Minimum 2 sources
848 848  2. **Source Quality:** Average reliability ≥0.6
849 849  3. **Evidence Agreement:** Supporting vs. contradicting ≥0.6
... ... @@ -850,7 +850,6 @@
850 850  4. **Uncertainty Factors:** Hedging in reasoning
851 851  
852 852  **Confidence Tiers:**
853 -
854 854  * **HIGH (80-100%):** ≥3 sources, ≥0.7 quality, ≥80% agreement
855 855  * **MEDIUM (50-79%):** ≥2 sources, ≥0.6 quality, ≥60% agreement
856 856  * **LOW (0-49%):** <2 sources OR low quality/agreement
... ... @@ -857,13 +857,11 @@
857 857  * **INSUFFICIENT:** <2 sources → DO NOT PUBLISH
858 858  
859 859  **Implementation Phases:**
860 -
861 861  * **POC1:** Gates 1 & 4 only (basic validation)
862 862  * **POC2:** All 4 gates (complete framework)
863 863  * **V1.0:** Hardened with <5% hallucination rate
864 864  
865 865  **Acceptance Criteria:**
866 -
867 867  * ✅ All gates operational
868 868  * ✅ Hallucination rate <5%
869 869  * ✅ Quality metrics public
... ... @@ -881,7 +881,6 @@
881 881  ==== API Security ====
882 882  
883 883  **Rate Limiting:**
884 -
885 885  * **Analysis endpoints:** 100 requests/hour per IP
886 886  * **Read endpoints:** 1,000 requests/hour per IP
887 887  * **Search:** 500 requests/hour per IP
... ... @@ -889,24 +889,21 @@
889 889  * **Burst protection:** Max 10 requests/second
890 890  
891 891  **Authentication & Authorization:**
892 -
893 893  * **API Keys:** Required for programmatic access
894 894  * **JWT tokens:** For user sessions (1-hour expiry)
895 895  * **OAuth2:** For third-party integrations
896 896  * **Role-Based Access Control (RBAC):**
897 -* Public: Read-only access to published claims
898 -* Contributor: Submit claims, provide evidence
899 -* Moderator: Review contributions, manage quality
900 -* Admin: System configuration, user management
828 + * Public: Read-only access to published claims
829 + * Contributor: Submit claims, provide evidence
830 + * Moderator: Review contributions, manage quality
831 + * Admin: System configuration, user management
901 901  
902 902  **CORS Policies:**
903 -
904 904  * Whitelist approved domains only
905 905  * No wildcard origins in production
906 906  * Credentials required for sensitive endpoints
907 907  
908 908  **Input Sanitization:**
909 -
910 910  * Validate all user input against schemas
911 911  * Sanitize HTML/JavaScript in text submissions
912 912  * Prevent SQL injection (use parameterized queries)
... ... @@ -914,12 +914,11 @@
914 914  * Max request size: 10MB
915 915  * File upload restrictions: Whitelist file types, scan for malware
916 916  
917 -----
846 +---
918 918  
919 919  ==== Data Security ====
920 920  
921 921  **Encryption at Rest:**
922 -
923 923  * Database encryption using AES-256
924 924  * Encrypted backups
925 925  * Key management via cloud provider KMS (AWS KMS, Google Cloud KMS)
... ... @@ -926,7 +926,6 @@
926 926  * Regular key rotation (90-day cycle)
927 927  
928 928  **Encryption in Transit:**
929 -
930 930  * HTTPS/TLS 1.3 only (no TLS 1.0/1.1)
931 931  * Strong cipher suites only
932 932  * HSTS (HTTP Strict Transport Security) enabled
... ... @@ -933,7 +933,6 @@
933 933  * Certificate pinning for mobile apps
934 934  
935 935  **Secure Credential Storage:**
936 -
937 937  * Passwords hashed with bcrypt (cost factor 12+)
938 938  * API keys encrypted in database
939 939  * Secrets stored in environment variables (never in code)
... ... @@ -940,13 +940,12 @@
940 940  * Use secrets manager (AWS Secrets Manager, HashiCorp Vault)
941 941  
942 942  **Data Privacy:**
943 -
944 944  * Minimal data collection (privacy by design)
945 945  * User data deletion on request (GDPR compliance)
946 946  * PII encryption in database
947 947  * Anonymize logs (no PII in log files)
948 948  
949 -----
874 +---
950 950  
951 951  ==== Application Security ====
952 952  
... ... @@ -964,7 +964,6 @@
964 964  10. **Server-Side Request Forgery:** URL validation, whitelist domains
965 965  
966 966  **Security Headers:**
967 -
968 968  * `Content-Security-Policy`: Strict CSP to prevent XSS
969 969  * `X-Frame-Options`: DENY (prevent clickjacking)
970 970  * `X-Content-Type-Options`: nosniff
... ... @@ -972,7 +972,6 @@
972 972  * `Permissions-Policy`: Restrict browser features
973 973  
974 974  **Dependency Vulnerability Scanning:**
975 -
976 976  * **Tools:** Snyk, Dependabot, npm audit, pip-audit
977 977  * **Frequency:** Daily automated scans
978 978  * **Action:** Patch critical vulnerabilities within 24 hours
... ... @@ -979,34 +979,30 @@
979 979  * **Policy:** No known high/critical CVEs in production
980 980  
981 981  **Security Audits:**
982 -
983 983  * **Internal:** Quarterly security reviews
984 984  * **External:** Annual penetration testing by certified firm
985 985  * **Bug Bounty:** Public bug bounty program (V1.1+)
986 986  * **Compliance:** SOC 2 Type II certification target (V1.5)
987 987  
988 -----
910 +---
989 989  
990 990  ==== Operational Security ====
991 991  
992 992  **DDoS Protection:**
993 -
994 994  * CloudFlare or AWS Shield
995 995  * Rate limiting at CDN layer
996 996  * Automatic IP blocking for abuse patterns
997 997  
998 998  **Monitoring & Alerting:**
999 -
1000 1000  * Real-time security event monitoring
1001 1001  * Alerts for:
1002 -* Failed login attempts (>5 in 10 minutes)
1003 -* API abuse patterns
1004 -* Unusual data access patterns
1005 -* Security scan detections
922 + * Failed login attempts (>5 in 10 minutes)
923 + * API abuse patterns
924 + * Unusual data access patterns
925 + * Security scan detections
1006 1006  * Integration with SIEM (Security Information and Event Management)
1007 1007  
1008 1008  **Incident Response:**
1009 -
1010 1010  * Documented incident response plan
1011 1011  * Security incident classification (P1-P4)
1012 1012  * On-call rotation for security issues
... ... @@ -1014,18 +1014,16 @@
1014 1014  * Public disclosure policy (coordinated disclosure)
1015 1015  
1016 1016  **Backup & Recovery:**
1017 -
1018 1018  * Daily encrypted backups
1019 1019  * 30-day retention period
1020 1020  * Tested recovery procedures (quarterly)
1021 1021  * Disaster recovery plan (RTO: 4 hours, RPO: 1 hour)
1022 1022  
1023 -----
941 +---
1024 1024  
1025 1025  ==== Compliance & Standards ====
1026 1026  
1027 1027  **GDPR Compliance:**
1028 -
1029 1029  * User consent management
1030 1030  * Right to access data
1031 1031  * Right to deletion
... ... @@ -1033,7 +1033,6 @@
1033 1033  * Privacy policy published
1034 1034  
1035 1035  **Accessibility:**
1036 -
1037 1037  * WCAG 2.1 AA compliance
1038 1038  * Screen reader compatibility
1039 1039  * Keyboard navigation
... ... @@ -1040,7 +1040,6 @@
1040 1040  * Alt text for images
1041 1041  
1042 1042  **Browser Support:**
1043 -
1044 1044  * Modern browsers only (Chrome/Edge/Firefox/Safari latest 2 versions)
1045 1045  * No IE11 support
1046 1046  
... ... @@ -1055,6 +1055,7 @@
1055 1055  * ✅ Incident response plan documented and tested
1056 1056  * ✅ 95% uptime over 30-day period
1057 1057  
973 +
1058 1058  === NFR13: Quality Metrics Transparency ===
1059 1059  
1060 1060  **Fulfills:** User trust, transparency, continuous improvement, IFCN methodology transparency
... ... @@ -1069,18 +1069,16 @@
1069 1069  
1070 1070  **Core Metrics to Display:**
1071 1071  
1072 -* \\
1073 -** \\
1074 -**1. Verdict Quality Metrics
988 +**1. Verdict Quality Metrics**
1075 1075  
1076 1076  **TIGERScore (Fact-Checking Quality):**
1077 -
1078 1078  * **Definition:** Measures how well generated verdicts match expert fact-checker judgments
1079 1079  * **Scale:** 0-100 (higher is better)
1080 1080  * **Calculation:** Using TIGERScore framework (Truth-conditional accuracy, Informativeness, Generality, Evaluativeness, Relevance)
1081 1081  * **Target:** Average ≥80 for production release
1082 1082  * **Display:**
1083 -{{code}}Verdict Quality (TIGERScore):
996 +{{code}}
997 +Verdict Quality (TIGERScore):
1084 1084  Overall: 84.2 ▲ (+2.1 from last month)
1085 1085  
1086 1086  Distribution:
... ... @@ -1088,18 +1088,19 @@
1088 1088   Good (60-80): 28%
1089 1089   Needs Improvement (<60): 5%
1090 1090  
1091 -Trend: [Graph showing improvement over time]{{/code}}
1005 +Trend: [Graph showing improvement over time]
1006 +{{/code}}
1092 1092  
1093 1093  **2. Hallucination & Faithfulness Metrics**
1094 1094  
1095 1095  **AlignScore (Faithfulness to Evidence):**
1096 -
1097 1097  * **Definition:** Measures how well verdicts align with actual evidence content
1098 1098  * **Scale:** 0-1 (higher is better)
1099 1099  * **Purpose:** Detect AI hallucinations (making claims not supported by evidence)
1100 1100  * **Target:** Average ≥0.85, hallucination rate <5%
1101 1101  * **Display:**
1102 -{{code}}Evidence Faithfulness (AlignScore):
1016 +{{code}}
1017 +Evidence Faithfulness (AlignScore):
1103 1103  Average: 0.87 ▼ (-0.02 from last month)
1104 1104  
1105 1105  Hallucination Rate: 4.2%
... ... @@ -1106,24 +1106,24 @@
1106 1106   - Claims without evidence support: 3.1%
1107 1107   - Misrepresented evidence: 1.1%
1108 1108  
1109 -Action: Prompt engineering review scheduled{{/code}}
1024 +Action: Prompt engineering review scheduled
1025 +{{/code}}
1110 1110  
1111 1111  **3. Evidence Quality Metrics**
1112 1112  
1113 1113  **Source Reliability:**
1114 -
1115 1115  * Average source quality score (0-1 scale)
1116 1116  * Distribution of high/medium/low quality sources
1117 1117  * Publisher track record trends
1118 1118  
1119 1119  **Evidence Coverage:**
1120 -
1121 1121  * Average number of sources per claim
1122 1122  * Percentage of claims with ≥2 sources (EFCSN minimum)
1123 1123  * Geographic diversity of sources
1124 1124  
1125 1125  **Display:**
1126 -{{code}}Evidence Quality:
1040 +{{code}}
1041 +Evidence Quality:
1127 1127  
1128 1128  Average Sources per Claim: 4.2
1129 1129  Claims with ≥2 sources: 94% (EFCSN compliant)
... ... @@ -1133,23 +1133,24 @@
1133 1133   Medium quality (0.5-0.8): 43%
1134 1134   Low quality (<0.5): 9%
1135 1135  
1136 -Geographic Diversity: 23 countries represented{{/code}}
1051 +Geographic Diversity: 23 countries represented
1052 +{{/code}}
1137 1137  
1138 1138  **4. Contributor Consensus Metrics** (when human reviewers involved)
1139 1139  
1140 1140  **Inter-Rater Reliability (IRR):**
1141 -
1142 1142  * **Calculation:** Cohen's Kappa or Fleiss' Kappa for multiple raters
1143 1143  * **Scale:** 0-1 (higher is better)
1144 1144  * **Interpretation:**
1145 -* >0.8: Almost perfect agreement
1146 -* 0.6-0.8: Substantial agreement
1147 -* 0.4-0.6: Moderate agreement
1148 -* <0.4: Poor agreement
1060 + * >0.8: Almost perfect agreement
1061 + * 0.6-0.8: Substantial agreement
1062 + * 0.4-0.6: Moderate agreement
1063 + * <0.4: Poor agreement
1149 1149  * **Target:** Maintain ≥0.7 (substantial agreement)
1150 1150  
1151 1151  **Display:**
1152 -{{code}}Contributor Consensus:
1067 +{{code}}
1068 +Contributor Consensus:
1153 1153  
1154 1154  Inter-Rater Reliability (IRR): 0.73 (Substantial agreement)
1155 1155   - Verdict agreement: 78%
... ... @@ -1157,9 +1157,10 @@
1157 1157   - Scenario structure agreement: 69%
1158 1158  
1159 1159  Cases requiring moderator review: 12
1160 -Moderator override rate: 8%{{/code}}
1076 +Moderator override rate: 8%
1077 +{{/code}}
1161 1161  
1162 -----
1079 +---
1163 1163  
1164 1164  ==== Quality Dashboard Implementation ====
1165 1165  
... ... @@ -1166,7 +1166,6 @@
1166 1166  **Dashboard Location:** `/quality-metrics`
1167 1167  
1168 1168  **Update Frequency:**
1169 -
1170 1170  * **POC2:** Weekly manual updates
1171 1171  * **Beta 0:** Daily automated updates
1172 1172  * **V1.0:** Real-time metrics (updated hourly)
... ... @@ -1216,7 +1216,7 @@
1216 1216  
1217 1217  {{/code}}
1218 1218  
1219 -----
1135 +---
1220 1220  
1221 1221  ==== Continuous Improvement Feedback Loop ====
1222 1222  
... ... @@ -1223,36 +1223,31 @@
1223 1223  **How Metrics Inform AKEL Improvements:**
1224 1224  
1225 1225  1. **Identify Weak Areas:**
1142 + * Low TIGERScore → Review prompt engineering
1143 + * High hallucination → Strengthen evidence grounding
1144 + * Low IRR → Clarify evaluation criteria
1226 1226  
1227 -* Low TIGERScore → Review prompt engineering
1228 -* High hallucination → Strengthen evidence grounding
1229 -* Low IRR → Clarify evaluation criteria
1230 -
1231 1231  2. **A/B Testing Integration:**
1147 + * Test prompt variations
1148 + * Measure impact on quality metrics
1149 + * Deploy winners automatically
1232 1232  
1233 -* Test prompt variations
1234 -* Measure impact on quality metrics
1235 -* Deploy winners automatically
1236 -
1237 1237  3. **Alert Thresholds:**
1152 + * TIGERScore drops below 75 → Alert team
1153 + * Hallucination rate exceeds 7% → Pause auto-publishing
1154 + * IRR below 0.6 → Moderator training needed
1238 1238  
1239 -* TIGERScore drops below 75 → Alert team
1240 -* Hallucination rate exceeds 7% → Pause auto-publishing
1241 -* IRR below 0.6 → Moderator training needed
1242 -
1243 1243  4. **Monthly Quality Reviews:**
1157 + * Analyze trends
1158 + * Identify systematic issues
1159 + * Plan prompt improvements
1160 + * Update AKEL models
1244 1244  
1245 -* Analyze trends
1246 -* Identify systematic issues
1247 -* Plan prompt improvements
1248 -* Update AKEL models
1162 +---
1249 1249  
1250 -----
1251 -
1252 1252  ==== Metric Calculation Details ====
1253 1253  
1254 1254  **TIGERScore Implementation:**
1255 -
1256 1256  * Reference: https://github.com/TIGER-AI-Lab/TIGERScore
1257 1257  * Input: Generated verdict + reference verdict (from expert)
1258 1258  * Output: 0-100 score across 5 dimensions
... ... @@ -1259,7 +1259,6 @@
1259 1259  * Requires: Test set of expert-reviewed claims (minimum 100)
1260 1260  
1261 1261  **AlignScore Implementation:**
1262 -
1263 1263  * Reference: https://github.com/yuh-zha/AlignScore
1264 1264  * Input: Generated verdict + source evidence text
1265 1265  * Output: 0-1 faithfulness score
... ... @@ -1266,12 +1266,11 @@
1266 1266  * Calculation: Semantic alignment between claim and evidence
1267 1267  
1268 1268  **Source Quality Scoring:**
1269 -
1270 1270  * Use existing source reliability database (e.g., NewsGuard, MBFC)
1271 1271  * Factor in: Publication history, corrections record, transparency
1272 1272  * Scale: 0-1 (weighted average across sources)
1273 1273  
1274 -----
1183 +---
1275 1275  
1276 1276  ==== Integration Points ====
1277 1277  
... ... @@ -1302,13 +1302,11 @@
1302 1302  == 14. Related Pages ==
1303 1303  
1304 1304  **Non-Functional Requirements (see Section 9):**
1305 -
1306 1306  * [[NFR11 — AKEL Quality Assurance Framework>>#NFR11]]
1307 1307  * [[NFR12 — Security Controls>>#NFR12]]
1308 1308  * [[NFR13 — Quality Metrics Transparency>>#NFR13]]
1309 1309  
1310 1310  **Other Requirements:**
1311 -
1312 1312  * [[User Needs>>FactHarbor.Specification.Requirements.User Needs.WebHome]]
1313 1313  * [[V1.0 Requirements>>FactHarbor.Specification.Requirements.V10.]]
1314 1314  * [[Gap Analysis>>FactHarbor.Specification.Requirements.GapAnalysis]]
... ... @@ -1317,8 +1317,8 @@
1317 1317  * [[Architecture>>FactHarbor.Specification.Architecture.WebHome]] - How requirements are implemented
1318 1318  * [[Data Model>>FactHarbor.Specification.Data Model.WebHome]] - Data structures supporting requirements
1319 1319  * [[Workflows>>FactHarbor.Specification.Workflows.WebHome]] - User interaction workflows
1320 -* [[AKEL>>Archive.FactHarbor.Specification.AI Knowledge Extraction Layer (AKEL).WebHome]] - AI system fulfilling automation requirements
1321 -* [[Global Rules>>Archive.FactHarbor.Organisation.How-We-Work-Together.GlobalRules.WebHome]]
1227 +* [[AKEL>>FactHarbor.Specification.AI Knowledge Extraction Layer (AKEL).WebHome]] - AI system fulfilling automation requirements
1228 +* [[Global Rules>>FactHarbor.Organisation.How-We-Work-Together.GlobalRules.WebHome]]
1322 1322  * [[Privacy Policy>>FactHarbor.Organisation.How-We-Work-Together.Privacy-Policy]]
1323 1323  
1324 1324  = V0.9.70 Additional Requirements =
... ... @@ -1378,7 +1378,6 @@
1378 1378  **FactHarbor-Specific Mapping:**
1379 1379  
1380 1380  **Likelihood Score to Rating Scale:**
1381 -
1382 1382  * 80-100% likelihood → 5 (Highly Supported)
1383 1383  * 60-79% likelihood → 4 (Supported)
1384 1384  * 40-59% likelihood → 3 (Mixed/Uncertain)
... ... @@ -1386,7 +1386,6 @@
1386 1386  * 0-19% likelihood → 1 (Refuted)
1387 1387  
1388 1388  **Multiple Scenarios Handling:**
1389 -
1390 1390  * If claim has multiple scenarios with different verdicts, generate **separate ClaimReview** for each scenario
1391 1391  * Add `disambiguatingDescription` field explaining scenario context
1392 1392  * Example: "Scenario: If interpreted as referring to 2023 data..."
... ... @@ -1419,6 +1419,7 @@
1419 1419  * ✅ All required fields populated with correct data types
1420 1420  * ✅ Handles multi-scenario claims correctly (separate ClaimReview per scenario)
1421 1421  
1327 +
1422 1422  === FR45: User Corrections Notification System ===
1423 1423  
1424 1424  **Fulfills:** IFCN Principle 5 (Open & Honest Corrections), EFCSN compliance
... ... @@ -1440,9 +1440,7 @@
1440 1440  
1441 1441  ==== Notification Mechanisms ====
1442 1442  
1443 -* \\
1444 -** \\
1445 -**1. In-Page Banner:
1349 +**1. In-Page Banner:**
1446 1446  
1447 1447  Display prominent banner on claim page:
1448 1448  
... ... @@ -1462,10 +1462,10 @@
1462 1462  
1463 1463  * Public changelog at `/claims/{id}/corrections`
1464 1464  * Displays for each correction:
1465 -* Date/time of correction
1466 -* What changed (before/after comparison)
1467 -* Why changed (reason if provided)
1468 -* Who made change (AKEL auto-update vs. contributor override)
1369 + * Date/time of correction
1370 + * What changed (before/after comparison)
1371 + * Why changed (reason if provided)
1372 + * Who made change (AKEL auto-update vs. contributor override)
1469 1469  
1470 1470  **3. Email Notifications (opt-in):**
1471 1471  
... ... @@ -1509,6 +1509,7 @@
1509 1509  * ✅ Mobile-responsive banner design
1510 1510  * ✅ Accessible (screen reader compatible)
1511 1511  
1416 +
1512 1512  === FR46: Image Verification System ===
1513 1513  
1514 1514  **Fulfills:** UN-27 (Visual Claim Verification)
... ... @@ -1526,25 +1526,23 @@
1526 1526  **Purpose:** Find earlier uses of the image to verify context
1527 1527  
1528 1528  **Implementation:**
1529 -
1530 1530  * Integrate APIs:
1531 -* **Google Vision AI** (reverse search)
1532 -* **TinEye** (oldest known uses)
1533 -* **Bing Visual Search** (broad coverage)
1435 + * **Google Vision AI** (reverse search)
1436 + * **TinEye** (oldest known uses)
1437 + * **Bing Visual Search** (broad coverage)
1534 1534  
1535 1535  **Process:**
1536 -
1537 1537  1. Extract image from claim or user upload
1538 1538  2. Query multiple reverse search services
1539 1539  3. Analyze results for:
1443 + * Earliest known publication
1444 + * Original context (what was it really showing?)
1445 + * Publication timeline
1446 + * Geographic spread
1540 1540  
1541 -* Earliest known publication
1542 -* Original context (what was it really showing?)
1543 -* Publication timeline
1544 -* Geographic spread
1545 -
1546 1546  **Output:**
1547 -{{code}}Reverse Image Search Results:
1449 +{{code}}
1450 +Reverse Image Search Results:
1548 1548  
1549 1549  Earliest known use: 2019-03-15 (5 years before claim)
1550 1550  Original context: "Photo from 2019 flooding in Mumbai"
... ... @@ -1557,9 +1557,10 @@
1557 1557  • 2020-07-22: Bangladesh monsoon
1558 1558  • 2024-10-15: Current claim (misattributed)
1559 1559  
1560 -[View full timeline]{{/code}}
1463 +[View full timeline]
1464 +{{/code}}
1561 1561  
1562 -----
1466 +---
1563 1563  
1564 1564  **Method 2: AI Manipulation Detection**
1565 1565  
... ... @@ -1566,41 +1566,36 @@
1566 1566  **Purpose:** Detect deepfakes, face swaps, and digital alterations
1567 1567  
1568 1568  **Implementation:**
1569 -
1570 1570  * Integrate detection services:
1571 -* **Sensity AI** (deepfake detection)
1572 -* **Reality Defender** (multimodal analysis)
1573 -* **AWS Rekognition** (face detection inconsistencies)
1474 + * **Sensity AI** (deepfake detection)
1475 + * **Reality Defender** (multimodal analysis)
1476 + * **AWS Rekognition** (face detection inconsistencies)
1574 1574  
1575 1575  **Detection Categories:**
1576 -
1577 1577  1. **Face Manipulation:**
1480 + * Deepfake face swaps
1481 + * Expression manipulation
1482 + * Identity replacement
1578 1578  
1579 -* Deepfake face swaps
1580 -* Expression manipulation
1581 -* Identity replacement
1582 -
1583 1583  2. **Image Manipulation:**
1485 + * Copy-paste artifacts
1486 + * Clone stamp detection
1487 + * Content-aware fill detection
1488 + * JPEG compression inconsistencies
1584 1584  
1585 -* Copy-paste artifacts
1586 -* Clone stamp detection
1587 -* Content-aware fill detection
1588 -* JPEG compression inconsistencies
1589 -
1590 1590  3. **AI Generation:**
1491 + * Detect fully AI-generated images
1492 + * Identify generation artifacts
1493 + * Check for model signatures
1591 1591  
1592 -* Detect fully AI-generated images
1593 -* Identify generation artifacts
1594 -* Check for model signatures
1595 -
1596 1596  **Confidence Scoring:**
1597 -
1598 1598  * **HIGH (80-100%):** Strong evidence of manipulation
1599 1599  * **MEDIUM (50-79%):** Suspicious artifacts detected
1600 1600  * **LOW (0-49%):** Minor inconsistencies or inconclusive
1601 1601  
1602 1602  **Output:**
1603 -{{code}}Manipulation Analysis:
1501 +{{code}}
1502 +Manipulation Analysis:
1604 1604  
1605 1605  Face Manipulation: LOW RISK (12%)
1606 1606  Image Editing: MEDIUM RISK (64%)
... ... @@ -1609,9 +1609,10 @@
1609 1609  
1610 1610  AI Generation: LOW RISK (8%)
1611 1611  
1612 -⚠️ Possible manipulation detected. Manual review recommended.{{/code}}
1511 +⚠️ Possible manipulation detected. Manual review recommended.
1512 +{{/code}}
1613 1613  
1614 -----
1514 +---
1615 1615  
1616 1616  **Method 3: Metadata Analysis (EXIF)**
1617 1617  
... ... @@ -1618,7 +1618,6 @@
1618 1618  **Purpose:** Extract technical details that may reveal manipulation or misattribution
1619 1619  
1620 1620  **Extracted Data:**
1621 -
1622 1622  * **Camera/Device:** Make, model, software
1623 1623  * **Timestamps:** Original date, modification dates
1624 1624  * **Location:** GPS coordinates (if present)
... ... @@ -1626,7 +1626,6 @@
1626 1626  * **File Properties:** Resolution, compression, format conversions
1627 1627  
1628 1628  **Red Flags:**
1629 -
1630 1630  * Metadata completely stripped (suspicious)
1631 1631  * Timestamp conflicts with claimed date
1632 1632  * GPS location conflicts with claimed location
... ... @@ -1634,7 +1634,8 @@
1634 1634  * Creation date after modification date (impossible)
1635 1635  
1636 1636  **Output:**
1637 -{{code}}Image Metadata:
1535 +{{code}}
1536 +Image Metadata:
1638 1638  
1639 1639  Camera: iPhone 14 Pro
1640 1640  Original date: 2023-08-12 14:32:15
... ... @@ -1646,20 +1646,19 @@
1646 1646  Claim says: "Taken in Los Angeles"
1647 1647  EXIF says: New York City
1648 1648  
1649 -⚠️ Edited 14 months after capture{{/code}}
1548 +⚠️ Edited 14 months after capture
1549 +{{/code}}
1650 1650  
1651 -----
1551 +---
1652 1652  
1653 1653  ==== Verification Workflow ====
1654 1654  
1655 1655  **Automatic Triggers:**
1656 -
1657 1657  1. User submits claim with image
1658 1658  2. Article being analyzed contains images
1659 1659  3. Social media post includes photos
1660 1660  
1661 1661  **Process:**
1662 -
1663 1663  1. Extract images from content
1664 1664  2. Run all 3 verification methods in parallel
1665 1665  3. Aggregate results into confidence score
... ... @@ -1694,16 +1694,14 @@
1694 1694  ==== Cost Considerations ====
1695 1695  
1696 1696  **API Costs (estimated per image):**
1697 -
1698 1698  * Google Vision AI: $0.001-0.003
1699 1699  * TinEye: $0.02 (commercial API)
1700 1700  * Sensity AI: $0.05-0.10
1701 1701  * AWS Rekognition: $0.001-0.002
1702 1702  
1703 -**Total per image:** $0.07-0.15**
1600 +**Total per image:** ~$0.07-0.15
1704 1704  
1705 1705  **Mitigation Strategies:**
1706 -
1707 1707  * Cache results for duplicate images
1708 1708  * Use free tier quotas where available
1709 1709  * Prioritize higher-value claims for deep analysis
... ... @@ -1718,6 +1718,7 @@
1718 1718  * ✅ Mobile-friendly image comparison interface
1719 1719  * ✅ False positive rate <15%
1720 1720  
1617 +
1721 1721  === FR47: Archive.org Integration ===
1722 1722  
1723 1723  **Priority:** CRITICAL
... ... @@ -1731,7 +1731,6 @@
1731 1731  **Automatic Archiving:**
1732 1732  
1733 1733  When AKEL links evidence:
1734 -
1735 1735  1. Check if URL already archived (Wayback Machine API)
1736 1736  2. If not, submit for archiving (Save Page Now API)
1737 1737  3. Store both original URL and archive URL
... ... @@ -1766,9 +1766,8 @@
1766 1766  * ✅ API rate limits respected
1767 1767  * ✅ Archive status visible in evidence display
1768 1768  
1769 -== Category 4: Community Safety ==
1770 1770  
1771 - FR48: Contributor Safety Framework ===
1666 +== Category 4: Community Safety ===== FR48: Contributor Safety Framework ===
1772 1772  
1773 1773  **Priority:** CRITICAL
1774 1774  **Fulfills:** UN-28 (Safe contribution environment)
... ... @@ -1778,9 +1778,7 @@
1778 1778  
1779 1779  **Specification:**
1780 1780  
1781 -* \\
1782 -** \\
1783 -**1. Privacy Protection:
1676 +**1. Privacy Protection:**
1784 1784  
1785 1785  * **Optional Pseudonymity:** Contributors can use pseudonyms
1786 1786  * **Email Privacy:** Emails never displayed publicly
... ... @@ -1823,9 +1823,8 @@
1823 1823  * ✅ Moderator tools implemented
1824 1824  * ✅ Safety policy published
1825 1825  
1826 -== Category 5: Continuous Improvement ==
1827 1827  
1828 - FR49: A/B Testing Framework ===
1720 +== Category 5: Continuous Improvement ===== FR49: A/B Testing Framework ===
1829 1829  
1830 1830  **Priority:** CRITICAL
1831 1831  **Fulfills:** Continuous system improvement
... ... @@ -1838,23 +1838,20 @@
1838 1838  **Test Capabilities:**
1839 1839  
1840 1840  1. **Prompt Variations:**
1733 + * Test different claim extraction prompts
1734 + * Test different verdict generation prompts
1735 + * Measure: Accuracy, clarity, completeness
1841 1841  
1842 -* Test different claim extraction prompts
1843 -* Test different verdict generation prompts
1844 -* Measure: Accuracy, clarity, completeness
1845 -
1846 1846  2. **Algorithm Variations:**
1738 + * Test different source scoring algorithms
1739 + * Test different confidence calculations
1740 + * Measure: Audit accuracy, user satisfaction
1847 1847  
1848 -* Test different source scoring algorithms
1849 -* Test different confidence calculations
1850 -* Measure: Audit accuracy, user satisfaction
1851 -
1852 1852  3. **Workflow Variations:**
1743 + * Test different quality gate thresholds
1744 + * Test different risk tier assignments
1745 + * Measure: Publication rate, quality scores
1853 1853  
1854 -* Test different quality gate thresholds
1855 -* Test different risk tier assignments
1856 -* Measure: Publication rate, quality scores
1857 -
1858 1858  **Implementation:**
1859 1859  
1860 1860  * **Traffic Split:** 50/50 or 90/10 splits
... ... @@ -1883,6 +1883,7 @@
1883 1883  * ✅ Statistical significance calculated
1884 1884  * ✅ Results inform system improvements
1885 1885  
1775 +
1886 1886  === FR54: Evidence Deduplication ===
1887 1887  
1888 1888  **Priority:** CRITICAL (POC2/Beta)
... ... @@ -1896,24 +1896,21 @@
1896 1896  **Deduplication Logic:**
1897 1897  
1898 1898  1. **URL Normalization:**
1789 + * Remove tracking parameters (?utm_source=...)
1790 + * Normalize http/https
1791 + * Normalize www/non-www
1792 + * Handle redirects
1899 1899  
1900 -* Remove tracking parameters (?utm_source=...)
1901 -* Normalize http/https
1902 -* Normalize www/non-www
1903 -* Handle redirects
1904 -
1905 1905  2. **Content Similarity:**
1795 + * If two sources have >90% text similarity → Same source
1796 + * If one is subset of other → Same source
1797 + * Use fuzzy matching for minor differences
1906 1906  
1907 -* If two sources have >90% text similarity → Same source
1908 -* If one is subset of other → Same source
1909 -* Use fuzzy matching for minor differences
1910 -
1911 1911  3. **Cross-Domain Syndication:**
1800 + * Detect wire service content (AP, Reuters)
1801 + * Mark as single source if syndicated
1802 + * Count original publication only
1912 1912  
1913 -* Detect wire service content (AP, Reuters)
1914 -* Mark as single source if syndicated
1915 -* Count original publication only
1916 -
1917 1917  **Display:**
1918 1918  
1919 1919  {{code}}
... ... @@ -1935,9 +1935,8 @@
1935 1935  * ✅ Unique vs. total counts accurate
1936 1936  * ✅ Improves evidence quality metrics
1937 1937  
1938 -== Additional Requirements (Lower Priority) ==
1939 1939  
1940 - FR50: OSINT Toolkit Integration ===
1826 +== Additional Requirements (Lower Priority) ===== FR50: OSINT Toolkit Integration ===
1941 1941  
1942 1942  
1943 1943  
... ... @@ -1947,7 +1947,6 @@
1947 1947  **Purpose:** Integrate open-source intelligence tools for advanced verification.
1948 1948  
1949 1949  **Tools to Integrate:**
1950 -
1951 1951  * InVID/WeVerify (video verification)
1952 1952  * Bellingcat toolkit
1953 1953  * Additional TBD based on V1.0 learnings
... ... @@ -1962,7 +1962,6 @@
1962 1962  **Purpose:** Verify video-based claims.
1963 1963  
1964 1964  **Specification:**
1965 -
1966 1966  * Keyframe extraction
1967 1967  * Reverse video search
1968 1968  * Deepfake detection (AI-powered)
... ... @@ -1979,7 +1979,6 @@
1979 1979  **Purpose:** Teach users to identify misinformation.
1980 1980  
1981 1981  **Specification:**
1982 -
1983 1983  * Interactive tutorials
1984 1984  * Practice exercises
1985 1985  * Detection quizzes
... ... @@ -1995,12 +1995,12 @@
1995 1995  **Purpose:** Share findings with IFCN/EFCSN members.
1996 1996  
1997 1997  **Specification:**
1998 -
1999 1999  * API for fact-checking organizations
2000 2000  * Structured data exchange
2001 2001  * Privacy controls
2002 2002  * Attribution requirements
2003 2003  
1886 +
2004 2004  == Summary ==
2005 2005  
2006 2006  **V1.0 Critical Requirements (Must Have):**
... ... @@ -2023,6 +2023,7 @@
2023 2023  * FR52: Detection Training
2024 2024  * FR53: Cross-Org Sharing
2025 2025  
1909 +
2026 2026  **Total:** 11 critical requirements for V1.0
2027 2027  
2028 2028  === FR54: Evidence Deduplication ===
... ... @@ -2039,24 +2039,21 @@
2039 2039  **Deduplication Logic:**
2040 2040  
2041 2041  1. **URL Normalization:**
1926 + * Remove tracking parameters (?utm_source=...)
1927 + * Normalize http/https
1928 + * Normalize www/non-www
1929 + * Handle redirects
2042 2042  
2043 -* Remove tracking parameters (?utm_source=...)
2044 -* Normalize http/https
2045 -* Normalize www/non-www
2046 -* Handle redirects
2047 -
2048 2048  2. **Content Similarity:**
1932 + * If two sources have >90% text similarity → Same source
1933 + * If one is subset of other → Same source
1934 + * Use fuzzy matching for minor differences
2049 2049  
2050 -* If two sources have >90% text similarity → Same source
2051 -* If one is subset of other → Same source
2052 -* Use fuzzy matching for minor differences
2053 -
2054 2054  3. **Cross-Domain Syndication:**
1937 + * Detect wire service content (AP, Reuters)
1938 + * Mark as single source if syndicated
1939 + * Count original publication only
2055 2055  
2056 -* Detect wire service content (AP, Reuters)
2057 -* Mark as single source if syndicated
2058 -* Count original publication only
2059 -
2060 2060  **Display:**
2061 2061  
2062 2062  {{code}}
... ... @@ -2078,9 +2078,8 @@
2078 2078  * ✅ Unique vs. total counts accurate
2079 2079  * ✅ Improves evidence quality metrics
2080 2080  
2081 -== Additional Requirements (Lower Priority) ==
2082 2082  
2083 - FR7: Automated Verdicts (Enhanced with Quality Gates) ===
1963 +== Additional Requirements (Lower Priority) ===== FR7: Automated Verdicts (Enhanced with Quality Gates) ===
2084 2084  
2085 2085  **POC1+ Enhancement:**
2086 2086  
... ... @@ -2102,7 +2102,6 @@
2102 2102  {{/code}}
2103 2103  
2104 2104  **Updated Verdict States:**
2105 -
2106 2106  * PUBLISHED
2107 2107  * INSUFFICIENT_EVIDENCE
2108 2108  * NON_FACTUAL_CLAIM
... ... @@ -2124,3 +2124,4 @@
2124 2124   Avg Source Quality: 0.73
2125 2125   Quality Score: 8.5/10
2126 2126  {{/code}}
2006 +