Changes for page Requirements
Last modified by Robert Schaub on 2026/02/08 21:32
To version 7.2
edited by Robert Schaub
on 2025/12/23 11:03
on 2025/12/23 11:03
Change comment:
Update document after refactoring.
Summary
-
Page properties (1 modified, 0 added, 0 removed)
Details
- Page properties
-
- Content
-
... ... @@ -32,7 +32,6 @@ 32 32 **Who**: Anyone (no login required) 33 33 34 34 **Can**: 35 - 36 36 * Browse and search claims 37 37 * View scenarios, evidence, verdicts, and confidence scores 38 38 * Flag issues or errors ... ... @@ -40,7 +40,6 @@ 40 40 * Submit claims automatically (new claims added if not duplicates) 41 41 42 42 **Cannot**: 43 - 44 44 * Modify content 45 45 * Access edit history details 46 46 ... ... @@ -51,7 +51,6 @@ 51 51 **Who**: Registered users (earns reputation through contributions) 52 52 53 53 **Can**: 54 - 55 55 * Everything a Reader can do 56 56 * Edit claims, evidence, and scenarios 57 57 * Add sources and citations ... ... @@ -60,7 +60,6 @@ 60 60 * Earn reputation points for quality contributions 61 61 62 62 **Reputation System**: 63 - 64 64 * New contributors: Limited edit privileges 65 65 * Established contributors (established reputation): Full edit access 66 66 * Trusted contributors (substantial reputation): Can approve certain changes ... ... @@ -68,7 +68,6 @@ 68 68 * Reputation lost through: Reverted edits, invalid flags, abuse 69 69 70 70 **Cannot**: 71 - 72 72 * Delete or hide content (only moderators) 73 73 * Override moderation decisions 74 74 ... ... @@ -79,7 +79,6 @@ 79 79 **Who**: Trusted community members with proven track record, appointed by governance board 80 80 81 81 **Can**: 82 - 83 83 * Review flagged content 84 84 * Hide harmful or abusive content 85 85 * Resolve disputes between contributors ... ... @@ -88,7 +88,6 @@ 88 88 * Access full audit logs 89 89 90 90 **Cannot**: 91 - 92 92 * Change governance rules 93 93 * Permanently ban users without board approval 94 94 * Override technical quality gates ... ... @@ -102,7 +102,6 @@ 102 102 **Not a permanent role**: Contacted externally when needed for contested claims in their domain 103 103 104 104 **When used**: 105 - 106 106 * Medical claims with life/safety implications 107 107 * Legal interpretations with significant impact 108 108 * Scientific claims with high controversy ... ... @@ -109,7 +109,6 @@ 109 109 * Technical claims requiring specialized knowledge 110 110 111 111 **Process**: 112 - 113 113 * Moderator identifies need for expert input 114 114 * Contact expert externally (don't require them to be users) 115 115 * Trusted Contributor provides written opinion with sources ... ... @@ -129,13 +129,11 @@ 129 129 **Status**: Visible to all users 130 130 131 131 **Includes**: 132 - 133 133 * AI-generated analyses (default state) 134 134 * User-contributed content 135 135 * Edited/improved content 136 136 137 137 **Quality Indicators** (displayed with content): 138 - 139 139 * **Confidence Score**: 0-100% (AI's confidence in analysis) 140 140 * **Source Quality Score**: 0-100% (based on source track record) 141 141 * **Controversy Flag**: If high dispute/edit activity ... ... @@ -145,7 +145,6 @@ 145 145 * **Review Status**: AI-generated / Human-reviewed / Expert-validated 146 146 147 147 **Automatic Warnings**: 148 - 149 149 * Confidence < 60%: "Low confidence - use caution" 150 150 * Source quality < 40%: "Sources may be unreliable" 151 151 * High controversy: "Disputed - multiple interpretations exist" ... ... @@ -158,7 +158,6 @@ 158 158 **Status**: Not visible to regular users (only to moderators) 159 159 160 160 **Reasons**: 161 - 162 162 * Spam or advertising 163 163 * Personal attacks or harassment 164 164 * Illegal content ... ... @@ -167,7 +167,6 @@ 167 167 * Abuse or harmful content 168 168 169 169 **Process**: 170 - 171 171 * Automated detection flags for moderator review 172 172 * Moderator confirms and hides 173 173 * Original author notified with reason ... ... @@ -190,7 +190,6 @@ 190 190 **AKEL is the primary system**. Human contributions supplement and train AKEL. 191 191 192 192 **AKEL Must**: 193 - 194 194 * Mark all outputs as AI-generated 195 195 * Display confidence scores prominently 196 196 * Provide source citations ... ... @@ -199,7 +199,6 @@ 199 199 * Learn from human corrections 200 200 201 201 **When AKEL Makes Errors**: 202 - 203 203 1. Capture the error pattern (what, why, how common) 204 204 2. Improve the system (better prompt, model, validation) 205 205 3. Re-process affected claims automatically ... ... @@ -230,7 +230,6 @@ 230 230 === 4.1 Source Requirements === 231 231 232 232 **Track Record Over Credentials**: 233 - 234 234 * Sources evaluated by historical accuracy 235 235 * Correction policy matters 236 236 * Independence from conflicts of interest ... ... @@ -237,7 +237,6 @@ 237 237 * Methodology transparency 238 238 239 239 **Source Quality Database**: 240 - 241 241 * Automated tracking of source accuracy 242 242 * Correction frequency 243 243 * Reliability score (updated continuously) ... ... @@ -269,7 +269,6 @@ 269 269 === 4.4 Confidence Scoring === 270 270 271 271 **Automated confidence calculation based on**: 272 - 273 273 * Source quality scores 274 274 * Evidence consistency 275 275 * Contradiction detection ... ... @@ -277,7 +277,6 @@ 277 277 * Historical accuracy of similar claims 278 278 279 279 **Thresholds**: 280 - 281 281 * < 40%: Too low to publish (needs improvement) 282 282 * 40-60%: Published with "Low confidence" warning 283 283 * 60-80%: Published as standard ... ... @@ -294,7 +294,6 @@ 294 294 === 5.1 Risk Score Calculation === 295 295 296 296 **Factors** (weighted algorithm): 297 - 298 298 * **Domain sensitivity**: Medical, legal, safety auto-flagged higher 299 299 * **Potential impact**: Views, citations, spread 300 300 * **Controversy level**: Flags, disputes, edit wars ... ... @@ -321,7 +321,6 @@ 321 321 === 6.1 Error Capture === 322 322 323 323 **When users flag errors or make corrections**: 324 - 325 325 1. What was wrong? (categorize) 326 326 2. What should it have been? 327 327 3. Why did the system fail? (root cause) ... ... @@ -340,7 +340,6 @@ 340 340 === 6.3 Quality Metrics Dashboard === 341 341 342 342 **Track continuously**: 343 - 344 344 * Error rate by category 345 345 * Source quality distribution 346 346 * Confidence score trends ... ... @@ -366,7 +366,6 @@ 366 366 === 7.2 Anomaly Detection === 367 367 368 368 **Automated alerts for**: 369 - 370 370 * Sudden quality drops 371 371 * Unusual patterns 372 372 * Contradiction clusters ... ... @@ -419,7 +419,6 @@ 419 419 **Fulfills**: UN-2 (Context-dependent verification), UN-3 (Article summary with FactHarbor analysis summary), UN-8 (Understanding disagreement) 420 420 421 421 **Automated scenario creation**: 422 - 423 423 * AKEL analyzes claim and generates likely scenarios (use-cases and contexts) 424 424 * Each scenario includes: assumptions, definitions, boundaries, evidence context 425 425 * Users can flag incorrect scenarios ... ... @@ -480,7 +480,6 @@ 480 480 **Purpose**: Provide side-by-side comparison of what a document claims vs. FactHarbor's complete analysis of its credibility 481 481 482 482 **Left Panel: Article Summary**: 483 - 484 484 * Document title, source, and claimed credibility 485 485 * "The Big Picture" - main thesis or position change 486 486 * "Key Findings" - structured summary of document's main claims ... ... @@ -488,7 +488,6 @@ 488 488 * "Conclusion" - document's bottom line 489 489 490 490 **Right Panel: FactHarbor Analysis Summary**: 491 - 492 492 * FactHarbor's independent source credibility assessment 493 493 * Claim-by-claim verdicts with confidence scores 494 494 * Methodology assessment (strengths, limitations) ... ... @@ -496,7 +496,6 @@ 496 496 * Analysis ID for reference 497 497 498 498 **Design Principles**: 499 - 500 500 * No scrolling required - both panels visible simultaneously 501 501 * Visual distinction between "what they say" and "FactHarbor's analysis" 502 502 * Color coding for verdicts (supported, uncertain, refuted) ... ... @@ -504,7 +504,6 @@ 504 504 * Mobile responsive (panels stack vertically on small screens) 505 505 506 506 **Implementation Notes**: 507 - 508 508 * Generated automatically by AKEL for every analyzed document 509 509 * Updates when verdict evolves (maintains version history) 510 510 * Exportable as standalone summary report ... ... @@ -531,8 +531,7 @@ 531 531 (% style="font-size:0.9em; color:#666;" %) 532 532 ↑ WELL SUPPORTED • 87% confidence 533 533 [[Click for evidence details →]] 534 - 535 - 505 +(%%) 536 536 ))) 537 537 538 538 The study, which followed 10,000 participants over five years, showed significant improvements in cardiovascular health markers. ... ... @@ -545,8 +545,7 @@ 545 545 ↑ UNCERTAIN • 45% confidence 546 546 Overstated - evidence shows risk reduction, not prevention 547 547 [[Click for details →]] 548 - 549 - 518 +(%%) 550 550 ))) 551 551 552 552 Dr. Maria Rodriguez, lead researcher, recommends incorporating more olive oil, fish, and vegetables into daily meals. ... ... @@ -559,8 +559,7 @@ 559 559 ↑ REFUTED • 15% confidence 560 560 Claim not supported by study design; correlation ≠ causation 561 561 [[Click for counter-evidence →]] 562 - 563 - 531 +(%%) 564 564 ))) 565 565 566 566 Participants also reported feeling more energetic and experiencing better sleep quality, though these were secondary measures. ... ... @@ -567,7 +567,6 @@ 567 567 ))) 568 568 569 569 **Legend:** 570 - 571 571 * 🟢 = Well-supported claim (confidence ≥75%) 572 572 * 🟡 = Uncertain claim (confidence 40-74%) 573 573 * 🔴 = Refuted/unsupported claim (confidence <40%) ... ... @@ -586,13 +586,11 @@ 586 586 **Confidence:** 87% 587 587 588 588 **Evidence Summary:** 589 - 590 590 * Meta-analysis of 12 RCTs confirms 23-28% risk reduction 591 591 * Consistent findings across multiple populations 592 592 * Published in peer-reviewed journal (high credibility) 593 593 594 594 **Uncertainty Factors:** 595 - 596 596 * Exact percentage varies by study (20-30% range) 597 597 598 598 [[View Full Analysis →]] ... ... @@ -599,7 +599,6 @@ 599 599 ))) 600 600 601 601 **Color-Coding System**: 602 - 603 603 * **Green**: Well-supported claims (confidence ≥75%, strong evidence) 604 604 * **Yellow/Orange**: Uncertain claims (confidence 40-74%, conflicting or limited evidence) 605 605 * **Red**: Refuted or unsupported claims (confidence <40%, contradicted by evidence) ... ... @@ -609,12 +609,8 @@ 609 609 610 610 (% style="width:100%; border-collapse:collapse;" %) 611 611 |=**Article Text**|=**Status**|=**Analysis** 612 -|((( 613 -A recent study published in the Journal of Nutrition has revealed new findings about the Mediterranean diet. 614 -)))|(% style="text-align:center;" %)Plain text|(% style="font-style:italic; color:#888;" %)Context - no highlighting 615 -|((( 616 -//Researchers found that Mediterranean diet followers had a 25% lower risk of heart disease compared to control groups// 617 -)))|(% style="background-color:#D4EDDA; text-align:center; padding:8px;" %)🟢 **WELL SUPPORTED**|((( 576 +|(((A recent study published in the Journal of Nutrition has revealed new findings about the Mediterranean diet.)))|(% style="text-align:center;" %)Plain text|(% style="font-style:italic; color:#888;" %)Context - no highlighting 577 +|(((//Researchers found that Mediterranean diet followers had a 25% lower risk of heart disease compared to control groups//)))|(% style="background-color:#D4EDDA; text-align:center; padding:8px;" %)🟢 **WELL SUPPORTED**|((( 618 618 **87% confidence** 619 619 620 620 Meta-analysis of 12 RCTs confirms 23-28% risk reduction ... ... @@ -621,12 +621,8 @@ 621 621 622 622 [[View Full Analysis]] 623 623 ))) 624 -|((( 625 -The study, which followed 10,000 participants over five years, showed significant improvements in cardiovascular health markers. 626 -)))|(% style="text-align:center;" %)Plain text|(% style="font-style:italic; color:#888;" %)Methodology - no highlighting 627 -|((( 628 -//Some experts believe this diet can completely prevent heart attacks// 629 -)))|(% style="background-color:#FFF3CD; text-align:center; padding:8px;" %)🟡 **UNCERTAIN**|((( 584 +|(((The study, which followed 10,000 participants over five years, showed significant improvements in cardiovascular health markers.)))|(% style="text-align:center;" %)Plain text|(% style="font-style:italic; color:#888;" %)Methodology - no highlighting 585 +|(((//Some experts believe this diet can completely prevent heart attacks//)))|(% style="background-color:#FFF3CD; text-align:center; padding:8px;" %)🟡 **UNCERTAIN**|((( 630 630 **45% confidence** 631 631 632 632 Overstated - evidence shows risk reduction, not prevention ... ... @@ -633,12 +633,8 @@ 633 633 634 634 [[View Details]] 635 635 ))) 636 -|((( 637 -Dr. Rodriguez recommends incorporating more olive oil, fish, and vegetables into daily meals. 638 -)))|(% style="text-align:center;" %)Plain text|(% style="font-style:italic; color:#888;" %)Recommendation - no highlighting 639 -|((( 640 -//The study proves that saturated fats cause heart disease// 641 -)))|(% style="background-color:#F8D7DA; text-align:center; padding:8px;" %)🔴 **REFUTED**|((( 592 +|(((Dr. Rodriguez recommends incorporating more olive oil, fish, and vegetables into daily meals.)))|(% style="text-align:center;" %)Plain text|(% style="font-style:italic; color:#888;" %)Recommendation - no highlighting 593 +|(((//The study proves that saturated fats cause heart disease//)))|(% style="background-color:#F8D7DA; text-align:center; padding:8px;" %)🔴 **REFUTED**|((( 642 642 **15% confidence** 643 643 644 644 Claim not supported by study; correlation ≠ causation ... ... @@ -647,7 +647,6 @@ 647 647 ))) 648 648 649 649 **Design Notes:** 650 - 651 651 * Highlighted claims use italics to distinguish from plain text 652 652 * Color backgrounds match XWiki message box colors (success/warning/error) 653 653 * Status column shows verdict prominently ... ... @@ -654,7 +654,6 @@ 654 654 * Analysis column provides quick summary with link to details 655 655 656 656 **User Actions**: 657 - 658 658 * **Hover** over highlighted claim → Tooltip appears 659 659 * **Click** highlighted claim → Detailed analysis modal/panel 660 660 * **Toggle** button to turn highlighting on/off ... ... @@ -661,18 +661,16 @@ 661 661 * **Keyboard**: Tab through highlighted claims 662 662 663 663 **Interaction Design**: 664 - 665 665 * Hover/click on highlighted claim → Show tooltip with: 666 -* Claim text 667 -* Verdict (e.g., "WELL SUPPORTED") 668 -* Confidence score (e.g., "85%") 669 -* Brief evidence summary 670 -* Link to detailed analysis 615 + * Claim text 616 + * Verdict (e.g., "WELL SUPPORTED") 617 + * Confidence score (e.g., "85%") 618 + * Brief evidence summary 619 + * Link to detailed analysis 671 671 * Toggle highlighting on/off (user preference) 672 672 * Adjustable color intensity for accessibility 673 673 674 674 **Technical Requirements**: 675 - 676 676 * Real-time highlighting as page loads (non-blocking) 677 677 * Claim boundary detection (start/end of assertion) 678 678 * Handle nested or overlapping claims ... ... @@ -680,19 +680,16 @@ 680 680 * Work with various content formats (HTML, plain text, PDFs) 681 681 682 682 **Performance Requirements**: 683 - 684 684 * Highlighting renders within 500ms of page load 685 685 * No perceptible delay in reading experience 686 686 * Efficient DOM manipulation (avoid reflows) 687 687 688 688 **Accessibility**: 689 - 690 690 * Color-blind friendly palette (use patterns/icons in addition to color) 691 691 * Screen reader compatible (ARIA labels for claim credibility) 692 692 * Keyboard navigation to highlighted claims 693 693 694 694 **Implementation Notes**: 695 - 696 696 * Claims extracted and analyzed by AKEL during initial processing 697 697 * Highlighting data stored as annotations with byte offsets 698 698 * Client-side rendering of highlights based on verdict data ... ... @@ -705,7 +705,6 @@ 705 705 **Fulfills**: UN-1 (Fast access to verified content), UN-16 (Clear review status) 706 706 707 707 **Simple flow**: 708 - 709 709 1. Claim submitted 710 710 2. AKEL processes (automated) 711 711 3. If confidence > threshold: Publish (labeled as AI-generated) ... ... @@ -717,7 +717,6 @@ 717 717 ==== FR10 — Moderation ==== 718 718 719 719 **Focus on abuse, not routine quality**: 720 - 721 721 * Automated abuse detection 722 722 * Moderators handle flags 723 723 * Quick response to harmful content ... ... @@ -788,7 +788,6 @@ 788 788 **Purpose:** Ensure extracted claims are factual assertions (not opinions/predictions) 789 789 790 790 **Checks:** 791 - 792 792 1. **Factual Statement Test:** Is this verifiable? (Yes/No) 793 793 2. **Opinion Detection:** Contains hedging language? ("I think", "probably", "best") 794 794 3. **Future Prediction Test:** Makes claims about future events? ... ... @@ -795,7 +795,6 @@ 795 795 4. **Specificity Score:** Contains specific entities, numbers, dates? 796 796 797 797 **Thresholds:** 798 - 799 799 * Factual: Must be "Yes" 800 800 * Opinion markers: <2 hedging phrases 801 801 * Specificity: ≥3 specific elements ... ... @@ -807,13 +807,11 @@ 807 807 **Purpose:** Ensure AI-linked evidence actually relates to claim 808 808 809 809 **Checks:** 810 - 811 811 1. **Semantic Similarity Score:** Evidence vs. claim (embeddings) 812 812 2. **Entity Overlap:** Shared people/places/things? 813 813 3. **Topic Relevance:** Discusses claim subject? 814 814 815 815 **Thresholds:** 816 - 817 817 * Similarity: ≥0.6 (cosine similarity) 818 818 * Entity overlap: ≥1 shared entity 819 819 * Topic relevance: ≥0.5 ... ... @@ -825,13 +825,11 @@ 825 825 **Purpose:** Validate scenario assumptions are logical and complete 826 826 827 827 **Checks:** 828 - 829 829 1. **Completeness:** All required fields populated 830 830 2. **Internal Consistency:** Assumptions don't contradict 831 831 3. **Distinguishability:** Scenarios meaningfully different 832 832 833 833 **Thresholds:** 834 - 835 835 * Required fields: 100% 836 836 * Contradiction score: <0.3 837 837 * Scenario similarity: <0.8 ... ... @@ -843,7 +843,6 @@ 843 843 **Purpose:** Only publish high-confidence verdicts 844 844 845 845 **Checks:** 846 - 847 847 1. **Evidence Count:** Minimum 2 sources 848 848 2. **Source Quality:** Average reliability ≥0.6 849 849 3. **Evidence Agreement:** Supporting vs. contradicting ≥0.6 ... ... @@ -850,7 +850,6 @@ 850 850 4. **Uncertainty Factors:** Hedging in reasoning 851 851 852 852 **Confidence Tiers:** 853 - 854 854 * **HIGH (80-100%):** ≥3 sources, ≥0.7 quality, ≥80% agreement 855 855 * **MEDIUM (50-79%):** ≥2 sources, ≥0.6 quality, ≥60% agreement 856 856 * **LOW (0-49%):** <2 sources OR low quality/agreement ... ... @@ -857,13 +857,11 @@ 857 857 * **INSUFFICIENT:** <2 sources → DO NOT PUBLISH 858 858 859 859 **Implementation Phases:** 860 - 861 861 * **POC1:** Gates 1 & 4 only (basic validation) 862 862 * **POC2:** All 4 gates (complete framework) 863 863 * **V1.0:** Hardened with <5% hallucination rate 864 864 865 865 **Acceptance Criteria:** 866 - 867 867 * ✅ All gates operational 868 868 * ✅ Hallucination rate <5% 869 869 * ✅ Quality metrics public ... ... @@ -881,7 +881,6 @@ 881 881 ==== API Security ==== 882 882 883 883 **Rate Limiting:** 884 - 885 885 * **Analysis endpoints:** 100 requests/hour per IP 886 886 * **Read endpoints:** 1,000 requests/hour per IP 887 887 * **Search:** 500 requests/hour per IP ... ... @@ -889,24 +889,21 @@ 889 889 * **Burst protection:** Max 10 requests/second 890 890 891 891 **Authentication & Authorization:** 892 - 893 893 * **API Keys:** Required for programmatic access 894 894 * **JWT tokens:** For user sessions (1-hour expiry) 895 895 * **OAuth2:** For third-party integrations 896 896 * **Role-Based Access Control (RBAC):** 897 -* Public: Read-only access to published claims 898 -* Contributor: Submit claims, provide evidence 899 -* Moderator: Review contributions, manage quality 900 -* Admin: System configuration, user management 828 + * Public: Read-only access to published claims 829 + * Contributor: Submit claims, provide evidence 830 + * Moderator: Review contributions, manage quality 831 + * Admin: System configuration, user management 901 901 902 902 **CORS Policies:** 903 - 904 904 * Whitelist approved domains only 905 905 * No wildcard origins in production 906 906 * Credentials required for sensitive endpoints 907 907 908 908 **Input Sanitization:** 909 - 910 910 * Validate all user input against schemas 911 911 * Sanitize HTML/JavaScript in text submissions 912 912 * Prevent SQL injection (use parameterized queries) ... ... @@ -914,12 +914,11 @@ 914 914 * Max request size: 10MB 915 915 * File upload restrictions: Whitelist file types, scan for malware 916 916 917 ---- --846 +--- 918 918 919 919 ==== Data Security ==== 920 920 921 921 **Encryption at Rest:** 922 - 923 923 * Database encryption using AES-256 924 924 * Encrypted backups 925 925 * Key management via cloud provider KMS (AWS KMS, Google Cloud KMS) ... ... @@ -926,7 +926,6 @@ 926 926 * Regular key rotation (90-day cycle) 927 927 928 928 **Encryption in Transit:** 929 - 930 930 * HTTPS/TLS 1.3 only (no TLS 1.0/1.1) 931 931 * Strong cipher suites only 932 932 * HSTS (HTTP Strict Transport Security) enabled ... ... @@ -933,7 +933,6 @@ 933 933 * Certificate pinning for mobile apps 934 934 935 935 **Secure Credential Storage:** 936 - 937 937 * Passwords hashed with bcrypt (cost factor 12+) 938 938 * API keys encrypted in database 939 939 * Secrets stored in environment variables (never in code) ... ... @@ -940,13 +940,12 @@ 940 940 * Use secrets manager (AWS Secrets Manager, HashiCorp Vault) 941 941 942 942 **Data Privacy:** 943 - 944 944 * Minimal data collection (privacy by design) 945 945 * User data deletion on request (GDPR compliance) 946 946 * PII encryption in database 947 947 * Anonymize logs (no PII in log files) 948 948 949 ---- --874 +--- 950 950 951 951 ==== Application Security ==== 952 952 ... ... @@ -964,7 +964,6 @@ 964 964 10. **Server-Side Request Forgery:** URL validation, whitelist domains 965 965 966 966 **Security Headers:** 967 - 968 968 * `Content-Security-Policy`: Strict CSP to prevent XSS 969 969 * `X-Frame-Options`: DENY (prevent clickjacking) 970 970 * `X-Content-Type-Options`: nosniff ... ... @@ -972,7 +972,6 @@ 972 972 * `Permissions-Policy`: Restrict browser features 973 973 974 974 **Dependency Vulnerability Scanning:** 975 - 976 976 * **Tools:** Snyk, Dependabot, npm audit, pip-audit 977 977 * **Frequency:** Daily automated scans 978 978 * **Action:** Patch critical vulnerabilities within 24 hours ... ... @@ -979,34 +979,30 @@ 979 979 * **Policy:** No known high/critical CVEs in production 980 980 981 981 **Security Audits:** 982 - 983 983 * **Internal:** Quarterly security reviews 984 984 * **External:** Annual penetration testing by certified firm 985 985 * **Bug Bounty:** Public bug bounty program (V1.1+) 986 986 * **Compliance:** SOC 2 Type II certification target (V1.5) 987 987 988 ---- --910 +--- 989 989 990 990 ==== Operational Security ==== 991 991 992 992 **DDoS Protection:** 993 - 994 994 * CloudFlare or AWS Shield 995 995 * Rate limiting at CDN layer 996 996 * Automatic IP blocking for abuse patterns 997 997 998 998 **Monitoring & Alerting:** 999 - 1000 1000 * Real-time security event monitoring 1001 1001 * Alerts for: 1002 -* Failed login attempts (>5 in 10 minutes) 1003 -* API abuse patterns 1004 -* Unusual data access patterns 1005 -* Security scan detections 922 + * Failed login attempts (>5 in 10 minutes) 923 + * API abuse patterns 924 + * Unusual data access patterns 925 + * Security scan detections 1006 1006 * Integration with SIEM (Security Information and Event Management) 1007 1007 1008 1008 **Incident Response:** 1009 - 1010 1010 * Documented incident response plan 1011 1011 * Security incident classification (P1-P4) 1012 1012 * On-call rotation for security issues ... ... @@ -1014,18 +1014,16 @@ 1014 1014 * Public disclosure policy (coordinated disclosure) 1015 1015 1016 1016 **Backup & Recovery:** 1017 - 1018 1018 * Daily encrypted backups 1019 1019 * 30-day retention period 1020 1020 * Tested recovery procedures (quarterly) 1021 1021 * Disaster recovery plan (RTO: 4 hours, RPO: 1 hour) 1022 1022 1023 ---- --941 +--- 1024 1024 1025 1025 ==== Compliance & Standards ==== 1026 1026 1027 1027 **GDPR Compliance:** 1028 - 1029 1029 * User consent management 1030 1030 * Right to access data 1031 1031 * Right to deletion ... ... @@ -1033,7 +1033,6 @@ 1033 1033 * Privacy policy published 1034 1034 1035 1035 **Accessibility:** 1036 - 1037 1037 * WCAG 2.1 AA compliance 1038 1038 * Screen reader compatibility 1039 1039 * Keyboard navigation ... ... @@ -1040,7 +1040,6 @@ 1040 1040 * Alt text for images 1041 1041 1042 1042 **Browser Support:** 1043 - 1044 1044 * Modern browsers only (Chrome/Edge/Firefox/Safari latest 2 versions) 1045 1045 * No IE11 support 1046 1046 ... ... @@ -1055,6 +1055,7 @@ 1055 1055 * ✅ Incident response plan documented and tested 1056 1056 * ✅ 95% uptime over 30-day period 1057 1057 973 + 1058 1058 === NFR13: Quality Metrics Transparency === 1059 1059 1060 1060 **Fulfills:** User trust, transparency, continuous improvement, IFCN methodology transparency ... ... @@ -1069,18 +1069,16 @@ 1069 1069 1070 1070 **Core Metrics to Display:** 1071 1071 1072 -* 1073 -** 1074 -**1. Verdict Quality Metrics 988 +**1. Verdict Quality Metrics** 1075 1075 1076 1076 **TIGERScore (Fact-Checking Quality):** 1077 - 1078 1078 * **Definition:** Measures how well generated verdicts match expert fact-checker judgments 1079 1079 * **Scale:** 0-100 (higher is better) 1080 1080 * **Calculation:** Using TIGERScore framework (Truth-conditional accuracy, Informativeness, Generality, Evaluativeness, Relevance) 1081 1081 * **Target:** Average ≥80 for production release 1082 1082 * **Display:** 1083 -{{code}}Verdict Quality (TIGERScore): 996 +{{code}} 997 +Verdict Quality (TIGERScore): 1084 1084 Overall: 84.2 ▲ (+2.1 from last month) 1085 1085 1086 1086 Distribution: ... ... @@ -1088,18 +1088,19 @@ 1088 1088 Good (60-80): 28% 1089 1089 Needs Improvement (<60): 5% 1090 1090 1091 -Trend: [Graph showing improvement over time]{{/code}} 1005 +Trend: [Graph showing improvement over time] 1006 +{{/code}} 1092 1092 1093 1093 **2. Hallucination & Faithfulness Metrics** 1094 1094 1095 1095 **AlignScore (Faithfulness to Evidence):** 1096 - 1097 1097 * **Definition:** Measures how well verdicts align with actual evidence content 1098 1098 * **Scale:** 0-1 (higher is better) 1099 1099 * **Purpose:** Detect AI hallucinations (making claims not supported by evidence) 1100 1100 * **Target:** Average ≥0.85, hallucination rate <5% 1101 1101 * **Display:** 1102 -{{code}}Evidence Faithfulness (AlignScore): 1016 +{{code}} 1017 +Evidence Faithfulness (AlignScore): 1103 1103 Average: 0.87 ▼ (-0.02 from last month) 1104 1104 1105 1105 Hallucination Rate: 4.2% ... ... @@ -1106,24 +1106,24 @@ 1106 1106 - Claims without evidence support: 3.1% 1107 1107 - Misrepresented evidence: 1.1% 1108 1108 1109 -Action: Prompt engineering review scheduled{{/code}} 1024 +Action: Prompt engineering review scheduled 1025 +{{/code}} 1110 1110 1111 1111 **3. Evidence Quality Metrics** 1112 1112 1113 1113 **Source Reliability:** 1114 - 1115 1115 * Average source quality score (0-1 scale) 1116 1116 * Distribution of high/medium/low quality sources 1117 1117 * Publisher track record trends 1118 1118 1119 1119 **Evidence Coverage:** 1120 - 1121 1121 * Average number of sources per claim 1122 1122 * Percentage of claims with ≥2 sources (EFCSN minimum) 1123 1123 * Geographic diversity of sources 1124 1124 1125 1125 **Display:** 1126 -{{code}}Evidence Quality: 1040 +{{code}} 1041 +Evidence Quality: 1127 1127 1128 1128 Average Sources per Claim: 4.2 1129 1129 Claims with ≥2 sources: 94% (EFCSN compliant) ... ... @@ -1133,23 +1133,24 @@ 1133 1133 Medium quality (0.5-0.8): 43% 1134 1134 Low quality (<0.5): 9% 1135 1135 1136 -Geographic Diversity: 23 countries represented{{/code}} 1051 +Geographic Diversity: 23 countries represented 1052 +{{/code}} 1137 1137 1138 1138 **4. Contributor Consensus Metrics** (when human reviewers involved) 1139 1139 1140 1140 **Inter-Rater Reliability (IRR):** 1141 - 1142 1142 * **Calculation:** Cohen's Kappa or Fleiss' Kappa for multiple raters 1143 1143 * **Scale:** 0-1 (higher is better) 1144 1144 * **Interpretation:** 1145 -* >0.8: Almost perfect agreement 1146 -* 0.6-0.8: Substantial agreement 1147 -* 0.4-0.6: Moderate agreement 1148 -* <0.4: Poor agreement 1060 + * >0.8: Almost perfect agreement 1061 + * 0.6-0.8: Substantial agreement 1062 + * 0.4-0.6: Moderate agreement 1063 + * <0.4: Poor agreement 1149 1149 * **Target:** Maintain ≥0.7 (substantial agreement) 1150 1150 1151 1151 **Display:** 1152 -{{code}}Contributor Consensus: 1067 +{{code}} 1068 +Contributor Consensus: 1153 1153 1154 1154 Inter-Rater Reliability (IRR): 0.73 (Substantial agreement) 1155 1155 - Verdict agreement: 78% ... ... @@ -1157,9 +1157,10 @@ 1157 1157 - Scenario structure agreement: 69% 1158 1158 1159 1159 Cases requiring moderator review: 12 1160 -Moderator override rate: 8%{{/code}} 1076 +Moderator override rate: 8% 1077 +{{/code}} 1161 1161 1162 ---- --1079 +--- 1163 1163 1164 1164 ==== Quality Dashboard Implementation ==== 1165 1165 ... ... @@ -1166,7 +1166,6 @@ 1166 1166 **Dashboard Location:** `/quality-metrics` 1167 1167 1168 1168 **Update Frequency:** 1169 - 1170 1170 * **POC2:** Weekly manual updates 1171 1171 * **Beta 0:** Daily automated updates 1172 1172 * **V1.0:** Real-time metrics (updated hourly) ... ... @@ -1216,7 +1216,7 @@ 1216 1216 1217 1217 {{/code}} 1218 1218 1219 ---- --1135 +--- 1220 1220 1221 1221 ==== Continuous Improvement Feedback Loop ==== 1222 1222 ... ... @@ -1223,36 +1223,31 @@ 1223 1223 **How Metrics Inform AKEL Improvements:** 1224 1224 1225 1225 1. **Identify Weak Areas:** 1142 + * Low TIGERScore → Review prompt engineering 1143 + * High hallucination → Strengthen evidence grounding 1144 + * Low IRR → Clarify evaluation criteria 1226 1226 1227 -* Low TIGERScore → Review prompt engineering 1228 -* High hallucination → Strengthen evidence grounding 1229 -* Low IRR → Clarify evaluation criteria 1230 - 1231 1231 2. **A/B Testing Integration:** 1147 + * Test prompt variations 1148 + * Measure impact on quality metrics 1149 + * Deploy winners automatically 1232 1232 1233 -* Test prompt variations 1234 -* Measure impact on quality metrics 1235 -* Deploy winners automatically 1236 - 1237 1237 3. **Alert Thresholds:** 1152 + * TIGERScore drops below 75 → Alert team 1153 + * Hallucination rate exceeds 7% → Pause auto-publishing 1154 + * IRR below 0.6 → Moderator training needed 1238 1238 1239 -* TIGERScore drops below 75 → Alert team 1240 -* Hallucination rate exceeds 7% → Pause auto-publishing 1241 -* IRR below 0.6 → Moderator training needed 1242 - 1243 1243 4. **Monthly Quality Reviews:** 1157 + * Analyze trends 1158 + * Identify systematic issues 1159 + * Plan prompt improvements 1160 + * Update AKEL models 1244 1244 1245 -* Analyze trends 1246 -* Identify systematic issues 1247 -* Plan prompt improvements 1248 -* Update AKEL models 1162 +--- 1249 1249 1250 ------ 1251 - 1252 1252 ==== Metric Calculation Details ==== 1253 1253 1254 1254 **TIGERScore Implementation:** 1255 - 1256 1256 * Reference: https://github.com/TIGER-AI-Lab/TIGERScore 1257 1257 * Input: Generated verdict + reference verdict (from expert) 1258 1258 * Output: 0-100 score across 5 dimensions ... ... @@ -1259,7 +1259,6 @@ 1259 1259 * Requires: Test set of expert-reviewed claims (minimum 100) 1260 1260 1261 1261 **AlignScore Implementation:** 1262 - 1263 1263 * Reference: https://github.com/yuh-zha/AlignScore 1264 1264 * Input: Generated verdict + source evidence text 1265 1265 * Output: 0-1 faithfulness score ... ... @@ -1266,12 +1266,11 @@ 1266 1266 * Calculation: Semantic alignment between claim and evidence 1267 1267 1268 1268 **Source Quality Scoring:** 1269 - 1270 1270 * Use existing source reliability database (e.g., NewsGuard, MBFC) 1271 1271 * Factor in: Publication history, corrections record, transparency 1272 1272 * Scale: 0-1 (weighted average across sources) 1273 1273 1274 ---- --1183 +--- 1275 1275 1276 1276 ==== Integration Points ==== 1277 1277 ... ... @@ -1302,13 +1302,11 @@ 1302 1302 == 14. Related Pages == 1303 1303 1304 1304 **Non-Functional Requirements (see Section 9):** 1305 - 1306 1306 * [[NFR11 — AKEL Quality Assurance Framework>>#NFR11]] 1307 1307 * [[NFR12 — Security Controls>>#NFR12]] 1308 1308 * [[NFR13 — Quality Metrics Transparency>>#NFR13]] 1309 1309 1310 1310 **Other Requirements:** 1311 - 1312 1312 * [[User Needs>>FactHarbor.Specification.Requirements.User Needs.WebHome]] 1313 1313 * [[V1.0 Requirements>>FactHarbor.Specification.Requirements.V10.]] 1314 1314 * [[Gap Analysis>>FactHarbor.Specification.Requirements.GapAnalysis]] ... ... @@ -1318,7 +1318,7 @@ 1318 1318 * [[Data Model>>FactHarbor.Specification.Data Model.WebHome]] - Data structures supporting requirements 1319 1319 * [[Workflows>>FactHarbor.Specification.Workflows.WebHome]] - User interaction workflows 1320 1320 * [[AKEL>>FactHarbor.Specification.AI Knowledge Extraction Layer (AKEL).WebHome]] - AI system fulfilling automation requirements 1321 -* [[Global Rules>> Archive.FactHarbor.Organisation.How-We-Work-Together.GlobalRules.WebHome]]1228 +* [[Global Rules>>FactHarbor.Organisation.How-We-Work-Together.GlobalRules.WebHome]] 1322 1322 * [[Privacy Policy>>FactHarbor.Organisation.How-We-Work-Together.Privacy-Policy]] 1323 1323 1324 1324 = V0.9.70 Additional Requirements = ... ... @@ -1378,7 +1378,6 @@ 1378 1378 **FactHarbor-Specific Mapping:** 1379 1379 1380 1380 **Likelihood Score to Rating Scale:** 1381 - 1382 1382 * 80-100% likelihood → 5 (Highly Supported) 1383 1383 * 60-79% likelihood → 4 (Supported) 1384 1384 * 40-59% likelihood → 3 (Mixed/Uncertain) ... ... @@ -1386,7 +1386,6 @@ 1386 1386 * 0-19% likelihood → 1 (Refuted) 1387 1387 1388 1388 **Multiple Scenarios Handling:** 1389 - 1390 1390 * If claim has multiple scenarios with different verdicts, generate **separate ClaimReview** for each scenario 1391 1391 * Add `disambiguatingDescription` field explaining scenario context 1392 1392 * Example: "Scenario: If interpreted as referring to 2023 data..." ... ... @@ -1419,6 +1419,7 @@ 1419 1419 * ✅ All required fields populated with correct data types 1420 1420 * ✅ Handles multi-scenario claims correctly (separate ClaimReview per scenario) 1421 1421 1327 + 1422 1422 === FR45: User Corrections Notification System === 1423 1423 1424 1424 **Fulfills:** IFCN Principle 5 (Open & Honest Corrections), EFCSN compliance ... ... @@ -1440,9 +1440,7 @@ 1440 1440 1441 1441 ==== Notification Mechanisms ==== 1442 1442 1443 -* 1444 -** 1445 -**1. In-Page Banner: 1349 +**1. In-Page Banner:** 1446 1446 1447 1447 Display prominent banner on claim page: 1448 1448 ... ... @@ -1462,10 +1462,10 @@ 1462 1462 1463 1463 * Public changelog at `/claims/{id}/corrections` 1464 1464 * Displays for each correction: 1465 -* Date/time of correction 1466 -* What changed (before/after comparison) 1467 -* Why changed (reason if provided) 1468 -* Who made change (AKEL auto-update vs. contributor override) 1369 + * Date/time of correction 1370 + * What changed (before/after comparison) 1371 + * Why changed (reason if provided) 1372 + * Who made change (AKEL auto-update vs. contributor override) 1469 1469 1470 1470 **3. Email Notifications (opt-in):** 1471 1471 ... ... @@ -1509,6 +1509,7 @@ 1509 1509 * ✅ Mobile-responsive banner design 1510 1510 * ✅ Accessible (screen reader compatible) 1511 1511 1416 + 1512 1512 === FR46: Image Verification System === 1513 1513 1514 1514 **Fulfills:** UN-27 (Visual Claim Verification) ... ... @@ -1526,25 +1526,23 @@ 1526 1526 **Purpose:** Find earlier uses of the image to verify context 1527 1527 1528 1528 **Implementation:** 1529 - 1530 1530 * Integrate APIs: 1531 -* **Google Vision AI** (reverse search) 1532 -* **TinEye** (oldest known uses) 1533 -* **Bing Visual Search** (broad coverage) 1435 + * **Google Vision AI** (reverse search) 1436 + * **TinEye** (oldest known uses) 1437 + * **Bing Visual Search** (broad coverage) 1534 1534 1535 1535 **Process:** 1536 - 1537 1537 1. Extract image from claim or user upload 1538 1538 2. Query multiple reverse search services 1539 1539 3. Analyze results for: 1443 + * Earliest known publication 1444 + * Original context (what was it really showing?) 1445 + * Publication timeline 1446 + * Geographic spread 1540 1540 1541 -* Earliest known publication 1542 -* Original context (what was it really showing?) 1543 -* Publication timeline 1544 -* Geographic spread 1545 - 1546 1546 **Output:** 1547 -{{code}}Reverse Image Search Results: 1449 +{{code}} 1450 +Reverse Image Search Results: 1548 1548 1549 1549 Earliest known use: 2019-03-15 (5 years before claim) 1550 1550 Original context: "Photo from 2019 flooding in Mumbai" ... ... @@ -1557,9 +1557,10 @@ 1557 1557 • 2020-07-22: Bangladesh monsoon 1558 1558 • 2024-10-15: Current claim (misattributed) 1559 1559 1560 -[View full timeline]{{/code}} 1463 +[View full timeline] 1464 +{{/code}} 1561 1561 1562 ---- --1466 +--- 1563 1563 1564 1564 **Method 2: AI Manipulation Detection** 1565 1565 ... ... @@ -1566,41 +1566,36 @@ 1566 1566 **Purpose:** Detect deepfakes, face swaps, and digital alterations 1567 1567 1568 1568 **Implementation:** 1569 - 1570 1570 * Integrate detection services: 1571 -* **Sensity AI** (deepfake detection) 1572 -* **Reality Defender** (multimodal analysis) 1573 -* **AWS Rekognition** (face detection inconsistencies) 1474 + * **Sensity AI** (deepfake detection) 1475 + * **Reality Defender** (multimodal analysis) 1476 + * **AWS Rekognition** (face detection inconsistencies) 1574 1574 1575 1575 **Detection Categories:** 1576 - 1577 1577 1. **Face Manipulation:** 1480 + * Deepfake face swaps 1481 + * Expression manipulation 1482 + * Identity replacement 1578 1578 1579 -* Deepfake face swaps 1580 -* Expression manipulation 1581 -* Identity replacement 1582 - 1583 1583 2. **Image Manipulation:** 1485 + * Copy-paste artifacts 1486 + * Clone stamp detection 1487 + * Content-aware fill detection 1488 + * JPEG compression inconsistencies 1584 1584 1585 -* Copy-paste artifacts 1586 -* Clone stamp detection 1587 -* Content-aware fill detection 1588 -* JPEG compression inconsistencies 1589 - 1590 1590 3. **AI Generation:** 1491 + * Detect fully AI-generated images 1492 + * Identify generation artifacts 1493 + * Check for model signatures 1591 1591 1592 -* Detect fully AI-generated images 1593 -* Identify generation artifacts 1594 -* Check for model signatures 1595 - 1596 1596 **Confidence Scoring:** 1597 - 1598 1598 * **HIGH (80-100%):** Strong evidence of manipulation 1599 1599 * **MEDIUM (50-79%):** Suspicious artifacts detected 1600 1600 * **LOW (0-49%):** Minor inconsistencies or inconclusive 1601 1601 1602 1602 **Output:** 1603 -{{code}}Manipulation Analysis: 1501 +{{code}} 1502 +Manipulation Analysis: 1604 1604 1605 1605 Face Manipulation: LOW RISK (12%) 1606 1606 Image Editing: MEDIUM RISK (64%) ... ... @@ -1609,9 +1609,10 @@ 1609 1609 1610 1610 AI Generation: LOW RISK (8%) 1611 1611 1612 -⚠️ Possible manipulation detected. Manual review recommended.{{/code}} 1511 +⚠️ Possible manipulation detected. Manual review recommended. 1512 +{{/code}} 1613 1613 1614 ---- --1514 +--- 1615 1615 1616 1616 **Method 3: Metadata Analysis (EXIF)** 1617 1617 ... ... @@ -1618,7 +1618,6 @@ 1618 1618 **Purpose:** Extract technical details that may reveal manipulation or misattribution 1619 1619 1620 1620 **Extracted Data:** 1621 - 1622 1622 * **Camera/Device:** Make, model, software 1623 1623 * **Timestamps:** Original date, modification dates 1624 1624 * **Location:** GPS coordinates (if present) ... ... @@ -1626,7 +1626,6 @@ 1626 1626 * **File Properties:** Resolution, compression, format conversions 1627 1627 1628 1628 **Red Flags:** 1629 - 1630 1630 * Metadata completely stripped (suspicious) 1631 1631 * Timestamp conflicts with claimed date 1632 1632 * GPS location conflicts with claimed location ... ... @@ -1634,7 +1634,8 @@ 1634 1634 * Creation date after modification date (impossible) 1635 1635 1636 1636 **Output:** 1637 -{{code}}Image Metadata: 1535 +{{code}} 1536 +Image Metadata: 1638 1638 1639 1639 Camera: iPhone 14 Pro 1640 1640 Original date: 2023-08-12 14:32:15 ... ... @@ -1646,20 +1646,19 @@ 1646 1646 Claim says: "Taken in Los Angeles" 1647 1647 EXIF says: New York City 1648 1648 1649 -⚠️ Edited 14 months after capture{{/code}} 1548 +⚠️ Edited 14 months after capture 1549 +{{/code}} 1650 1650 1651 ---- --1551 +--- 1652 1652 1653 1653 ==== Verification Workflow ==== 1654 1654 1655 1655 **Automatic Triggers:** 1656 - 1657 1657 1. User submits claim with image 1658 1658 2. Article being analyzed contains images 1659 1659 3. Social media post includes photos 1660 1660 1661 1661 **Process:** 1662 - 1663 1663 1. Extract images from content 1664 1664 2. Run all 3 verification methods in parallel 1665 1665 3. Aggregate results into confidence score ... ... @@ -1694,16 +1694,14 @@ 1694 1694 ==== Cost Considerations ==== 1695 1695 1696 1696 **API Costs (estimated per image):** 1697 - 1698 1698 * Google Vision AI: $0.001-0.003 1699 1699 * TinEye: $0.02 (commercial API) 1700 1700 * Sensity AI: $0.05-0.10 1701 1701 * AWS Rekognition: $0.001-0.002 1702 1702 1703 -**Total per image:** $0.07-0.15 **1600 +**Total per image:** ~$0.07-0.15 1704 1704 1705 1705 **Mitigation Strategies:** 1706 - 1707 1707 * Cache results for duplicate images 1708 1708 * Use free tier quotas where available 1709 1709 * Prioritize higher-value claims for deep analysis ... ... @@ -1718,6 +1718,7 @@ 1718 1718 * ✅ Mobile-friendly image comparison interface 1719 1719 * ✅ False positive rate <15% 1720 1720 1617 + 1721 1721 === FR47: Archive.org Integration === 1722 1722 1723 1723 **Priority:** CRITICAL ... ... @@ -1731,7 +1731,6 @@ 1731 1731 **Automatic Archiving:** 1732 1732 1733 1733 When AKEL links evidence: 1734 - 1735 1735 1. Check if URL already archived (Wayback Machine API) 1736 1736 2. If not, submit for archiving (Save Page Now API) 1737 1737 3. Store both original URL and archive URL ... ... @@ -1766,9 +1766,8 @@ 1766 1766 * ✅ API rate limits respected 1767 1767 * ✅ Archive status visible in evidence display 1768 1768 1769 -== Category 4: Community Safety == 1770 1770 1771 - FR48: Contributor Safety Framework === 1666 +== Category 4: Community Safety ===== FR48: Contributor Safety Framework === 1772 1772 1773 1773 **Priority:** CRITICAL 1774 1774 **Fulfills:** UN-28 (Safe contribution environment) ... ... @@ -1778,9 +1778,7 @@ 1778 1778 1779 1779 **Specification:** 1780 1780 1781 -* 1782 -** 1783 -**1. Privacy Protection: 1676 +**1. Privacy Protection:** 1784 1784 1785 1785 * **Optional Pseudonymity:** Contributors can use pseudonyms 1786 1786 * **Email Privacy:** Emails never displayed publicly ... ... @@ -1823,9 +1823,8 @@ 1823 1823 * ✅ Moderator tools implemented 1824 1824 * ✅ Safety policy published 1825 1825 1826 -== Category 5: Continuous Improvement == 1827 1827 1828 - FR49: A/B Testing Framework === 1720 +== Category 5: Continuous Improvement ===== FR49: A/B Testing Framework === 1829 1829 1830 1830 **Priority:** CRITICAL 1831 1831 **Fulfills:** Continuous system improvement ... ... @@ -1838,23 +1838,20 @@ 1838 1838 **Test Capabilities:** 1839 1839 1840 1840 1. **Prompt Variations:** 1733 + * Test different claim extraction prompts 1734 + * Test different verdict generation prompts 1735 + * Measure: Accuracy, clarity, completeness 1841 1841 1842 -* Test different claim extraction prompts 1843 -* Test different verdict generation prompts 1844 -* Measure: Accuracy, clarity, completeness 1845 - 1846 1846 2. **Algorithm Variations:** 1738 + * Test different source scoring algorithms 1739 + * Test different confidence calculations 1740 + * Measure: Audit accuracy, user satisfaction 1847 1847 1848 -* Test different source scoring algorithms 1849 -* Test different confidence calculations 1850 -* Measure: Audit accuracy, user satisfaction 1851 - 1852 1852 3. **Workflow Variations:** 1743 + * Test different quality gate thresholds 1744 + * Test different risk tier assignments 1745 + * Measure: Publication rate, quality scores 1853 1853 1854 -* Test different quality gate thresholds 1855 -* Test different risk tier assignments 1856 -* Measure: Publication rate, quality scores 1857 - 1858 1858 **Implementation:** 1859 1859 1860 1860 * **Traffic Split:** 50/50 or 90/10 splits ... ... @@ -1883,6 +1883,7 @@ 1883 1883 * ✅ Statistical significance calculated 1884 1884 * ✅ Results inform system improvements 1885 1885 1775 + 1886 1886 === FR54: Evidence Deduplication === 1887 1887 1888 1888 **Priority:** CRITICAL (POC2/Beta) ... ... @@ -1896,24 +1896,21 @@ 1896 1896 **Deduplication Logic:** 1897 1897 1898 1898 1. **URL Normalization:** 1789 + * Remove tracking parameters (?utm_source=...) 1790 + * Normalize http/https 1791 + * Normalize www/non-www 1792 + * Handle redirects 1899 1899 1900 -* Remove tracking parameters (?utm_source=...) 1901 -* Normalize http/https 1902 -* Normalize www/non-www 1903 -* Handle redirects 1904 - 1905 1905 2. **Content Similarity:** 1795 + * If two sources have >90% text similarity → Same source 1796 + * If one is subset of other → Same source 1797 + * Use fuzzy matching for minor differences 1906 1906 1907 -* If two sources have >90% text similarity → Same source 1908 -* If one is subset of other → Same source 1909 -* Use fuzzy matching for minor differences 1910 - 1911 1911 3. **Cross-Domain Syndication:** 1800 + * Detect wire service content (AP, Reuters) 1801 + * Mark as single source if syndicated 1802 + * Count original publication only 1912 1912 1913 -* Detect wire service content (AP, Reuters) 1914 -* Mark as single source if syndicated 1915 -* Count original publication only 1916 - 1917 1917 **Display:** 1918 1918 1919 1919 {{code}} ... ... @@ -1935,9 +1935,8 @@ 1935 1935 * ✅ Unique vs. total counts accurate 1936 1936 * ✅ Improves evidence quality metrics 1937 1937 1938 -== Additional Requirements (Lower Priority) == 1939 1939 1940 - FR50: OSINT Toolkit Integration === 1826 +== Additional Requirements (Lower Priority) ===== FR50: OSINT Toolkit Integration === 1941 1941 1942 1942 1943 1943 ... ... @@ -1947,7 +1947,6 @@ 1947 1947 **Purpose:** Integrate open-source intelligence tools for advanced verification. 1948 1948 1949 1949 **Tools to Integrate:** 1950 - 1951 1951 * InVID/WeVerify (video verification) 1952 1952 * Bellingcat toolkit 1953 1953 * Additional TBD based on V1.0 learnings ... ... @@ -1962,7 +1962,6 @@ 1962 1962 **Purpose:** Verify video-based claims. 1963 1963 1964 1964 **Specification:** 1965 - 1966 1966 * Keyframe extraction 1967 1967 * Reverse video search 1968 1968 * Deepfake detection (AI-powered) ... ... @@ -1979,7 +1979,6 @@ 1979 1979 **Purpose:** Teach users to identify misinformation. 1980 1980 1981 1981 **Specification:** 1982 - 1983 1983 * Interactive tutorials 1984 1984 * Practice exercises 1985 1985 * Detection quizzes ... ... @@ -1995,12 +1995,12 @@ 1995 1995 **Purpose:** Share findings with IFCN/EFCSN members. 1996 1996 1997 1997 **Specification:** 1998 - 1999 1999 * API for fact-checking organizations 2000 2000 * Structured data exchange 2001 2001 * Privacy controls 2002 2002 * Attribution requirements 2003 2003 1886 + 2004 2004 == Summary == 2005 2005 2006 2006 **V1.0 Critical Requirements (Must Have):** ... ... @@ -2023,6 +2023,7 @@ 2023 2023 * FR52: Detection Training 2024 2024 * FR53: Cross-Org Sharing 2025 2025 1909 + 2026 2026 **Total:** 11 critical requirements for V1.0 2027 2027 2028 2028 === FR54: Evidence Deduplication === ... ... @@ -2039,24 +2039,21 @@ 2039 2039 **Deduplication Logic:** 2040 2040 2041 2041 1. **URL Normalization:** 1926 + * Remove tracking parameters (?utm_source=...) 1927 + * Normalize http/https 1928 + * Normalize www/non-www 1929 + * Handle redirects 2042 2042 2043 -* Remove tracking parameters (?utm_source=...) 2044 -* Normalize http/https 2045 -* Normalize www/non-www 2046 -* Handle redirects 2047 - 2048 2048 2. **Content Similarity:** 1932 + * If two sources have >90% text similarity → Same source 1933 + * If one is subset of other → Same source 1934 + * Use fuzzy matching for minor differences 2049 2049 2050 -* If two sources have >90% text similarity → Same source 2051 -* If one is subset of other → Same source 2052 -* Use fuzzy matching for minor differences 2053 - 2054 2054 3. **Cross-Domain Syndication:** 1937 + * Detect wire service content (AP, Reuters) 1938 + * Mark as single source if syndicated 1939 + * Count original publication only 2055 2055 2056 -* Detect wire service content (AP, Reuters) 2057 -* Mark as single source if syndicated 2058 -* Count original publication only 2059 - 2060 2060 **Display:** 2061 2061 2062 2062 {{code}} ... ... @@ -2078,9 +2078,8 @@ 2078 2078 * ✅ Unique vs. total counts accurate 2079 2079 * ✅ Improves evidence quality metrics 2080 2080 2081 -== Additional Requirements (Lower Priority) == 2082 2082 2083 - FR7: Automated Verdicts (Enhanced with Quality Gates) === 1963 +== Additional Requirements (Lower Priority) ===== FR7: Automated Verdicts (Enhanced with Quality Gates) === 2084 2084 2085 2085 **POC1+ Enhancement:** 2086 2086 ... ... @@ -2102,7 +2102,6 @@ 2102 2102 {{/code}} 2103 2103 2104 2104 **Updated Verdict States:** 2105 - 2106 2106 * PUBLISHED 2107 2107 * INSUFFICIENT_EVIDENCE 2108 2108 * NON_FACTUAL_CLAIM ... ... @@ -2124,3 +2124,4 @@ 2124 2124 Avg Source Quality: 0.73 2125 2125 Quality Score: 8.5/10 2126 2126 {{/code}} 2006 +