Changes for page Requirements

Last modified by Robert Schaub on 2025/12/24 18:27

From version 3.1
edited by Robert Schaub
on 2025/12/24 17:59
Change comment: Imported from XAR
To version 2.1
edited by Robert Schaub
on 2025/12/24 13:58
Change comment: Imported from XAR

Summary

Details

Page properties
Content
... ... @@ -1,5 +1,6 @@
1 1  = Requirements =
2 2  
3 +
3 3  {{info}}
4 4  **Phase Assignments:** See [[Requirements Roadmap Matrix>>FactHarbor.Roadmap.Requirements-Roadmap-Matrix.WebHome]] for which requirements are implemented in which phases.
5 5  {{/info}}
... ... @@ -448,6 +448,8 @@
448 448  This requirement has been **dropped from the current architecture and design**. Versioned entities have been replaced with simple edit history tracking only. Full evolution timeline functionality is deferred to future releases beyond V1.0.
449 449  {{/warning}}
450 450  
452 +
453 +
451 451  **Fulfills**: UN-15 (Verdict evolution timeline)
452 452  
453 453  * Claims and verdicts update as new evidence emerges
... ... @@ -622,11 +622,11 @@
622 622  
623 623  **Interaction Design**:
624 624  * Hover/click on highlighted claim → Show tooltip with:
625 - * Claim text
626 - * Verdict (e.g., "WELL SUPPORTED")
627 - * Confidence score (e.g., "85%")
628 - * Brief evidence summary
629 - * Link to detailed analysis
628 + * Claim text
629 + * Verdict (e.g., "WELL SUPPORTED")
630 + * Confidence score (e.g., "85%")
631 + * Brief evidence summary
632 + * Link to detailed analysis
630 630  * Toggle highlighting on/off (user preference)
631 631  * Adjustable color intensity for accessibility
632 632  
... ... @@ -815,6 +815,7 @@
815 815  
816 816  **Fulfills:** Data protection, system integrity, user privacy, production readiness
817 817  
821 +
818 818  **Purpose:** Protect FactHarbor systems, user data, and operations from security threats, ensuring production-grade security posture.
819 819  
820 820  **Specification:**
... ... @@ -833,10 +833,10 @@
833 833  * **JWT tokens:** For user sessions (1-hour expiry)
834 834  * **OAuth2:** For third-party integrations
835 835  * **Role-Based Access Control (RBAC):**
836 - * Public: Read-only access to published claims
837 - * Contributor: Submit claims, provide evidence
838 - * Moderator: Review contributions, manage quality
839 - * Admin: System configuration, user management
840 + * Public: Read-only access to published claims
841 + * Contributor: Submit claims, provide evidence
842 + * Moderator: Review contributions, manage quality
843 + * Admin: System configuration, user management
840 840  
841 841  **CORS Policies:**
842 842  * Whitelist approved domains only
... ... @@ -927,10 +927,10 @@
927 927  **Monitoring & Alerting:**
928 928  * Real-time security event monitoring
929 929  * Alerts for:
930 - * Failed login attempts (>5 in 10 minutes)
931 - * API abuse patterns
932 - * Unusual data access patterns
933 - * Security scan detections
934 + * Failed login attempts (>5 in 10 minutes)
935 + * API abuse patterns
936 + * Unusual data access patterns
937 + * Security scan detections
934 934  * Integration with SIEM (Security Information and Event Management)
935 935  
936 936  **Incident Response:**
... ... @@ -978,10 +978,12 @@
978 978  * ✅ Incident response plan documented and tested
979 979  * ✅ 95% uptime over 30-day period
980 980  
985 +
981 981  === NFR13: Quality Metrics Transparency ===
982 982  
983 983  **Fulfills:** User trust, transparency, continuous improvement, IFCN methodology transparency
984 984  
990 +
985 985  **Purpose:** Provide transparent, measurable quality metrics that demonstrate AKEL's performance and build user trust in automated fact-checking.
986 986  
987 987  **Specification:**
... ... @@ -1003,9 +1003,9 @@
1003 1003  Overall: 84.2 ▲ (+2.1 from last month)
1004 1004  
1005 1005  Distribution:
1006 - Excellent (>80): 67%
1007 - Good (60-80): 28%
1008 - Needs Improvement (<60): 5%
1012 + Excellent (>80): 67%
1013 + Good (60-80): 28%
1014 + Needs Improvement (<60): 5%
1009 1009  
1010 1010  Trend: [Graph showing improvement over time]
1011 1011  {{/code}}
... ... @@ -1023,8 +1023,8 @@
1023 1023  Average: 0.87 ▼ (-0.02 from last month)
1024 1024  
1025 1025  Hallucination Rate: 4.2%
1026 - - Claims without evidence support: 3.1%
1027 - - Misrepresented evidence: 1.1%
1032 + - Claims without evidence support: 3.1%
1033 + - Misrepresented evidence: 1.1%
1028 1028  
1029 1029  Action: Prompt engineering review scheduled
1030 1030  {{/code}}
... ... @@ -1049,9 +1049,9 @@
1049 1049  Claims with ≥2 sources: 94% (EFCSN compliant)
1050 1050  
1051 1051  Source Quality Distribution:
1052 - High quality (>0.8): 48%
1053 - Medium quality (0.5-0.8): 43%
1054 - Low quality (<0.5): 9%
1058 + High quality (>0.8): 48%
1059 + Medium quality (0.5-0.8): 43%
1060 + Low quality (<0.5): 9%
1055 1055  
1056 1056  Geographic Diversity: 23 countries represented
1057 1057  {{/code}}
... ... @@ -1062,10 +1062,10 @@
1062 1062  * **Calculation:** Cohen's Kappa or Fleiss' Kappa for multiple raters
1063 1063  * **Scale:** 0-1 (higher is better)
1064 1064  * **Interpretation:**
1065 - * >0.8: Almost perfect agreement
1066 - * 0.6-0.8: Substantial agreement
1067 - * 0.4-0.6: Moderate agreement
1068 - * <0.4: Poor agreement
1071 + * >0.8: Almost perfect agreement
1072 + * 0.6-0.8: Substantial agreement
1073 + * 0.4-0.6: Moderate agreement
1074 + * <0.4: Poor agreement
1069 1069  * **Target:** Maintain ≥0.7 (substantial agreement)
1070 1070  
1071 1071  **Display:**
... ... @@ -1073,9 +1073,9 @@
1073 1073  Contributor Consensus:
1074 1074  
1075 1075  Inter-Rater Reliability (IRR): 0.73 (Substantial agreement)
1076 - - Verdict agreement: 78%
1077 - - Evidence quality agreement: 71%
1078 - - Scenario structure agreement: 69%
1082 + - Verdict agreement: 78%
1083 + - Evidence quality agreement: 71%
1084 + - Scenario structure agreement: 69%
1079 1079  
1080 1080  Cases requiring moderator review: 12
1081 1081  Moderator override rate: 8%
... ... @@ -1105,16 +1105,16 @@
1105 1105  
1106 1106  {{code}}
1107 1107  ┌─────────────────────────────────────────────────────────────┐
1108 -│ FactHarbor Quality Metrics Last updated: │
1109 -│ Public Dashboard 2 hours ago │
1114 +│ FactHarbor Quality Metrics Last updated: │
1115 +│ Public Dashboard 2 hours ago
1110 1110  └─────────────────────────────────────────────────────────────┘
1111 1111  
1112 1112  📊 KEY METRICS
1113 1113  ─────────────────────────────────────────────────────────────
1114 -TIGERScore (Verdict Quality): 84.2 ▲ (+2.1)
1115 -AlignScore (Faithfulness): 0.87 ▼ (-0.02)
1116 -Hallucination Rate: 4.2% ✓ (Target: <5%)
1117 -Average Sources per Claim: 4.2 ▲ (+0.3)
1120 +TIGERScore (Verdict Quality): 84.2 ▲ (+2.1)
1121 +AlignScore (Faithfulness): 0.87 ▼ (-0.02)
1122 +Hallucination Rate: 4.2% ✓ (Target: <5%)
1123 +Average Sources per Claim: 4.2 ▲ (+0.3)
1118 1118  
1119 1119  📈 TRENDS (30 days)
1120 1120  ─────────────────────────────────────────────────────────────
... ... @@ -1144,25 +1144,25 @@
1144 1144  **How Metrics Inform AKEL Improvements:**
1145 1145  
1146 1146  1. **Identify Weak Areas:**
1147 - * Low TIGERScore → Review prompt engineering
1148 - * High hallucination → Strengthen evidence grounding
1149 - * Low IRR → Clarify evaluation criteria
1153 + * Low TIGERScore → Review prompt engineering
1154 + * High hallucination → Strengthen evidence grounding
1155 + * Low IRR → Clarify evaluation criteria
1150 1150  
1151 1151  2. **A/B Testing Integration:**
1152 - * Test prompt variations
1153 - * Measure impact on quality metrics
1154 - * Deploy winners automatically
1158 + * Test prompt variations
1159 + * Measure impact on quality metrics
1160 + * Deploy winners automatically
1155 1155  
1156 1156  3. **Alert Thresholds:**
1157 - * TIGERScore drops below 75 → Alert team
1158 - * Hallucination rate exceeds 7% → Pause auto-publishing
1159 - * IRR below 0.6 → Moderator training needed
1163 + * TIGERScore drops below 75 → Alert team
1164 + * Hallucination rate exceeds 7% → Pause auto-publishing
1165 + * IRR below 0.6 → Moderator training needed
1160 1160  
1161 1161  4. **Monthly Quality Reviews:**
1162 - * Analyze trends
1163 - * Identify systematic issues
1164 - * Plan prompt improvements
1165 - * Update AKEL models
1168 + * Analyze trends
1169 + * Identify systematic issues
1170 + * Plan prompt improvements
1171 + * Update AKEL models
1166 1166  
1167 1167  ---
1168 1168  
... ... @@ -1204,6 +1204,9 @@
1204 1204  * ✅ Mobile-responsive dashboard design
1205 1205  * ✅ Metrics inform quarterly AKEL improvement planning
1206 1206  
1213 +
1214 +
1215 +
1207 1207  == 13. Requirements Traceability ==
1208 1208  
1209 1209  For full traceability matrix showing which requirements fulfill which user needs, see:
... ... @@ -1238,6 +1238,7 @@
1238 1238  
1239 1239  **Fulfills:** UN-13 (Cite FactHarbor Verdicts), UN-14 (API Access for Integration), UN-26 (Search Engine Visibility)
1240 1240  
1250 +
1241 1241  **Purpose:** Generate valid ClaimReview structured data for every published analysis to enable Google/Bing search visibility and fact-check discovery.
1242 1242  
1243 1243  **Specification:**
... ... @@ -1250,35 +1250,35 @@
1250 1250  
1251 1251  {{code language="json"}}
1252 1252  {
1253 - "@context": "https://schema.org",
1254 - "@type": "ClaimReview",
1255 - "datePublished": "YYYY-MM-DD",
1256 - "url": "https://factharbor.org/claims/{claim_id}",
1257 - "claimReviewed": "The exact claim text",
1258 - "author": {
1259 - "@type": "Organization",
1260 - "name": "FactHarbor",
1261 - "url": "https://factharbor.org"
1262 - },
1263 - "reviewRating": {
1264 - "@type": "Rating",
1265 - "ratingValue": "1-5",
1266 - "bestRating": "5",
1267 - "worstRating": "1",
1268 - "alternateName": "FactHarbor likelihood score"
1269 - },
1270 - "itemReviewed": {
1271 - "@type": "Claim",
1272 - "author": {
1273 - "@type": "Person",
1274 - "name": "Claim author if known"
1275 - },
1276 - "datePublished": "YYYY-MM-DD if known",
1277 - "appearance": {
1278 - "@type": "CreativeWork",
1279 - "url": "Original claim URL if from article"
1280 - }
1281 - }
1263 + "@context": "https://schema.org",
1264 + "@type": "ClaimReview",
1265 + "datePublished": "YYYY-MM-DD",
1266 + "url": "https://factharbor.org/claims/{claim_id}",
1267 + "claimReviewed": "The exact claim text",
1268 + "author": {
1269 + "@type": "Organization",
1270 + "name": "FactHarbor",
1271 + "url": "https://factharbor.org"
1272 + },
1273 + "reviewRating": {
1274 + "@type": "Rating",
1275 + "ratingValue": "1-5",
1276 + "bestRating": "5",
1277 + "worstRating": "1",
1278 + "alternateName": "FactHarbor likelihood score"
1279 + },
1280 + "itemReviewed": {
1281 + "@type": "Claim",
1282 + "author": {
1283 + "@type": "Person",
1284 + "name": "Claim author if known"
1285 + },
1286 + "datePublished": "YYYY-MM-DD if known",
1287 + "appearance": {
1288 + "@type": "CreativeWork",
1289 + "url": "Original claim URL if from article"
1290 + }
1291 + }
1282 1282  }
1283 1283  {{/code}}
1284 1284  
... ... @@ -1324,10 +1324,12 @@
1324 1324  * ✅ All required fields populated with correct data types
1325 1325  * ✅ Handles multi-scenario claims correctly (separate ClaimReview per scenario)
1326 1326  
1337 +
1327 1327  === FR45: User Corrections Notification System ===
1328 1328  
1329 1329  **Fulfills:** IFCN Principle 5 (Open & Honest Corrections), EFCSN compliance
1330 1330  
1342 +
1331 1331  **Purpose:** When any claim analysis is corrected, notify users who previously viewed the claim to maintain transparency and build trust.
1332 1332  
1333 1333  **Specification:**
... ... @@ -1363,10 +1363,10 @@
1363 1363  
1364 1364  * Public changelog at `/claims/{id}/corrections`
1365 1365  * Displays for each correction:
1366 - * Date/time of correction
1367 - * What changed (before/after comparison)
1368 - * Why changed (reason if provided)
1369 - * Who made change (AKEL auto-update vs. contributor override)
1378 + * Date/time of correction
1379 + * What changed (before/after comparison)
1380 + * Why changed (reason if provided)
1381 + * Who made change (AKEL auto-update vs. contributor override)
1370 1370  
1371 1371  **3. Email Notifications (opt-in):**
1372 1372  
... ... @@ -1410,10 +1410,12 @@
1410 1410  * ✅ Mobile-responsive banner design
1411 1411  * ✅ Accessible (screen reader compatible)
1412 1412  
1425 +
1413 1413  === FR46: Image Verification System ===
1414 1414  
1415 1415  **Fulfills:** UN-27 (Visual Claim Verification)
1416 1416  
1430 +
1417 1417  **Purpose:** Verify authenticity and context of images shared with claims to detect manipulation, misattribution, and out-of-context usage.
1418 1418  
1419 1419  **Specification:**
... ... @@ -1426,18 +1426,18 @@
1426 1426  
1427 1427  **Implementation:**
1428 1428  * Integrate APIs:
1429 - * **Google Vision AI** (reverse search)
1430 - * **TinEye** (oldest known uses)
1431 - * **Bing Visual Search** (broad coverage)
1443 + * **Google Vision AI** (reverse search)
1444 + * **TinEye** (oldest known uses)
1445 + * **Bing Visual Search** (broad coverage)
1432 1432  
1433 1433  **Process:**
1434 1434  1. Extract image from claim or user upload
1435 1435  2. Query multiple reverse search services
1436 1436  3. Analyze results for:
1437 - * Earliest known publication
1438 - * Original context (what was it really showing?)
1439 - * Publication timeline
1440 - * Geographic spread
1451 + * Earliest known publication
1452 + * Original context (what was it really showing?)
1453 + * Publication timeline
1454 + * Geographic spread
1441 1441  
1442 1442  **Output:**
1443 1443  {{code}}
... ... @@ -1465,26 +1465,26 @@
1465 1465  
1466 1466  **Implementation:**
1467 1467  * Integrate detection services:
1468 - * **Sensity AI** (deepfake detection)
1469 - * **Reality Defender** (multimodal analysis)
1470 - * **AWS Rekognition** (face detection inconsistencies)
1482 + * **Sensity AI** (deepfake detection)
1483 + * **Reality Defender** (multimodal analysis)
1484 + * **AWS Rekognition** (face detection inconsistencies)
1471 1471  
1472 1472  **Detection Categories:**
1473 1473  1. **Face Manipulation:**
1474 - * Deepfake face swaps
1475 - * Expression manipulation
1476 - * Identity replacement
1488 + * Deepfake face swaps
1489 + * Expression manipulation
1490 + * Identity replacement
1477 1477  
1478 1478  2. **Image Manipulation:**
1479 - * Copy-paste artifacts
1480 - * Clone stamp detection
1481 - * Content-aware fill detection
1482 - * JPEG compression inconsistencies
1493 + * Copy-paste artifacts
1494 + * Clone stamp detection
1495 + * Content-aware fill detection
1496 + * JPEG compression inconsistencies
1483 1483  
1484 1484  3. **AI Generation:**
1485 - * Detect fully AI-generated images
1486 - * Identify generation artifacts
1487 - * Check for model signatures
1499 + * Detect fully AI-generated images
1500 + * Identify generation artifacts
1501 + * Check for model signatures
1488 1488  
1489 1489  **Confidence Scoring:**
1490 1490  * **HIGH (80-100%):** Strong evidence of manipulation
... ... @@ -1497,8 +1497,8 @@
1497 1497  
1498 1498  Face Manipulation: LOW RISK (12%)
1499 1499  Image Editing: MEDIUM RISK (64%)
1500 - • Clone stamp artifacts detected in sky region
1501 - • JPEG compression inconsistent between objects
1514 + • Clone stamp artifacts detected in sky region
1515 + • JPEG compression inconsistent between objects
1502 1502  
1503 1503  AI Generation: LOW RISK (8%)
1504 1504  
... ... @@ -1608,10 +1608,11 @@
1608 1608  * ✅ Mobile-friendly image comparison interface
1609 1609  * ✅ False positive rate <15%
1610 1610  
1625 +
1611 1611  === FR47: Archive.org Integration ===
1612 1612  
1613 -**Importance:** CRITICAL
1614 -**Fulfills:** Evidence persistence, FR5 (Evidence linking)
1628 +**Importance:** CRITICAL
1629 +**Fulfills:** Evidence persistence, FR5 (Evidence linking)
1615 1615  
1616 1616  **Purpose:** Ensure evidence remains accessible even if original sources are deleted.
1617 1617  
... ... @@ -1654,10 +1654,11 @@
1654 1654  * ✅ API rate limits respected
1655 1655  * ✅ Archive status visible in evidence display
1656 1656  
1672 +
1657 1657  == Category 4: Community Safety ===== FR48: Contributor Safety Framework ===
1658 1658  
1659 -**Importance:** CRITICAL
1660 -**Fulfills:** UN-28 (Safe contribution environment)
1675 +**Importance:** CRITICAL
1676 +**Fulfills:** UN-28 (Safe contribution environment)
1661 1661  
1662 1662  **Purpose:** Protect contributors from harassment, doxxing, and coordinated attacks.
1663 1663  
... ... @@ -1706,10 +1706,11 @@
1706 1706  * ✅ Moderator tools implemented
1707 1707  * ✅ Safety policy published
1708 1708  
1725 +
1709 1709  == Category 5: Continuous Improvement ===== FR49: A/B Testing Framework ===
1710 1710  
1711 -**Importance:** CRITICAL
1712 -**Fulfills:** Continuous system improvement
1728 +**Importance:** CRITICAL
1729 +**Fulfills:** Continuous system improvement
1713 1713  
1714 1714  **Purpose:** Test and measure improvements to AKEL prompts, algorithms, and workflows.
1715 1715  
... ... @@ -1718,19 +1718,19 @@
1718 1718  **Test Capabilities:**
1719 1719  
1720 1720  1. **Prompt Variations:**
1721 - * Test different claim extraction prompts
1722 - * Test different verdict generation prompts
1723 - * Measure: Accuracy, clarity, completeness
1738 + * Test different claim extraction prompts
1739 + * Test different verdict generation prompts
1740 + * Measure: Accuracy, clarity, completeness
1724 1724  
1725 1725  2. **Algorithm Variations:**
1726 - * Test different source scoring algorithms
1727 - * Test different confidence calculations
1728 - * Measure: Audit accuracy, user satisfaction
1743 + * Test different source scoring algorithms
1744 + * Test different confidence calculations
1745 + * Measure: Audit accuracy, user satisfaction
1729 1729  
1730 1730  3. **Workflow Variations:**
1731 - * Test different quality gate thresholds
1732 - * Test different risk tier assignments
1733 - * Measure: Publication rate, quality scores
1748 + * Test different quality gate thresholds
1749 + * Test different risk tier assignments
1750 + * Measure: Publication rate, quality scores
1734 1734  
1735 1735  **Implementation:**
1736 1736  
... ... @@ -1760,10 +1760,11 @@
1760 1760  * ✅ Statistical significance calculated
1761 1761  * ✅ Results inform system improvements
1762 1762  
1780 +
1763 1763  === FR54: Evidence Deduplication ===
1764 1764  
1765 -**Importance:** CRITICAL (POC2/Beta)
1766 -**Fulfills:** Accurate evidence counting, quality metrics
1783 +**Importance:** CRITICAL (POC2/Beta)
1784 +**Fulfills:** Accurate evidence counting, quality metrics
1767 1767  
1768 1768  **Purpose:** Avoid counting the same source multiple times when it appears in different forms.
1769 1769  
... ... @@ -1772,20 +1772,20 @@
1772 1772  **Deduplication Logic:**
1773 1773  
1774 1774  1. **URL Normalization:**
1775 - * Remove tracking parameters (?utm_source=...)
1776 - * Normalize http/https
1777 - * Normalize www/non-www
1778 - * Handle redirects
1793 + * Remove tracking parameters (?utm_source=...)
1794 + * Normalize http/https
1795 + * Normalize www/non-www
1796 + * Handle redirects
1779 1779  
1780 1780  2. **Content Similarity:**
1781 - * If two sources have >90% text similarity → Same source
1782 - * If one is subset of other → Same source
1783 - * Use fuzzy matching for minor differences
1799 + * If two sources have >90% text similarity → Same source
1800 + * If one is subset of other → Same source
1801 + * Use fuzzy matching for minor differences
1784 1784  
1785 1785  3. **Cross-Domain Syndication:**
1786 - * Detect wire service content (AP, Reuters)
1787 - * Mark as single source if syndicated
1788 - * Count original publication only
1804 + * Detect wire service content (AP, Reuters)
1805 + * Mark as single source if syndicated
1806 + * Count original publication only
1789 1789  
1790 1790  **Display:**
1791 1791  
... ... @@ -1793,7 +1793,7 @@
1793 1793  Evidence Sources (3 unique, 5 total):
1794 1794  
1795 1795  1. Original Article (NYTimes)
1796 - - Also appeared in: WashPost, Guardian (syndicated)
1814 + - Also appeared in: WashPost, Guardian (syndicated)
1797 1797  
1798 1798  2. Research Paper (Nature)
1799 1799  
... ... @@ -1808,10 +1808,13 @@
1808 1808  * ✅ Unique vs. total counts accurate
1809 1809  * ✅ Improves evidence quality metrics
1810 1810  
1829 +
1811 1811  == Additional Requirements (Lower Importance) ===== FR50: OSINT Toolkit Integration ===
1812 1812  
1813 -**Fulfills:** Advanced media verification
1814 1814  
1833 +
1834 +**Fulfills:** Advanced media verification
1835 +
1815 1815  **Purpose:** Integrate open-source intelligence tools for advanced verification.
1816 1816  
1817 1817  **Tools to Integrate:**
... ... @@ -1821,8 +1821,10 @@
1821 1821  
1822 1822  === FR51: Video Verification System ===
1823 1823  
1824 -**Fulfills:** UN-27 (Visual claims), advanced media verification
1825 1825  
1846 +
1847 +**Fulfills:** UN-27 (Visual claims), advanced media verification
1848 +
1826 1826  **Purpose:** Verify video-based claims.
1827 1827  
1828 1828  **Specification:**
... ... @@ -1834,8 +1834,10 @@
1834 1834  
1835 1835  === FR52: Interactive Detection Training ===
1836 1836  
1837 -**Fulfills:** Media literacy education
1838 1838  
1861 +
1862 +**Fulfills:** Media literacy education
1863 +
1839 1839  **Purpose:** Teach users to identify misinformation.
1840 1840  
1841 1841  **Specification:**
... ... @@ -1846,8 +1846,10 @@
1846 1846  
1847 1847  === FR53: Cross-Organizational Sharing ===
1848 1848  
1849 -**Fulfills:** Collaboration with other fact-checkers
1850 1850  
1875 +
1876 +**Fulfills:** Collaboration with other fact-checkers
1877 +
1851 1851  **Purpose:** Share findings with IFCN/EFCSN members.
1852 1852  
1853 1853  **Specification:**
... ... @@ -1856,6 +1856,7 @@
1856 1856  * Privacy controls
1857 1857  * Attribution requirements
1858 1858  
1886 +
1859 1859  == Summary ==
1860 1860  
1861 1861  **V1.0 Critical Requirements (Must Have):**
... ... @@ -1878,12 +1878,15 @@
1878 1878  * FR52: Detection Training
1879 1879  * FR53: Cross-Org Sharing
1880 1880  
1909 +
1881 1881  **Total:** 11 critical requirements for V1.0
1882 1882  
1883 1883  === FR54: Evidence Deduplication ===
1884 1884  
1885 -**Fulfills:** Accurate evidence counting, quality metrics
1886 1886  
1915 +
1916 +**Fulfills:** Accurate evidence counting, quality metrics
1917 +
1887 1887  **Purpose:** Avoid counting the same source multiple times when it appears in different forms.
1888 1888  
1889 1889  **Specification:**
... ... @@ -1891,20 +1891,20 @@
1891 1891  **Deduplication Logic:**
1892 1892  
1893 1893  1. **URL Normalization:**
1894 - * Remove tracking parameters (?utm_source=...)
1895 - * Normalize http/https
1896 - * Normalize www/non-www
1897 - * Handle redirects
1925 + * Remove tracking parameters (?utm_source=...)
1926 + * Normalize http/https
1927 + * Normalize www/non-www
1928 + * Handle redirects
1898 1898  
1899 1899  2. **Content Similarity:**
1900 - * If two sources have >90% text similarity → Same source
1901 - * If one is subset of other → Same source
1902 - * Use fuzzy matching for minor differences
1931 + * If two sources have >90% text similarity → Same source
1932 + * If one is subset of other → Same source
1933 + * Use fuzzy matching for minor differences
1903 1903  
1904 1904  3. **Cross-Domain Syndication:**
1905 - * Detect wire service content (AP, Reuters)
1906 - * Mark as single source if syndicated
1907 - * Count original publication only
1936 + * Detect wire service content (AP, Reuters)
1937 + * Mark as single source if syndicated
1938 + * Count original publication only
1908 1908  
1909 1909  **Display:**
1910 1910  
... ... @@ -1912,7 +1912,7 @@
1912 1912  Evidence Sources (3 unique, 5 total):
1913 1913  
1914 1914  1. Original Article (NYTimes)
1915 - - Also appeared in: WashPost, Guardian (syndicated)
1946 + - Also appeared in: WashPost, Guardian (syndicated)
1916 1916  
1917 1917  2. Research Paper (Nature)
1918 1918  
... ... @@ -1927,6 +1927,7 @@
1927 1927  * ✅ Unique vs. total counts accurate
1928 1928  * ✅ Improves evidence quality metrics
1929 1929  
1961 +
1930 1930  == Additional Requirements (Lower Importance) ===== FR7: Automated Verdicts (Enhanced with Quality Gates) ===
1931 1931  
1932 1932  **POC1+ Enhancement:**
... ... @@ -1936,15 +1936,15 @@
1936 1936  {{code}}
1937 1937  Workflow:
1938 1938  1. Extract claims
1939 - ↓
1971 +
1940 1940  2. [GATE 1] Validate fact-checkable
1941 - ↓
1973 +
1942 1942  3. Generate scenarios
1943 - ↓
1975 +
1944 1944  4. Generate verdicts
1945 - ↓
1977 +
1946 1946  5. [GATE 4] Validate confidence
1947 - ↓
1979 +
1948 1948  6. Display to user
1949 1949  {{/code}}
1950 1950  
... ... @@ -1963,11 +1963,11 @@
1963 1963  
1964 1964  {{code}}
1965 1965  Analysis Summary:
1966 - Verifiable Claims: 3/5
1967 - High Confidence Verdicts: 1
1968 - Medium Confidence: 2
1969 - Evidence Sources: 12
1970 - Avg Source Quality: 0.73
1971 - Quality Score: 8.5/10
1998 + Verifiable Claims: 3/5
1999 + High Confidence Verdicts: 1
2000 + Medium Confidence: 2
2001 + Evidence Sources: 12
2002 + Avg Source Quality: 0.73
2003 + Quality Score: 8.5/10
1972 1972  {{/code}}
1973 1973