Changes for page Requirements

Last modified by Robert Schaub on 2025/12/23 11:03

From version 1.1
edited by Robert Schaub
on 2025/12/22 19:12
Change comment: Imported from XAR
To version 5.1
edited by Robert Schaub
on 2025/12/22 20:48
Change comment: Imported from XAR

Summary

Details

Page properties
Content
... ... @@ -306,7 +306,7 @@
306 306  4. How common is this pattern?
307 307  5. Store in ErrorPattern table (improvement queue)
308 308  
309 -=== 6.2 Weekly Improvement Cycle ===
309 +=== 6.2 Continuous Improvement Cycle ===
310 310  
311 311  1. **Review**: Analyze top error patterns
312 312  2. **Develop**: Create fix (prompt, model, validation)
... ... @@ -326,7 +326,7 @@
326 326  * Re-work rate
327 327  * Claims processed per hour
328 328  
329 -**Goal**: 10% monthly improvement in error rate
329 +**Goal**: continuous improvement in error rate
330 330  
331 331  == 7. Automated Quality Monitoring ==
332 332  
... ... @@ -826,162 +826,8 @@
826 826  
827 827  **Milestone:** POC2 (internal), Beta 0 (public), V1.0 (real-time)
828 828  
829 -== 10. Requirements Priority Matrix ==
830 830  
831 -This table shows all functional and non-functional requirements ordered by urgency and priority.
832 832  
833 -**Note:** Implementation phases (POC1, POC2, Beta 0, V1.0) are defined in [[POC Requirements>>FactHarbor.Specification.POC.Requirements]] and [[Implementation Roadmap>>FactHarbor.Implementation-Roadmap.WebHome]], not in this priority matrix.
834 -
835 -**Priority Levels:**
836 -* **CRITICAL** - System doesn't work without it, or major safety/legal risk
837 -* **HIGH** - Core functionality, essential for success
838 -* **MEDIUM** - Important but not blocking
839 -* **LOW** - Nice to have, can be deferred
840 -
841 -**Urgency Levels:**
842 -* **HIGH** - Immediate need (critical for proof of concept)
843 -* **MEDIUM** - Important but not immediate
844 -* **LOW** - Future enhancement
845 -
846 -|= ID |= Title |= Priority |= Urgency
847 -| **HIGH URGENCY** |||
848 -| **FR1** | Claim Intake | CRITICAL | HIGH
849 -| **FR5** | Evidence Collection | CRITICAL | HIGH
850 -| **FR7** | Verdict Computation | CRITICAL | HIGH
851 -| **NFR11** | Quality Assurance Framework | CRITICAL | HIGH
852 -| **FR2** | Claim Normalization | HIGH | HIGH
853 -| **FR3** | Claim Classification | HIGH | HIGH
854 -| **FR4** | Scenario Generation | HIGH | HIGH
855 -| **FR6** | Evidence Evaluation | HIGH | HIGH
856 -| **MEDIUM URGENCY** |||
857 -| **NFR12** | Security Controls | CRITICAL | MEDIUM
858 -| **FR9** | Corrections | HIGH | MEDIUM
859 -| **FR44** | ClaimReview Schema | HIGH | MEDIUM
860 -| **FR45** | Corrections Notification | HIGH | MEDIUM
861 -| **FR48** | Safety Framework | HIGH | MEDIUM
862 -| **NFR3** | Transparency | HIGH | MEDIUM
863 -| **NFR13** | Quality Metrics | HIGH | MEDIUM
864 -| **FR8** | User Contribution | MEDIUM | MEDIUM
865 -| **FR10** | Publishing | MEDIUM | MEDIUM
866 -| **FR13** | API | MEDIUM | MEDIUM
867 -| **FR46** | Image Verification | MEDIUM | MEDIUM
868 -| **FR47** | Archive.org Integration | MEDIUM | MEDIUM
869 -| **NFR1** | Performance | MEDIUM | MEDIUM
870 -| **NFR2** | Scalability | MEDIUM | MEDIUM
871 -| **NFR4** | Security & Privacy | MEDIUM | MEDIUM
872 -| **NFR5** | Maintainability | MEDIUM | MEDIUM
873 -| **LOW URGENCY** |||
874 -| **FR11** | Social Sharing | LOW | LOW
875 -| **FR12** | Notifications | LOW | LOW
876 -| **FR49** | A/B Testing | LOW | LOW
877 -| **FR50** | OSINT Toolkit Integration | LOW | LOW
878 -| **FR51** | Video Verification System | LOW | LOW
879 -| **FR52** | Interactive Detection Training | LOW | LOW
880 -| **FR53** | Cross-Organizational Sharing | LOW | LOW
881 -
882 -**Total:** 31 requirements (23 Functional, 8 Non-Functional)
883 -
884 -**See also:**
885 -* [[POC Requirements>>FactHarbor.Specification.POC.Requirements]] - POC1 scope and simplifications
886 -* [[Implementation Roadmap>>FactHarbor.Implementation-Roadmap.WebHome]] - Phase-by-phase implementation plan
887 -* [[User Needs>>FactHarbor.Specification.Requirements.User Needs.WebHome]] - Foundation that drives these requirements
888 -
889 -=== 10.1 User Needs Priority ===
890 -
891 -User Needs (UN) are the foundation that drives functional and non-functional requirements. They are not independently prioritized; instead, their priority is inherited from the FR/NFR requirements they drive.
892 -
893 -|= ID |= Title |= Drives Requirements
894 -| **UN-1** | Trust Assessment at a Glance | Multiple FR/NFR
895 -| **UN-2** | Claim Extraction and Verification | FR1-7
896 -| **UN-3** | Article Summary with FactHarbor Analysis Summary | FR4
897 -| **UN-4** | Social Media Fact-Checking | FR1, FR4
898 -| **UN-5** | Source Provenance and Track Records | FR6
899 -| **UN-6** | Publisher Reliability History | FR6
900 -| **UN-7** | Evidence Transparency | NFR3
901 -| **UN-8** | Understanding Disagreement and Consensus | FR4
902 -| **UN-9** | Methodology Transparency | NFR3, NFR11
903 -| **UN-10** | Manipulation Tactics Detection | FR48
904 -| **UN-11** | Filtered Research | FR3
905 -| **UN-12** | Submit Unchecked Claims | FR8
906 -| **UN-13** | Cite FactHarbor Verdicts | FR10
907 -| **UN-14** | API Access for Integration | FR13
908 -| **UN-15** | Verdict Evolution Timeline | FR7
909 -| **UN-16** | AI vs. Human Review Status | FR9
910 -| **UN-17** | In-Article Claim Highlighting | FR1
911 -| **UN-26** | Search Engine Visibility | FR44
912 -| **UN-27** | Visual Claim Verification | FR46
913 -| **UN-28** | Safe Contribution Environment | FR48
914 -
915 -**Total:** 20 User Needs
916 -
917 -**Note:** Each User Need inherits priority from the requirements it drives. For example, UN-2 (Claim Extraction and Verification) drives FR1-7, which are CRITICAL/HIGH priority, therefore UN-2 is also critical to the project.
918 -
919 -== 11. MVP Scope ==
920 -
921 -**Phase 1 (Months 1-3): Read-Only MVP**
922 -
923 -Build:
924 -* Automated claim analysis
925 -* Confidence scoring
926 -* Source evaluation
927 -* Browse/search interface
928 -* User flagging system
929 -
930 -**Goal**: Prove AI quality before adding user editing
931 -
932 -**User Needs fulfilled in Phase 1**: UN-1, UN-2, UN-3, UN-4, UN-5, UN-6, UN-7, UN-8, UN-9, UN-12
933 -
934 -**Phase 2 (Months 4-6): User Contributions**
935 -
936 -Add only if needed:
937 -* Simple editing (Wikipedia-style)
938 -* Reputation system
939 -* Basic moderation
940 -* In-article claim highlighting (FR13)
941 -
942 -**Additional User Needs fulfilled**: UN-13, UN-17
943 -
944 -**Phase 3 (Months 7-12): Refinement**
945 -
946 -* Continuous quality improvement
947 -* Feature additions based on real usage
948 -* Scale infrastructure
949 -
950 -**Additional User Needs fulfilled**: UN-14 (API access), UN-15 (Full evolution tracking)
951 -
952 -**Deferred**:
953 -* Federation (until multiple successful instances exist)
954 -* Complex contribution workflows (focus on automation)
955 -* Extensive role hierarchy (keep simple)
956 -
957 -== 12. Success Metrics ==
958 -
959 -**System Quality** (track weekly):
960 -* Error rate by category (target: -10%/month)
961 -* Average confidence score (target: increase)
962 -* Source quality distribution (target: more high-quality)
963 -* Contradiction detection rate (target: increase)
964 -
965 -**Efficiency** (track monthly):
966 -* Claims processed per hour (target: increase)
967 -* Human hours per claim (target: decrease)
968 -* Automation coverage (target: >90%)
969 -* Re-work rate (target: <5%)
970 -
971 -**User Satisfaction** (track quarterly):
972 -* User flag rate (issues found)
973 -* Correction acceptance rate (flags valid)
974 -* Return user rate
975 -* Trust indicators (surveys)
976 -
977 -**User Needs Metrics** (track quarterly):
978 -* UN-1: % users who understand trust scores
979 -* UN-4: Time to verify social media claim (target: <30s)
980 -* UN-7: % users who access evidence details
981 -* UN-8: % users who view multiple scenarios
982 -* UN-15: % users who check evolution timeline
983 -* UN-17: % users who enable in-article highlighting; avg. time spent on highlighted vs. non-highlighted articles
984 -
985 985  == 13. Requirements Traceability ==
986 986  
987 987  For full traceability matrix showing which requirements fulfill which user needs, see:
... ... @@ -1065,19 +1065,145 @@
1065 1065  
1066 1066  **Milestone:** V1.0
1067 1067  
1068 -=== FR50-FR53: Future Enhancements (V2.0+) ===
914 +=== FR50: OSINT Toolkit Integration ===
1069 1069  
1070 -* **FR50:** OSINT Toolkit Integration
1071 -* **FR51:** Video Verification System
1072 -* **FR52:** Interactive Detection Training
1073 -* **FR53:** Cross-Organizational Sharing
1074 1074  
1075 -**Milestone:** V2.0+ (12-18 months post-launch)
1076 1076  
1077 -== Enhanced Existing Requirements ==
918 +**Fulfills:** Advanced media verification
919 +**Phase:** V1.1
1078 1078  
1079 -=== FR7: Automated Verdicts (Enhanced with Quality Gates) ===
921 +**Purpose:** Integrate open-source intelligence tools for advanced verification.
1080 1080  
923 +**Tools to Integrate:**
924 +* InVID/WeVerify (video verification)
925 +* Bellingcat toolkit
926 +* Additional TBD based on V1.0 learnings
927 +
928 +=== FR51: Video Verification System ===
929 +
930 +
931 +
932 +**Fulfills:** UN-27 (Visual claims), advanced media verification
933 +**Phase:** V1.1
934 +
935 +**Purpose:** Verify video-based claims.
936 +
937 +**Specification:**
938 +* Keyframe extraction
939 +* Reverse video search
940 +* Deepfake detection (AI-powered)
941 +* Metadata analysis
942 +* Acoustic signature analysis
943 +
944 +=== FR52: Interactive Detection Training ===
945 +
946 +
947 +
948 +**Fulfills:** Media literacy education
949 +**Phase:** V1.5
950 +
951 +**Purpose:** Teach users to identify misinformation.
952 +
953 +**Specification:**
954 +* Interactive tutorials
955 +* Practice exercises
956 +* Detection quizzes
957 +* Gamification elements
958 +
959 +=== FR53: Cross-Organizational Sharing ===
960 +
961 +
962 +
963 +**Fulfills:** Collaboration with other fact-checkers
964 +**Phase:** V1.5
965 +
966 +**Purpose:** Share findings with IFCN/EFCSN members.
967 +
968 +**Specification:**
969 +* API for fact-checking organizations
970 +* Structured data exchange
971 +* Privacy controls
972 +* Attribution requirements
973 +
974 +
975 +== Summary ==
976 +
977 +**V1.0 Critical Requirements (Must Have):**
978 +
979 +* FR44: ClaimReview Schema ✅
980 +* FR45: Corrections Notification ✅
981 +* FR46: Image Verification ✅
982 +* FR47: Archive.org Integration ✅
983 +* FR48: Contributor Safety ✅
984 +* FR49: A/B Testing ✅
985 +* FR54: Evidence Deduplication ✅
986 +* NFR11: Quality Assurance Framework ✅
987 +* NFR12: Security Controls ✅
988 +* NFR13: Quality Metrics Dashboard ✅
989 +
990 +**V1.1+ (Future):**
991 +
992 +* FR50: OSINT Integration
993 +* FR51: Video Verification
994 +* FR52: Detection Training
995 +* FR53: Cross-Org Sharing
996 +
997 +
998 +**Total:** 11 critical requirements for V1.0
999 +
1000 +=== FR54: Evidence Deduplication ===
1001 +
1002 +
1003 +
1004 +**Fulfills:** Accurate evidence counting, quality metrics
1005 +**Phase:** POC2, Beta 0, V1.0
1006 +
1007 +**Purpose:** Avoid counting the same source multiple times when it appears in different forms.
1008 +
1009 +**Specification:**
1010 +
1011 +**Deduplication Logic:**
1012 +
1013 +1. **URL Normalization:**
1014 + * Remove tracking parameters (?utm_source=...)
1015 + * Normalize http/https
1016 + * Normalize www/non-www
1017 + * Handle redirects
1018 +
1019 +2. **Content Similarity:**
1020 + * If two sources have >90% text similarity → Same source
1021 + * If one is subset of other → Same source
1022 + * Use fuzzy matching for minor differences
1023 +
1024 +3. **Cross-Domain Syndication:**
1025 + * Detect wire service content (AP, Reuters)
1026 + * Mark as single source if syndicated
1027 + * Count original publication only
1028 +
1029 +**Display:**
1030 +
1031 +{{code}}
1032 +Evidence Sources (3 unique, 5 total):
1033 +
1034 +1. Original Article (NYTimes)
1035 + - Also appeared in: WashPost, Guardian (syndicated)
1036 +
1037 +2. Research Paper (Nature)
1038 +
1039 +3. Official Statement (WHO)
1040 +{{/code}}
1041 +
1042 +**Acceptance Criteria:**
1043 +
1044 +* ✅ URL normalization works
1045 +* ✅ Content similarity detected
1046 +* ✅ Syndicated content identified
1047 +* ✅ Unique vs. total counts accurate
1048 +* ✅ Improves evidence quality metrics
1049 +
1050 +
1051 +== Additional Requirements (Lower Priority) ===== FR7: Automated Verdicts (Enhanced with Quality Gates) ===
1052 +
1081 1081  **POC1+ Enhancement:**
1082 1082  
1083 1083  After AKEL generates verdict, it passes through quality gates: