Changes for page Requirements
Last modified by Robert Schaub on 2025/12/23 11:03
Summary
-
Page properties (1 modified, 0 added, 0 removed)
Details
- Page properties
-
- Content
-
... ... @@ -1617,26 +1617,216 @@ 1617 1617 1618 1618 === FR47: Archive.org Integration === 1619 1619 1620 -Auto-save evidence sources to Wayback Machine. 1620 +**Priority:** CRITICAL 1621 +**Fulfills:** Evidence persistence, FR5 (Evidence linking) 1622 +**Phase:** V1.0 1621 1621 1622 -** Milestone:**Beta01624 +**Purpose:** Ensure evidence remains accessible even if original sources are deleted. 1623 1623 1624 - === FR48:Safety Frameworkfor Contributors ===1626 +**Specification:** 1625 1625 1626 - Protect contributorsfromharassment and legal threats.1628 +**Automatic Archiving:** 1627 1627 1628 -**Milestone:** V1.1 1630 +When AKEL links evidence: 1631 +1. Check if URL already archived (Wayback Machine API) 1632 +2. If not, submit for archiving (Save Page Now API) 1633 +3. Store both original URL and archive URL 1634 +4. Display both to users 1629 1629 1630 - === FR49:A/B Testing Framework===1636 +**Archive Display:** 1631 1631 1632 -Test AKEL approaches and UI designs systematically. 1638 +{{code}} 1639 +Evidence Source: [Original URL] 1640 +Archived: [Archive.org URL] (Captured: [date]) 1633 1633 1634 -**Milestone:** V1.0 1642 +[View Original] [View Archive] 1643 +{{/code}} 1635 1635 1636 - ===FR50: OSINT ToolkitIntegration ===1645 +**Fallback Logic:** 1637 1637 1647 +* If original URL unavailable → Auto-redirect to archive 1648 +* If archive unavailable → Display warning 1649 +* If both unavailable → Flag for manual review 1638 1638 1651 +**API Integration:** 1639 1639 1653 +* Use Wayback Machine Availability API 1654 +* Use Save Page Now API (SPNv2) 1655 +* Rate limiting: 15 requests/minute (Wayback limit) 1656 + 1657 +**Acceptance Criteria:** 1658 + 1659 +* ✅ All evidence URLs auto-archived 1660 +* ✅ Archive links displayed to users 1661 +* ✅ Fallback to archive if original unavailable 1662 +* ✅ API rate limits respected 1663 +* ✅ Archive status visible in evidence display 1664 + 1665 + 1666 +== Category 4: Community Safety ===== FR48: Contributor Safety Framework === 1667 + 1668 +**Priority:** CRITICAL 1669 +**Fulfills:** UN-28 (Safe contribution environment) 1670 +**Phase:** V1.0 1671 + 1672 +**Purpose:** Protect contributors from harassment, doxxing, and coordinated attacks. 1673 + 1674 +**Specification:** 1675 + 1676 +**1. Privacy Protection:** 1677 + 1678 +* **Optional Pseudonymity:** Contributors can use pseudonyms 1679 +* **Email Privacy:** Emails never displayed publicly 1680 +* **Profile Privacy:** Contributors control what's public 1681 +* **IP Logging:** Only for abuse prevention, not public 1682 + 1683 +**2. Harassment Prevention:** 1684 + 1685 +* **Automated Toxicity Detection:** Flag abusive comments 1686 +* **Personal Information Detection:** Auto-block doxxing attempts 1687 +* **Coordinated Attack Detection:** Identify brigading patterns 1688 +* **Rapid Response:** Moderator alerts for harassment 1689 + 1690 +**3. Safety Features:** 1691 + 1692 +* **Block Users:** Contributors can block harassers 1693 +* **Private Contributions:** Option to contribute anonymously 1694 +* **Report Harassment:** One-click harassment reporting 1695 +* **Safety Resources:** Links to support resources 1696 + 1697 +**4. Moderator Tools:** 1698 + 1699 +* **Quick Ban:** Immediately block abusers 1700 +* **Pattern Detection:** Identify coordinated attacks 1701 +* **Appeal Process:** Fair review of moderation actions 1702 +* **Escalation:** Serious threats escalated to authorities 1703 + 1704 +**5. Trusted Contributor Protection:** 1705 + 1706 +* **Enhanced Privacy:** Additional protection for high-profile contributors 1707 +* **Verification:** Optional identity verification (not public) 1708 +* **Legal Support:** Resources for contributors facing legal threats 1709 + 1710 +**Acceptance Criteria:** 1711 + 1712 +* ✅ Pseudonyms supported 1713 +* ✅ Toxicity detection active 1714 +* ✅ Doxxing auto-blocked 1715 +* ✅ Harassment reporting functional 1716 +* ✅ Moderator tools implemented 1717 +* ✅ Safety policy published 1718 + 1719 + 1720 +== Category 5: Continuous Improvement ===== FR49: A/B Testing Framework === 1721 + 1722 +**Priority:** CRITICAL 1723 +**Fulfills:** Continuous system improvement 1724 +**Phase:** V1.0 1725 + 1726 +**Purpose:** Test and measure improvements to AKEL prompts, algorithms, and workflows. 1727 + 1728 +**Specification:** 1729 + 1730 +**Test Capabilities:** 1731 + 1732 +1. **Prompt Variations:** 1733 + * Test different claim extraction prompts 1734 + * Test different verdict generation prompts 1735 + * Measure: Accuracy, clarity, completeness 1736 + 1737 +2. **Algorithm Variations:** 1738 + * Test different source scoring algorithms 1739 + * Test different confidence calculations 1740 + * Measure: Audit accuracy, user satisfaction 1741 + 1742 +3. **Workflow Variations:** 1743 + * Test different quality gate thresholds 1744 + * Test different risk tier assignments 1745 + * Measure: Publication rate, quality scores 1746 + 1747 +**Implementation:** 1748 + 1749 +* **Traffic Split:** 50/50 or 90/10 splits 1750 +* **Randomization:** Consistent per claim (not per user) 1751 +* **Metrics Collection:** Automatic for all variants 1752 +* **Statistical Significance:** Minimum sample size calculation 1753 +* **Rollout:** Winner promoted to 100% traffic 1754 + 1755 +**A/B Test Workflow:** 1756 + 1757 +{{code}} 1758 +1. Hypothesis: "New prompt improves claim extraction" 1759 +2. Design test: Control vs. Variant 1760 +3. Define metrics: Extraction accuracy, completeness 1761 +4. Run test: 7-14 days, minimum 100 claims each 1762 +5. Analyze results: Statistical significance? 1763 +6. Decision: Deploy winner or iterate 1764 +{{/code}} 1765 + 1766 +**Acceptance Criteria:** 1767 + 1768 +* ✅ A/B testing framework implemented 1769 +* ✅ Can test prompt variations 1770 +* ✅ Can test algorithm variations 1771 +* ✅ Metrics automatically collected 1772 +* ✅ Statistical significance calculated 1773 +* ✅ Results inform system improvements 1774 + 1775 + 1776 +=== FR54: Evidence Deduplication === 1777 + 1778 +**Priority:** CRITICAL (POC2/Beta) 1779 +**Fulfills:** Accurate evidence counting, quality metrics 1780 +**Phase:** POC2, Beta 0, V1.0 1781 + 1782 +**Purpose:** Avoid counting the same source multiple times when it appears in different forms. 1783 + 1784 +**Specification:** 1785 + 1786 +**Deduplication Logic:** 1787 + 1788 +1. **URL Normalization:** 1789 + * Remove tracking parameters (?utm_source=...) 1790 + * Normalize http/https 1791 + * Normalize www/non-www 1792 + * Handle redirects 1793 + 1794 +2. **Content Similarity:** 1795 + * If two sources have >90% text similarity → Same source 1796 + * If one is subset of other → Same source 1797 + * Use fuzzy matching for minor differences 1798 + 1799 +3. **Cross-Domain Syndication:** 1800 + * Detect wire service content (AP, Reuters) 1801 + * Mark as single source if syndicated 1802 + * Count original publication only 1803 + 1804 +**Display:** 1805 + 1806 +{{code}} 1807 +Evidence Sources (3 unique, 5 total): 1808 + 1809 +1. Original Article (NYTimes) 1810 + - Also appeared in: WashPost, Guardian (syndicated) 1811 + 1812 +2. Research Paper (Nature) 1813 + 1814 +3. Official Statement (WHO) 1815 +{{/code}} 1816 + 1817 +**Acceptance Criteria:** 1818 + 1819 +* ✅ URL normalization works 1820 +* ✅ Content similarity detected 1821 +* ✅ Syndicated content identified 1822 +* ✅ Unique vs. total counts accurate 1823 +* ✅ Improves evidence quality metrics 1824 + 1825 + 1826 +== Additional Requirements (Lower Priority) ===== FR50: OSINT Toolkit Integration === 1827 + 1828 + 1829 + 1640 1640 **Fulfills:** Advanced media verification 1641 1641 **Phase:** V1.1 1642 1642