Changes for page Requirements

Last modified by Robert Schaub on 2025/12/23 11:03

From version 6.1
edited by Robert Schaub
on 2025/12/23 08:03
Change comment: Imported from XAR
To version 7.1
edited by Robert Schaub
on 2025/12/23 09:47
Change comment: Imported from XAR

Summary

Details

Page properties
Content
... ... @@ -1617,26 +1617,216 @@
1617 1617  
1618 1618  === FR47: Archive.org Integration ===
1619 1619  
1620 -Auto-save evidence sources to Wayback Machine.
1620 +**Priority:** CRITICAL
1621 +**Fulfills:** Evidence persistence, FR5 (Evidence linking)
1622 +**Phase:** V1.0
1621 1621  
1622 -**Milestone:** Beta 0
1624 +**Purpose:** Ensure evidence remains accessible even if original sources are deleted.
1623 1623  
1624 -=== FR48: Safety Framework for Contributors ===
1626 +**Specification:**
1625 1625  
1626 -Protect contributors from harassment and legal threats.
1628 +**Automatic Archiving:**
1627 1627  
1628 -**Milestone:** V1.1
1630 +When AKEL links evidence:
1631 +1. Check if URL already archived (Wayback Machine API)
1632 +2. If not, submit for archiving (Save Page Now API)
1633 +3. Store both original URL and archive URL
1634 +4. Display both to users
1629 1629  
1630 -=== FR49: A/B Testing Framework ===
1636 +**Archive Display:**
1631 1631  
1632 -Test AKEL approaches and UI designs systematically.
1638 +{{code}}
1639 +Evidence Source: [Original URL]
1640 +Archived: [Archive.org URL] (Captured: [date])
1633 1633  
1634 -**Milestone:** V1.0
1642 +[View Original] [View Archive]
1643 +{{/code}}
1635 1635  
1636 -=== FR50: OSINT Toolkit Integration ===
1645 +**Fallback Logic:**
1637 1637  
1647 +* If original URL unavailable → Auto-redirect to archive
1648 +* If archive unavailable → Display warning
1649 +* If both unavailable → Flag for manual review
1638 1638  
1651 +**API Integration:**
1639 1639  
1653 +* Use Wayback Machine Availability API
1654 +* Use Save Page Now API (SPNv2)
1655 +* Rate limiting: 15 requests/minute (Wayback limit)
1656 +
1657 +**Acceptance Criteria:**
1658 +
1659 +* ✅ All evidence URLs auto-archived
1660 +* ✅ Archive links displayed to users
1661 +* ✅ Fallback to archive if original unavailable
1662 +* ✅ API rate limits respected
1663 +* ✅ Archive status visible in evidence display
1664 +
1665 +
1666 +== Category 4: Community Safety ===== FR48: Contributor Safety Framework ===
1667 +
1668 +**Priority:** CRITICAL
1669 +**Fulfills:** UN-28 (Safe contribution environment)
1670 +**Phase:** V1.0
1671 +
1672 +**Purpose:** Protect contributors from harassment, doxxing, and coordinated attacks.
1673 +
1674 +**Specification:**
1675 +
1676 +**1. Privacy Protection:**
1677 +
1678 +* **Optional Pseudonymity:** Contributors can use pseudonyms
1679 +* **Email Privacy:** Emails never displayed publicly
1680 +* **Profile Privacy:** Contributors control what's public
1681 +* **IP Logging:** Only for abuse prevention, not public
1682 +
1683 +**2. Harassment Prevention:**
1684 +
1685 +* **Automated Toxicity Detection:** Flag abusive comments
1686 +* **Personal Information Detection:** Auto-block doxxing attempts
1687 +* **Coordinated Attack Detection:** Identify brigading patterns
1688 +* **Rapid Response:** Moderator alerts for harassment
1689 +
1690 +**3. Safety Features:**
1691 +
1692 +* **Block Users:** Contributors can block harassers
1693 +* **Private Contributions:** Option to contribute anonymously
1694 +* **Report Harassment:** One-click harassment reporting
1695 +* **Safety Resources:** Links to support resources
1696 +
1697 +**4. Moderator Tools:**
1698 +
1699 +* **Quick Ban:** Immediately block abusers
1700 +* **Pattern Detection:** Identify coordinated attacks
1701 +* **Appeal Process:** Fair review of moderation actions
1702 +* **Escalation:** Serious threats escalated to authorities
1703 +
1704 +**5. Trusted Contributor Protection:**
1705 +
1706 +* **Enhanced Privacy:** Additional protection for high-profile contributors
1707 +* **Verification:** Optional identity verification (not public)
1708 +* **Legal Support:** Resources for contributors facing legal threats
1709 +
1710 +**Acceptance Criteria:**
1711 +
1712 +* ✅ Pseudonyms supported
1713 +* ✅ Toxicity detection active
1714 +* ✅ Doxxing auto-blocked
1715 +* ✅ Harassment reporting functional
1716 +* ✅ Moderator tools implemented
1717 +* ✅ Safety policy published
1718 +
1719 +
1720 +== Category 5: Continuous Improvement ===== FR49: A/B Testing Framework ===
1721 +
1722 +**Priority:** CRITICAL
1723 +**Fulfills:** Continuous system improvement
1724 +**Phase:** V1.0
1725 +
1726 +**Purpose:** Test and measure improvements to AKEL prompts, algorithms, and workflows.
1727 +
1728 +**Specification:**
1729 +
1730 +**Test Capabilities:**
1731 +
1732 +1. **Prompt Variations:**
1733 + * Test different claim extraction prompts
1734 + * Test different verdict generation prompts
1735 + * Measure: Accuracy, clarity, completeness
1736 +
1737 +2. **Algorithm Variations:**
1738 + * Test different source scoring algorithms
1739 + * Test different confidence calculations
1740 + * Measure: Audit accuracy, user satisfaction
1741 +
1742 +3. **Workflow Variations:**
1743 + * Test different quality gate thresholds
1744 + * Test different risk tier assignments
1745 + * Measure: Publication rate, quality scores
1746 +
1747 +**Implementation:**
1748 +
1749 +* **Traffic Split:** 50/50 or 90/10 splits
1750 +* **Randomization:** Consistent per claim (not per user)
1751 +* **Metrics Collection:** Automatic for all variants
1752 +* **Statistical Significance:** Minimum sample size calculation
1753 +* **Rollout:** Winner promoted to 100% traffic
1754 +
1755 +**A/B Test Workflow:**
1756 +
1757 +{{code}}
1758 +1. Hypothesis: "New prompt improves claim extraction"
1759 +2. Design test: Control vs. Variant
1760 +3. Define metrics: Extraction accuracy, completeness
1761 +4. Run test: 7-14 days, minimum 100 claims each
1762 +5. Analyze results: Statistical significance?
1763 +6. Decision: Deploy winner or iterate
1764 +{{/code}}
1765 +
1766 +**Acceptance Criteria:**
1767 +
1768 +* ✅ A/B testing framework implemented
1769 +* ✅ Can test prompt variations
1770 +* ✅ Can test algorithm variations
1771 +* ✅ Metrics automatically collected
1772 +* ✅ Statistical significance calculated
1773 +* ✅ Results inform system improvements
1774 +
1775 +
1776 +=== FR54: Evidence Deduplication ===
1777 +
1778 +**Priority:** CRITICAL (POC2/Beta)
1779 +**Fulfills:** Accurate evidence counting, quality metrics
1780 +**Phase:** POC2, Beta 0, V1.0
1781 +
1782 +**Purpose:** Avoid counting the same source multiple times when it appears in different forms.
1783 +
1784 +**Specification:**
1785 +
1786 +**Deduplication Logic:**
1787 +
1788 +1. **URL Normalization:**
1789 + * Remove tracking parameters (?utm_source=...)
1790 + * Normalize http/https
1791 + * Normalize www/non-www
1792 + * Handle redirects
1793 +
1794 +2. **Content Similarity:**
1795 + * If two sources have >90% text similarity → Same source
1796 + * If one is subset of other → Same source
1797 + * Use fuzzy matching for minor differences
1798 +
1799 +3. **Cross-Domain Syndication:**
1800 + * Detect wire service content (AP, Reuters)
1801 + * Mark as single source if syndicated
1802 + * Count original publication only
1803 +
1804 +**Display:**
1805 +
1806 +{{code}}
1807 +Evidence Sources (3 unique, 5 total):
1808 +
1809 +1. Original Article (NYTimes)
1810 + - Also appeared in: WashPost, Guardian (syndicated)
1811 +
1812 +2. Research Paper (Nature)
1813 +
1814 +3. Official Statement (WHO)
1815 +{{/code}}
1816 +
1817 +**Acceptance Criteria:**
1818 +
1819 +* ✅ URL normalization works
1820 +* ✅ Content similarity detected
1821 +* ✅ Syndicated content identified
1822 +* ✅ Unique vs. total counts accurate
1823 +* ✅ Improves evidence quality metrics
1824 +
1825 +
1826 +== Additional Requirements (Lower Priority) ===== FR50: OSINT Toolkit Integration ===
1827 +
1828 +
1829 +
1640 1640  **Fulfills:** Advanced media verification
1641 1641  **Phase:** V1.1
1642 1642