Changes for page Requirements

Last modified by Robert Schaub on 2025/12/23 11:03

From version 7.1
edited by Robert Schaub
on 2025/12/23 09:47
Change comment: Imported from XAR
To version 6.1
edited by Robert Schaub
on 2025/12/23 08:03
Change comment: Imported from XAR

Summary

Details

Page properties
Content
... ... @@ -1617,216 +1617,26 @@
1617 1617  
1618 1618  === FR47: Archive.org Integration ===
1619 1619  
1620 -**Priority:** CRITICAL
1621 -**Fulfills:** Evidence persistence, FR5 (Evidence linking)
1622 -**Phase:** V1.0
1620 +Auto-save evidence sources to Wayback Machine.
1623 1623  
1624 -**Purpose:** Ensure evidence remains accessible even if original sources are deleted.
1622 +**Milestone:** Beta 0
1625 1625  
1626 -**Specification:**
1624 +=== FR48: Safety Framework for Contributors ===
1627 1627  
1628 -**Automatic Archiving:**
1626 +Protect contributors from harassment and legal threats.
1629 1629  
1630 -When AKEL links evidence:
1631 -1. Check if URL already archived (Wayback Machine API)
1632 -2. If not, submit for archiving (Save Page Now API)
1633 -3. Store both original URL and archive URL
1634 -4. Display both to users
1628 +**Milestone:** V1.1
1635 1635  
1636 -**Archive Display:**
1630 +=== FR49: A/B Testing Framework ===
1637 1637  
1638 -{{code}}
1639 -Evidence Source: [Original URL]
1640 -Archived: [Archive.org URL] (Captured: [date])
1632 +Test AKEL approaches and UI designs systematically.
1641 1641  
1642 -[View Original] [View Archive]
1643 -{{/code}}
1634 +**Milestone:** V1.0
1644 1644  
1645 -**Fallback Logic:**
1636 +=== FR50: OSINT Toolkit Integration ===
1646 1646  
1647 -* If original URL unavailable → Auto-redirect to archive
1648 -* If archive unavailable → Display warning
1649 -* If both unavailable → Flag for manual review
1650 1650  
1651 -**API Integration:**
1652 1652  
1653 -* Use Wayback Machine Availability API
1654 -* Use Save Page Now API (SPNv2)
1655 -* Rate limiting: 15 requests/minute (Wayback limit)
1656 -
1657 -**Acceptance Criteria:**
1658 -
1659 -* ✅ All evidence URLs auto-archived
1660 -* ✅ Archive links displayed to users
1661 -* ✅ Fallback to archive if original unavailable
1662 -* ✅ API rate limits respected
1663 -* ✅ Archive status visible in evidence display
1664 -
1665 -
1666 -== Category 4: Community Safety ===== FR48: Contributor Safety Framework ===
1667 -
1668 -**Priority:** CRITICAL
1669 -**Fulfills:** UN-28 (Safe contribution environment)
1670 -**Phase:** V1.0
1671 -
1672 -**Purpose:** Protect contributors from harassment, doxxing, and coordinated attacks.
1673 -
1674 -**Specification:**
1675 -
1676 -**1. Privacy Protection:**
1677 -
1678 -* **Optional Pseudonymity:** Contributors can use pseudonyms
1679 -* **Email Privacy:** Emails never displayed publicly
1680 -* **Profile Privacy:** Contributors control what's public
1681 -* **IP Logging:** Only for abuse prevention, not public
1682 -
1683 -**2. Harassment Prevention:**
1684 -
1685 -* **Automated Toxicity Detection:** Flag abusive comments
1686 -* **Personal Information Detection:** Auto-block doxxing attempts
1687 -* **Coordinated Attack Detection:** Identify brigading patterns
1688 -* **Rapid Response:** Moderator alerts for harassment
1689 -
1690 -**3. Safety Features:**
1691 -
1692 -* **Block Users:** Contributors can block harassers
1693 -* **Private Contributions:** Option to contribute anonymously
1694 -* **Report Harassment:** One-click harassment reporting
1695 -* **Safety Resources:** Links to support resources
1696 -
1697 -**4. Moderator Tools:**
1698 -
1699 -* **Quick Ban:** Immediately block abusers
1700 -* **Pattern Detection:** Identify coordinated attacks
1701 -* **Appeal Process:** Fair review of moderation actions
1702 -* **Escalation:** Serious threats escalated to authorities
1703 -
1704 -**5. Trusted Contributor Protection:**
1705 -
1706 -* **Enhanced Privacy:** Additional protection for high-profile contributors
1707 -* **Verification:** Optional identity verification (not public)
1708 -* **Legal Support:** Resources for contributors facing legal threats
1709 -
1710 -**Acceptance Criteria:**
1711 -
1712 -* ✅ Pseudonyms supported
1713 -* ✅ Toxicity detection active
1714 -* ✅ Doxxing auto-blocked
1715 -* ✅ Harassment reporting functional
1716 -* ✅ Moderator tools implemented
1717 -* ✅ Safety policy published
1718 -
1719 -
1720 -== Category 5: Continuous Improvement ===== FR49: A/B Testing Framework ===
1721 -
1722 -**Priority:** CRITICAL
1723 -**Fulfills:** Continuous system improvement
1724 -**Phase:** V1.0
1725 -
1726 -**Purpose:** Test and measure improvements to AKEL prompts, algorithms, and workflows.
1727 -
1728 -**Specification:**
1729 -
1730 -**Test Capabilities:**
1731 -
1732 -1. **Prompt Variations:**
1733 - * Test different claim extraction prompts
1734 - * Test different verdict generation prompts
1735 - * Measure: Accuracy, clarity, completeness
1736 -
1737 -2. **Algorithm Variations:**
1738 - * Test different source scoring algorithms
1739 - * Test different confidence calculations
1740 - * Measure: Audit accuracy, user satisfaction
1741 -
1742 -3. **Workflow Variations:**
1743 - * Test different quality gate thresholds
1744 - * Test different risk tier assignments
1745 - * Measure: Publication rate, quality scores
1746 -
1747 -**Implementation:**
1748 -
1749 -* **Traffic Split:** 50/50 or 90/10 splits
1750 -* **Randomization:** Consistent per claim (not per user)
1751 -* **Metrics Collection:** Automatic for all variants
1752 -* **Statistical Significance:** Minimum sample size calculation
1753 -* **Rollout:** Winner promoted to 100% traffic
1754 -
1755 -**A/B Test Workflow:**
1756 -
1757 -{{code}}
1758 -1. Hypothesis: "New prompt improves claim extraction"
1759 -2. Design test: Control vs. Variant
1760 -3. Define metrics: Extraction accuracy, completeness
1761 -4. Run test: 7-14 days, minimum 100 claims each
1762 -5. Analyze results: Statistical significance?
1763 -6. Decision: Deploy winner or iterate
1764 -{{/code}}
1765 -
1766 -**Acceptance Criteria:**
1767 -
1768 -* ✅ A/B testing framework implemented
1769 -* ✅ Can test prompt variations
1770 -* ✅ Can test algorithm variations
1771 -* ✅ Metrics automatically collected
1772 -* ✅ Statistical significance calculated
1773 -* ✅ Results inform system improvements
1774 -
1775 -
1776 -=== FR54: Evidence Deduplication ===
1777 -
1778 -**Priority:** CRITICAL (POC2/Beta)
1779 -**Fulfills:** Accurate evidence counting, quality metrics
1780 -**Phase:** POC2, Beta 0, V1.0
1781 -
1782 -**Purpose:** Avoid counting the same source multiple times when it appears in different forms.
1783 -
1784 -**Specification:**
1785 -
1786 -**Deduplication Logic:**
1787 -
1788 -1. **URL Normalization:**
1789 - * Remove tracking parameters (?utm_source=...)
1790 - * Normalize http/https
1791 - * Normalize www/non-www
1792 - * Handle redirects
1793 -
1794 -2. **Content Similarity:**
1795 - * If two sources have >90% text similarity → Same source
1796 - * If one is subset of other → Same source
1797 - * Use fuzzy matching for minor differences
1798 -
1799 -3. **Cross-Domain Syndication:**
1800 - * Detect wire service content (AP, Reuters)
1801 - * Mark as single source if syndicated
1802 - * Count original publication only
1803 -
1804 -**Display:**
1805 -
1806 -{{code}}
1807 -Evidence Sources (3 unique, 5 total):
1808 -
1809 -1. Original Article (NYTimes)
1810 - - Also appeared in: WashPost, Guardian (syndicated)
1811 -
1812 -2. Research Paper (Nature)
1813 -
1814 -3. Official Statement (WHO)
1815 -{{/code}}
1816 -
1817 -**Acceptance Criteria:**
1818 -
1819 -* ✅ URL normalization works
1820 -* ✅ Content similarity detected
1821 -* ✅ Syndicated content identified
1822 -* ✅ Unique vs. total counts accurate
1823 -* ✅ Improves evidence quality metrics
1824 -
1825 -
1826 -== Additional Requirements (Lower Priority) ===== FR50: OSINT Toolkit Integration ===
1827 -
1828 -
1829 -
1830 1830  **Fulfills:** Advanced media verification
1831 1831  **Phase:** V1.1
1832 1832