Changes for page Requirements
Last modified by Robert Schaub on 2025/12/23 11:03
From version 7.2
edited by Robert Schaub
on 2025/12/23 11:03
on 2025/12/23 11:03
Change comment:
Update document after refactoring.
Summary
-
Page properties (2 modified, 0 added, 0 removed)
Details
- Page properties
-
- Parent
-
... ... @@ -1,1 +1,1 @@ 1 -Test.FactHarbor V0\.9\.76.Specification.WebHome1 +Test.FactHarbor.Specification.WebHome - Content
-
... ... @@ -1617,216 +1617,26 @@ 1617 1617 1618 1618 === FR47: Archive.org Integration === 1619 1619 1620 -**Priority:** CRITICAL 1621 -**Fulfills:** Evidence persistence, FR5 (Evidence linking) 1622 -**Phase:** V1.0 1620 +Auto-save evidence sources to Wayback Machine. 1623 1623 1624 -** Purpose:** Ensure evidence remains accessibleeven if originalsourcesaredeleted.1622 +**Milestone:** Beta 0 1625 1625 1626 - **Specification:**1624 +=== FR48: Safety Framework for Contributors === 1627 1627 1628 - **AutomaticArchiving:**1626 +Protect contributors from harassment and legal threats. 1629 1629 1630 -When AKEL links evidence: 1631 -1. Check if URL already archived (Wayback Machine API) 1632 -2. If not, submit for archiving (Save Page Now API) 1633 -3. Store both original URL and archive URL 1634 -4. Display both to users 1628 +**Milestone:** V1.1 1635 1635 1636 - **ArchiveDisplay:**1630 +=== FR49: A/B Testing Framework === 1637 1637 1638 -{{code}} 1639 -Evidence Source: [Original URL] 1640 -Archived: [Archive.org URL] (Captured: [date]) 1632 +Test AKEL approaches and UI designs systematically. 1641 1641 1642 -[View Original] [View Archive] 1643 -{{/code}} 1634 +**Milestone:** V1.0 1644 1644 1645 - **FallbackLogic:**1636 +=== FR50: OSINT Toolkit Integration === 1646 1646 1647 -* If original URL unavailable → Auto-redirect to archive 1648 -* If archive unavailable → Display warning 1649 -* If both unavailable → Flag for manual review 1650 1650 1651 -**API Integration:** 1652 1652 1653 -* Use Wayback Machine Availability API 1654 -* Use Save Page Now API (SPNv2) 1655 -* Rate limiting: 15 requests/minute (Wayback limit) 1656 - 1657 -**Acceptance Criteria:** 1658 - 1659 -* ✅ All evidence URLs auto-archived 1660 -* ✅ Archive links displayed to users 1661 -* ✅ Fallback to archive if original unavailable 1662 -* ✅ API rate limits respected 1663 -* ✅ Archive status visible in evidence display 1664 - 1665 - 1666 -== Category 4: Community Safety ===== FR48: Contributor Safety Framework === 1667 - 1668 -**Priority:** CRITICAL 1669 -**Fulfills:** UN-28 (Safe contribution environment) 1670 -**Phase:** V1.0 1671 - 1672 -**Purpose:** Protect contributors from harassment, doxxing, and coordinated attacks. 1673 - 1674 -**Specification:** 1675 - 1676 -**1. Privacy Protection:** 1677 - 1678 -* **Optional Pseudonymity:** Contributors can use pseudonyms 1679 -* **Email Privacy:** Emails never displayed publicly 1680 -* **Profile Privacy:** Contributors control what's public 1681 -* **IP Logging:** Only for abuse prevention, not public 1682 - 1683 -**2. Harassment Prevention:** 1684 - 1685 -* **Automated Toxicity Detection:** Flag abusive comments 1686 -* **Personal Information Detection:** Auto-block doxxing attempts 1687 -* **Coordinated Attack Detection:** Identify brigading patterns 1688 -* **Rapid Response:** Moderator alerts for harassment 1689 - 1690 -**3. Safety Features:** 1691 - 1692 -* **Block Users:** Contributors can block harassers 1693 -* **Private Contributions:** Option to contribute anonymously 1694 -* **Report Harassment:** One-click harassment reporting 1695 -* **Safety Resources:** Links to support resources 1696 - 1697 -**4. Moderator Tools:** 1698 - 1699 -* **Quick Ban:** Immediately block abusers 1700 -* **Pattern Detection:** Identify coordinated attacks 1701 -* **Appeal Process:** Fair review of moderation actions 1702 -* **Escalation:** Serious threats escalated to authorities 1703 - 1704 -**5. Trusted Contributor Protection:** 1705 - 1706 -* **Enhanced Privacy:** Additional protection for high-profile contributors 1707 -* **Verification:** Optional identity verification (not public) 1708 -* **Legal Support:** Resources for contributors facing legal threats 1709 - 1710 -**Acceptance Criteria:** 1711 - 1712 -* ✅ Pseudonyms supported 1713 -* ✅ Toxicity detection active 1714 -* ✅ Doxxing auto-blocked 1715 -* ✅ Harassment reporting functional 1716 -* ✅ Moderator tools implemented 1717 -* ✅ Safety policy published 1718 - 1719 - 1720 -== Category 5: Continuous Improvement ===== FR49: A/B Testing Framework === 1721 - 1722 -**Priority:** CRITICAL 1723 -**Fulfills:** Continuous system improvement 1724 -**Phase:** V1.0 1725 - 1726 -**Purpose:** Test and measure improvements to AKEL prompts, algorithms, and workflows. 1727 - 1728 -**Specification:** 1729 - 1730 -**Test Capabilities:** 1731 - 1732 -1. **Prompt Variations:** 1733 - * Test different claim extraction prompts 1734 - * Test different verdict generation prompts 1735 - * Measure: Accuracy, clarity, completeness 1736 - 1737 -2. **Algorithm Variations:** 1738 - * Test different source scoring algorithms 1739 - * Test different confidence calculations 1740 - * Measure: Audit accuracy, user satisfaction 1741 - 1742 -3. **Workflow Variations:** 1743 - * Test different quality gate thresholds 1744 - * Test different risk tier assignments 1745 - * Measure: Publication rate, quality scores 1746 - 1747 -**Implementation:** 1748 - 1749 -* **Traffic Split:** 50/50 or 90/10 splits 1750 -* **Randomization:** Consistent per claim (not per user) 1751 -* **Metrics Collection:** Automatic for all variants 1752 -* **Statistical Significance:** Minimum sample size calculation 1753 -* **Rollout:** Winner promoted to 100% traffic 1754 - 1755 -**A/B Test Workflow:** 1756 - 1757 -{{code}} 1758 -1. Hypothesis: "New prompt improves claim extraction" 1759 -2. Design test: Control vs. Variant 1760 -3. Define metrics: Extraction accuracy, completeness 1761 -4. Run test: 7-14 days, minimum 100 claims each 1762 -5. Analyze results: Statistical significance? 1763 -6. Decision: Deploy winner or iterate 1764 -{{/code}} 1765 - 1766 -**Acceptance Criteria:** 1767 - 1768 -* ✅ A/B testing framework implemented 1769 -* ✅ Can test prompt variations 1770 -* ✅ Can test algorithm variations 1771 -* ✅ Metrics automatically collected 1772 -* ✅ Statistical significance calculated 1773 -* ✅ Results inform system improvements 1774 - 1775 - 1776 -=== FR54: Evidence Deduplication === 1777 - 1778 -**Priority:** CRITICAL (POC2/Beta) 1779 -**Fulfills:** Accurate evidence counting, quality metrics 1780 -**Phase:** POC2, Beta 0, V1.0 1781 - 1782 -**Purpose:** Avoid counting the same source multiple times when it appears in different forms. 1783 - 1784 -**Specification:** 1785 - 1786 -**Deduplication Logic:** 1787 - 1788 -1. **URL Normalization:** 1789 - * Remove tracking parameters (?utm_source=...) 1790 - * Normalize http/https 1791 - * Normalize www/non-www 1792 - * Handle redirects 1793 - 1794 -2. **Content Similarity:** 1795 - * If two sources have >90% text similarity → Same source 1796 - * If one is subset of other → Same source 1797 - * Use fuzzy matching for minor differences 1798 - 1799 -3. **Cross-Domain Syndication:** 1800 - * Detect wire service content (AP, Reuters) 1801 - * Mark as single source if syndicated 1802 - * Count original publication only 1803 - 1804 -**Display:** 1805 - 1806 -{{code}} 1807 -Evidence Sources (3 unique, 5 total): 1808 - 1809 -1. Original Article (NYTimes) 1810 - - Also appeared in: WashPost, Guardian (syndicated) 1811 - 1812 -2. Research Paper (Nature) 1813 - 1814 -3. Official Statement (WHO) 1815 -{{/code}} 1816 - 1817 -**Acceptance Criteria:** 1818 - 1819 -* ✅ URL normalization works 1820 -* ✅ Content similarity detected 1821 -* ✅ Syndicated content identified 1822 -* ✅ Unique vs. total counts accurate 1823 -* ✅ Improves evidence quality metrics 1824 - 1825 - 1826 -== Additional Requirements (Lower Priority) ===== FR50: OSINT Toolkit Integration === 1827 - 1828 - 1829 - 1830 1830 **Fulfills:** Advanced media verification 1831 1831 **Phase:** V1.1 1832 1832