Document Registry
What happens here
The system begins with a declarative registry of financial documents.
Each document is defined by:
- company
- fiscal year
- report type
- source URL
No document is processed unless it is explicitly declared.
Why this matters
This creates reproducibility, auditability, and clear provenance. Nothing "just appears" in the system.
Declarative registry
What we chose: YAML-based document registry listing company, fiscal year, report type, and source URL.
Alternatives: Hardcoding URLs, dynamic crawling, manual uploads
Why: Explicit declaration equals reproducibility. Prevents accidental ingestion and mimics production data contracts.