Data sources
Every signal on FilingIQ traces back to a primary public filing. This page is the canonical inventory of where the data comes from, how often it refreshes, and under what license it can be redistributed.
Authored by Thomas Kraaibeek, Co-founder and Head of Engineering.
Primary sources
| Source | Coverage | License | Last verified |
|---|---|---|---|
| SEC EDGAR Form 4 | All US-listed issuers | Public domain | 2026-05-14 |
| SEC EDGAR Form 13F-HR | Roughly 13,000 13F filers | Public domain | 2026-05-14 |
| SEC EDGAR Forms 3 / 5 / SC 13D / SC 13G | Insider initial, annual, and 5% owner filings | Public domain | 2026-05-14 |
| House Office of the Clerk PTR | US Representatives under STOCK Act | Public domain | 2026-05-14 |
| Senate Office of Public Records PTR | US Senators under STOCK Act | Public domain | 2026-05-14 |
| Bioguide (bioguide.congress.gov) | Member identity normalization | Public domain | 2026-05-14 |
| Yahoo Finance (company profile enrichment) | Company metadata, market cap, exchange | Terms of service | 2026-05-14 |
How we verify each source
FilingIQ ingests directly from primary regulatory portals. We do not resell third-party data feeds. The ingestion pipeline runs continuously against the SEC EDGAR full-text search index for Form 4 and Form 13F, and against the House Office of the Clerk and Senate Office of Public Records portals for Periodic Transaction Reports filed under the STOCK Act.
Each row stored in our production database carries a provenance pointer to the underlying filing: the SEC accession number for EDGAR-sourced rows, and the canonical PDF URL for Congressional disclosures. Where the upstream document is amended (Form 4 amendments are common), we keep both the original and the amendment, mark the original superseded, and rescore on the amendment payload.
Company-level metadata (legal name, exchange, sector, market cap, ISIN) is enriched nightly from Yahoo Finance to support entity disambiguation. The enrichment is treated as supplementary; primary identity is always the SEC CIK or the bioguide ID for a Member of Congress.
Field-level provenance
For every transaction surfaced on the platform, the following fields are traceable to a primary source:
- Insider name, role, and CIK: derived from the Form 4 reporting owner block.
- Issuer name, CIK, and ticker: derived from the Form 4 issuer block, joined to our company table on CIK.
- Transaction date, price, shares, and code: parsed from the Form 4 Table I non-derivative rows or Table II derivative rows.
- 10b5-1 plan flag: parsed from the Form 4 footnote text using the standard footnote markers SEC publishes for trading plan disclosures.
- Politician name, chamber, party, state, and district: joined from the bioguide reference dataset on the politician identifier extracted from the PTR filing.
- Congressional transaction security and amount range: parsed from the PTR PDF; the amount field is a discrete range bucket (for example $1,001 to $15,000) per the STOCK Act disclosure schedule.
Derived signals (FilingIQ-published)
- FilingIQ Composite Score: multi-factor Form 4 model, defined at /methodology. Proprietary to FilingIQ.
- Institutional Flow Score (IFS): 13F-derived institutional accumulation signal. Proprietary to FilingIQ.
- Congressional Political Score (CPS): STOCK Act trade signal. Proprietary to FilingIQ.
Attribution
SEC EDGAR and Congressional disclosure data is US Government public-domain. No attribution to FilingIQ is required for the raw filings. When citing FilingIQ-derived scores (Composite, IFS, CPS) in research, please link back to filingiq.io/methodology.
Methodology
For the full scoring methodology and known data quality caveats, see methodology. Author bios at /about/team. Not investment advice; see disclaimer.