scripts(otc): dedupe by CIK; commit the 861-company lead list

The master file lists warrants/units as separate tickers under one CIK, so the
pull now dedupes to one row per company (other tickers kept in all_tickers).

data/otc_leads.csv: 861 unique active US-domestic microcap OTC issuers
(<$75M float, all actively filing, 100% with business address + phone). By
incorporation: DE 365, NV 325 (DE+NV=690 = the reincorporation targets), WY 44,
FL 39, MD 38. Dropped from the 2,771 OTC universe: 1,672 foreign, 62
accelerated/large filers, 73 delinquent/dark. EDGAR has no email -> phone +
address captured for enrichment / direct mail / call.
This commit is contained in:
justin 2026-06-09 07:10:54 -05:00
parent 1b3cbf2fbf
commit 37393e5bbc
3 changed files with 2684 additions and 0 deletions

1808
data/otc_leads_rejected.csv Normal file

File diff suppressed because it is too large Load diff