new-site

Author	SHA1	Message	Date
justin	1eb29f80be	fix(verifier): mx_unreachable was mislabeling live big-ISP mailboxes The verifier returned (True, 'mx_unreachable') when it couldn't complete a port-25 probe to ANY MX — marking 438,163 addresses email_verified=TRUE. But these are NOT dead: they're dominated by Comcast (13.7k), AT&T/SBCGlobal (13.5k), Verizon, Cox, Charter, Frontier, etc. — major ISPs that deliberately tarpit/refuse probes from unknown IPs. Confirmed from prod: comcast MX connects + returns 220. The probe failure ≠ undeliverable. Fix: return (False, 'mx_probe_blocked') — MX exists, deliverability UNKNOWN, must be confirmed by a real send. Excluded from PW campaigns; prime burner-verification target (burner_list_verify upgrades it to send_confirmed on delivery). Existing 438,163 mx_unreachable rows reclassified in prod to mx_probe_blocked / verified=FALSE.	2026-06-17 05:48:08 -05:00
justin	0b7a35a58e	trucking campaigns: daily builder + MX verifier concurrency + tracking column - build_trucking_campaigns.py: nightly script that creates 8 Listmonk campaigns per day (4 TZ x 2 types: MCS-150 overdue 2k/TZ, inactive USDOT 1k/TZ) at 4AM ET / 5AM ET (CT) / 6AM ET (MT) / 7AM ET (PT). Deduplicates via listmonk_sent_at column. - migration 083: add listmonk_sent_at + listmonk_campaign_type to fmcsa_carriers - email_verifier.py: bump max_workers from 5 to 20 for 4x faster throughput - cron: daily pw-trucking-campaigns at 08:00 UTC (3 AM EST) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-31 10:07:44 -05:00
justin	72d1b336c5	Add --where filter to email verifier for targeted scrubs Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-05-30 15:06:02 -05:00
justin	97f6a08183	Bind email verifier to secondary IP (.72) for SMTP probes Campaign emails send from .71 via Postfix (now explicitly bound). Verification RCPT TO probes go from .72 to protect sending reputation. Configurable via VERIFY_SOURCE_IP env var. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-05-30 14:52:22 -05:00
justin	ed0a44645e	Email verifier: add catch-all domain detection via random probe Before checking the real address, sends a random 20-char address to the domain. If the server accepts it (250), the domain is catch-all and individual verification is meaningless. Result cached per domain. Existing known catch-all list (gmail, outlook, etc.) still bypassed. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-05-29 14:34:23 -05:00
justin	e67df3c4c3	DOT intake: review-only (no telecom entity step), email verifier, updated flagger excluding 4+ year stale carriers - Intake manifest: DOT services use ["review"] only, skipping the telecom entity step with FRN/USAC fields - Flagger: excludes 4+ year overdue carriers from campaign (spam trap risk). 18,277 safe targets from 100K records. - Email verifier: self-hosted MX + SMTP verification tool Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-05-29 00:29:28 -05:00

6 commits