hc refresh: run Mon/Wed/Fri instead of weekly to shrink CMS data-lag

The 'already revalidated' replies come from the CMS data-lag window (a provider
completes their revalidation but CMS's public Due Date List still shows them
overdue for weeks). Running the refresh 3x/week instead of weekly shrinks that
window from up to 7 days to ~2-3, so a provider who just completed stops being
targeted faster. No change to the overdue window or audience size -- this is the
lever that reduces stale-data complaints without losing prospects.
This commit is contained in:
justin 2026-06-08 10:53:36 -05:00
parent a78d60a127
commit 2156a5e05f

View file

@ -1,10 +1,10 @@
# Healthcare data refresh: weekly re-check of every emailable NPI against the
# live government sources (CMS Revalidation list, OIG LEIE) + MX re-classification
# (Google-host detection) so warmup sends never go stale. Runs Mon 06:00 Central,
# ~1h before the 07:00 weekday send, propagating fresh status into the channel
# CSVs the campaign cron reads. Takes ~8 min. SAM is opt-in (--sam-pages); SAM
# exclusions rarely carry an NPI, so OIG LEIE is the NPI-bearing exclusion source.
# Then prune-only: remove any subscriber whose domain newly became Google-hosted
# from the warmup lists (deliverability safety net; removes only likely-bouncers,
# never evicts for audience reasons).
0 6 * * 1 deploy cd /opt/performancewest && python3 -u scripts/hc_data_refresh.py >> /var/log/pw-hc-refresh.log 2>&1 && python3 -u scripts/build_healthcare_campaigns_cron.py --prune-only >> /var/log/pw-hc-refresh.log 2>&1
# Healthcare data refresh: re-check every emailable NPI against the live
# government sources (CMS Revalidation list, OIG LEIE) + MX re-classification
# (Google-host detection) so warmup sends never go stale. Runs Mon/Wed/Fri 06:00
# Central, ~1h before the 07:00 weekday send. Mon/Wed/Fri (vs weekly) shrinks the
# CMS data-lag window to ~2-3 days, so a provider who just completed their
# revalidation stops being targeted faster (fewer "already done" replies).
# Takes ~8 min. SAM is opt-in (--sam-pages); SAM exclusions rarely carry an NPI,
# so OIG LEIE is the NPI-bearing exclusion source. Then prune-only removes newly-
# Google-hosted and suppressed subscribers from the warmup lists.
0 6 * * 1,3,5 deploy cd /opt/performancewest && python3 -u scripts/hc_data_refresh.py >> /var/log/pw-hc-refresh.log 2>&1 && python3 -u scripts/build_healthcare_campaigns_cron.py --prune-only >> /var/log/pw-hc-refresh.log 2>&1