title: Pollster slant vs customer mix status: superseded by AN-017 / AN-018 (2026-06-02)

Pollster slant vs customer mix

Test of the reputation-equilibrium theory in docs/theory.md § "Pollster reputation: volume vs customer mix": per-pollster β should be increasing in the firm's share of candidate-sponsored polls and decreasing in their share of media-sponsored polls.

Update (2026-06-02). The first-pass test below (11 institutes from AN-007) was extended to 31 firms in AN-017; AN-018 then showed firm size dominates customer mix. On the 31-firm sample, customer-share is still positively signed (γ = +7.6 unweighted / +4.3 WLS) but explains < 5% of β variance (p = 0.27 / 0.54). Log firm volume gives δ = −4.28 unweighted (p = 0.017, R² = 0.18) and δ = −5.74 WLS (p = 0.0005, R² = 0.35); joint regression keeps size significant (δ = −7.09, p = 0.0002, R² = 0.40) and drops customer- share to p = 0.13. Monotone tertile split: small firms (n_total median ~13) β = +11.98; medium (~41) β = +8.64; large (~118) β = −0.93. The IIP/Census high-volume / near-zero-β cluster identified in the original scatter below was the volume-discipline mechanism showing through, not a customer-mix-sorting failure. Theory.md § "Pollster reputation" has been rewritten to lead with volume; customer-mix sorting is now the secondary refinement.

The original first-pass test is preserved below for historical context.

Method

For each of the 462 distinct pollster institutes in the 2024 mayoral sample, classify their polls into customer categories:

Join with build/table/per_pollster_beta.csv (the per-pollster β from the heterogeneity battery). Restrict to institutes with ≥5 self-sponsored polls AND non-degenerate cluster-robust SE (SE > 0.1) — leaves 11 institutes.

Regress: β = α + γ · candidate_share + ε.

Results

Spec Slope γ SE p n
OLS (unweighted) +13.58 15.39 0.40 0.08 11
WLS (weighted by n_self) +6.28 8.34 0.47 0.06 11

Both slopes have the predicted positive sign, but neither is statistically significant. n=11 is too thin for a sharp test.

Auxiliary specs:

Spec Coefficient SE p
β ~ media_share γ_m = -17.50 13.52 0.27
β ~ pollster_self_share γ_p = -22.16 16.80 0.18

Both negative, as the theory predicts (more media → less slant). Still underpowered.

Scatter

Pollster β vs candidate-share

The visual picture qualitatively supports the theory at the low-end of candidate-share but breaks down at the high-end:

The IIP/Census pattern at the high end is the most interesting empirical finding. It's consistent with one of the theory's secondary predictions: high-volume firms are more disciplined than low-volume ones, because more reputation is at stake. IIP (412 polls) and Census (263 polls) have larger reputation investments to protect than the smaller mid-volume pollsters showing larger β.

Interpretation

Direction is right; magnitude is plausible (a 100-point candidate- share difference predicts a 6-14 pp difference in β, comparable to the headline β = +8). But the test is underpowered — 11 institutes is too few to discriminate the slope from zero.

The bimodality prediction (firms cluster toward poles, not the middle) is harder to test with this n but doesn't obviously hold in the scatter — there are institutes at every share level.

Caveats

Next steps (queued in todo)

Reproduce

# In /workspace/projects/poll-sponsor-bias:
python source/analysis/heterogeneity.py            # produces per_pollster_beta.csv
python source/analysis/pollster_customer_mix.py    # first-pass: 11-institute slope + figure
python source/analysis/an-017-customer-mix-refresh.py   # 31-firm refresh (γ on candidate-share)
python source/analysis/an-018-firm-size-discipline.py   # log(n_total) discipline; supersedes