Theory

Theoretical apparatus motivating the empirical design. The paper is primarily empirical — measuring sponsor bias in TSE-registered polls — but the why does this matter hinges on how polls actually shape voter behavior. This file collects the framework theories that make poll content consequential. As additional theories enter the project (e.g., bandwagon dynamics, candidate-quality signaling, fundraising-cascade models), they get sibling ## sections.

Polls as coordination devices (strategic voting)

The cleanest motivation for why polls matter at all: in plurality systems, instrumentally-rational voters do not want to "waste their vote" on candidates with no chance of finishing in the seat-winning set. Polls are the common-knowledge signal that lets voters coordinate on who is viable. Multiple equilibria collapse onto one.

Setup

M-seat election with N ≥ M+1 candidates. By Cox's "M+1 rule" [cite:cox1997making], strategic voting concentrates support on at most M+1 candidates in equilibrium — those plausibly contesting the marginal seat. Candidates beyond the (M+1)-th get strategic abandonment.
Brazilian municipal mayoral elections use plurality (M=1) in munis with fewer than 200,000 registered voters → 2 viable candidates by Cox's rule.
Larger munis (≥200,000 registered voters) use a two-round runoff system: an absolute-majority winner takes the seat in round 1; otherwise the top 2 advance to round 2. The first-round seat-winning set is effectively the top 2 → 3 viable candidates by the M+1 rule applied to the first round.
Cutoff: top-2 viability in small munis, top-3 viability in runoff-eligible munis.

Brazilian institutional detail

The 200,000-registered-voters threshold is set by Constitution Art. 29 II combined with Lei 9.504/1997 Art. 3. Computed each cycle by TSE; the 2024 cycle had 102 munis (out of 5,568) crossing the threshold and using runoffs.
The threshold is sharp and pre-determined (eligibility published ahead of registration), so the institutional variation is exogenous to candidate quality.

Predictions

Strategic-voting + polls-as-coordination jointly imply that the marginal viability threshold is the political quantity that matters:

Polls move voters most when the marginal-viability identity is uncertain. A candidate clearly in 4th in a small-muni poll has little to gain from strategic-vote shifts no matter what; a candidate near the 2nd-place threshold (small munis) or 3rd-place threshold (large munis) has the most to gain from appearing viable.
Sponsor-bias incentive is heterogeneous in the same direction. Sponsors of candidates near the viability cutoff have the largest marginal return to slanted polls (move voters across the wasted-vote frontier); sponsors of front-runners or hopeless candidates have weaker incentives. This is the testable coordination-theory prediction in the sponsor-bias setting: β should be larger for candidates near the cutoff than for the very top or very bottom of the field.
The cutoff itself shifts at the 200k-voter institutional boundary. In small munis, the cutoff is between 2nd and 3rd place; in large munis, between 3rd and 4th. If the coordination theory is right, the position of the largest β should shift accordingly across the institutional discontinuity. This is the cleanest design-level test the project can support — RD-style at the 200,000-voters threshold, with the position of peak β across final-rank quintiles as the heterogeneity object.
Polls matter more in small munis (in absolute consequence). Coordination problems are more acute when N is large relative to M+1 viable spots; smaller plurality elections (M+1=2) have a sharper coordination cliff than runoff-eligible ones (M+1=3). This is in tension with the slant-incentive prediction — the demand for slanted polls (from sponsors) and the effect of slanted polls (on voters) are predicted to peak at different places.

Mapping to the empirical design

Final-rank position is a usable proxy for ex-ante viability rank: every candidate's poll appears in the data with their realized final share, which we already use to construct error. Final-rank-quintile interactions with sponsored_by give the heterogeneity test directly.
The runoff-eligibility threshold (200,000 registered voters) is a pre-determined institutional cutoff — fuzzy-RD or sharp-RD estimates of β at the boundary are feasible if enough mass sits near it. Eligibility is binary, exogenous to candidate quality, and publicly known at registration time.

Open testability concern

Per the project framing: every candidate has a strategic-vote incentive to commission slanted polls regardless of their position, because slant could in principle move voters across any viability threshold the voter is using. The coordination-theory prediction is not that "only borderline candidates slant" but that "the voter-side return to slant peaks at the threshold." Whether sponsors internalize this in their commissioning decisions is itself an empirical question — which limits the within-paper test of coordination theory to the heterogeneity-in-β across final-rank position. See docs/thinking.md for sketches on tighter tests.

Why this theory belongs in the motivation

The descriptive finding — Brazilian sponsor-paid polls overstate the sponsor by 6–7 pp on average — gains its policy weight only if those polls then shape voter behavior. The coordination-devices theory is the most prosaic and well-evidenced channel through which poll content moves votes. The companion empirical literature is rich and agrees on the qualitative claim:

Granzier et al. (2023) documents that past poll rankings shape voter behavior in French two-round elections — the closest institutional analog to Brazilian runoff-eligible munis.
Andonie & Kuzmics (2012) formalizes pre-election polls as game-theoretic coordination devices — the formal model whose predictions we're invoking informally above.
Faas et al. (2008) and its successors document the bandwagon-and-strategic-vote mechanism in multi-party plurality systems (Germany).

The Brazilian-context piece Lloyd et al. (2016) documents non-random poll error operating through strategic voting specifically — the closest substantive predecessor for the claim that poll-induced voter coordination is consequential in Brazil.

References

cox1997making: Cox (1997) Making Votes Count — the M+1 rule and the canonical formalization of strategic voting under plurality.
duverger1954political: Duverger (1954) Political Parties — the original two-party tendency under plurality.
myersonweber1993theory: Myerson & Weber (1993) "A theory of voting equilibria" (APSR) — the formal pivotal-voter model behind Cox.
granzier2019coordination: French two-round elections, the closest institutional analog to the Brazilian runoff-eligible munis.
andonie2012preelection: the formal coordination-devices model.
schmittbeck2008polls: bandwagon + strategic-vote in multi-party plurality (Germany).
lloyd2016vote: poll-induced strategic voting in Brazil — closest substantive predecessor.

All entries in paper/references.bib; expand docs/literature.md with the three classics (Cox, Duverger, Myerson-Weber) next time the literature pass runs.

Polls as bandwagon triggers

Related to coordination but mechanistically distinct: voters do not explicitly compute who's pivotal but are nonetheless pulled toward the candidate who appears to be leading. Mechanisms in the literature:

Conformity / social desirability — desire to be on the winning side ex post; psychological, not instrumental.
Expressive utility from voting for a winner — utility payoff from the act of supporting the eventual winner, independent of marginal pivotal probability.
Mobilization asymmetry — supporters of the apparent leader exert more effort because the win looks within reach; supporters of trailers stay home (a turnout channel rather than a vote-choice channel).

The underdog effect is the counter-prediction: sympathy for the trailing candidate, pulling votes the other way. Documented in lab experiments (less so in field data); the net behavioral pattern in any given setting is empirical. The fact that we observe sponsors overstating (rather than understating) their candidates is itself weak evidence the underdog effect is not the dominant force in the Brazilian 2024 mayoral setting — sponsors would slant down if underdog dominated bandwagon.

Setup

Voters' utility = u(vote_choice) + ψ × P(c wins) for c chosen, with ψ > 0 capturing the bandwagon premium. As polls move P(c wins) for the leading c upward, more voters with marginal preferences cross over to c.
Mechanism does not depend on the M+1 viability cutoff. The attractor is the apparent front-runner, not the marginal-viability candidate.

Predictions

β should be largest for sponsors of candidates just below the front-runner. These are the candidates with the most to gain from appearing to be leading — slant them up by ~7 pp and they cross from "close 2nd" to "apparent 1st," triggering the bandwagon premium. Sponsors of a clear 4th-place candidate gain less from slant (they can't realistically be made to look like leaders); sponsors of clear 1st-place candidates also gain less (they already appear leading).
Bandwagon prediction differs from coordination prediction sharply in runoff-eligible munis. In large munis (≥200k voters), coordination predicts β peaks near final rank 3 (the M+1 cutoff for the runoff first round); bandwagon predicts β peaks near final rank 2 (candidates trying to look like front-runner). This is the cleanest discriminating test the project can run — the same heterogeneity analysis distinguishes the two voter-side theories.
β should be larger in tighter races. Where front-runners are not visibly far ahead, slant has more room to flip the apparent leader identity. Predicts a competitiveness × sponsored_by interaction with positive sign. (Coordination would predict the same sign through a different channel — tight races also have more strategic-vote movement.)

Mapping to the empirical design

Same final-rank-quintile heterogeneity machinery as for coordination, but the predicted position of peak β differs. Plot β estimates by final-rank-quintile, separately for small and runoff-eligible munis: coordination and bandwagon predict different peak positions in the latter.
Race competitiveness measured as either (a) the final vote-share gap between 1st and 2nd, or (b) the polling lead at the start of the field period.

Open testability concern

The two voter-side theories can coexist — bandwagon and coordination operate through different cognitive channels and may both shape behavior. The within-paper test is which one dominates the heterogeneity pattern, not which is "real." A null discriminating result is consistent with both forces operating.

References

mcallister1991bandwagon: McAllister & Studlar (1991) — classic on bandwagon/underdog/projection in British polls.
araujo2021casting: Brazil-specific natural experiment from the 2018 election (voters casting after partial results saw a bandwagon effect) — the cleanest Brazilian evidence for the mechanism.
granzier2019coordination: distinguishes the two effects empirically (French two-round elections).
chatterjee2020voting: bandwagon vs underdog evidence from the Indian exit poll ban.
farjam2020bandwagon: field-experimental bandwagon evidence with real political organizations.
simon1954bandwagon: Simon (1954) — the original bandwagon / underdog conceptual treatment.

Polls as quality cues (Bayesian information)

A third voter-side mechanism: voters take a candidate's poll standing as a Bayesian signal about candidate quality, distinct from both strategic coordination (who's viable) and bandwagon (conformity). Closest framing: "if many voters seem to like candidate X, then X likely has qualities I might not have observed directly." This is the low-information-rationality / cue-taking tradition.

Setup

Voter has a noisy prior over candidate c's quality θ_c. Sees poll standing s_c as a noisy public signal of θ_c (a summary statistic of others' assessments of θ_c). Bayes-updates from (prior, s_c) to posterior over θ_c, then votes by maximizing expected utility over the posterior.
The strength of belief-updating from a poll signal scales inversely with the strength of the prior. A well-known incumbent has a sharp prior — the poll moves the posterior little. An unknown challenger has a diffuse prior — same poll moves the posterior a lot.

Predictions

β should be largest for sponsors of new / lesser-known candidates. First-time candidates, non-incumbents, candidates without prior elected office — these are the candidates for whom voters have weak priors, so a poll signal has the most posterior-shifting weight. This contrasts with both coordination (predicts peak at viability cutoff regardless of candidate experience) and bandwagon (predicts peak just below front-runner regardless of candidate experience).
Candidates with strong endorsement / media coverage should show smaller β. Their quality priors are already informed from other sources, so a slanted poll has less posterior weight.
Repeat candidates (returning to a previous office) should show smaller β than newcomers — voters have direct prior experience to anchor the quality prior.

Mapping to the empirical design

Heterogeneity by candidate-level priors — most accessible observables:
- First-time candidate vs incumbent
- Prior elected office count (TSE registry carries past candidacies → easy to compute)
- Years in office before this race
Two-way heterogeneity: β by (rank-position × first-time-candidate) separates information-cue from coordination/bandwagon predictions. Information cue says β is largest in new candidates near any rank, while the other two say β depends on rank regardless of experience.

Open testability concern

"Quality signal" is a posited cognitive process — we don't directly observe voter inference. The heterogeneity test (β by candidate experience) is the indirect test; alternative explanations exist for any difference observed (e.g., new candidates may also have more volatile sponsors).

References

lupia1994shortcuts: Lupia (1994) — voters using shortcuts / cues in low-information settings (California insurance reform).
popkin1991reasoning: Popkin (1991) The Reasoning Voter — the low-information-rationality framework where polls serve as cognitive shortcuts.

Why bias survives voter discounting (information frictions)

A standard objection threatens the whole enterprise: if every voter observes who paid for a poll, and sponsor identity is a public field on every TSE registration, rational voters can subtract the expected slant and the sponsor gets no return from biasing. In equilibrium sponsors should pay for unbiased polls and β should be zero. The data say β ≈ +7 pp. So something in the rational-discount chain is broken.

This section walks through the obvious resolutions, eliminates the ones that don't survive scrutiny, and lands on information frictions on the voter side as the load-bearing assumption that ties the four voter-side / supply-side theories above into a coherent picture.

Obvious resolutions that don't survive

"Voters can't see the methodology so they can't discount Channel A." Voters don't need to parse the 1,600-char DS_PLANO_AMOSTRAL to discount. They could just run the regression this paper runs — observe historical sponsor-poll error and apply a sponsor- conditional discount. Or update anecdotally from past episodes of sponsor-poll error. So design opacity per se does not protect the slant. The bias produces an observable historical pattern that a sufficiently motivated voter could in principle estimate.
"Sponsor's payoff is on rank, not level — rank can't be discounted away." Once sponsor identity is observed, voters can apply the discount to whichever target the sponsor cares about, including rank. "This poll says X is in 2nd, sponsored by X, probably X is in 4th" is a rank discount, not a level discount.
"Sponsors are short-horizon so the unraveling discipline doesn't bite." Pollsters are not — they run hundreds of polls across cycles and do face long-run reputational stakes. So the intermediary in the contract has skin in the game even if the principal does not.

What survives: voters are not fully informed about β

What the three eliminations leave is a single resolution: voters do not have, and do not acquire, the information they would need to discount the slant precisely. This is the information-friction story. Five operational sub-mechanisms make it concrete and (mostly) testable:

Rational inattention. Acquiring a precise estimate of sponsor-specific bias is costly — it requires aggregating across thousands of polls and final results. The marginal voter's instrumental stake in any single race is small, so the cost-benefit calculation does not justify the acquisition. Voters end up with diffuse priors over β and under-discount in expectation. [cite:sims2003implications]
Anecdotal updating is biased. What voters do instead is update from salient episodes — the 2018 polls miss, a pundit's accusation that a particular firm slants. Anecdotal samples are small, selected on what got media attention, and weighted toward recent/vivid cases. The resulting estimate of β can be off in either direction and is unlikely to coincide with the academic-regression estimate. The data set for the rational discount does not, in practice, exist in any voter's head.
Channel A produces heterogeneous bias across polls. Even a voter who has internalized "sponsored polls overstate by 7 pp on average" cannot back out this poll's bias, because Channel A bias depends on design choices that differ across sponsored polls (urban-only vs full-coverage, education-quota vs not, etc.). The variance of per-poll bias around the mean is large; a constant-discount rule overcorrects for honest sponsored polls and undercorrects for heavily-designed ones. This refines the eliminated argument (1) above: design opacity does not protect the average β, but it does protect per-poll dispersion of β that an ideal voter would need to discount well.
The marginal pivotal voter is less informed than the average voter. Sponsors care about the voter who tips the race. In close mayoral races, that voter is disproportionately low-engagement, low-political-knowledge, and unlikely to be the one who computes a sponsor-conditional discount. The sponsor's marginal return is on the marginal voter, not the average.
Pollster long-run reputation is multi-dimensional. Pollsters are long-run players, but their reputation function rewards aggregate accuracy on final-week, headline-race polls — the ones used in the post-election "who called it right" press — far more than per-poll accuracy on sponsored mid-cycle polls. A firm can sustain sponsor-poll inflation in early/mid-cycle polls and still close the cycle with a clean final-week poll that defends its public reputation. This is consistent with the customer-mix-sorting prediction in the pollster-reputation section below: the reputation incentive bites where it is observable to the headline-poll audience, not where it isn't.

Mapping to the empirical design

The information-friction story has within-paper consequences but is not itself fully testable here:

Channel A vs B decomposition (already queued). If Channel A dominates, mechanism (3) — heterogeneous per-poll bias — immediately gains weight: the voter's discount problem is high-dimensional, not just a constant. If Channel B dominates, the unraveling puzzle is sharper, because voters could in principle discount a constant residual — and we have to lean harder on (1), (2), (4).
β by race competitiveness (heterogeneity battery). Mechanism (4) implies β should be larger in close races, where the marginal pivotal voter is the least informed and the sponsor's return to that voter is highest. The bandwagon section above predicts the same sign through a different channel; the within- paper test cannot distinguish them.
β by pollster customer mix (already implemented). Mechanism (5) is operationalized in the customer-mix-sorting section below; the within-paper test is n=11 and underpowered, but the sign is right.

Open testability concerns

Voter sophistication is unobserved. The information-friction story rests on a population distribution of discount factors that we don't measure. Within-paper, we observe only the equilibrium-level β, which is jointly determined by sponsor slant choice and voter discount. We cannot decompose β into "how much sponsors slant" vs "how much voters fail to discount" without an exogenous shock to voter information.
A direct test would need a voter-information experiment. E.g., a treatment that informs a random subset of voters about sponsor-conditional historical bias (or even just about who sponsored the poll they are reading) and measures whether their vote intentions update less than control voters' do. This is outside this paper but is a natural sequel.
Anecdotal evidence on voter awareness. Even a small survey asking "did you know the headline poll in race X was paid for by candidate X" would speak to mechanism (1) — if a substantial share of voters did not know, the rational-discount premise was never operative in the first place. Worth flagging in the paper's discussion.

References

sims2003implications: Sims (2003) "Implications of rational inattention" (JME) — the canonical formalization of costly attention. Likely not in paper/references.bib; add when this section is paper-ready.
crawford1982strategic: Crawford & Sobel (1982) "Strategic Information Transmission" (ECMA) — partial-revelation cheap-talk equilibrium when sender bias is known to receiver. Likely not in paper/references.bib; add when paper-ready.
dellavigna2010persuasion: DellaVigna & Gentzkow (2010) "Persuasion: empirical evidence" (ARE) — survey of why audiences fail to discount sender bias in practice.
mullainathan2005market: Mullainathan & Shleifer (2005) "The market for news" (AER) — partial-discounting equilibrium with heterogeneous-prior receivers; closest formal analog for this setting.

All four are candidates for a literature-pass addition; the information-friction argument is foundational enough that the paper should cite the standard references when writing it up.

Polls as Bayesian persuasion (supply-side / Channel A)

Why a sponsor would commission a slanted poll in the first place: the sponsor (sender) chooses a signal structure to maximize the receivers' posterior belief about their candidate, subject to a disclosure constraint. In the Brazilian setting, the disclosure constraint is the mandatory PesqEle registration regime — every methodology choice is publicly filed before the poll's results can be released.

Setup

Sender (the sponsor + chosen pollster) commits to a signal structure σ — operationalized as the poll's methodology (sample design, quota variables, population frame, geographic coverage).
Receivers (voters, media, donors) observe the methodology and the realized poll result. Posterior over the candidate's true standing is computed from both.
The disclosure regime makes the methodology public — receivers can in principle penalize a poll whose methodology obviously favors the sponsor. But quota sampling with multi-stage selection gives substantial latitude in declared methodology that does not trigger obvious red flags (e.g., urban-only DS_DADO_MUNICIPIO is common and not by itself disqualifying).
Sender's objective: maximize the receivers' posterior probability that the candidate is viable / leading. Choice variable: the declared methodology (Channel A) plus residual fabrication (Channel B, outside the disclosed signal structure).

Predictions

Channel A vs Channel B decomposition is the core supply-side prediction. Total β = β^A (movement through methodology) + β^B (residual). After conditioning on the structured methodology features (sample size, days-to-election, ST_PESQUISA_PROPRIA) and the LLM-extracted methodology features (coverage_class, quota variables, population frame), β should shrink to β^B. If β shrinks substantially → Channel A dominates (Bayesian persuasion through disclosed methodology). If β stays roughly constant → Channel B dominates (residual / fabrication, outside the disclosure regime).
β^A should be larger where methodology flexibility is greater. Rural-heavy munis (where coverage_class choice matters more), races with demographic skew (where quota choice matters more) should show larger Channel-A contribution.
β^B should be larger where verifiability is weaker. Polls far from the election (less ex-post verification) should show larger Channel-B contribution. Polls in races with weaker media scrutiny should also.

Mapping to the empirical design

Channel A vs B decomposition: regression-based, comparing β with and without LLM-extracted methodology controls (queued behind the poll_methodology extractor — see pipelines/politica/docs/todo.md).
Heterogeneity by methodology-flexibility proxies: rural share of muni population; demographic Gini; race competitiveness.

Refined Channel-A lever inventory

The canonical lever-inventory table — concrete-design differences, opacity differences, ruled-out alternatives — lives in docs/source-of-bias.md. That doc also documents the size-mismatch problem (the measured magnitudes of the documented levers do not add up to +7 pp) and the open agenda of probes (non-response handling, weighting structure, mode, question-order priming, interviewer-supervision detail) that would either identify a sharp design lever or leave "opacity is genuinely the answer" as the well-earned default.

The empirical highlights flowing into the inventory:

Concrete design-choice differences (real but small): population reference frame (findings-paired), coverage class (AN-019), census-setor cluster usage (findings-paired).
Opacity differences (loud, but not a mechanism themselves): audit % (AN-021), methodology completeness (AN-022), coverage deferral (AN-024).
Ruled out: crude per-row Channel B (AN-013), partisan bairro/stronghold selection (AN-032).

Open testability concern

The decomposition is identified only up to the observable methodology features. Residual β^B contains both "fabrication" (actual numerical manipulation) and "design slant via dimensions we didn't extract." Sharpening the dictionary of extracted methodology features matters — see docs/todo.md § Mechanism decomposition for the queued schema.

References

kamenica2011bayesian: Kamenica & Gentzkow (2011) "Bayesian Persuasion" (AER) — the canonical formal model.
andonie2012preelection: pre-election polls as strategic coordination devices, the formal model whose receivers' problem is the natural lens here.
jo2022informational: informational roles of pre-election polls — game-theoretic, focused on welfare cost of slanted polls.

Pollster reputation: volume vs customer mix (supply-side)

Pollsters' equilibrium slant choice is disciplined by reputation, which has two distinct components: (i) absolute volume — the aggregate stake a firm has in being publicly seen as accurate, and (ii) customer-mix sorting — which segment of clients the firm primarily serves. AN-018 establishes that volume dominates customer mix as the empirical predictor of within-firm sponsor bias. The section is organized accordingly: volume-discipline first, with customer-mix sorting as the secondary refinement.

Setup

Reputation is the firm's product. Three mechanisms shape how much slant a firm tolerates when paid for a sponsored poll:

Volume discipline. A firm producing many polls per cycle has a public name that is constantly being judged against final outcomes. The marginal reputational cost of a visibly slanted poll scales with the firm's total exposure: a single bad call against a 1,500-poll book costs more aggregate reputation than the same call against a 50-poll book. This is a Holmström-style career-concerns argument applied to the firm rather than the individual.
Customer-mix sorting. Different client segments demand different reputations. Candidates pay for favorable numbers to coordinate voters and donors around viability; media outlets sell credibility to their audience and need polls that earn trust ex post. A firm whose customer base is mostly candidates can sustain a slant- friendly reputation; a firm whose customer base is mostly media cannot, because the next media customer observes the lapse.
Public-marketing exposure. Firms that primarily produce their own marketing polls (rather than client-paid polls) sell their brand directly; their public reputation IS the product, and visible slant on a client job poisons the brand. This is a reputation-as-certification mechanism distinct from the customer-mix-sorting story.

The first two of these mechanisms can co-move: small firms tend to attract candidate-heavy customers, big firms tend to attract media-heavy customers. The empirical question AN-018 settled is which axis is load-bearing once both are observable.

Predictions

P1 — Volume discipline (the dominant axis). Per-firm β within the within-candidate FE design should decrease in log(firm's total poll volume in the cycle). Mechanism: aggregate reputation cost scales with exposure; big firms face higher absolute stakes from a public miss. Confirmed: AN-018 univariate OLS δ = −4.28 (p = 0.017, R² = 0.18); WLS δ = −5.74 (p = 0.0005, R² = 0.35). Each doubling of firm volume associates with β falling by ~4-5 pp. The relationship is monotone across firm-size tertiles: small (n_total ~13) β = +11.98; medium (~41) β = +8.64; large (~118) β = −0.93.

P2 — Pollster-self share (reputation-as-product). Firms that primarily publish their own marketing polls (the pollster_self customer type) face the sharpest reputation sanction from slant, because their brand is the product. Their β when paid by a sponsor should be lower than firms with mixed customer composition. Confirmed (marginally): AN-017 γ_pollster_self = −19.9 (p = 0.10, n = 31) in the regression of β on customer-mix shares.

P3 — Customer-mix sorting (the secondary, weaker, axis). Conditional on volume, β should increase in the firm's candidate-share of customer mix. The customer-mix axis is real but is empirically subordinate to volume. Univariate AN-017 γ_candidate_share = +7.6 (p = 0.27, R² = 0.04) — right direction but not significant on its own. Joint AN-018 regression with log_n_total: γ_candidate becomes statistically indistinguishable from zero (and flips sign — see "Open issues" below). The candidate-share/slant correlation that AN-007 first reported appears to be primarily volume confounding.

P4 — Equilibrium bimodality. The pollster market should be bimodal in either size or customer mix rather than uniformly distributed, because firms cannot sustain both reputations simultaneously. The current 31-firm sample is not large enough to test this sharply, but AN-016's forest plot shows a visible cluster near β = 0 for the large-firm tertile and a cluster near β = +12 for the small-firm tertile, with relatively thin mass between them.

Empirical mapping

The size-discipline mechanism is the cleanest first-order empirical story for the cross-firm β heterogeneity surfaced by the robustness battery. The cascade of findings:

AN-016 (within-firm β refit on 31 firms with ≥ 5 self- sponsored polls each). β range across firms = [−10.95, +35.20]; sd 10.3. 19 of 31 firms individually significant. PDF style and LLM extraction held strictly fixed within firm, so the cross-firm dispersion cannot be a data-quality artifact.
AN-017 (refresh of AN-007's per-firm β vs candidate-share on the AN-016 sample). γ_candidate_share = +7.6 unweighted (R² = 0.04) — right direction but statistically weak. pollster_self coefficient γ = −19.9 (p = 0.10) is the cleanest single piece of mixed-customer evidence.
AN-018 (firm size vs customer mix). log(n_total) coefficient dominates: univariate R² leaps to 0.18–0.35; joint WLS regression gives δ_log_n_total = −7.09 (p = 0.0002) and γ_candidate becomes non-significant (p = 0.13). R² in the joint WLS is 0.40.

Tertile pattern (AN-018): small firms (n_total median ~13) β mean = +11.98 (9 of 12 individually significant); medium (~41) β = +8.64 (8 of 10 significant); large (~118) β = −0.93 (only 2 of 9 significant). The "large" tertile contains every big-name 2024 pollster (CENSUS, IIP, INSTITUTO PARANÁ, Verita, AR7, AGILI); the "small" tertile contains METHODUS, CAMARGO, INTENÇÃO, DATA SC, VISÃO, RADAR, BRASLOPES, SEND, and analogous niche firms.

Substantive read for the paper

The headline +7.85 pp sponsor bias is not a uniform industry property. It is concentrated in small, low-volume firms; the big-name Brazilian polling industry appears to self-discipline at the within-candidate level. The headline is a weighted cross-firm average; the variance is the substantively interesting fact.

This sharpens the policy reading two ways:

Targeting "the polling industry" generically misses the point. The publicly-visible big-name firms (those whose results show up in news cycles) are not the source of the bias problem.
Regulatory leverage exists, but only by addressing volume-based reputation accountability at the small-firm tail — for example, by requiring small firms to publish their cumulative final-outcome accuracy across all registered polls (not just the sponsored ones), so that small firms accumulate the same kind of cumulative reputation cost that big firms already face naturally.

Open issues

Why does customer-share flip sign in the joint regression? Univariate (AN-017): γ_candidate_share = +7.6 (right direction). Joint with size: γ = −10.21 (p = 0.13). The flip suggests that conditional on firm size, higher candidate-share is associated with lower β — possibly because at fixed size, firms that specialize narrowly in candidate work are more often single-candidate vehicles (low marginal reputation cost from one extra slant; but also low incentive to slant if the firm exists only for the candidate's purposes). The sign-flip is statistically marginal so could be sampling noise; worth a multi-cycle replication.
Cross-section vs causal: AN-018's per-firm β is a panel cross-section. The relationship is between firms that happen to be small in 2024 and high β in 2024, not between changes in firm size and changes in β. A within-firm longitudinal design (multi-cycle panel) would identify whether growing firms reduce slant over time.
Selection of sponsors into firms: candidates may observably sort into firms based on expected slant. The cross-firm dispersion in β reflects both supply-side reputation discipline and demand-side sponsor preferences. Decomposing these requires within-firm-over-time variation in the customer-mix at fixed size.

References

holmstrom1999career: Holmström (1999) "Managerial Incentive Problems: A Dynamic Perspective" (RES) — the canonical career-concerns model. Applied at the firm rather than individual level for the volume-discipline argument. Not yet in paper/references.bib; add when this theory is formalized in the paper.
mathios1998adoptive: classic reputation-as-product / certification disclosure argument. Backstops the pollster-self-share prediction (P2).
andonie2012preelection: pre-election polls as coordination devices (demand side).
klein1981role: Klein & Leffler (1981) "The Role of Market Forces in Assuring Contractual Performance" (JPE) — repeat- player reputation as quality-assurance, the foundation for the size-discipline argument. Add to paper/references.bib.

Empirical anchors:

AN-007 (source/analysis/pollster_customer_mix.py) — first-pass customer-mix slope on 11 institutes.
AN-016 (source/analysis/an-016-within-firm-beta.py) — per-firm β on 31 institutes with PDF-style held fixed.
AN-017 (source/analysis/an-017-customer-mix-refresh.py) — customer-mix sorting refresh on the AN-016 sample.
AN-018 (source/analysis/an-018-firm-size-discipline.py) — firm-size dominates customer mix; the load-bearing finding for this section.

Polls as verifiable disclosure

A supply-side complement to ## Polls as Bayesian persuasion: the sender faces a future cost of slanting that scales with how verifiable the claim is at the time the receiver consumes it. In the canonical Bayesian-persuasion setup the sender commits a signal structure and bears no ex-post cost. Brazilian electoral polls violate that assumption: the election day eventually arrives and returns a public, granular check on the poll's headline numbers.

Status: drafted 2026-06-02.

Setup

Sender (sponsor + chosen pollster) chooses slant magnitude τ ∈ ℝ subject to a cost c(τ) × P(detection), where P(detection) is decreasing in time-to-election.
Detection cost is multi-channel: reputational (a sued or contradicted pollster loses future media commissions), regulatory (LE.33.§3 UFIR 50k–100k multa for unregistered divulgation; LE.33.§4 criminalizes fabrication with detenção 6 mo – 1 yr + multa — [institutions.md §"Compliance and sanctions"]), commercial (the sponsor's own party / coligação loses control of the narrative if a poll is publicly refuted).
This is the cheap-talk-with-verification logic of Crawford-Sobel (1982) sharpened by Milgrom-Roberts (1986): when receivers obtain an informative ex-post signal of the underlying state, sender exaggeration is bounded.
Channel A (theory.md §"Polls as Bayesian persuasion") is essentially time-invariant because methodology choices are committed at registration. The verifiability channel acts on Channel B (residual / fabrication) — the part of slant that the disclosed signal structure does not justify.

Predictions

β^B declines with proximity to election day. After Channel A controls, the residual sponsor bias should be monotonically decreasing in proximity to election day. Polls in the final 1–2 weeks should show β^B ≈ 0; the action is at the longer horizon, where detection is far off and discounted.
β is larger where scrutiny is weaker. Munis with no major-firm poll in the race (less cross-firm cross-check), low media density, or low EJ injunction activity — i.e., low P(detection) — should show higher β. Operationalizable via coarse proxies (presence of a Datafolha / Quaest / Ipec poll in the race; muni population; race-level injunction count).
Enforcement events suppress β. A TSE / MPF investigation into a pollster firm should suppress β in that firm's subsequent polls — a Bayesian update on enforcement intensity. Testable in principle if an enforcement-event panel materializes; flagged conjectural for now.

Mapping to the empirical design

Headline test: the sponsored_by × days_to_election interaction after Channel A controls. Prediction: negative interaction. This is the same interaction thinking.md flagged as verifiable-disclosure / future cost ([thinking.md §"Possible directions"]).
Cross-race heterogeneity: sponsored_by × major-firm-present; sponsored_by × muni-population-tercile.
Channel B test (3): out of scope without an enforcement-event panel.

Open testability concern

The verifiable-disclosure prediction opposes the bandwagon-incentive prediction (theory.md §"Polls as bandwagon triggers", prediction 3): near-election polls have larger voter-side bandwagon return → sponsor's demand for slant is higher near the election; verifiable disclosure says the sponsor's cost of slant is also higher near the election. The two effects pull in opposite directions on the days_to_election × sponsored_by interaction. The observed sign discriminates between them, but a null is consistent with both operating and roughly cancelling — a perennial cost/demand-decomposition problem.

References

Crawford & Sobel (1982): Crawford & Sobel (1982) "Strategic Information Transmission" — canonical cheap-talk model; sender's lie shrinks when verification noise shrinks.
Milgrom & Roberts (1986): Milgrom & Roberts (1986) "Price and Advertising Signals of Product Quality" — full disclosure as the equilibrium under cheap verification.
Dziuda (2011): Dziuda (2011) "Strategic Argumentation" (JET) — partial-verifiability model; pollster discloses a subset of methodology elements while receivers cannot audit the rest. The empirically relevant case for Channel A.
Kamenica & Gentzkow (2011): Bayesian persuasion baseline (already cited in theory.md §"Polls as Bayesian persuasion").
Institutional facts on LE.33.§3 / §4 / LE.34 sanctions: [institutions.md §"Compliance and sanctions"]. ABEP industry critique of auto-financiamento as caixa-dois cover documented in the Paraná Pesquisas reporting [stories.csv #077].

Polls as career-concerns games

Second supply-side complement to ## Polls as Bayesian persuasion: the pollster firm is a repeat player whose long-run revenue depends on accuracy reputation among future clients. Slanting any individual poll trades short-run sponsor revenue against long-run accuracy reputation; the trade-off varies with how much reputational capital the firm has accumulated.

Status: drafted 2026-06-02.

Setup

Pollster firm f has reputational capital R_f accumulated from past poll-vs-result accuracy. Firm chooses a slant level τ_f for each commission given (sponsor offer w, reputational depreciation function δ(τ, detection_probability)).
Standard career-concerns model (Holmström 1999): an agent with private effort and a public output signal balances accumulated reputation against current-period payoff. Newcomer firms underprovide effort early (no reputation to protect); established firms protect their stock.
Brazilian polling market satisfies the model's structural conditions: firms have observable accuracy histories (TSE registry
- final election results), clients verify ex post, the market is repeated across cycles.
Distinction from Bayesian persuasion (theory.md §"Polls as Bayesian persuasion"): the sender's identity is now the firm, not the sponsor. The mechanism is about who the firm agrees to slant for and how much, not about why the sponsor demands slant.

Predictions

β is smaller for major national firms. Pollster fixed effects in the headline spec already absorb each firm's average house effect; the prediction here is on the interaction sponsored_by × major_firm — slant per unit of sponsorship is smaller for firms with more reputational capital. Predicted sign: negative.
Newcomer firms show the largest β. First-electoral-cycle pollsters have no reputation to protect. Operationalize as years-since-first-TSE-registration of the pollster CNPJ.
Visible methodology defense as reputation maintenance. When a major firm is challenged on methodology (e.g., Datafolha responded to Russomanno's 2020 censorship attempt with a detailed 35-year-old-methodology defense [stories.csv #131]), the defense itself is reputation-maintenance behavior the model predicts. Small firms with sponsorship-heavy portfolios should defend less and settle more.

Mapping to the empirical design

Headline test: triple interaction sponsored_by × pollster_volume_tercile on the spec-3 regression. Volume tercile is a coarse proxy for reputation stock.
Sharper version: years-since-first-TSE-registration of the pollster, computed from the multi-cycle TSE poll registry.
Qualitative complement: the LE.34 audit-request record and LE.33.§3 multa history (where available) gives a firm-by-firm cross-section of visible reputational stakes.

Open testability concern

Pollster volume mixes reputation stock and capacity: a major firm has many polls because clients trust it AND because it has the field operations to deliver them. Both predict the same heterogeneity sign through different channels. A sharper test would need a pollster-level shock that changes reputation without changing capacity (loss of a major media contract, statistician-level audit finding) — the data don't support that yet.

A second concern: Brazilian polling firms are often vertically linked to media outlets (Datafolha ↔ Folha; Globo's polling partnerships) [institutions.md §"Major firms"]. The media outlet's reputation may be the binding constraint rather than the polling firm's own — the model's predictions hold but the unit of accumulation is the outlet-pollster pair, not the pollster CNPJ.

References

Holmstr\"om (1999): Holmström (1999) "Managerial Incentive Problems: A Dynamic Perspective" (REStud) — the canonical career-concerns model.
Dewatripont et al. (1999): Dewatripont, Jewitt & Tirole (1999) "The Economics of Career Concerns, Part I: Comparing Information Structures" (REStud) — characterizes how the precision of the public output signal shapes career-concerns incentives. Directly relevant: pollster output is verified more precisely close to election day.
Meirowitz (2005): Meirowitz (2005) "Polling Games and Information Revelation in the Downsian Framework" (GEB) — formal model of pre-election polls as strategic games. The most directly poll-specific theoretical reference.
Gentzkow & Shapiro (2006): Gentzkow & Shapiro (2006) "Media Bias and Reputation" (JPE) — reputation-bias model in media markets; structurally identical to the polling-firm problem with media outlet → polling firm, consumers → media clients + sponsors.
Industry context on the major-firm landscape and the academic- industry overlap (Quaest / Felipe Nunes; Datafolha methodology defense practice): [institutions.md §"Brazilian polling industry"]; Folha (2025) profile of Felipe Nunes' Quaest-Globo collaboration [stories.csv #001].

Discriminating among the voter-side theories

The three voter-side theories above (coordination, bandwagon, information-cue) predict different positions of peak β across final-rank quintiles and candidate-experience bins. Summarized in one table:

Theory	Where β peaks	Discriminating heterogeneity
Coordination	At M+1 viability cutoff (rank 2 in small munis, rank 3 in large)	Position of peak shifts at 200k institutional cutoff
Bandwagon	Just below front-runner (rank 2, regardless of muni size)	Same peak position in small and large munis
Information cue	Among lesser-known / first-time candidates, regardless of rank	β larger for non-incumbents

The β-by-(rank × runoff-eligibility × first-time-candidate) cube is the within-paper test that distinguishes them. The Bayesian persuasion theory operates on the supply side and is orthogonal: its decomposition (Channel A vs B) is testable independently of the voter-side discriminating test.