How are polling averages weighted?

Polling averages are constructed by aggregating multiple individual polls and applying weighting rules that reflect poll quality and recency. The main weighting factors are: (1) Pollster quality grade — based on historical accuracy, methodology transparency, and sample size. Higher-grade pollsters (A+, A) receive more weight than lower-grade pollsters (C, D). (2) Recency — more recent polls receive more weight because they better reflect current conditions. The weighting decay function varies by aggregator; FiveThirtyEight historically used a half-life of approximately 30 days. (3) Sample size — larger samples receive modestly more weight, though the effect is smaller than quality and recency after a threshold of ~800 respondents. (4) Likely voter vs. registered voter — likely voter screens are considered more predictive close to elections and may receive additional weight in final weeks.

What is a pollster house effect?

A house effect is a systematic bias in a pollster's results relative to the aggregate. If a pollster consistently shows candidates 2-3 points more Republican than the polling average across many races and cycles, it has a Republican house effect of approximately +2 to +3. House effects arise from methodological choices: sample composition (who gets called, who responds), weighting targets (which demographic composition the pollster assumes), question wording, and interviewer effects. Sophisticated polling aggregators like FiveThirtyEight calculate each pollster's historical house effect and correct for it when incorporating their polls into the average. This prevents a single methodological bias from dominating the average even when a pollster releases many polls.

How accurate are polling averages typically?

Polling averages are significantly more accurate than individual polls, but they still have documented failure modes. In recent U.S. presidential elections: the 2016 national polling average was accurate to within 1 point in the popular vote, but state-level averages had a systematic Republican underestimate in upper Midwest states. 2020 national averages underestimated Trump by 3-4 points in key states despite accurately tracking Biden's national popular vote lead. The 2022 Senate polling average underestimated Republican performance in several states. The pattern of Republican underestimation in recent cycles is the most significant systematic failure mode in current polling averages. Aggregators have responded with methodological changes, but the root cause — likely differential non-response among certain Republican-leaning voter groups — has not been fully resolved.

How Polling Averages Work: Weighting, Aggregation, and What to Trust

3–4x

More accurate than individual polls

30 days

Typical recency half-life in major aggregators

A–F

Pollster grade range; A-rated polls weighted ~3x C-rated

800

Minimum respondent threshold for reliable polls

Key Findings

Quality-weighted averaging uses 5+ adjustment factors: pollster historical accuracy, sample size (larger = less random error), recency (more recent = more predictive), methodology transparency, and house-effect correction for each pollster's known partisan lean.
House effects — the systematic partisan lean of individual pollsters — typically range from D+3 to R+3 for established organizations, are measurable across prior election cycles, and are real enough that the same race will poll differently at the same moment depending on which firm conducted the survey.
Transparency weighting punishes polls that won't disclose full methodology: pollsters that hide sample composition, question wording, or response rates get downweighted versus those that publish complete technical specifications.
The five-factor approach outperforms simple averaging most significantly in low-polling-frequency races (House primaries, smaller Senate contests) where the available pool is dominated by partisan internals with known and directional bias.
The fundamental limit of any weighting methodology: if systematic error is shared across all polls (non-response bias in 2020/2024), quality weighting produces false precision — a precisely calculated average that is still systematically wrong in the same direction as every component poll.

The Five Weighting Factors

Factor	What It Does	Impact on Weight	Who Uses It
Pollster grade	Historical accuracy vs. final election results	High (2–4x difference A vs. C)	FiveThirtyEight, RCP (implicitly)
Recency decay	More recent polls get higher weight	High (6-week-old poll may get half weight)	FiveThirtyEight, Economist model
House effect	Corrects for partisan lean in pollster method	Moderate (1–3 pt adjustment)	FiveThirtyEight, Decision Desk
Sample size	Larger samples marginally higher weight	Low beyond 800n threshold	Most aggregators
LV vs. RV screen	Likely voter screens more predictive near election	Moderate close to election	FiveThirtyEight
Methodology type	Live caller vs. online vs. IVR	Moderate (live callers historically more accurate)	FiveThirtyEight

Polling data visualized on electoral map

Understanding House Effects

A house effect is the most important concept for reading individual polls critically. If Pollster X consistently shows Republican candidates 3 points higher than the polling average across many races and cycles, the intelligent reader discounts those results by approximately 3 points, not because the pollster is dishonest but because their methodology systematically captures a different version of the electorate.

House effects arise from genuine methodological choices. A pollster that calls landlines heavily will reach an older, more Republican-leaning sample. A pollster that conducts online opt-in surveys may reach a more politically engaged, atypically partisan sample. A pollster that weights to 2020 voter turnout proportions may produce results that look different from one weighting to registered voter proportions. None of these choices is necessarily wrong — pollsters are making judgments about what the actual electorate will look like. But they produce systematic differences that aggregators should account for.

Republican-Leaning Pollsters

Several pollsters with documented Republican house effects are among the most prolific releasers of public polling. When they flood a competitive race with polls, simple averages can be pushed significantly right of the true state of the race. House effect correction prevents this distortion.

Herding Problem

Pollsters who release results close to the final election have incentives to “herd” toward the average to protect their track record. This produces a false precision where polls converge in the final week, potentially hiding a systematic error that all polls are making in the same direction.

Simple vs. Weighted Average

RealClearPolitics uses a simple unweighted recent average with no quality adjustment or house effect correction. This is easy to understand but academically inferior. Analyses comparing simple to weighted averages show weighted averages reduce average error by 15–25% in competitive races.

Related Analysis

Generic Ballot Tracker — Democrats +7.0 as of June 2026 → Senate Majority Math 2026 — Democrats Need Net +4 to Flip → House Majority Math 2026 — Republicans Hold 4-Seat Margin → 2026 Election Forecast — Senate Tipping-Point Races →

What Polling Averages Can and Cannot Tell You

Polling averages are the best available tool for tracking candidate standing in real time. They are far better than any individual poll at filtering out noise from house effects, question wording variations, and sampling fluctuations. For tracking trends — whether a candidate’s support is rising or falling, how approval responds to events — a well-constructed polling average is genuinely informative.

What polling averages cannot tell you: whether there is a systematic polling accuracy in the same direction as recent cycles (all polls may be wrong in the same direction); whether turnout assumptions embedded in likely voter screens will prove accurate; whether undecided voters will break in a predictable pattern; and whether late-deciding voters (those who make up their minds in the final 2 weeks) will follow the same pattern as voters polled months earlier. The 2016 and 2020 experience shows that even a near-accurate polling average at the national level can mask significant state-level errors when systematic non-response creates correlated errors across most state polls simultaneously.