Focaldata
On the eve of the election, Focaldata in partnership with Prolific can reveal our final seat forecast of the 2024 general election. We find Labour on course for a 238-seat majority, a record-breaking result that sees the party winning 444 seats, based on a 40% share of the national vote (-1.4pp since our last drop). The Conservatives are expected to win in 108 constituencies (-2 since our last drop), based on a 23.2% vote share (+0.2pp).
On a seat-by-seat basis, these final estimates are quite similar to our previous release last week. Across the 5,048 constituency cells (i.e. 631 constituencies * 8 parties) there are less than 6% of occurrences where the vote share estimates show a change of 3 percentage points or more. 11% of our estimates show a change of 2 points or more.
Looking at the changes in more detail, we find that:
Data for all constituencies is available here.
#1 Unwinding adjustment is too aggressive
We deploy an adjustment – which YouGov coined as “unwinding”. This adjustment seeks to cater for some of the historical problems with MRP relating to regularisation and attenuation. We think it’s best to be really transparent about how material this adjustment is. Our model learns from past election distributions to suppress MRP tendencies to move beyond proportional swing. Our adjustment results in significant change in seat count of around 40 seats for the Conservatives, meaning that without it, we would be towards the lower end of MRP forecasts for the Conservatives. For transparency, our ‘raw’ seat counts (i.e. without the unwinding algorithm) would be Labour on 476, Conservatives on 68, Liberal Democrats on 62, SNP on 17, Reform UK on 4, Green on 2 and Plaid Cymru on 2.
Our political judgement remains that beyond proportional swing is unlikely – but we accept entirely that we don’t know where the “slope” is likely to be between uniform national swing and proportional. The evidence from the 2024 local elections was firmly towards (if not entirely) proportional. Please see a long-form discussion of this here.
However, even with unwinding, looking at the marginality of seats, the Conservatives only hold 65 seats with a margin of over 5 percentage points, and are acutely vulnerable to changes in vote share up to election day. Labour by contrast lead in 398 seats by a margin of over 5 points. This is a staggering difference. To clarify, we are not forecasting c.60 seats for the Conservatives but it is entirely reasonable to assume this could happen if our unwinding adjustment has been too aggressive.
#2 Fieldwork is too late
The other potential source of error is our fieldwork dates. Interviewing people from the 10 June – 1 July means we may miss the latest political movements.
#3 Tactical voting
We have tactical voting effects in the model, but acknowledge that there is great uncertainty over the Liberal Democrat specific seat gains.
#4 Change in turnout patterns
Our validated turnout model uses historic voting patterns going back to 2015. Should the turnout patterns of the electorate change in a material and drastic manner - particularly around youth turnout and older voter abstention we could be overestimating the Conservative vote.
#5 Minor party vote distribution patterns
MRP can struggle with minor parties - and particularly can suppress the vote shares of smaller parties at the top end of the scale. There is a strong possibility that our model may be doing this for Liberal Democrats, Greens and in particular for Reform. Should this be the case we can envisage a scenario where the Liberal Democrats exceed our 57-seat forecast, the Greens pick up seats and Reform ends up with a handful of 4-7 MPs. An extension of this is where there are prominent independents. Our model could underplay significant vote shares for independents running in more diverse constituencies challenging Labour from the left.
What is multilevel regression with post-stratification (MRP)?
Multi-level regression with post-stratification (MRP) is a statistical technique for estimating public opinion in small geographic areas or sub-groups using national opinion surveys. It originated in America, and was used by academics to estimate state-level opinion cheaply, given the expense of doing polls throughout the country.
MRP has two main elements. The first is to use a survey to build a multi-level regression model that predicts opinion (or any quantity of interest) from certain variables, normally demographics. The second is to weight (post-stratify) your results by the relevant population frequency, to get population level (or constituency level) estimates.
At the end of this process the aim is to get more accurate, more granular (thus more actionable) estimates of public opinion than traditional polling. There are, however, significant technical challenges to implementing it effectively. These include large data requirements, dedicated cloud computing resources, and an understanding of Bayesian statistics.
Why are everyone’s forecasts so different?
Thankfully we cover that here in a long-form essay. TL/DR is that every pollster has different vote shares, and the distribution of each party’s vote share - specifically its slope is drastically different between pollsters
Who built the model?
This new UK model was built by Dr Ben Lobo and Dr Adam Higgins. Our modelers received technical support from Dr Pete Logg who led our successful 2019 UK MRP efforts, and Dr Matt Chennells and James Alster and the wider technical team, and domain support from our Chief Executive Justin Ibbett and Chief Research Officer James Kanagasooriam.
Focaldata was set up to provide high quality and rapid turnaround MRP in 2017. Since then we have conducted thousands of models for hundreds of clients - corporate and political ones. For this election we built up an entirely new MRP model vs our 2019 model - which strongly performed and captured the dynamics of the race and voting shapes efficiently of each party. For this election we have partnered with the firm Prolific to supplement our sample to provide our model with the largest possible sample in a restrained time period given campaign vote share change - to forecast the general election.
The component parts of our UK 2024 “MRP” model was built from scratch include:
Other notes about the model
This MRP uses ‘Rallings and Thrasher’ 2019 notional results parliamentary boundaries.
Technical notes
Focaldata interviewed 36,726 British adults along with our survey partners Prolific from 10 June to 1 July 2024
We provide estimates of each party vote share in each constituency. The estimates show point estimates along with credible intervals i.e. the high and low estimates. To calculate these, following estimation of the multilevel model, we draw 500 samples from the posterior, using the poststratification frame as new data. We then calculate the median, low (5%) and high (95%) confidence intervals. The intervals indicate that according to our model, there is a 90% chance of the outcome lying between the low and high estimate - while the point estimate is the median value across all our 500 draws.
For each constituency, we provide probabilities of a party winning a given seat. These probabilities are the percentage of the 500 draws that the party wins a seat. Our final seat count is calculated by summing the probabilities for each party across all constituencies. This means our seat count takes into account the uncertainty around our estimates for each party in each constituency.
We include a time parameter in our model so that our estimates take into account any temporal changes in vote choice and ensure that our estimates are weighted to the most recent time period.
We're on a mission to power the world’s understanding of what people think and do.
We research public opinion for data-driven organisations. With Focaldata, decision makers and researchers get the most rigorous data and analysis at 5x the speed — reducing the time to decision while delivering uncommonly actionable insights. We are a team of engineers, data scientists, product specialists and researchers building outstanding technology and next-generation services. Our services are global. We are non-political and non-ideological.
Prolific is a platform that enables researchers to collect high-quality human-powered data at scale.
Using the Prolific platform researchers can target, contact and manage research participants from Prolific’s diverse, vetted and fairly-treated pool – to deliver world-changing research and the next generation of AI.