Aug 23 2016

What 7.5m tweets taught us about the Brexit campaign

By Stefan Bauchowitz and Max Hänska

How did Eurosceptic (leave) and pro-European (remain) activity compare on social media in the run-up to the EU referendum, and was there a relationship between social media users and votes? To find out how leave and remain compared, we collected more than 7.5 million Brexit related tweets during the 23 days leading up to the referendum through twitter’s streaming API. We used a support vector machine to identify which tweets clearly supported the leave or remain camp (and manually coded a random sub-sample of those to ensure our allocation was reliable). Given the polarity of the issue this worked well, and the model correctly identified most tweets. We used the result of this exercise to assign each user in our sample to one of the two camps.

We collected tweets containing the terms ‘Brexit’, ‘EUref’ and ‘EU Referendum’, which were all frequently used to refer to the referendum. While the term Brexit has great currency across both camps, it was used more often by users who wanted to leave the EU as it lends itself more easily to positive slogans (e.g. “Can’t wait for #Brexit to win!” or “Brexit to save Europe”, also echoed by “Brexit means Brexit”). Even though EURef and EU Referendum are more neutral terms, in both sub-samples we find that support for leaving, measured by number of tweets, outstripped support for remaining by a factor of 2.3 and 1.75 respectively. The margins confirm a slight bias in the term ‘brexit’ where the strength of leave over remain was more pronounced. Overall it is clear that the army of leave users was larger in numbers and more active in tweeting their cause (see Figure 1).

level of tweet activity by keyword.

level of tweet activity by keyword.


Other researchers examining google search trends, Instagram posts, and Facebook found a similar tendency of Eurosceptic views being communicated with greater intensity by a greater number of users. Researchers from Loughborough University revealed that, weighted for circulation, 82% of newspaper articles were pro-Leave. Both in print and on social media Brits had greater exposure to Eurosceptic than pro-European opinions.

We also mapped twitter activity to local authority districts. To do this, we used Google’s and Bing’s geo coding services to translate user-provided location information to geographical coordinates which we then matched with local authority districts. This is not an exact science, both because many users provide no or fictitious location information in their profiles, and because the finer the granularity of geo-location required, the more error-prone the result (see here and here). As many users specify their location as ‘London’ rather than its constituent boroughs, we aggregated all tweets from users located there. We plotted the share of users supporting remain against share of the remain vote. We excluded districts where we identified fewer than 200 users, giving us usable data for 100 local authorities.

Share of remain tweets by share of remain vote.

Share of remain tweets by share of remain vote.


There is clearly a pattern in the way the referendum campaign unfolded on twitter, with those wanting to leave communicating in greater numbers and with greater intensity. Districts with a greater share of twitter users supporting leave also tended to vote for leaving the EU, so that twitter activity correlates with voting in the Referendum.

Yet, we must be cautious to avoid over-interpretation, in particular regarding claims that social media can predict election outcomes, the problems of which have been pointedly enumerated. Finding a pattern in the data post hoc is quite a different thing than confidently identifying and interpreting the pattern ex ante—leave lead on social media by a much larger margin than it did in the vote, so it is not at all clear how one should have interpreted results from a twitter analysis before the vote. The problem is, we lack demographic descriptors of social media users according to which we may weight or interpret results.

Nevertheless, given that twitter users are generally thought to be younger and young people tended to vote remain, the result is surprising either way. It seems plausible that leave voters were more motivated, and consequently more active on twitter. It also seems likely that slogans such as vote leave, take control, or even Brexit better lent themselves to a simple message (particularly useful given the constraints of a tweet), and allowed different interpretations such that users could project their desired meaning onto the slogan. Whether in the press or on social media, British voters were more likely to encounter messages that favored leaving the EU, than those that favored remaining.

This post originally appeared on the LSE Brexit blog

Note: This article gives the views of the author, and not the position of the Euro Crisis in the Press blog nor of the London School of Economics.

Stefan Bauchowitz holds a PhD from the London School of Economics.

Max Hänska is a lecturer (Assistant Prof.) at De Montfort University (UK) where his research interests center on social media, political communication and collective decision-making.

Related articles on LSE Euro Crisis in the Press:

The battle lines have been etched

‘We want our country back’ – stop sneering, start listening

On Brexit & Control

The UK is Reaping What the British Media Have Been Sowing for a Long Time


Print Friendly, PDF & Email
This entry was posted in EU ref, Max Hänska, United Kingdom and tagged , , . Bookmark the permalink.