Please could you answer all the questions seen in the attached picture. Please send in the answers on a word document (no references or citations needed) along with the R file so I can see the codes you used. I will provide the dataset. The answers in the word file don’t need to be super long just a few short sentences will do.
Part III: Application – 33 marks
Please answer the following questions based on your analysis of the “BES_2017_summative.dta” dataset, which you
can download from Moodle. The dataset is a subset of the British Election Study 2017, which is a representative
sample of British voters conducted after the 2017 General Election. You can find more information here: https:
//www.britishеlectionstudy.com/data-object/2017-face-to-face/. Please make sure to number your tables
and figures, and add the appropriate labels.
Exercise 1: 10 marks
a Create a table which summarises how political interest varies by university education. University education
is a binary variable coded 1 (went to university) or 0 (did not go to university). Display the means and a
meaningful measure of variability (or spread) by education level. Interpret the results you see in your sample.
(4 marks)
b You would like to make an inference about the relationship between university education and political interest
in the population of British voters. Formulate the appropriate null hypothesis and test whether you can reject
the null hypothesis using linear regression. (6 marks)
2
Exercise 2: 10 marks
a Create a scatter plot which shows how the vote share for the Liberal Democrats in 2017 varied by age. You
might have to adjust the y-axis to account for the range of the data. (4 points)
b Reproduce the same plot, but this time include a linear (OLS) regression line in your plot. Is the association
between age and LibDem vote in your sample positive or negative? Why is the OLS regression line also called
the “line of best fit”? (6 points)
Exercise 3: 13 marks
a At the individual level, regress voting LibDem on age (use linear regression). Write down the OLS regression
model that you are estimating, and define all terms in the model. Display and interpret the substantive results
of your model. (5 marks)
b Calculate by hand: What is the predicted LibDem vote for a 23 year old? Display all steps in your calculation.
(2 marks)
c Calculate by hand: What is the predicted LibDem vote for 16 year olds if the government decided to lower
the voting age? Display all steps in your calculation. (2 marks)
d What can you say about the relationship between voting LibDem and age in the population of British citizens?
(4 marks)