Understanding the willingness of public for Covid-19 vaccination based on survey data.

1. The original visualisation

For this DataViz Makeover, I have used data from Imperial College London YouGov Covid 19 Behaviour Tracker Data Hub, which collects global insights on people’s behaviours in response to Covid-19.

Before making over the selected visualisation, it is important to have a clear understanding of the context of the visualisation and its key takeaways which are dependent on:

2. Critiques and suggestions for current visualisation

My critique will be based on Ben Jones data visualisation’s four quadrants of clarity and aesthetic.

Ben Jones defined clarity as “Does it make sense” and aesthetics as “Does it look good”.

Based on this concept with the 4 quadrants, I have classified this original visualisation to be in the III quadrant, see below for the explanation on my classification.

2.1 Clarity

S/NCritiquesImprovement points
1Titles are not helpful and has spelling error - no definition on “Pro-vaccine”. The writer does not explain the concept of pro-vaccine – Are they numbers of ‘strongly agrees’ only or include ‘agree’?Change title. Rank countries by using percentage of agrees (strongly agree and agree).
2The visualisation on the right is misleading – by showing only the “strongly agreed %” with no further explanation on selection basis. For example, in terms of ‘strongly agreed’ to vaccinate, the visualisation ranked France higher than Japan although France has a high number of people who strongly disagree, whilst large proportion of Japanese are ‘neutral’.Remove chart. Use statistical values to showcase survey results.
3Legends not fully labeled – values from 2-4 are not labeled and may confuse the reader.Remove numbers and label legends: Strongly Agree, Agree, Neutral, Disagree, Strongly Disagree
4What is the margin of error - Given that this data is a survey data (a sample size), the survey does not accurately represent the whole populations' opinion due to probability distribution.Use statistical values and chart (i.e error bars) to show the margin of error for the survey results.
5No call outs to provide context - Since this is a static chart, call outs should be used to provide more contextIf static chart, include call outs, text or images to enhance the chart. If interactive, include tooltip details and ability to filter.
6No source data includedIncluded source data at bottom of visualisation.
7100% stacked bar chart may not be the most appropriate chart - As there are 16 countries and 5 opinions, which is alot of variables, it is difficult to gauge agree / disagree / neutral trends using stacked bar chart.Given that the data is a survey data and that the main idea is to analyse trends for percentage opinions of agrees, disagrees and neutral, the likert (divergent) chart would be more appropriate.

2.2 Aesthetics

8Colour palette used is glaring and not meaningful as it does not enhance the conveyance of information.Use grey for neutral, reddish colours for disagree and bluish colours for agree. Or the “traffic light” colours of Red (Disagree), Yellow (Neutral) and Green(Agree).
9Redundant chart – The right chart is a subset of the left chart.If the visualiser wants to highlight the “% strongly agree” field, just tone down the colours of the other fields and use a striking colour for the “% strongly agree” field. Alternatively, a gradual intensity of colour to match the corresponding change in the intensity of the view.
10Location of legends - there is a lot of whitespace / unused space below the legendAlign legend horizontally below the chart.
11Decimal points of axes are not consistent - one chart uses no decimal and another uses 1 decimal points. Formatting should be consistent throughout the visualisation.Use consistent formatting - 1 decimal point.
12Country axis title is not necessary - Given that Country names are self-explanatory, the axis title can be removed.Remove country axis title.

3. Proposed Design

3.1 Sketch

  • Use reddish for disagree, grey for neutral and blueish for agree.
  • Margin of error to be calculated based on proportion rather than mean – want to know the proportion of respondents who are positive / willing to take the vaccine.
  • Make chart interactive – toggle margin of error proportion by positive, negative and neutral; tooltip details, filter on click.

3.2 Advantages of Proposed Design

1) Margin of error chart
Readers will know the quantum of precision to expect for percentage of people agreeable, neutral and disagreeable to the vaccine.

2) Ranks countries using agreed opinions
By using this method, all agreed opinions are taken into consideration when ranking countries, as opposed to using 1 opinion (strongly agree) as done in the original visualisation. Since there are only two outcomes in reality - vaccinate or do not vaccinate - public health would have an indication on the percentage of population who would opt to vaccinate.

3) Neutrals by the side
Given that a large proportion of respondents voted neutral, we can observe the trend better with neutrals by the side rather than the usual divergent chart, of which neutrals tend to be between positive and negative.

4) Interactive charts
Readers can toggle survey question, see demographic snapshots by country and see breakdown of opinions by demographic.

5) Labeled opinion legend
Legends are labeled with proper text instead of numbers, making it immediately clear to the reader.

6) Meaningful colour palette
Colours are consistent throughout both dashboards, colours are visually pleasing.

4. Final visualisation

4.1 Dashboard

For the purpose of this exercise, I have included additional survey questions and demographic data compared to the original visualisation.

The dashboards are interactive - demographic and country data changes according to selection.

Due to limitation of webpage width, visualisation may appear compressed. Hence to enjoy the full experience of this visualisation, please view in Full Screen mode. This visualisation is also available on Tableau Public.

4.2 Main observations

1) The French are not at all positive about the Covid-19 vaccine

Although France has the 7th highest death toll globally as of Feb-2021, the French are not very positive to be vaccinated against Covid-19 with 52% of the respondents being negative about having the Covid-19 vaccination.
Those who are particularly negative about the vaccine are in the 30 to 39 age band. French women also tend to be more negative than men.

2) Females are less positive about the vaccine
Females are less positive about taking the vaccine should it be available within the week - on a global level only 45% of females are positive about taking the vaccine as compared to 52% of males. Furthermore, the margin of error for females is narrower than that of males - indicating higher confidence levels in the produced result.
The French females are particularly hesitant with 60.3% of the respondents voted against taking the vaccine.
These vaccine hesitancy by women may be linked to pregnancy as Covid-19 vaccine trials were not conducted on pregnant women.

3) Asian Countries - high proportion of neutral response
The only 3 Asian countries (Singapore, Japan and South Korea) in this survey lead the proportion of neutrals with a relatively wide margin of error of 4-5 percentage points.
Diving deeper into the age bands by respective countries, neutrality appears to be quite consistent across all age bands, except for the older respondents 80 years old and above who are either more receptive or neutral to being vaccinated except for the Koreans 90 - 99 year olds where 100% were negative.

4) More than half of the respondents (those in the positive and neutral group) are worried about the potential side effects of Covid-19 vaccine.
Although France was the least positive about the Covid-19 vaccine (as see in observation 1), the 3 Asian countries (S.Korea, Singapore and Japan) are the most worried about the side effects of the Covid-19 vaccine. Further investigation reveals that the younger respondents (those below 59 years old) tend to worry more about the side effects of Covid-19 vaccine compared to older respondents 60 years old and above. And again we see that females tend to be more worried than males.

5) Respondents are more positive about taking the vaccine in the future (in a year).
Respondents across all countries are more receptive (those in positive group) to taking the Covid-19 vaccine in a year. Hence it would be interesting to observe how sentiment changes across time.

5. Data visualisation steps

Of the 30 files, 14 files were excluded as the "vac_1" field was not available, i.e the vaccine survey was not conducted in these countries.

S/NCountryIncluded?Pre-Processing in excelComments
3DenmarkYesYesEmployment details split into 7 columns
4FinlandYesYesEmployment details split into 7 columns
7IsraelYesYesValues on household_size, household_children and employment_status were missing
11NorwayYesYesEmployment details split into 7 columns
15swedenYesYesEmployment details split into 7 columns

5.1 Data Cleaning and Preparation

Data cleaning with Excel

1) Denmark, Finland, Norway, Sweden
Employment status were split into 7 columns, see figure below.

As such, Excel formula was used to combined the 7 columns to get field "employment_status".

Figure 9

2) Israel
Values for household_size, household_children and employment_status were missing. We have created 3 new columns and included “NA” values for these field.

Figure 10

Extraction with R
The following script was used to

  1. extract selected 11 columns - age, gender, household size, household children, employment status, vac 1, vac2_1, vac2_2, vac2_3, vac2_6 and vac_3.
  2. add new column with country name
  3. merge all 16 files into 1

Data Cleaning with Excel

  1. Employment status field
    Upon inspection of the data, I observed that the employment data had a mix of strings and numeric data.

    Hence I have replaced the numbers to strings according to the codebook provided by YouGov:
    NumericChanged to
    1Full time employment
    2Part time employment
    3Full time student
    6Not working

I used the excel Find and Replace function to replace the numeric figures.
  1. Survey questions: Vac fields (6 total)
    Similarly to ‘employment status’ above, I have replaced the figures to the following:
    OriginalChanged to
    1 - Strongly agreeStrongly agree
    5 – Strongly disagreeStrongly disagree

I used the excel Find and Replace function for replacement.
Save excel file as clean_countries v2.

My data is now clean and ready to be loaded into Tableau.

Data Preparation with Tableau
1) Data Source Tab
i) Load file
On the landing page of Tableau, under Connect > To a File > More. And select the “clean_countries v2” file.

ii) Pivot and rename survey questions.
Pivot data
iii) Rename headers to “Questions” and “Opinion”

2) Employment status
Group “NA”, “Not working” and “Other” together as “Others”

Figure 17

3) Age groups
Group ages into 8 bands :

  • 18 to 29
  • 30 to 39
  • 40 to 49
  • 50 to 59
  • 60 to 69
  • 70 to 79
  • 80 to 89
  • 90 to 99
4) Household children
There were some responses that were not in accordance to the codebook. As such, for numeric values that were not within the codebook, I have assumed that the figure indicates number of children.
Group number of children into 3 bands:

ValuesGroup to
1,0No Kids
2,3,4,5,6,7, 8Kids

Figure 19

5) Household size
Group households into 4 bands:

ValuesGroup to

Figure 20

6) Survey
Apply the following Aliases

Figure 21

  1. Score

Apply the following Aliases

Figure 23

5.2 Data visualisation steps

1) Likert Scale

  1. Create calculated fields

Background vector created by freepik.