2.1. Data
In order to identify barriers and motivations for tap and bottled water consumption, we conducted a questionnaire survey in the Czech Republic. The questionnaire included questions that elicited attitudes, norms, perceived barriers, habits, socioeconomic characteristics, consumption behaviour, and values.
Data collection took place 4th October to 8th November 2022. The recruitment of respondents for the survey was carried out by a contractor from an actively managed online access panel, i.e. a panel in which the identity of respondents was verified during recruitment and, where applicable, during the respondent's participation in the panel (e.g. survey questions that check one of the indicators listed in the registration questionnaire, telephone checks or checks on the uniqueness of the bank account number provided), and in which each respondent was controlled to take a maximum of 2 surveys per week, up to a maximum of 36 surveys per year.
Participation in the survey posed a risk to respondents comparable to normal internet surfing. The questionnaire survey did not ask controversial questions or questions that would be traumatic for the respondents to answer. The questionnaires were anonymous. The panel operator assigned a random identification number to the respondent at the beginning of the questionnaire, and only this random identification number is available to the researchers analysing the survey data. Respondents gave informed consent before participating in the survey. The ethical approval was obtained from the Institutional Review Board of the Charles University Environment Centre (Number 1/2022).
The sample of respondents was selected by quota sampling with quotas for region, size of place of residence, age, gender, and education. The resulting sample is representative of the Czech population aged 18 to 69 years based on socio-demographic characteristics, see
Table 1 in the Appendix.
In the data cleaning process, incomplete records, records with logical inconsistencies, and „speeders“, i.e. questionnaires below 48% of the median completion time (217 questionnaires in total), were discarded. The pilot survey was conducted approximately one week before the main data collection, with approximately 200 valid observations. The final total number of valid questionnaires in the dataset is 3411.
2.2. Measures
In this section, we describe the questions and response scales utilised to construct variables used in our models.
Bottled and tap water consumption was recorded as frequency of consumption. Respondents answered the questions "How often do you personally drink tap water (water from the public tap)?" and "How often do you drink bottled water (sparkling or still, flavoured or unflavoured, mineral, drinking, spring, or baby water)?" and had a total of 6 choices indicating the extent of their consumption ("Several times a day," "Once a day," "Several times a week," "Several times a month," "Less frequently," and "Never").
The variable "taste" was measured by letting respondents indicate either the option "Bad taste" for the question "For what reasons do you not drink tap water, or do you drink it infrequently?" or "Taste" for the question "What are your reasons for drinking bottled water?". Health concern was measured using identical questions to "taste", i.e. when respondents reported having health concerns about tap water (response options "Health concern" or "The water in our house is defective") or perceiving bottled water to be beneficial to their health (response options "It is healthier than tap" or "Safety"). The question "For what reasons do you not drink tap water, or do you drink it infrequently?" was close-ended with a response option “Other”, which was followed by an open-ended question allowing to provide further reasons. The responses to the open-ended question were coded and counted. The percentages in
Figure 2 include also responses to the open-ended question.
The habit of drinking tap water was constructed based on the question "How often do the following situations happen to you?" which takes into account respondents' answers to only the following two statements: "If tap water is available, I prefer it to bottled water without thinking." and "I drink tap water automatically." [
29]. The frequency of situations was measured on a 7-point Likert scale, where 1 represented "Rarely" and 7 represented "Very often." The resulting variable was created by adding two items (statements) and dividing them by the appropriate number (in this case two). The internal consistency of the scale was measured using Cronbach's alpha and has a value of 0.80, which is a result above 0.70 and indicates good scale consistency [
27]. The calculated value indicates the average of the scores on the items included in the scale.
Environmental concern was measured by incorporating several statements related to biospheric values [
30]. Participants were presented with descriptions of some people and were asked to evaluate how similar the persons described are to them (“It is important to him/her to…””...prevent pollution”, “...live in harmony with nature”, “...protect the environment”). Responses to these are measured by a Likert scale containing six points ranging from one ('Not at all like me') to six ('Very much like me') [
31]. The variable was constructed as in the previous case (Cronbach's
α=0.88).
The construct called behavioural beliefs in the TPB [
24] is labelled “attitudesf” in our regression models to shorten the variable name. The variable “attitudesf” was constructed by multiplying measures of behavioural belief strength by their corresponding measures of evaluation and by averaging the multiplication products (Cronbach's
α=0.74). The behavioural belief strength was measured using a seven-point Likert scale with response anchors "Strongly Disagree" and "Strongly Agree". The following statements were used: "If I drink mostly tap water for the next 3 months...": ...I will save money on bottled water, ...I will save time shopping and taking plastic bottles to the dumpster, ...I will have a mineral deficiency, ...I will get much less microplastics into my body, ...I will get harmful substances from chlorinating water, ...I will drink fresh water. Evaluations of beliefs were also measured using a Likert scale format, but with a rating of 1 indicating "Very Bad" and a 7 indicating "Very Good" in response to the question "How good or bad do you think these situations are?". Not all situations presented to respondents were in the same direction, so reverse coding was needed in some cases.
The construct of normative beliefs [
24] was measured as a combination of a question to elicit injunctive normative belief strength ("Do the following people think you should drink mostly tap water over the next 3 months?" with items "My parents", "My spouse", "My children", "Good friends") and motivation to comply ("To what extent do you care about what the following people think you should do?" with the same items). In both cases, a seven-point Likert scale was used. For the injunctive normative belief strength, response anchors are "Definitely not" and "Definitely yes". For the motivation to comply, response anchors are "I don't care at all" and "I care a lot". The variable “norm” was constructed by multiplying the injunctive normative belief strength by the motivation to comply and by averaging the multiplication products. The internal consistency was acceptable (Cronbach's
α=0.78).
The construct of control beliefs is composed of control belief strength and power of control factor [
24]. To measure the control belief strength, participants reported on frequency with which they face barriers to getting tap water when on-the-go (question: "How often do the following situations happen to you?"; statements: “In restaurants or cafes, it is not possible to order tap water”; “At petrol stations, it is not possible to turn on or buy tap water.”; “In public places, it is not possible to turn on tap water”.; scored 1 = "Rarely" and 7 ="Very often"). To elicit the power of control factor, respondents were asked how much they agreed or disagreed with the following statements: "Drinking mostly tap water over the next 3 months will be easier for me..." "... if it can be ordered at more restaurants and cafes", "... if it can be bought at petrol stations in own bottle and its price is lower than price of bottled water", and "... if there are more drinking fountains in public places". A 7-point Likert scale with a rating of one corresponding to "Strongly disagree" and a rating of seven corresponding to "Strongly agree" was utilised. The variable “behave control” was constructed in an identical manner to variable “norm” and “attitudesf” (Cronbach's alpha
α=0.75).
Respondents were asked to rate the statements related to the TPB in context of the possibility they will predominantly (90%) drink tap water over the next three months, either in general or outside the home, in case their households are not connected to a public water supply.
Quota and several socio-demographic questions were also included in the questionnaire. In the regression models, we use dummy variables for gender, age, education, income, employment status, marital status, and number of household members. In our sample, 49% of participants are female and 51% male. There was also an option "Other" as the preferred gender. Only 0.12% of the respondents gave this answer, therefore this category is merged together with the female gender. Respondents were asked to indicate their age in years. The median age of the participants was 45 years. Survey participants were also asked for their highest level of education, choosing from a list of 11 possible levels. To compare groups, age was divided into three categories: younger people (18 to 34 years old), middle-aged (35 to 50 years old) and older people (51 years old and above). Education was also divided into three categories: primary and lower secondary (maximum of vocational secondary without a high school diploma), secondary (those who had not reached higher education than grammar school) and tertiary (education level higher than grammar school), see
Table 1 in the Appendix.
The income was measured as the respondent's total net monthly household income from all sources after deducting taxes and levies using 12 income categories, ranging from "Less than 13,000 CZK" to "More than 70,000 CZK", and options "Don't know" or "Don't want to answer". To create continuous variable, we assigned the midpoint of each income category to all observations in that category. We computed per capita adjusted household income by dividing this continuous variable by the respective number of household members. As the per capita adjusted household income was not significant in the regression models, we present only models with dummy income variables in this paper. Dummy income variables were created by recoding per capita adjusted household categories. Three categories were created: income per household member less than or equal to 12,250 CZK (low income), income greater than 12,250 CZK and less than or equal to 21,000 CZK (medium), and income higher than 21,000 CZK (high). The fourth dummy variable was created for missing information on income.
To measure the employment status, 10 options were offered in the survey. The employment status was categorized as follows: working (full-time, part-time and self-employed), unemployed (unemployed, retired, studying or disabled) and at home (maternity leave, full-time caring). In our sample, the most represented were full-time employed (50%) and retired (15%).
Respondents were asked to mark one of seven marital status categories or "Don't want to answer". Most of respondents are married (43%). Again, 3 categories were constructed for modelling. The first category represents people who are not single (married, in a partnership or living together without being married), the second is single (single and never been in a formal relationship), and the third category is lost partner (people who are widowed, divorced or separated from their partner).
Survey participants were also asked to state the number of household members, including themselves. Most of them (34%) lived in a two-person household. Those who indicated a household size of 2 or more were also asked about the number of children under the age of 18 and retired persons in their household. The most common number was 1 child (20%) or 1 retiree (25%) in the respondent's household.