Overview

For thousands of years human civilization has relied on dams for flood control, water supply, irrigation, navigation, and hydropower. Dams and reservoirs have long been in use, however the past century has seen a dramatic increase in their number as populations have grown and development has spread around the globe.

While dams and reservoirs can provide significant benefits to society, they nonetheless disrupt the natural flow regime of rivers and can have a host of negative environmental and social impacts. These negative impacts include (but are not limited to) significant methane emissions from reservoirs, disruption of stream and riparian ecosystems, changes in sediment delivery downstream, and displacement of communities around the dam/reservoir.

Given their potential for both positive and negative environmental and societal impacts, the careful study of dams and reservoirs is important to ensuring optimal outcomes. With tens of thousands of large dams around the globe, an important step in their study has been the mapping and cataloging of existing dams. In today’s lab you will work with data from the United Nations Food and Agricultural Organization (FAO) Georeferenced dams database1 to gain insight into the patterns, features, and trends in global dams.


Background reading

For some helpful background you should read the Executive Summary section of the following report. This document will give you a useful overview and context of the issues and will help guide your analysis. In addition to this document, you may find it helpful to search out other sources to help you make sense of your analysis.


Dams and Development: A new framework for decision-making


Objectives

In the lab that you submit you should discuss your observations/findings related the objectives below. Do not simply make the figures/tables without providing a discussion.

Required

0. Examine the data.

First you will need to download the data. The data can be downloaded from our class site using the following code

library(tidyverse)

dams_data <- read_csv("https://stahlm.github.io/ENS_215/Data/Dams_FAO_SelectCols_LabData.csv")


You will also need to look at the readme file associated with the above dataset. The readme file describes the variables in the dataset.

Once you’ve loaded in the data, make sure the data looks ok and familiarize yourself with the different columns. You can examine/get familiar with the dataset in many ways (e.g. head(), summary(), View(),…). Remember that View() should be run in your CONSOLE not your notebook.

After looking at the data you may find it helpful to reorder the columns. You can easily move columns of interest to the front of the dataset using the select() function. You can do this by specifying the columns you want up front and then using the everything() function to specify that you would like to put all the remaining columns after the ones you’ve moved up front. For instance,

select(Dataset, column to move up front, another column to move up front, everything())


In your submitted lab write-up/discussion, you should provide a short summary/description of the dataset (e.g. number of variables and observations, general content, key variables, period of record,…).

IMPORTANT NOTE: some of the variables will have missing data, which is recorded as NA. NA values can create issues with some functions (e.g. mean()), however these issues can be dealt with by telling the function to ignore observations with NA values. This is done by specifying na.rm = TRUE as an input to a function. For instance mean(data here, na.rm = TRUE).


1. Create a scatter plot showing the relationship between reservoir storage and reservoir area

Do you notice any interesting features? Comment on your observations. You may want to try coloring the points by continent to see if there are regional patterns/trends.


2. Create a scatter plot showing the relationship between reservoir storage and dam height

Do you notice any interesting features? Comment on your observations. Do you observe any outlier or notable points. You may want to try coloring the points by continent to see if there are regional patterns/trends.

Hint: setting your axes to log scale using the scale_x_log10() and scale_y_log10() can make the plots easier to read/interpret


3. Calculate height, area, and storage averages by continent and display these results to your notebook.

How do these statistics differ by continent? Any thoughts on why they may differ?

Hint: Use the functionality of dplyr to do this efficiently


4. Create a box plot to show how the distribution of reservoir storage capacities varies between continents

Once you’ve made you plot, take and look and note what you observe. Are the distributions similar or different between continents?

Hint: setting your axes to log scale using the scale_x_log10() and/or scale_y_log10() can make the plots easier to read/interpret


Additional analysis

The task above were intended to get you started. Once you finish those above tasks you will need to continue with your own analysis. Below are some additional ideas, however you are more than free (and encouraged) to pursue your own ideas.


1. Create a summary table that has additional info about dams and reservoirs

You could make a table with averages, total, minimums, and maximums for dam height, reservoir area, reservoir storage,… This table could be for all of the data or broken down by continent (or by country, or by year,…)


2. Create a plot showing the total number of dams over time


3. Select a country or countries of interest and examine the data


4. Examine the data on the purposes of the reservoirs (e.g. irrigation, water supply, navigation,…)


Prepare your final lab report

Advice/guidance

Remember that your lab should be a nicely formatted and organized report that includes your code, output, and discussion. You should have an introduction and conclusion section and you should clearly delineate different sections with headers. Also remember to cite any sources that your rely/reference.

At this point in the term you should start to think about the organizational logic of your final lab report (i.e. order your analysis sections and discussion in a sensible manner). You don’t need to get bogged down on this point when working in lab, but it is good to consider when you are putting together your final report for submission.

The “additional analysis” section of your lab is a critical compenent and should be given careful thought and effort.

It is important to realize that my expectations for each of the graded criteria increase as the term progresses and you gain more skills and are better able to develop your own analyses.

Make sure you are familiar with the expectations and evaluation criteria presented in the links below:


Submitting the lab

Your lab is due prior to the start of next week’s lab. Once you are finished and satisfied with your work you should Knit it to an .html file and submit both your html and Rmd file to Nexus.

Please make sure that the .html file you submit is .html and not nb.html.