Learning more
- Our class (ENS-215) website: All of the content will remain up and available to you.
- ModernDive: For those of you interested in learning more about statistics and data modeling the later chapters in the book are a great starting point.
- R for Data Science: As you already know this book is a comprehensive resource for using R in data science applications.
- An Introduction to Statistical Learning: An excellent and freely available textbook that introduces machine learning concepts and applications. This book is a bit advanced (aimed at upper level undergraduates and graduate students) but is still reasonably accessible.
- Two of the authors (Prof. Hastie and Tibshirani of Stanford) have made 15 hrs of video lectures that go along with the book. The lectures are excellent and actually pretty entertaining too (they have great banter with each other). You can access the lectures here.
- Geocomputation with R: Excellent and freely available text on geographic data analysis, visualization, and modeling. The book implements these concepts and techniques with R. The chapter on making maps in R is likely to be of particular interest to many of you.
- DataCamp: There are many additional interactive classes available here covering a wide-range of topics (e.g. machine learning, geographic/spatial analysis, statistics) and their implementation in R. Your free access lasts for six months, so you will still have access for some time after the term ends.
Data resources
- USGS National Water Information Service: Web interface that provides access to USGS water data (e.g. flows, water levels, chemical and biological conditions) for nearly 2 million sites across the US and US territories. You can also directly query and download data right in R using USGS R packages.
- NOAA Climate Data: Climate datasets for the US and the world.
- Natural Earth Data: Site with freely available geospatial dataset (e.g. country borders, river shapefiles, land surface elevation data).
- US Census Data: Access to US census data (e.g. population, income, demographics,..)
- Data.gov: Has tons of datasets from US government agencies. Data is available across a wide range of topics.
- USDA data: Data from the US Department of Agriculture (crop yields, irrigated acres, fertilizer usage)
- UN FAO Aquastat Data: UN data on water resources.
- NYC Open Data: Excellent site with datasets for NYC, many of which are related environment.