In a world that’s full of data, we have many questions. Luckily, all this data can also help us answer those questions. This course will walk through the basics of statistical thinking and will teach the correct statistical tool to help answer our questions of interest. Ultimately our goal is to assist you in learning how to interpret our findings and develop meaningful conclusions.
This course is intended to have the same “punch” as a typical introductory undergraduate statistics course, with an added twist of modeling. This course is also intentionally devised to be sequential, with each new piece building on the previous topics. Once completed, students should feel comfortable using basic statistical techniques to answer their own questions about their own data, using a widely available statistical software package (R).
Week One: Introduction to Data
- Why study statistics?
- Variables and data
- Getting to know R and RStudio
Week Two: Univariate Descriptive Statistics
- Graphs and distribution shapes
- Measures of center and spread
- The Normal distribution
Week Three: Bivariate Distributions
- The scatterplot
Week Four: Bivariate Distributions (Categorical Data)
- Contingency tables
- Conditional probability
- Examining independence
Week Five: Linear Functions
- What is a function?
- Least squares
- The Linear function – regression
Week Six: Exponential and Logistic Function Models
- Exponential data
- The Logistic function model
- Picking a good model
Week Seven: Sampling
- The sampling distribution
- Central limit theorem
- Confidence intervals
Week Eight: Hypothesis Testing (One Group Means)
- What makes a hypothesis test?
- Errors in testing
- Alpha and critical values
- Single sample test
Week Nine: Hypothesis Testing (Two Group Means)
- Independent t-test
- Dependent t-test
Week Ten: Hypothesis Testing (Categorical Data)
- The chi-square test
Week Eleven: Hypothesis Testing (More Than Two Group Means)
- The ANOVA
- One-way ANOVA
Videos, Readings, and Comprehension Checks are worth 20% of your grade. Pre-Labs are worth 15%; Labs are worth 30%; and Problem Sets are worth 35% of your grade. A final score of 70% or more is a passing grade to receive the course certificate.
There are no formal due dates for course assignments. However, to receive the maximum benefit from the course, we suggest that you complete each chapter’s coursework by noon on the Tuesday on which the next chapter opens.
In order to receive yourcertificate of completion, you must submit all coursework (and receive a passing grade) by midnight on Tuesday, February 10.
Texts & Software
We will be using a custom created open source text that will be embedded into the edX course as PDF readings. Your reading assignments will be released weekly with links to the appropriate PDF readings. The current reading assignments (including the optional reading assignments) are also available under the "Readings" tab.
You will need to install R and RStudio onto the computer you will be using for the course. Instructions for downloading the software can be found here (Mac users) or here (Windows users) as well as on the sidebar of the "Course Info" page and the "Download R and RStudio" page.
Pre-Labs & Labs
Before each Lab, there will be a Pre-Lab that offers guided instruction processes to help you answer the questions. Each Pre-Lab and Lab is focused around a particular question. You will (1) Reflect on the Question, (2) Analyze the Data, and (3) Draw Conclusions.