Day | Section | Topic |
---|---|---|
Mon, Jan 13 | 1.2 | Data tables, variables, and individuals |
Wed, Jan 15 | 2.1.3 | Histograms & skew |
Fri, Jan 17 | 2.1.5 | Boxplots |
Today we covered data tables, individuals, and variables. We also talked about the difference between categorical and quantitative variables.
In the data table in the example above, who or what are the individuals? What are the variables and which are quantitative and which are categorical?
If we want to compare states to see which are safer, why is it better to compare the rates instead of the total fatalities?
What is wrong with this student’s answer to the previous question?
Rates are better because they are more precise and easier to understand.
I like this incorrect answer because it is a perfect example of bullshit. This student doesn’t know the answer so they are trying to write something that sounds good and earns partial credit. Try to avoid writing bullshit. If you catch yourself writing B.S. on one of my quizzes or tests, then you can be sure that you a missing a really simple idea and you should see if you can figure out what it is.
Today we did our first in-class workshop:
Before that, we talked about how to summarize quantitative data. We started by reviewing the mean and median. We saw how to find the average in Excel, and we talked about how to find the position of the median in a long list of numbers (assuming they are sorted).
Then we used the class data we collected last time to introduce histograms and stem-and-leaf plots (also known as stemplots). We also talked about how to tell if data is skewed left or skewed right. One important concept is that the median is not affected by skew, but the average is pulled in the direction of the skew, so the average will be bigger than the median when the data is skewed right.
Until recently, Excel did not have an easy way to make histograms, but Google Sheets does. If you need to make a histogram, I recommend using Google Sheets or this histogram plotter tool.
Which is greater, the mean or the median household income?
Can you think of a distribution that is skewed left?
Why isn’t this bar graph from the book a histogram?
We introduced the five number summary and box-and-whisker plots (boxplots). We also talked about the interquartile range (IQR) and how to use the rule to determine if data is an outlier.
We started with this simple example:
An 8 man crew team actually includes 9 men, the 8 rowers and one coxswain. Suppose the weights (in pounds) of the 9 men on a team are as follows:
120 180 185 200 210 210 215 215 215
Find the 5-number summary and draw a box-and-whisker plot for this data. Is the coxswain who weighs 120 lbs. an outlier?
Day | Section | Topic |
---|---|---|
Mon, Jan 20 | Martin Luther King day, no class | |
Wed, Jan 22 | 2.1.4 | Standard deviation |
Fri, Jan 24 | 4.1 | Normal distribution |
Day | Section | Topic |
---|---|---|
Mon, Jan 27 | 4.1.5 | 68-95-99.7 rule |
Wed, Jan 29 | 4.1.4 | Normal distribution computations |
Fri, Jan 31 | 2.1, 8.1 | Scatterplots and correlation |
Day | Section | Topic |
---|---|---|
Mon, Feb 3 | 8.2 | Least squares regression introduction |
Wed, Feb 5 | 8.2 | Least squares regression practice |
Fri, Feb 7 | 1.3 | Sampling: populations and samples |
Day | Section | Topic |
---|---|---|
Mon, Feb 10 | 1.3 | Bias versus random error |
Wed, Feb 12 | Review | |
Fri, Feb 14 | Midterm 1 |
Day | Section | Topic |
---|---|---|
Mon, Feb 17 | 1.4 | Randomized controlled experiments |
Wed, Feb 19 | 3.1 | Defining probability |
Fri, Feb 21 | 3.1 | Multiplication and addition rules |
Day | Section | Topic |
---|---|---|
Mon, Feb 24 | 3.4 | Weighted averages & expected value |
Wed, Feb 26 | 3.4 | Random variables |
Fri, Feb 28 | 7.1 | Sampling distributions |
Day | Section | Topic |
---|---|---|
Mon, Mar 3 | 5.1 | Sampling distributions for proportions |
Wed, Mar 5 | 5.2 | Confidence intervals for a proportion |
Fri, Mar 7 | 5.2 | Confidence intervals for a proportion - con’d |
Day | Section | Topic |
---|---|---|
Mon, Mar 17 | Review | |
Wed, Mar 19 | Midterm 2 | |
Fri, Mar 21 | 5.3 | Hypothesis testing for a proportion |
Day | Section | Topic |
---|---|---|
Mon, Mar 24 | 6.1 | Inference for a single proportion |
Wed, Mar 26 | 5.3.3 | Decision errors |
Fri, Mar 28 | 6.2 | Difference of two proportions (hypothesis tests) |
Day | Section | Topic |
---|---|---|
Mon, Mar 31 | 6.2.3 | Difference of two proportions (confidence intervals) |
Wed, Apr 2 | 7.1 | Introducing the t-distribution |
Fri, Apr 4 | 7.1.4 | One sample t-confidence intervals |
Day | Section | Topic |
---|---|---|
Mon, Apr 7 | 7.2 | Paired data |
Wed, Apr 9 | 7.3 | Difference of two means |
Fri, Apr 11 | 7.3 | Difference of two means |
Day | Section | Topic |
---|---|---|
Mon, Apr 14 | Review | |
Wed, Apr 16 | Midterm 3 | |
Fri, Apr 18 | 7.4 | Statistical power |
Day | Section | Topic |
---|---|---|
Mon, Apr 21 | 6.3 | Chi-squared statistic |
Wed, Apr 23 | 6.4 | Testing association with chi-squared |
Fri, Apr 25 | Choosing the right technique | |
Mon, Apr 28 | Last day, recap & review |