Where are we and what should I be doing?


9. August 12, 2023:

Finished the course!!

To prepare for the final finish and review all of the homework on go over the two old finals on the webpage.

The final will be open noon Friday August 18 to midnight August 19.

There are review sessions scheduled Sunday August 13 at 7pm
and Thursday August 17 at 7pm.

Of course you can email questions to Percy or Rob.


8. August 5 2023:

On Friday night we got into the section on trees.
On Saturday morning we just got to the beginning of the section of trees.
The Friday night folks should do all of the section 5 homework.
The Saturday morning folks should do all of the section 5 homework except the last question
which is on tree.

Old Finals

Old finals from 2012 and 2007 are posted.
You might want to start working through these finals to prepare for our final.
07 final: skip Question 9 parts 9,15,17.
For parts Q7 (c), (d) you would need to use R, prop.test as in Section 2 homework, Q10 (a).
12 final: skip Question 2, Question 7 parts 3,4 would need R:prop.test.


7. July 30 2023:

In both sections we finished the notes on Multiple regression and started the topics notes
on understanding Multiple regression.

Do all the homework for multiple regression (Section 4).

Note that for excel I have a video of doing the multiple regression with a dummy variable
and the excel file in the Section 4 part of the webpage.

In R, the link "Simple Multiple Regression in R" does multiple regression with categorical variables in R.

There is also a link to the R script plot-midcity-with-N-and-B.R which
does the plotting with categorical B=Brick, N=neighborhood that we looked at in the notes.
The plotting is done in the R-package ggplot2, but you could do the same thing in base R.


5. July 15, 2023:

On Friday we finished the Section Section 3, Simple Linear Regression Notes.
On Saturday we finished all but 5. Back to portfolios of the Section Section 3, Simple Linear Regression Notes.
So, the midterm covers up to and including 4. Simple Linear Regression and Correlation in the Section 3 notes.

Homework
Finish the section 2 homework and problems 1,2,4,5, and 6 from the Section 3 homework.
R users might also look at problem 3 which is on the predictive interval.

We have the midterm this week, no class July 21-22!!!
The midterm exam will be open from noon Friday July 21 to 11:59 pm Saturday July 22, 2 hour time limit.

Review Sessions on zoom
Tuesday, July 18, 7pm-9pm.
Thurday, July 20, 7pm-9pm.

To prepare for the midterm:
Do all the homework suggested above and review all the past homework.

Review Quiz1 and Quiz2.

From the old midterms:
(Note that we are not doing things in the same order as some of the old classes so don't look at all of the problems!)
2020 midterm: all problems.
2016 midterm: problems 2,4,5,6,7.


Note:
You do not need to be able to run a regression for the midterm,
but you do need to be able to run regressions to do the homework.
Examples of running simple linear regressions and doing scatter plots are on the webpage in both R and excel.
In excel check out the video "Simple Regression and Scatter plot in Excel" in the excel section of the webpage.
In R, check out the link "simple-data-analysis.R" in the R section of the webpage and "Simple Linear Regression in R"
in the Section 3 notes part of the webpage.
If you have questions about running regressions in Excel, contact Percy.
If you have questions about running regressions in R, contact Percy or Rob.



4. July 8, 2023:

We got to the notes "4. The Improved Cereal Process" in the section 2 notes on Confidence Intervals and Hypothesis Tests.

Homework

Problems 1.1 - 1.9 from Section 2.


quiz 2: Tuesday morning - Thursday midnight , July 11 - 13.

Quiz
To prepare for the quiz, the following homework problems are particularly useful:
Section 1:
1.12 Ford and Tesla
1.15 Working with the Normal Distribution
Section 2:
1.3 The Audit
1.5 USA Mean Return
1.7 CI-for-proportion-of-up-ticks

Additional Reading Stine and Foster: Chapters 15 and 16
openintro: 5.1, 5.2.

Old Tests
2020 midterm: problems 1-5.


3. July 2, 2023:

In the Friday class we got into the first part of the section 2 notes, IID Normal Models.


On Saturday morning, we finished the section 1 notes and looked at the R script Normal-distribution-in-R.R.

Note that in the Excel section of the webpage there is a video on
plotting the normal density in excel.
Don't forget to check out NORMIDIST and NORMINV in the excel functions.

Homework

All problems from Section 1.
Problem 1.18 involved more advanced R so have a look at it but you don't have to do it.

Old tests
07 final: problems 3,4, and 5.
12 final: problems 3,4,5, and 6.
17 midterm: problems, 3,4, and 5.

Additional Optional Reading:
Stine and Foster: Chapters 12 and 14
openintro: 4.1, 5.1

Optional R
Check out the R scripts at the bottom of the Section 1 notes area on the webpage.
Maybe start with the Normal-distribution-in-R.R script.


2. June 24, 2023:

In the Friday class we got to the beginning of continous disrtributions.

On Saturday morning, we got to the mean and variance of the sum of IID random variables (slide 86).

In the homework file ( Section 1 Homework) you should now be able to do the first 12 problems.
Solutions to all problems are provided.

Note that the solutions to the problems are done in R. Checking the details of the R is a great way to get going in R!

Quiz Preparation:

Quiz 1 will be available from June 27 at noon to June 29 at midnight.
Once you start, you will have a one hour to complete it.
Tests are open book, internet, R or whatever software you want to use.
You just can't use a person or chatGPT.

To prepare for the quiz, make sure you do problems 3 and 5 from the Section 1 homework.
So, note that the quiz only covers the basic probability stuff we did in week 1.

Additional Optional Reading:

Stine and Foster: Chapters 9 and 10.
OpenIntro Statistics: section 3.2.


Old Tests

have a look at:
2015 Quiz 2: question 1
2013 Quiz 2: question 2


Optional R
Have look at the second set of notes on R on the webpage, see:
Data in R, Vectors, Lists, and Data Frames
R and Data, html

1. June 17, 2023:

In both the Friday and Saturday class we finished up at sampling with and without replacement in the Section 1 notes.
This is about 6 slides into section 4, More Than Two Random Variables.

We also went into R and played with variables and functions a bit.
Remember, R is optional.

Homework
(remember, you don't have to hand the homework in)

Try problems 1-8 of the Section 1 homework.

Old tests
A good question to try on the old tests would be: 07 final, 3 (a)-(e).
You could also have a look at problem 7 of the 2016 midterm.

Excel

See the video Introduction to statistics in Excel for excel stats basics.
Make sure you:
(i) do tools/excel add ins/Analysis ToolPak.
(ii) can download the file midcity.csv from the data webpage (data to a directory (folder) and then read it into excel.
(iii) see where statistical functions are in /formulas/More functions.
(iv) see where more stats is in data/data analysi.

Optional R

Read through A first look at R, which is also on the webpage.
Then, see if you can pick up in R where we left off by continuing to work through the R code in
simple-data-analysis.R.

Have a look at the second link on R from the webpage under Notes on R, R and Data

Optional Reading:

OpenIntro Statistics: sections 2.1-2.3.
Stine and Foster: Chapters 7, 8, 9.1


See the syllabus for references to these books.


0. June 12, 2023:

There are no pre first class assignments.

You may want to look at the materials on excel and R on the webpage.