Computational Statistics, STP 540, Spring 2025

sgd

Basic Course Information

Final Project is due 5/10/2025, 9am:
Just email me pdf (or if you have to, html) of the project.
Make sure all group member names are clearly indicated!!

The final project should just be a 5 to 10 page write up of what you did.
Explain to me what you did and what simulations and/or data you used.
Then show me some nice plots and tables showing your results.
HAVE FUN!!
Don't include code, except as an appendix.

Class time and place

Tu Th 	12:00 PM - 1:15 PM 	1/9/23  - 4/28/23 	Tempe WXLR A302

Instructor: Robert McCulloch, robert.mcculloch@asu.edu

TA: TA: Enya Kuo, ekuo2@asu.edu
TA office hours:??

How-to-use-Canvas-Discussions.pdf

Syllabus: Syllabus

Some usefull books: books

Where we are and what I should be doing?

where and what

R and Python

Information on R

Information on Python

Suggested Projects

Inference for the parameters of a Gaussian Process
See Murphy chapter 15.1 to 15.2.5. Murphy.
See also Rasmussen-and-Williams.pdf.
See also chapter 5 of the book "Surrogates" by Robert Gramacy.

Learning a single layer neural network
   see section 10.7 Fitting a Neural Network in "An Introduction to Statistical Learning", second edition
    by James, Witten, Hastie, and Tibshirani.
   Simple Chain Rule Gradient Computation for a Single Layer
   Single Layer Neural networks, complete notes from Applied Machine Learning

EM algorithm for a mixture of normals

Some old projects:
Comparing the EM algorithm with the Gibbs sample for uninvariate normal mixtures
Gaussian Processes
Comparing the EM algorithm with Gibbs for univariate mixtures Monte Carlo EM algorithm

Homework

How_to_Submit_Homework_in_Canvas.pdf

Homework 1
Due February 17.

Homework 2
Due March 10.

Homework 3
logit-funs.R
Due April 4.

Notes

A first look at simple logistic regression

Let's review a basic nonlinear model in statistics: simple logistic regression.
We will write simple code to compute the likelihood.
We will look the idea of vectorization which applies in both R and python.
Later we will go into more details on how the likelihood is optimized.

Simple vectorized summing in python
jupyter notebook version

Basic notes on Logistic regression:
Simple Logistic Regression Likelihood, script

Simple logit in R and python:
Simple example of logit in R, Rmd
Simple example of logit in Python, notebook
The default data is available at:
ISLR-Default.csv

Scripts to compute the log-likehood:
R code: Logit Example in R
Python code: Logit Example in Python
Simple Logistic Regression Likelihood, html
Simple Logistic Regression Likelihood, ipynb

Plot logit likelihood using color palettes (e.g. viridis) in R

Advanced R, Wickham, Section 24.5.
".. vectorization means finding the existing R function that is implemented in C
and most closely applied to your problem."

Of course, if you code directly in a lower level language like C++ you get the speed:
Calling C++ out of R using, Rcpp, a Makefile, and SHLIB
Calling C++ out of R using Rcpp using rstudio

More detail on Rcpp in rstudio:
tarball of R package: mll1_1.0.tar.gz
step to make R package: steps.txt
R script to test: do.R
output from do.R: output-from-do.txt

Files to compare pure C++ with vectorized R: in-cpp.zip

Matrix Decompositions in Statistics

Quick Review of Some Keys Ideas in Linear Algebra
   Simple python script to compare sklearn.Linear regression with (X'X)^{-1} X'y
   Simple R script to compare lm with (X'X)^{-1} X'y
   What Really IS a Matrix Determinant?

QR Matrix Factorization Least Squares and Computation (with R and C++)

The Multivariate Normal and the Choleski and Eigen Decompositions
   Look at cholesky and spectral in R

Singular Value Decomposition

simple example of svd in python
simple example of svd in python, html
simple example of svd in python, notebook

do_image-svd-approx.py
image approximation with SVD in R, thanks to Andrew Ritchey.

Optimization

Optimization

See chapters 4 and 8 of "Deep Learning" by Goodfellow, Bengio, and Courville.

Simple notes on single layer neural net: Single Layer

Section 3 recording, logit derivatives: recording
Section 4 recording, Taylor's Theorem: recording
Sections 8 and 9 recording, Momentum and Newton's method: recording

An Overview of Gradient Descent Algorithms

Mixture Models and the EM Algorithm

The EM Algorithm
   See Chapter 11 of Murphy.
   See Chapter 4 of Givens and Hoeting.
   See Chapter 8.5 of Hastie, Tibshirani, and Friedman.

The Bootstrap

(Efron and Hastie, chapters 10 and 11)
The Bootstrap

Thompson Sampling

Tutorial on Thompson Sampling

BART

Introduction to BART
Old BART Talk
Bayesian Additive Regression Trees, Computational Approaches
chapter in Computational Statistics in Data Science

Short course on BART given at BYU, June 2023