Stat 470/670: Exploratory Data Analysis

Logistics

Meetings: Tuesdays and Thursdays, 4-5:15pm, Fine Arts 102

Website: http://jfukuyama.github.io/teaching/stat670

Instructor: Prof. Julia Fukuyama             jfukuyam at iu dot edu
Office hours: Wednesday 1-3pm             Office: Informatics East, Room 201

Associate Instructor: Ms. Fatma Parlak             fparlak at iu dot edu
Office hours: Thursday 9-11am             Office: Informatics East, Room 103

Course Overview

Graphical and modeling techniques for exploring data, with an emphasis on visualization, interpretation, and clear communication of findings. Use of modern software tools for data manipulation and visualization. Connections to traditional statistical methods.

Textbooks

The primary textbook for the course will be Cleveland’s Visualizing Data.

Also useful will be Hadley Wickham’s ggplot2: Elegant Graphics for Data Analysis, available for free through the IU library as an ebook, and R for Data Science by Wickham and Grolemund, available online.

Readings and notes for topics not covered in the textbooks will be posted to the course website and to canvas.

Class Structure

Classes will be a combination of lecture and tool demonstration. It will generally be helpful for you to bring a laptop with R installed so you can follow along. Slides or notes, with R code, will be posted to the class website the day before each lecture.

Assessment

Grades will be assigned based on:

There will be no final exam; the last responsibility for the course will be the report for the final project due December 7.

All the assignments will be graded on how well the material is presented in addition to accuracy. This means there should be no extraneous material, plots should be readable, and text and figures should be formatted nicely.