This is the course website for 326.621A: “Introduction to Data Science” at Seoul National University in Fall 2018. Assignments, lecture notes, and open source code will all be available on this website.
Joong-Ho (Johann) Won
Email: wonj AT stats DOT snu DOT ac DOT kr
Sungdong Lee
** Email ** clan1212 AT snu DOT ac DOT kr
Class Time: Mondays Wednesdays 11:00 AM - 12:15 PM in 25-203
Office Hours: MW 12:15 - 01:15 PM
Textbook: This course has no required textbook.
References:
This course introduces some computing skills and software tools for handling potentially big data. By the end of this course, you will be able to acquire hands-on experience with
The course will be graded based on the following components:
The following schedule is tentative, and is subject to change over the course.
| Week | Topic | Assignment | Due Date |
|---|---|---|---|
| 1 (9/3, 9/5) | Introduction, Unix Basics I | ||
| 2 (9/10, 9/12) | No class – will makeup if necessary | Homework 1 | 9/30/2018 |
| 3 (9/17, 9/19) | Unix Basics II, Git and Github, R Markdown | ||
| 4 (9/24, 9/26) | Chusok recess | ||
| 5 (10/1, 10/3) | R Markdown, Reproducible Research | ||
| 6 (10/8, 10/10) | More Git | Homework 2 | 10/28/2018 |
| 7 (10/15, 10/17) | Data Visualization | ||
| 8 (10/22, 10/24) | Data Transformation | ||
| 9 (10/29, 10/31) | Tidy Data, Strings, Databases part1 | Homework 3 | 11/25/2018 |
| 10 (11/5, 11/7) | Databases part2 | ||
| 11 (11/12, 11/14) | Cloud Computing, Distributed Data Analysis part 1 | ||
| 12 (11/19, 11/21) | Distributed Data Analysis part 2 | Homework 4 | |
| 13 (11/26, 11/28) | Distributed Data Analysis tips, Docker | ||
| 14 (12/3, 12/5) | Advanced R Programming part 1, part 2 | ||
| 15 (12/10, 12/12) |