This is the course website for 326.621A: “Introduction to Data Science” at Seoul National University in Fall 2018. Assignments, lecture notes, and open source code will all be available on this website.
Joong-Ho (Johann) Won
Email: wonj AT stats DOT snu DOT ac DOT kr
Sungdong Lee
** Email ** clan1212 AT snu DOT ac DOT kr
Class Time: Mondays Wednesdays 11:00 AM - 12:15 PM in 25-203
Office Hours: MW 12:15 - 01:15 PM
Textbook: This course has no required textbook.
References:
This course introduces some computing skills and software tools for handling potentially big data. By the end of this course, you will be able to acquire hands-on experience with
The course will be graded based on the following components:
The following schedule is tentative, and is subject to change over the course.
Week | Topic | Assignment | Due Date |
---|---|---|---|
1 (9/3, 9/5) | Introduction, Unix Basics I | ||
2 (9/10, 9/12) | No class – will makeup if necessary | Homework 1 | 9/30/2018 |
3 (9/17, 9/19) | Unix Basics II, Git and Github, R Markdown | ||
4 (9/24, 9/26) | Chusok recess | ||
5 (10/1, 10/3) | R Markdown, Reproducible Research | ||
6 (10/8, 10/10) | More Git | Homework 2 | 10/28/2018 |
7 (10/15, 10/17) | Data Visualization | ||
8 (10/22, 10/24) | Data Transformation | ||
9 (10/29, 10/31) | Tidy Data, Strings, Databases part1 | Homework 3 | 11/25/2018 |
10 (11/5, 11/7) | Databases part2 | ||
11 (11/12, 11/14) | Cloud Computing, Distributed Data Analysis part 1 | ||
12 (11/19, 11/21) | Distributed Data Analysis part 2 | Homework 4 | |
13 (11/26, 11/28) | Distributed Data Analysis tips, Docker | ||
14 (12/3, 12/5) | Advanced R Programming part 1, part 2 | ||
15 (12/10, 12/12) |