326_621a-2018fall

326.621A Introduction to Data Science @ SNU

This is the course website for 326.621A: “Introduction to Data Science” at Seoul National University in Fall 2018. Assignments, lecture notes, and open source code will all be available on this website.

Announcements

Instructor

Joong-Ho (Johann) Won

Email: wonj AT stats DOT snu DOT ac DOT kr

TA

Sungdong Lee

** Email ** clan1212 AT snu DOT ac DOT kr

Class Time: Mondays Wednesdays 11:00 AM - 12:15 PM in 25-203

Office Hours: MW 12:15 - 01:15 PM

Textbook: This course has no required textbook.

References:

Course Objectives

This course introduces some computing skills and software tools for handling potentially big data. By the end of this course, you will be able to acquire hands-on experience with

Course Overview

Assessment

The course will be graded based on the following components:

Late policy

Schedule

The following schedule is tentative, and is subject to change over the course.

Week Topic Assignment Due Date
1 (9/3, 9/5) Introduction, Unix Basics I    
2 (9/10, 9/12) No class – will makeup if necessary Homework 1 9/30/2018
3 (9/17, 9/19) Unix Basics II, Git and Github, R Markdown    
4 (9/24, 9/26) Chusok recess    
5 (10/1, 10/3) R Markdown, Reproducible Research    
6 (10/8, 10/10) More Git Homework 2 10/28/2018
7 (10/15, 10/17) Data Visualization    
8 (10/22, 10/24) Data Transformation    
9 (10/29, 10/31) Tidy Data, Strings, Databases part1 Homework 3 11/25/2018
10 (11/5, 11/7) Databases part2    
11 (11/12, 11/14) Cloud Computing, Distributed Data Analysis part 1    
12 (11/19, 11/21) Distributed Data Analysis part 2 Homework 4 12/09/2018 12/16/2018
13 (11/26, 11/28) Distributed Data Analysis tips, Docker    
14 (12/3, 12/5) Advanced R Programming part 1, part 2    
15 (12/10, 12/12) Shiny