Digging the Digital Era: A Data Science Primer
QAC 211
Spring 2017
| Section:
01
|
This course may be repeated for credit. |
Certificates: Applied Data Science |
The course introduces students to the practice of what has come to be known as data science. Using a multidisciplinary approach and data from a variety of sources that cover any aspect of everyday life--from credit card transactions to social media interactions and Web searches--data scientists try to analyze and predict events and behavior. The first part of the course defines the area and introduces basic concepts, tools, and emerging applications. We describe how big data analysis affects both business practices and public policy and discuss applications in different areas/disciplines. We also discuss the ethical, legal, and privacy dimensions of big-data analysis. In part two of the course, we work on data acquisition and management and introduce appropriate programming and data-management tools. In part three, we concentrate on basic analytical and visualization techniques as we explore and understand the emerging patterns. Using a learning-by-doing approach in a computing laboratory, students will learn how to write computer programs in R--programming in R is a significant part of the course work--to access, organize, and analyze data through a series of small projects designed to illustrate the application of the techniques we develop for a variety of data sets and situations. Students will also engage in a semester-long project where they will access and use data from social media (Twitter) to address their own research questions. |
Credit: 1 |
Gen Ed Area Dept:
NSM QAC, SBS QAC |
Course Format: Laboratory Course | Grading Mode: Graded |
Level: UGRD |
Prerequisites: None |
|
Fulfills a Requirement for: (CADS)(DATA-MN)(HRAD-MN)(PSYC) |
|
Past Enrollment Probability: 50% - 74% |
SECTION 01 | Special Attributes: CQC |
Major Readings: Wesleyan RJ Julia Bookstore
Textbook:
Pathak, Manas, Beginning Data Science with R, Springer, 2014, available online through Wesleyan library: http://link.springer.com/book/10.1007/978-3-319-12066-9
Lovelace, Robin and James Cheshire, Introduction to visualising spatial data in R, online: https://cran.r-project.org/doc/contrib/intro-spatial-rl.pdf
Articles:
Spasojevic, Nemanja et al., When-to-post on Social Networks, ArXiv e-prints, http://adsabs.harvard.edu/abs/2015arXiv150602089S
Eichstaedt, Johannes et al., Psychological Language on Twitter Predicts County-Level Heart Disease Mortality, Psychological Science, Vol. 26(2), 2015, http://pss.sagepub.com/content/26/2/159.abstract
Nofer, Michael, The Value of Social Media for Predicting Stock Returns: Preconditions, Instruments and Performance Analysis, Springer, 2015,
Mayer-Schoenberger, Viktor and Kenneth Cukier, The Rise of Big Data: How It's Changing the Way We Think About the World, Foreign Affairs, Vol. 92(3), 2013
Tene, Omar and Jules Polonetsky, Big Data for All: Privacy and User Control in the Age of Analytics, Northwestern Journal of Technology and Intellectual Property, Vol. 11(5), April 2013
|
Examinations and Assignments: Several homework assignments, two take-home midterms, and a course project Presentation &report). Part of the grade depends on in-class participation/preparedness. |
Instructor(s): Oleinikov,Pavel V Times: ..T.R.. 01:20PM-02:40PM; Location: ALLB204; |
Total Enrollment Limit: 26 | | SR major: 0 | JR major: 0 |   |   |
Seats Available: 1 | GRAD: X | SR non-major: 6 | JR non-major: 7 | SO: 7 | FR: 6 |
Drop/Add Enrollment Requests | | | | | |
Total Submitted Requests: 1 | 1st Ranked: 1 | 2nd Ranked: 0 | 3rd Ranked: 0 | 4th Ranked: 0 | Unranked: 0 |
|
|