Many of you have heard of studies that analyzed Twitter messages and predicted some phenomena--spread of flu in New York, consumer confidence index, and so on. Behind the success of these studies are the systems for data storage and retrieval. A regular user can access only the latest nine days of tweets. Any study that aspires to analyze longer periods has to deal with the issues of storing the observations and retrieving them later for analysis. The goal of this course is to show you how to do that--how to connect to various types of databases and how to retrieve and update your data. We will start with relational databases; learn SQL, the language used to query and update the data; and explore the latest developments in the database field--Hadoop and MapReduce.
||Gen Ed Area Dept:
NSM QAC, SBS QAC|
|Course Format: Laboratory Course||Grading Mode: Student Option|
||Fulfills a Major Requirement for: (CIM)
||Past Enrollment Probability: 75% - 89%
|SECTION 01 - 2nd Quarter|
|Special Attributes: CQC|
|Major Readings: Wesleyan RJ Julia Bookstore
Darmawikarta, Djoni, SQL FOR MYSQL : A BEGINNER'S TUTORIAL (accessible online through the Wesleyan library)
Reference Manuals, on-line custom materials and tutorials.
|Examination and Assignments: |
Weekly homeworks and a final project
|Additional Requirements and/or Comments: |
While there are no formal prerequisites for the course, you may find the course rather challenging if you have never worked with data using statistical analysis software, Excel etc.
Please Note: The course meets the second half of the semester while QAC 151 meets at the same time during the first half of the term.
|Instructor(s): Oleinikov,Pavel V Times: .M..... 07:10PM-10:00PM; Location: TBA|
|Total Enrollment Limit: 19||SR major: 0||JR major: 0|| || |
|Seats Available: 4||GRAD: X||SR non-major: 5||JR non-major: 5||SO: 5||FR: 4|