Many of you heard of studies that analyzed Twitter messages and predicted some phenomena--spread of flu in New York, consumer confidence index, etc. Behind the success of these studies are the systems for data storage and retrieval. A regular user can access only the latest 9 days of tweets. Any study that aspires to analyze longer periods has to deal with the issues of storing the observations and retrieving them later for analysis. The goal of this course is to show you how to do that--how to connect to various types of databases and how to retrieve and update your data. We will start with relational databases; learn SQL, the language used to query and update the data; and will explore the latest developments in the database field--Hadoop and MapReduce.
||Gen Ed Area Dept:
NSM QAC, SBS QAC|
|Course Format: Laboratory Course||Grading Mode: Student Option|
||Fulfills a Major Requirement for: (INFO-MN)
||Past Enrollment Probability: Not Available
|SECTION 01 - 2nd Quarter|
|Special Attributes: CQC|
|Major Readings: Wesleyan RJ Julia Bookstore
Darmawikarta, Djoni, SQL FOR MYSQL : A BEGINNER'S TUTORIAL (accessible online through the Wesleyan library)
Reference Manuals, on-line custom materials and tutorials.
|Examinations and Assignments: |
Weekly homeworks and a final project
|Additional Requirements and/or Comments: |
While there are no formal prerequisites for the course, you may find the course rather challenging if you have never worked with data using statistical analysis software, Excel etc.
|Instructor(s): Oleinikov,Pavel V Times: ..T.... 07:00PM-09:50PM; Location: ALLB204; |
|Total Enrollment Limit: 24||SR major: 0||JR major: 0|| || |
|Seats Available: 11||GRAD: 2||SR non-major: 5||JR non-major: 6||SO: 6||FR: 5|
|Drop/Add Enrollment Requests|
|Total Submitted Requests: 0||1st Ranked: 0||2nd Ranked: 0||3rd Ranked: 0||4th Ranked: 0||Unranked: 0|