Prerequisite(s): CSIT 528. This course provides fundamental exploratory techniques to summarize and visualize data sets. Exploratory Data Analysis (EDA), which usually comes before formal hypothesis testing can identify interesting patterns and eliminate ideas that are not worthwhile to pursue. R statistical programming language will be used to learn how to manage datasets, use plotting system as well as apply various clustering methods and high dimension reduction technique. Methods to visualize data sets of one, two and multiple variables with examples will also be presented.