en:learning:schools:s01:worksheets:ba-ws-07-1

This worksheet summarizes descriptive statistic functions which can be used to describe the properties of a data set. After completing this worksheet you should know how to use basic descriptive statistic functions and produce box whisker plots.

R Studio — we recommend to use R Studio for (interactive) programming with R. You can download R Studio from the official web page.

- your script and data from W06-1 Leave-one-out validation

This time, the following analysis is build on top of your script from W06-1. Please copy your script “W06-1.R”, rename the copy to “W07-1.R” and use it for the programming tasks of this worksheet.

Please figure out the functions for some general descriptive statistics: mean, median, standard deviation, maximum and minimum. Let's try those on the observed animal activity data set.

Let's compare the descriptive values with the predicted values of the animal activity from W06-1.

While those values give us a good impression of the differences between the observed and predicted animal activity data set, a visualization might be more intuitive.

Please visualize the data set properties of the observed and predicted animal activity using a boxplot (e.g. function boxplot()).

Please identify the outliers by computing a scatter plot first and then starting the identify() function. Now you can click on a point of the plot to get the meta information for it.

To terminate the identify() function, just hit Escape

en/learning/schools/s01/worksheets/ba-ws-07-1.txt · Last modified: 2015/09/22 16:22 (external edit)

Except where otherwise noted, content on this wiki is licensed under the following license: CC Attribution-Noncommercial-Share Alike 4.0 International