Inconsistencies of Fundamental Summary Functions

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

Inconsistencies of Fundamental Summary Functions

Dario Strbenac-2
Good day,

Functions such as min and max work on a data frame of numbers, but other functions, like median do not. For example,

database <- data.frame(Name = LETTERS[1:3], Amount = 1:3, Price = 4:6)
> min(database[, 2:3])
[1] 1
> mean(database[, 2:3])
[1] NA
> median(database[, 2:3])
Error in median.default(database[, 2:3]) : need numeric data

This observation was made when teaching university students in the first year of their degrees.  Some functions require coercion by using as.matrix but others don't. Surely these simple commands should be consistent.

Also, the documentation for min and max is incomplete. It states "‘Numeric’ arguments are vectors of type integer and numeric, and logical." It should also mention that it works on data frames, although not when data frames are represented as lists, which can easily be coerced back to data frames.

> min(as.list(database[, 2:3]))
Error in min(as.list(database[, 2:3])) :
  invalid 'type' (list) of argument

--------------------------------------
Dario Strbenac
PhD Student
University of Sydney
Camperdown NSW 2050
Australia
______________________________________________
[hidden email] mailing list
https://stat.ethz.ch/mailman/listinfo/r-devel