Calculate basic summary statistics by a grouping variable. This function is inspired by the
psych::describeBy() function but has been implemented to always return a data frame using
the tidyverse. The funs parameter provides additional flexibility regarding what summary
statistics are calculated.
Value
a data frame with a column group corresponding to the group variable and column
variable corresponding to the names of the data frame. Additional columns for each
function in the funs parameter.
Examples
data(pisa2015, package = 'clav')
describe_by(df = pisa2015[,c('science_score', 'efficacy', 'motivation')], group = pisa2015$country)
#>           group      variable     n       mean         sd          se   median
#> 1 UNITED STATES      efficacy  4637   1.870309  0.6662445 0.009783969   2.0000
#> 2 UNITED STATES    motivation  4637   1.926785  0.7561965 0.011104937   2.0000
#> 3 UNITED STATES science_score  4637 506.398045 91.8862771 1.349373286 504.8727
#> 4        CANADA      efficacy 15934   1.875698  0.6669602 0.005283692   2.0000
#> 5        CANADA    motivation 15934   2.046865  0.7794135 0.006174553   2.0000
#> 6        CANADA science_score 15934 526.652038 84.6720478 0.670776218 529.0316
#>        min      max    range
#> 1   0.0000   3.0000   3.0000
#> 2   0.0000   3.0000   3.0000
#> 3 234.1381 817.7677 583.6296
#> 4   0.0000   3.0000   3.0000
#> 5   0.0000   3.0000   3.0000
#> 6 225.5391 795.2875 569.7484