dplyr Aggregation

Remember to

Load surveys.csv data into surveys

group_by(surveys, species_id)

Different looking kind of data.frame
- Source, grouping, and data type information

surveys_by_species <- group_by(surveys, species_id)

summarize(surveys_by_species, abundance = n())

surveys_by_species_plot <- group_by(surveys, species_id, plot_id)
summarize(surveys_by_species, abundance = n())

species_weight <- summarize(surveys_by_species_plot, avg_weight = mean(weight))

Open table
Why did we get NA?
- mean(weight) returns NA when weight has missing values (NA)
Can fix using mean(weight, na.rm = TRUE)

species_weight <- summarize(surveys_by_species,
                            avg_weight = mean(weight, na.rm = TRUE))

na.omit(surveys_weight)

Do Shrub Volume Aggregation.

Data Science Skills in R