The “group_by” Function in R
Package: dplyr
Purpose: Groups a data frame by one or more variables.
General Class: Data Manipulation
Required Argument(s):
.data: A data frame or tibble.
...: Grouping variables (column names or expressions).
Notable Optional Arguments:
None specific to group_by, but it is commonly used in conjunction with other functions from dplyr like summarize, mutate, etc.
Example:
# Example data for grouping a data frame using dplyr
library(dplyr)
df <- data.frame(
ID = c(1, 2, 3, 4, 5, 6),
Category = c("A", "B", "A", "B", "A", "B"),
Value = c(10, 15, 20, 25, 30, 35)
)
# Group data frame by the "Category" column
grouped_df <- group_by(df, Category)
# Display the grouped data frame
print(grouped_df)In this example, the group_by function from the dplyr package is used to group a data frame (df) by the values in the “Category” column. The result is a grouped data frame (grouped_df). Note that you need to have the dplyr package installed and loaded to use the group_by function.