The “randomForest” Function in R

Jul 29

mtry: Number of variables randomly sampled as candidates at each split (default is √p for classification and p/3 for regression).

Example:
# Load the required library
library(randomForest)

# Load the iris dataset
data(iris)

# Fit a random forest model to predict species based on other variables
rf_model <- randomForest(Species ~ ., data = iris, ntree = 100, importance = TRUE)

# Print the model summary
print(rf_model)

# View the importance of each predictor variable
print(importance(rf_model))
In this example, the randomForest function is used to build a random forest model to predict the species of iris flowers based on the other variables in the iris dataset. The ntree argument specifies that 100 trees should be grown, and importance is set to TRUE to assess the importance of each predictor variable. After fitting the model, the summary and variable importance are printed. This function is widely used for its effectiveness in classification and regression tasks due to its ability to handle large datasets and complex interactions.

The “imap” Function in R