The “h2o.importFile” Function in R

  • Package: h2o

  • Purpose: To import a file into H2O’s distributed key-value store.

  • General Class: Data Import

  • Required Argument(s):

    • path: The path to the file to be imported.

  • Notable Optional Arguments:

    • destination_frame: Character. The name of the key to import the data into. If not specified, H2O will generate a key.

    • parse: Logical. If TRUE, parse the data into an H2OFrame. The default is TRUE.

    • header: Integer. Row number to use as column names. The default is -1, which means autodetect.

    • sep: Character. The field separator character. The default is “ ”.

    • col.types: List. A named list of types for columns. The default is NULL.

    • na.strings: Character vector. A character vector of strings to interpret as missing values. The default is NULL.

  • Example (with Explanation):

  • # Load necessary package
    library(h2o)

    # Initialize H2O
    h2o.init()

    # Import a CSV file into H2O cluster
    data <- h2o.importFile(path = "data.csv")

    # Display a summary of the imported data
    summary(data)

  • In this example, the h2o.importFile function from the h2o package is used to import a CSV file named “data.csv” into H2O’s distributed key-value store. By default, the function parses the data into an H2OFrame and autodetects column names. Once imported, the summary function is used to display a summary of the imported data. This function is commonly used to load large datasets into H2O for distributed data analysis and machine learning tasks in R.

Previous
Previous

The “h2o.assign” Function in R

Next
Next

The “h2o.init” Function in R