match and %in% return values similar to the base R generic
functions.
h2o.match(x, table, nomatch = NA_integer_, start_index = 1)
match.H2OFrame(x, table, nomatch = NA_integer_, start_index = 1)
x %in% tablea vector from an H2OFrame object with values to be matched.
an R object to match x against. Duplicates are ignored. The index of the first occurrence
will be used.
the value to be returned in the case when no match is found. Numeric value or NaN, default is NaN.
index from which this starts the indexing of the table list, numeric value >=0, default is 1.
Returns a new H2OFrame containing a vector where the index of value from the table is returned
if the value matches; returns nomatch value otherwise.
match for base R implementation.
if (FALSE) { # \dontrun{
h2o.init()
data <- as.h2o(iris)
match_col <- h2o.match(data$Species, c("setosa", "versicolor", "setosa"))
iris_match <- h2o.cbind(data, match_col)
sample <- h2o.splitFrame(iris_match, ratios=0.05, seed=1)[[1]]
sample
# Sepal.Length Sepal.Width Petal.Length Petal.Width Species C1
#1 5.2 3.5 1.5 0.2 setosa 1
#2 5.0 3.5 1.3 0.3 setosa 1
#3 7.0 3.2 4.7 1.4 versicolor 2
#4 4.9 2.4 3.3 1.0 versicolor 2
#5 5.5 2.4 3.8 1.1 versicolor 2
#6 5.8 2.7 5.1 1.9 virginica NaN
} # }