Skip to contents

It applies to any data type whereas caret::nearZeroVar() is only for numeric columns.

Usage

rmNZV(df1, minUniPerc = 0.05, minUniCount = 5)

Arguments

df1

a data.frame or matrix

minUniPerc, minUniCount

criteria to remove columns unique values are all the values except the most common value, e.g. 1,4,2 in c(1,2,3,3,4). uniCount and uniPerc are the count and percentage of samples having the unique values failure to match either leads to removal

Value

a cleaned df1; data type is kept even if 0 or 1 column remains

Examples

# none