Skip to main content
The Encoding component transforms categorical text columns into numbers so ML models can work with them.

Configuration

OptionDescription
MethodEncoding strategy (see table below).
FeaturesCategorical columns to encode.

Methods

MethodHow it worksBest for
Ordinal EncodingAssigns an integer (0, 1, 2, …) to each category. Preserves order if categories have a natural ranking.Ordinal categories (e.g. Low / Medium / High)
One Hot EncodingCreates a new binary column for each unique category value.Nominal categories with no inherent order (e.g. Color, City)

Input / Output

Type
InputDataFrame
OutputDataFrame
One Hot Encoding expands the number of columns. For columns with many unique values (high cardinality), this can make the DataFrame very wide. Consider Feature Selection afterward.