Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.clickml.app/llms.txt

Use this file to discover all available pages before exploring further.

The Missing Values component extends basic cleaning with more imputation strategies, including KNN-based imputation and interpolation.

Configuration

OptionDescription
ColumnsColumns to apply the strategy to. Supports All Numerical Features and All Categorical Features shortcuts. Leave empty to apply to all columns.
StrategyImputation method (see table below).
Fill ValueThe constant value to use when Strategy is Fill Constant.
N NeighborsNumber of neighbors for KNN Imputation (default: 5).

Strategies

StrategyDescription
Drop RowsRemove rows with any missing value in selected columns
Drop ColumnsRemove the selected columns
Fill ConstantFill missing values with a fixed value
Fill MeanFill with column mean
Fill MedianFill with column median
Fill ModeFill with most frequent value
Fill ForwardPropagate the last valid value forward
Fill BackwardPropagate the next valid value backward
Interpolate LinearLinear interpolation between known values (numerical only)
KNN ImputationImpute using K-nearest neighbors based on other features (numerical only)

Input / Output

Type
InputDataFrame
OutputDataFrame
KNN Imputation produces the most accurate results but is slower on large datasets. For quick fixes, Fill Mean or Fill Median is usually sufficient.