Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.clickml.app/llms.txt

Use this file to discover all available pages before exploring further.

The EDA component runs statistical analyses on your DataFrame and returns charts and summaries you can explore in the canvas. Use it to understand your data’s structure, spot problems, and decide what preprocessing to apply.

Configuration

OptionDescription
Analysis TypeThe analysis to run (see below).
FeaturesColumns to include in the analysis (used by Distribution and Outlier Detection).

Analysis types

TypeWhat you get
Correlation MatrixHeatmap of Pearson correlations between all numerical features. Shows the top 10 most correlated pairs.
Missing ValuesBar chart of missing value counts and percentages per column. Flags completely missing columns.
StatisticsDescriptive statistics table: count, mean, std, min, quartiles, max, skewness, kurtosis for numerical columns; top value, frequency, and unique count for categorical columns.
DistributionHistogram with summary stats (mean, median, std, skewness) for each selected feature.
Outlier DetectionBox-plot style outlier summary using the IQR method (1.5× IQR rule). Shows outlier count and percentage per feature.

Input / Output

Type
InputDataFrame
OutputAnalytics data (displayed in the canvas — not passed downstream)
Run EDA right after loading your data to get a quick overview before deciding which preprocessing components to add.