emmi.pipeline.sample_processors.drop_outliers

Classes

DropOutliersSampleProcessor

Drops all outliers from fields in a batch.

Module Contents

class emmi.pipeline.sample_processors.drop_outliers.DropOutliersSampleProcessor(item, affected_items=None, min_value=None, max_value=None, min_quantile=None, max_quantile=None)

Bases: ksuit.data.pipeline.SampleProcessor

Drops all outliers from fields in a batch.

Parameters:
  • item (str) – The item to drop outliers from.

  • affected_items (set[str] | None) – List of item (keys) that is also affected by outlier removal. Defaults to None.

  • min_value (float) – Drop outliers below min_value. Defaults to None.

  • max_value (float) – Drop outliers above max_value. Defaults to None.

  • min_quantile (float) – Drop outliers in/below min_quantile. Defaults to None.

  • max_quantile (float) – Drop outliers in/above max_value. Defaults to None.

item
affected_items = None
min_value = None
max_value = None
min_quantile = None
max_quantile = None