Description
Generates a list of all distinct values in the specified <table>.<column>
column starting from <start>
days to <end>
days. When used in the target field, the time period should be positive (in the future). When used in a temporal filter field, the time period should be negative (in the past).
This operator can be applied to columns of any type.
For an optimal user experience, Kumo recommends limiting the use of LIST_DISTINCT
to cases where the number of possible return values is less than 1000.
Example
LIST_DISTINCT(LOAN.AMOUNT, 0, 30)
Training Considerations
- Kumo runs your pQuery over many historic slices of data and across all entities that meet your filter criteria to generate training examples.
- If for a given entity and historic data slice there are no fact rows to aggregate over, this training example will be dropped. Thus, the operator makes the implicit assumption that for every entity a prediction is generated for, that entity has at least one fact row in the prediction horizon.
Updated 11 months ago