Predict

Trend, seasonality, and noise estimation for metric streams.

predict - reducer

Predict is a subgraph which consumes a metric stream and emits a prediction stream based on estimated trend, seasonality, and a smoothed version of the input. As any sub, it can only be used in stream context.

Prediction errors (the difference between predict's output and the input) can signal anomalies in the stream.

Usage

... | predict [-field field] [-over duration] [-every duration] [-pct percentile] [-nonneg true/false] [-detrend true/false] [-deseason true/false] [-denoise true/false]

Option	Description	Required?
`-field`	name of metric field to predict	No, default: `value`
`-over`	period: the length of one repeating cycle in the input metric.	No, default: `:1 week:`
`-every`	interval between emitted prediction points	No, default: is based on period
`-pct`	percentile to retain during initial reduction to every-spaced values	No, default: `0.5`
`-nonneg`	do not allow predictions to become negative.	No, default: `true`
`-detrend`	if false, do not remove estimated trend	No, default: `true`
`-deseason`	if false, do not remove estimated cyclic effect	No, default: `true`
`-denoise`	if false, do not smooth the detrended/deseasoned value	No, default: `true`

predict is intended to be used with a historic read or superquery (combination of historic and live data stream) that includes enough history to initialize its estimators. predict can begin emitting prediction points almost immediately, though the early points will simply be de-meaned. After 2 periods have gone by, trend and seasonality (at the resolution of -every) will switch on, and after 3 periods have passed all estimators have full windows of data.

Output

Each output point contains the following fields:

Field	Description
fieldname (the value of the `-field`option)	average value of field over `-every`interval
`T`	portion of field predicted by the trend estimator
`S`	portion of field predicted by seasonality
`Y`	portion of field predicted by smoothing
`P`	predicted value of field, `T + S + Y`
`E`	prediction error, `P`- field
`Z`	normalized error based on the sample `stdev`over trailing periods

Example: using predict to predict the next value of Math.sin(x) function

// Example showing how the `predict` sub can be used to predict the output of a
// `Math.sin` function over time. Looking closely at the output you'll see
// how the `E` (error) value gets quite small after about 2 cycles through,
// which matches the natural cyclical behavior of the `sin(x)` function.
// Once the * `predict` sub has seen "enough" cycles and figured out
// the seasonality it will start to produce more accurate predictions.

import "predict" as predict;

emit -from :1 minute ago: -to :now: -every :1s:
| put x=count(), value = Math.sin(50*x)
// noneg=false, because sin(x) obviously has negative values
| predict.predict -over :1s: -every :1s: -nonneg false
| keep time, name, value, P, E
| view table -columnOrder 'time', 'name', 'value', 'P', 'E'