Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

This example read data from some stream and sort them alphabetically using a OrderBy plugin and uses Systematic Sampling to sample the input records considering the sample size and oversampling percenatage percentage mentioned in the inputs below:

...

Property

Value

Sample Size

3

Sampling Type

Systematic

Over Sampling Percentage

20

Total Records

10

If the aggregator receives as an input record:

...

id

name

salary

occupation

1

John

1000

Artist

2

Kelly

2000

Singer

3

Kiara

3000

Scientist

4

Phoebe

2500

Farmer

5

Mike

4000

Baker

6

Avril

4300

Banker

7

Miley

8700

Actress

8

Katy

6500

Chef

9

Seth

2300

Miner

10

Ben

9800

Director

After, applying Systematic sampling, plugin will emit 4 random records considering the sample size and over-sampling percentage provided in the inputs.