In this lab, we will be set up a population analysis job to find unusual customers using the eCommerce sample data.
In the case of the population jobs, the analyzed data is split by the distinct values of a field. This field defines what is called a population. The splits are analyzed in the context of all the splits to find unusual values in the population. In other words, the population analysis is a comparison of an individual entity against a collective model of all members in the population as witnessed over time.
Navigate to Machine Learning > Anomaly Detection > Jobs.
Click Create job.
Select the “Kibana Sample Data eCommerce” data view.
4. Select the Population job wizard.
5. Click on Use full data, then click Next.
6. Use "customer_full_name.keyword" as the Population field.
7. Add Metric: use "sum" on the field "taxful_total_price".
8. Use 15m bucket span and click Next.
9. Name the job "lab3a_unusually_big_orders" and place it under “mylabs” group, then click Next.
10. Click Next after passing the job validation.
11. Review the Job Settings and click on Create job to start the ML job.
12. Wait around half a minute while the job completes. Click on the View Results.
13. Customer "Wagdi Shaw" is the highest anomalous customer.
14. Explore Anomalous User - Click on red tile for user "Wagdi Shaw".
Note:
Wagdi Shaw had a purchase of $2250, much more than the typical user purchase of $62.44
(Optional) You could repeat the process of creating custom URL to raw data (as performed in Lab 1d) to see what Wagdi Shaw ordered