Skip to main content

Benford's Law

Synopsis

This operator tests, if columns in your data set are following Benford's Law

Description

Benford's Law states, that for real world applications often the distribution of leading digits of numbers is not uniform. Smaller digits (like 1 or 2) appear more often than larger digits. For more information please see: https://en.wikipedia.org/wiki/Benford's_law

This operator checks, if the first digit of the values ina numerical column follow this law. It first counts how often a leading digit occurs and calculates the frequency. This frequency is returned at the frequency port. Also, the expected frequency according to Benford's law is reported.

In a second step a p-value is calculated. This value indicates how likely it is, that the observed distribution of leading digits is compatible with Benford's law. Low numbers mean, that the distribution is different to what's expected.

Input

exa

The input example set

Output

frequency

A table with the frequency of the first digit in the data set and the expected frequency if the data set would follow Benford's law.

pvalue

The table with the p-value ("probability") that a column follows Benford's Law

ori

The original data set

Parameters

See Also