median, mean, mode as minimizers

2024年5月9日

Median, mean, and mode are the most common measures of central tendency.

Norms

In mathematics, a norm measures the “distance” from a point to another. A norm is denoted by double vertical lines: $||x||$.

Absolute-value norm is a norm on a single value, see $\eqref{absolute}$.

$$
\begin{equation}
||x||=|x|
\label{absolute}
\end{equation}
$$

Euclidean norm is a norm on a list, see $\eqref{euclidean}$.

$$
\begin{equation}
||\mathbf{x}||= \sqrt{x_1^2 + \cdots + x_n^2}
\label{euclidean}
\end{equation}
$$

p-norm is a generalization of Euclidean norm. It is a norm on a list.

Let $p \geq 1$ be a real number. The p-norm of list $\mathbf{x} = (x_1, \ldots, x_n)$ is

$$ ||\mathbf{x}||_p := \left(\sum_{i=1}^n \left|x_i\right|^p\right)^{1/p}. $$

When p=1, p-norm reduces to $||\mathbf{x}||_1 := \sum_{i=1}^n \left|x_i\right| $. It takes absolute values, but it is not the absolute-value norm due to the $\sum$ symbol.

When p=2, we get Euclidean norm.

As p approaches $\infty$, the p-norm approaches the infinity norm or maximum norm: $||\mathbf{x}||_\infty := \max_i \left|x_i\right|$ .

When p=0, 0-norm is not defined.

Statistical dispersions

In statistics, dispersion is the extent to which a distribution is stretched or squeezed.

For a given (finite) data set $\mathbf{x}$, the dispersion about a point c is the “distance” from $x_i$ to the c in the p-norm:

$$ f_p(c) = \left\| \mathbf{x} – \mathbf{c} \right\|_p = \left( \frac{1}{n} \sum_{i=1}^n \left| x_i – c\right| ^p \right) ^{1/p} $$

p	dispersion	central tendency
0	variation ratio	mode
1	average absolute deviation	median
2	standard deviation	mean
$\infty$	maximum deviation	mid-range

Average Absolute Deviation has the identical form of Mean Absolute Error (MAE) which is from the viewpoint of prediction.

The 2-norm is Mean Squared Error (MSE) in addition with the power of $\frac{1}{2}$. MSE is $\frac{1}{n} \sum_{i=1}^n \left(x_i-c \right)^2$.

The 0-norm counts the number of unequal points. Mode minimizes this 0-norm.

For $ p = \infty$ the largest number dominates, and thus mid-range $M=\frac{\max x + \min x}{2} $ minimizes $\infty$-norm.

median, mean, mode as minimizers

Norms

Statistical dispersions

See also

《median, mean, mode as minimizers》有一个想法