Loss Functions for Detecting Outliers in Panel Data

Charles D. Coleman, Thomas Bryan

Published: 2025/9/6

Abstract

The detection of outliers is of critical importance in the assurance of data quality. Outliers may exist in observed data or in data derived from these observed data, such as estimates and forecasts. An outlier may indicate a problem with its data generation process or may simply be a true statement about the world. Without making any distributional assumptions, this paper proposes the use of loss functions to detect these outliers in panel data. An unsigned loss function is derived axiomatically. A signed loss function is developed to account for positive and negative outliers separately. In the case of nominal time an exact parametrization of the loss function is obtained. A time-invariant loss function permits the comparison of data at multiple times on the same basis. Several examples are provided, including an example in which the outliers are classified by another variable.

Loss Functions for Detecting Outliers in Panel Data | SummarXiv | SummarXiv