Machine Learning-Enhanced Colorimetric Sensing: Achieving over 5700-fold Accuracy Improvement via Full-Spectrum Modeling

Majid Aalizadeh, Chinmay Raut, Ali Tabartehfarahani, Xudong Fan

Published: 2025/9/3

Abstract

Conventional colorimetric sensing methods typically rely on signal intensity at a single wavelength, often selected heuristically based on peak visual modulation. This approach overlooks the structured information embedded in full-spectrum transmission profiles, particularly in intensity-based systems where linear models may be highly effective. In this study, we experimentally demonstrate that applying a forward feature selection strategy to normalized transmission spectra, combined with linear regression and ten-fold cross-validation, yields significant improvements in predictive accuracy. Using food dye dilutions as a model system, the mean squared error was reduced from over 22,000 with a single wavelength to 3.87 using twelve selected features, corresponding to a more than 5,700-fold enhancement. These results validate that full-spectrum modeling enables precise concentration prediction without requiring changes to the sensing hardware. The approach is broadly applicable to colorimetric assays used in medical diagnostics, environmental monitoring, and industrial analysis, offering a scalable pathway to improve sensitivity and reliability in existing platforms.