Machine Learning-Enabled NIR Spectroscopy. Part 3: Hyperparameter by Design (HyD) Based ANN-MLP Optimization, Model Generalizability, and Model Transferability

Hussain Ali, Prakash Muthudoss, Chirag Chauhan, Ilango Kaliappan, Dinesh Kumar, Amrit Paudel*, Gobi Ramasamy*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Data variations, library changes, and poorly tuned hyperparameters can cause failures in data-driven modelling. In such scenarios, model drift, a gradual shift in model performance, can lead to inaccurate predictions. Monitoring and mitigating drift are vital to maintain model effectiveness. USFDA and ICH regulate pharmaceutical variation with scientific risk-based approaches. In this study, the hyperparameter optimization for the Artificial Neural Network Multilayer Perceptron (ANN-MLP) was investigated using open-source data. The design of experiments (DoE) approach in combination with target drift prediction and statistical process control (SPC) was employed to achieve this objective. First, pre-screening and optimization DoEs were conducted on lab-scale data, serving as internal validation data, to identify the design space and control space. The regression performance metrics were carefully monitored to ensure the right set of hyperparameters was selected, optimizing the modelling time and storage requirements. Before extending the analysis to external validation data, a drift analysis on the target variable was performed. This aimed to determine if the external data fell within the studied range or required retraining of the model. Although a drift was observed, the external data remained well within the range of the internal validation data. Subsequently, trend analysis and process monitoring for the mean absolute error of the active content were conducted. The combined use of DoE, drift analysis, and SPC enabled trend analysis, ensuring that both current and external validation data met acceptance criteria. Out-of-specification and process control limits were determined, providing valuable insights into the model’s performance and overall reliability. This comprehensive approach allowed for robust hyperparameter optimization and effective management of model lifecycle, crucial in achieving accurate and dependable predictions in various real-world applications. Graphical Abstract: [Figure not available: see fulltext.].

Original languageEnglish
Article number254
JournalAAPS PharmSciTech
Volume24
Issue number8
DOIs
Publication statusPublished - Dec 2023

Keywords

  • Artificial Neural Network Multilayer Perceptron (ANN-MLP)
  • data-driven modelling
  • design of experiments (DoE)
  • hyperparameter optimization
  • model generalizability
  • model lifecycle management
  • model transferability
  • near infrared (NIR)
  • process monitoring
  • statistical process control (SPC)
  • target drift detection

ASJC Scopus subject areas

  • Pharmaceutical Science

Cite this