Identifiability analysis

From Wikipedia the free encyclopedia

Identifiability analysis is a group of methods found in mathematical statistics that are used to determine how well the parameters of a model are estimated by the quantity and quality of experimental data.[1] Therefore, these methods explore not only identifiability of a model, but also the relation of the model to particular experimental data or, more generally, the data collection process.

Introduction[edit]

Assuming a model is fit to experimental data, the goodness of fit does not reveal how reliable the parameter estimates are. The goodness of fit is also not sufficient to prove the model was chosen correctly. For example, if the experimental data is noisy or if there is an insufficient number of data points, it could be that the estimated parameter values could vary drastically without significantly influencing the goodness of fit. To address these issues the identifiability analysis could be applied as an important step to ensure correct choice of model, and sufficient amount of experimental data. The purpose of this analysis is either a quantified proof of correct model choice and integrality of experimental data acquired or such analysis can serve as an instrument for the detection of non-identifiable and sloppy parameters, helping planning the experiments and in building and improvement of the model at the early stages.

Structural and practical identifiability analysis[edit]

Structural identifiability analysis is a particular type of analysis in which the model structure itself is investigated for non-identifiability. Recognized non-identifiabilities may be removed analytically through substitution of the non-identifiable parameters with their combinations or by another way. The model overloading with number of independent parameters after its application to simulate finite experimental dataset may provide the good fit to experimental data by the price of making fitting results not sensible to the changes of parameters values, therefore leaving parameter values undetermined. Structural methods are also referred to as a priori, because non-identifiability analysis in this case could also be performed prior to the calculation of the fitting score functions, by exploring the number degrees of freedom (statistics) for the model and the number of independent experimental conditions to be varied.

Practical identifiability analysis can be performed by exploring the fit of existing model to experimental data. Once the fitting in any measure was obtained, parameter identifiability analysis can be performed either locally near a given point (usually near the parameter values provided the best model fit) or globally over the extended parameter space. The common example of the practical identifiability analysis is profile likelihood method.

See also[edit]

  • Curve fitting – Process of constructing a curve that has the best fit to a series of data points
  • Estimation theory – Branch of statistics to estimate models based on measured data
  • Identifiability – Statistical property which a model must satisfy to allow precise inference
  • Parameter identification problem – Parameter estimation technique in statistics, particularly econometrics
  • Regression analysis – Set of statistical processes for estimating the relationships among variables

Notes[edit]

References[edit]

  • Brun, Roland; Reichert, Peter; Künsch, Hans R. (2001). "Practical identifiability analysis of large environmental simulation models". Water Resources Research. 37 (4): 1015–1030. Bibcode:2001WRR....37.1015B. doi:10.1029/2000WR900350.
  • Cobelli, C.; DiStefano, J. (1980). "Parameter and structural identifiability concepts and ambiguities: a critical review and analysis". Am. J. Physiol. Regul. Integr. Comp. Physiol. 239 (239): 7–24. doi:10.1152/ajpregu.1980.239.1.R7. PMID 7396041.
  • Gutenkunst, Ryan N.; Waterfall, Joshua J.; Casey, Fergal P.; Brown, Kevin S.; Myers, Christopher R.; Sethna, James P. (2007). "Universally Sloppy Parameter Sensitivities in Systems Biology Models". PLOS Computational Biology. 3 (10): –189. arXiv:q-bio/0701039. doi:10.1371/journal.pcbi.0030189. PMC 2000971. PMID 17922568.
  • Lavielle, M.; Aarons, L. (2015), "What do we mean by identifiability in mixed effects models?", Journal of Pharmacokinetics and Pharmacodynamics, 43: 111–122; doi:10.1007/s10928-015-9459-4.
  • Myasnikova, E.; Samsonova, A.; Kozlov, K.; Samsonova, M.; Reinitz, J. (2001-01-01). "Registration of the expression patterns of Drosophila segmentation genes by two independent methods". Bioinformatics. 17 (1): 3–12. doi:10.1093/bioinformatics/17.1.3. PMID 11222257.
  • Raue, A.; Kreutz, C.; Maiwald, T.; Bachmann, J.; Schilling, M.; Klingmuller, U.; Timmer, J. (2009-08-01). "Structural and practical identifiability analysis of partially observed dynamical models by exploiting the profile likelihood". Bioinformatics. 25 (15): 1923–1929. doi:10.1093/bioinformatics/btp358. PMID 19505944.
  • Stanhope, S.; Rubin, J. E.; Swigon D. (2014), "Identifiability of linear and linear-in-parameters dynamical systems from a single trajectory", SIAM Journal on Applied Dynamical Systems, 13: 1792–1815; doi:10.1137/130937913.
  • Vandeginste, B.; Bates, D. M.; Watts, D. G. (1988). "Nonlinear regression analysis: Its applications". Journal of Chemometrics. 3 (3) (published 1989): 544–545. doi:10.1002/cem.1180030313. ISBN 0471-816434.