Pré-Publication, Document De Travail Année : 2024

A new iterative framework for simulation-based population genetic inference with improved coverage properties of confidence intervals

Résumé

Abstract Simulation-based methods such as approximate Bayesian computation (ABC) are widely used to infer the evolutionary history of populations from molecular genetic data. We describe and evaluate a new iterative method of statistical inference about model parameters, which revisits the idea of inferring a likelihood surface using simulation when the likelihood function cannot be evaluated. In addition to the traditional assessment of precision in terms of bias and mean square error, we also evaluate the coverage of confidence intervals. It is based on combining the random forest machine learning method, and multivariate Gaussian mixture (MGM) models, in an effective inference workflow, here used to fit models with up to 15 variable parameters. Masked autoregressive flows, a deep learning technique, is also tested as an alternative to MGM models. The method is compared to that of approximate Bayesian computation (ABC) with random forests, with which it shares some technical features, on scenarios of inference of historical demography from population genetic data. These comparisons highlight the importance of an iterative workflow for exploring the parameter space efficiently. For equivalent simulation effort of the data-generating process, the new summary-likelihood method provides intervals whose coverage is better controlled than the marginal coverage of intervals provided by ABC with random forests, and than generally reported for ABC methods. The iterative workflow can also yield greater improvements in estimator precision when larger datasets are used.

Dates et versions

hal-05370937 , version 1 (18-11-2025)

Identifiants

Citer

François Rousset, Raphaël Leblois, Arnaud Estoup, Jean-Michel Marin. A new iterative framework for simulation-based population genetic inference with improved coverage properties of confidence intervals. 2025. ⟨hal-05370937⟩
84 Consultations
0 Téléchargements

Altmetric

Partager

  • More