Judy Wang (Organizer) |
Daniela Castro-Camilo |
Sebastian Engelke |
Tiandong Wang |
Chen Zhou |
A Bayesian multivariate extreme value mixture model |
Daniela Castro-Camilo Lecturer in Statistics, School of Mathematics and Statistics, University of Glasgow, Glasgow G12 8QQ Daniela.CastroCamilo@glasgow.ac.uk |
Impact assessment of natural hazards requires the consideration of both extreme and non-extreme events. Extensive research has been conducted on the joint modelling of bulk and tail in univariate settings; however, the corresponding body of research in the context of multivariate analysis is comparatively scant. This study extends the univariate joint modelling of bulk and tail to the multivariate framework. Specifically, it pertains to cases where multivariate observations exhibit extremity in at least one component.
We propose a multivariate extreme value mixture model that assumes a parametric model to capture the bulk of the distribution, which is in the max-domain of attraction of a multivariate extreme value distribution. The multivariate tail is described by the asymptotically justified multivariate generalized Pareto distribution. Bayesian inference based on multivariate random-walk Metropolis-Hastings and the automated factor slice sampler allows us to easily incorporate uncertainty from the threshold selection. The performance of our model is tested using different simulation scenarios, and the applicability of our model is illustrated using temperature records in the UK that show the need to accurately describe the joint tail behaviour. |
Machine learning beyond the data range: an extreme value perspective |
Sebastian Engelke Associate Professor, Research Center for Statistics, University of Geneva Daniela.CastroCamilo@glasgow.ac.uk |
Machine learning methods perform well in prediction tasks within the range of the training data. These methods typically break down when interest is in (1) prediction in areas of the predictor space with few or no training observations; or (2) prediction of quantiles of the response that go beyond the observed records. Extreme value theory provides the mathematical foundation for extrapolation beyond the range of the training data, both in the dimension of the predictor space and the response variable. In this talk we present recent methodology that combines this extrapolation theory with flexible machine learning methods to tackle the out-of-distribution generalization problem (1) and the extreme quantile regression problem (2). We show the practical importance of prediction beyond the training observations in environmental and climate applications, where domain shifts in the predictor space occur naturally due to climate change and risk assessment for extreme quantiles is required. |
Testing for Strong VS Full Dependence |
Tiandong Wang, Ph.D. Shanghai Center for Mathematical Sciences, Fudan University td_wang@fudan.edu.cn |
Preferential attachment models of network growth are bivariate heavy tailed models for in- and out-degree with limit measures which either concentrate on a ray of positive slope from the origin or on all of the positive quadrant depending on whether the model includes reciprocity or not. Concentration on the ray is called full dependence. If there were a reliable way to distinguish full dependence from not-full, we would have guidance about which model to choose. This motivates investigating tests that distinguish between (i) full dependence; (ii) strong dependence (limit measure concentrates on a proper subcone of the positive quadrant; (iii) concentration on positive quadrant. We give two test statistics and discuss their asymptotically normal behavior under full and not-full dependence.
This is a joint work with Prof. Sidney Resnick at Cornell University. |
Tail copula estimation for heteroscedastic extremes |
Chen Zhou Erasmus University Rotterdam zhou@ese.eur.nl |
Consider independent multivariate random vectors which follow the same copula, but where each marginal distribution is allowed to be non-stationary. This non-stationarity is for each marginal governed by a scedasis function (see Einmahl et al. (2016)) that is the same for all marginals. We establish the asymptotic normality of the usual rank-based estimator of the stable tail dependence function, or, when specialized to bivariate random vectors, the corresponding estimator of the tail copula. Remarkably, the heteroscedastic marginals do not affect the limiting process. Next, under a bivariate setup, we develop nonparametric tests for testing whether the scedasis functions are the same for both marginals. Detailed simulations show the good performance of the estimator for the tail dependence coefficient as well as that of the new tests. In particular, novel asymptotic confidence intervals for the tail dependence coefficient are presented and their good finite-sample behavior is shown. Finally an application to the S&P500 and Dow Jones indices reveals that their scedasis functions are about equal and that they exhibit strong tail dependence. |