Multivariate Methods

Objectives: Multivariate distributions from shared mixing variables; 2 dimensional FFT techniques.

Audience: Users interested in directly estimating multivariate distributions.

Prerequisites: DecL, general use of aggregate, probability.

See also:

Contents:

Helpful References

Two Shortcomings

aggregate methods are fundamentally univariate. Only one loss is tracked through each calculation. As a result, there are several programs it is impossible to model directly:

  • The net of an occurrence cat program with an aggregate limit (need to track the net and cession to know when the aggregate limit is exhausted).

  • The total cession from a specific and agg program (need to track two ceded results).

These are dual problems. It is possible to model them using two dimensional FFTs.

Correlated Aggregate Distributions

Here we extend some of the ideas in Section 1.7.3 from plain frequency distributions to aggregate distributions. Begin with bivariate aggregate distributions. There are two different situations which commonly arise. First we could model a bivariate severity distribution and a univariate count distribution:

\[(A,B)=(X_1,Y_1)+\cdots+(X_N, Y_N).\]

Equation ([modelone]) arises naturally as the distribution of losses and allocated expense, ceded and retained losses, reported and ultimate claims, and in many other situations. Secondly we could model

\[(A,B)=(X_1+\cdots+X_M, Y_1+\cdots+Y_N)\]

where \(X_i\) and \(Y_j\) are independent severities and \((M,N)\) is a bivariate frequency distribution. ([modeltwo]) could be used to model losses in a clash policy.

We will use the following notation. \(A=X_1+\cdots+X_M\) and \(B=Y_1+\cdots+Y_N\) are two aggregate distributions, with \(X_i\) iid and \(Y_j\) iid, but neither \(X\) and \(Y\) nor \(M\) and \(N\) necessarily independent. Let \(\mathsf{E}(X)=x\) and \(\mathsf{E}(Y)=y\), \(\mathsf{var}(X)=v_x\) and \(\mathsf{var}(Y)=v_y\). Let \(\mathsf{E}(M)=m\), \(\mathsf{E}(N)=n\), \(c\) be the contagion of \(M\) and \(d\) that of \(N\). Hence \(\mathsf{var}(M)=m(1+cm)\) and \(\mathsf{var}(N)=n(1+dn)\).

Will now calculate the correlation coefficient between \(A\) and \(B\) in four situations.

Correlated Severities, Single Frequency

Assume that the bivariate severity distribution \((X,Y)\) has moment generating function \(M_{(X,Y)}(\zeta,\tau)\). Also assume that the claim count distribution \(N\) is a \(G\)-mixed Poisson. Then, just as for univariate aggregate distributions, the MGF of the bivariate aggregate \((A,B)\) is

\[M_{(A,B)}(\zeta,\tau)= M_G( n(M_{(X,Y)}(\zeta,\tau)-1)).\label{bivMGF}\]

Therefore, since \(\mathsf{E}(G)=1\) and \(\mathsf{E}(G^2)=1+c\),

\[\begin{split}E(AB) &= \frac{\partial^2 M_{(A,B)}}{\partial\zeta\partial\tau}\Big\vert_{(0,0)} \notag \\ &= M_G''(0)n^2\frac{\partial M_{(X,Y)}}{\partial\zeta} \frac{\partial M_{(X,Y)}}{\partial\zeta} + M_G'(0)n \frac{\partial^2 M_{(X,Y)}}{\partial\zeta\partial\tau} \notag \\ &=(1+c)n^2xy + n\mathsf{E}(XY) \notag \\ &=(1+c)n^2xy + n\mathsf{cov}(X,Y) + nxy.\end{split}\]

The value of \(\mathsf{cov}(X,Y)\) will depend on the particular bivarate severity distribution.

For example, suppose that \(Z\) represents ground up losses, \(X\) represents a retention to \(a\) and \(Y\) losses excess of \(a\) (per ground up claim), so \(Z=X+Y\). Then \((X,Y)\) is a bivariate severity distribution. Since \(Y\) is zero when \(Z\le a\) we have \(\mathsf{cov}(X,Y)=(a-x)y\).

Bivariate Frequency

The second method for generating correlated aggregate distributions is to use a bivariate frequency distribution. So, suppose \((M,N)\) has a \(G\)-mixed bivariate Poisson distribution. The variance of \(A\) is given by Equation ([varAgg]). To compute the covariance of \(A\) and \(B\) write the bivariate MGF of \((A,B)\) as

\[M_{(A,B)}(\zeta,\eta)=M(\zeta,\eta)=M_G(m(M_X(\zeta)-1) +n(M_Y(\eta)-1))=M_G(\psi(\zeta,\eta))\]

where the last equality defines \(\psi\). Then, evaluating at the partial derivatives at zero, we get

\[\begin{split}\mathsf{E}(AB) &= \frac{\partial^2 M}{\partial\zeta\partial\eta} \notag \\ &= \frac{\partial^2 M_G}{\partial t^2} \frac{\partial \psi}{\partial\zeta} \frac{\partial \psi}{\partial\eta} + \frac{\partial M_G}{\partial t} \frac{\partial^2 \psi}{\partial\zeta\partial\eta} \notag \\ &= (1+c)mxny.\end{split}\]

Hence

\[\mathsf{cov}(A,B)=\mathsf{E}(AB)-\mathsf{E}(A)\mathsf{E}(B)=cmnxy.\]

Parameter Uncertainty

It is common for actuaries to work with point estimates as though they are certain. In reality there is a range around any point estimate. We now work through one possible implication of such parameter uncertainty. We will model \(\mathsf{E}[A]=R\) and \(\mathsf{E}[B]=S\) with \(R\) and \(S\) correlated random variables, and \(A\) and \(B\) conditionally independent given \(R\) and \(S\). We will assume for simplicity that the severities \(X\) and \(Y\) are fixed and that the uncertainty all comes from claim counts. The reader can extend the model to varying severities as an exercise. \(R\) and \(S\) pick up uncertainty in items like the trend factor, tail factors and other economic variables, as well as the natural correlation induced through actuarial methods such as the Bornheutter-Ferguson.

Suppose \(\mathsf{E}[R]=r\), \(\mathsf{E}[S]=s\), \(\mathsf{var}(R)=v_r\), \(\mathsf{var}(S)=v_s\) and let \(\rho\) be the correlation coefficient between \(R\) and \(S\).

By ([varAgg]) the conditional distribution of \(A \mid R\) is a mixed compound Poisson distribution with expected claim count \(R/x\) and contagion \(c\). Therefore the conditional variance is

\[\begin{split}\mathsf{var}(A \mid R) &= \mathsf{E}[M \mid R]\mathsf{var}(X)+\mathsf{var}(M \mid R)\mathsf{E}[X]^2 \\ &= R/x v_x + R/x(1+cR/x) x^2 \\ &= xR(1+ v_x/x^2) + cR^2,\end{split}\]

and the unconditional variance of \(A\) is

\[\begin{split}\mathsf{var}(A) &= \mathsf{E}[\mathsf{var}(A \mid R)] + \mathsf{var}(\mathsf{E}[A \mid R]) \\ &= \mathsf{E}[xR(v_x/x^2+1)+cR^2] + \mathsf{var}(R) \\ &= xr(v_x/x^2+1)+c(v_r+r^2) + v_r.\end{split}\]

Next, because \(A\) and \(B\) are conditionally independent given \(R\) and \(S\),

\[\begin{split}\mathsf{cov}(A,B) &= \mathsf{E}[\mathsf{cov}(A,B \mid R,S)] + \mathsf{cov}(\mathsf{E}[A \mid R], \mathsf{E}[B \mid S]) \\ &= \mathsf{cov}(R, S).\label{simpleCov}\end{split}\]

Note Equation ([simpleCov]) is only true if we assume \(A\not=B\).

Parameter Uncertainty and Bivariate Frequency

Finally, suppose \(\mathsf{E}[A]=R\), \(\mathsf{E}[B]=S\) with \(R\) and \(S\) correlated parameters and conditional on \((R,S)\) suppose that \((M,N)\) has a \(G\)-mixed bivariate Poisson distribution. By ([covMNM]) \(\mathsf{cov}(A,B \mid R,S)=cRS\). The unconditional variances are as given in ([varA]). The covariance term is

\[\begin{split}\mathsf{cov}(A,B) &= \mathsf{E}[\mathsf{cov}(A,B \mid R,S)] + \mathsf{cov}(\mathsf{E}[A \mid R], \mathsf{E}[B \mid S]) \\ &= c\mathsf{E}[RS] + \mathsf{cov}(R,S) \\ &= (1+c)\mathsf{cov}(R,S) + crs \\ &= \rho \sqrt{v_rv_s}(1+c)+crs.\end{split}\]