15 September 2024
Fabricated data:
\[ g(E(y)) = \beta_0 + x_1\beta_1 \]
\[ g(E(y)) = \beta_0 + x_1\beta_1 \]
\[g(E(y)) = \beta_0 + x_1\beta_1 + x_1^2\beta_2 + x_1^3\beta_3 \]
\[g(E(y)) = \beta_0 + x_1\beta_1 + x_1^2\beta_2 + x_1^3\beta_3 \]
Can we:
Splines defined by basis functions.
\[ g(E(y)) = \beta_0 + \sum^k_{j=1} b_j(x_1)\beta_j \]
\[ g(E(y)) = \beta_0 + \sum^k_{j=1} b_j(x_1)\beta_j \]
\[ g(E(y)) = \beta_0 + \sum^k_{j=1} b_j(x_1)\beta_j \]
# A tibble: 5 × 5
term estimate std.error statistic p.value
<chr> <dbl> <dbl> <dbl> <dbl>
1 (Intercept) 1.71 0.300 5.71 8.45e- 7
2 bs(x, knots = k, degree = 3)1 2.95 0.600 4.92 1.21e- 5
3 bs(x, knots = k, degree = 3)2 -7.49 0.515 -14.5 1.27e-18
4 bs(x, knots = k, degree = 3)3 -0.199 0.562 -0.354 7.25e- 1
5 bs(x, knots = k, degree = 3)4 -4.66 0.392 -11.9 1.65e-15
\[ g(E(y)) = \beta_0 + \sum^k_{j=1}{b_j(x_1)\beta_j} \]
\[ g(E(y)) = \beta_0 + \sum^k_{j=1}{b_j(x_1)\beta_j} \]
Smoothing penalty (maximize penalized likelihood): \[ 2L(\boldsymbol{\beta}) - \boldsymbol{\lambda} (\boldsymbol{\beta^T}\boldsymbol{S}\boldsymbol{\beta}) \]
mgcv
Basis function | Name | Properties |
---|---|---|
tp |
Thin plate spline | Low rank (far fewer parameters than data), isotropic (equal smoothing in any direction) regression splines |
ts |
Thin plate spline | Thin plate spline with penalties on the null space such that it can be shrunk to zero. Useful in combination with by= as this does not penalize the null space. |
cr |
Cubic regression spline | Splines with knots spread evenly throughout the covariate domain |
cc |
Cyclic cubic regression spline | Cubic regression splines with ends that meet (have the same second order derivatives) |
ps |
Penalized B-spline | Allow the distribution of knots to be based on data distribution |
cp |
Cyclic penalized B-spline | Penalized B-splines with ends that meet |
gp |
Gaussian process | Gaussian process smooths with five sets of correlation structures |
mrf() |
Markov random fields | Useful for modelling of discrete space. Uses penalties based on neighbourhood matrices (pairwise distances between discrete locations) |
mgcv
Basis function | Name | Properties |
---|---|---|
re |
Random effects | Parametric terms penalized via identity matrices. Equivalent to i.i.d in mixed effects models |
fs |
Factor smooth interaction | For single dimension smoothers. Duplicates the basis functions for each level of the categorical covariate, yet uses a single smoothing parameter across all. |
sos |
Splines on the sphere | Isotropic 2D splines in spherical space. Useful for large spatial domains. |
so |
Soap film smooths | Smooths within polygon boundaries. These are useful for modelling complex spatial areas. |
mgcv
Smoother definition functions | Type | Properties |
---|---|---|
s() |
General spline smoothers | - for multidimensional smooths, assumes that each component are on the same scale as there is only a single smoothing parameter for the smooth |
te() |
Tensor product smooth | - smooth functions of numerous covariates that are built as the tensor product of the comprising smooths (and penalties) |
- the interaction between numerous terms, each with their own smoothing parameter that penalizes the average wiggliness of that term. | ||
ti() |
Tensor product interaction smooth | - smooths that include only the highest order interactions (exclude the basis functions associated with the main effects of the comprising smooths) |
t2() |
Alternative tensor product smooth with non-overlapping … | - creates basis functions and penalties for paired combinations of separate penalized and unpenalized components of each term |