Appendix 2
Regressions

Very often, we have a set of n measurements (xi, yi) of a parameter, y, in different x points. However, describing the parameter using n measurement points is generally not sufficient, especially if we wish to make estimates at points where no measurements are available.

To make it possible to have an estimate of y at every possible point defined by x, a mathematical model can be developed from the knowledge of the measurements available. This model is noted y = f(x) and is called an estimator. It allows us to calculate the y value of the considered parameter, whatever point x retains.

A2.1. Determining a linear estimator

By adopting a linear regression, the sought model is of the form:

math

where:

  • – yM is the y value estimated by the model for x; and
  • – a and b are the model parameters. They are inferred from the n data points (xi, yi) so that the estimations yiM are as close as possible to the measurement. yi; that is, the difference between the actual measurements and the estimations is as small as possible.

Indeed, for the available n measurements (xi, yi), the model error at point i is defined by the difference, at this point, between the measured value of y and the model’s value, yM:

math

Coefficients a and b are determined by minimizing the sum, S, of squared errors:

math

or, substituting for ε(i):

math

and replacing yM(Xi) by its expression: math

By developing the squared expression, we obtain:

math

S is minimal for math and math:

math

Likewise: math.

Hence: math.

We can then infer the expression of a:

math

A2.2. Performance of the estimator

One way to give an idea about the performance of an estimator is to compute the sum of the squared errors obtained.

Recall that in the case of a linear regression, the model is:

math

with:

math

and:

math

Once the parameters, a and b, of the model are calculated, we can determine, at every measurement point defined by (xi, yi), the difference between the value of y estimated by the model and that obtained through measurements. This difference is given by:

math

The performance of the model is then measured by the sum of squared differences:

math