Likelihood and Regression
1. Likelihood Function
To keep things simple and the notation as light as possible, we only consider unconditional distributions of univariate responses from independent observations in this chapter.
Note: We assume an underlying probability space exists and write for and for the push-forward measure .
1.1 Probabilities and Distribution Functions
Note: The response we are interested in follows a distribution and we write understanding that the probability space exists and that is a -measurable function.
Note: It is important to note that for non-finite sample spaces an observation is conceptually always an event, i.e. a set . We never observe outcomes, i.e. elements , directly. For discrete sample spaces with finite cardinality however, events might very well be observed.
Definition 1.1 (Cumulative distribution function): Let have an ordering. The cumulative distribution function is defined as
Note: The cdf is a monotonically, but not necessarily strictly, increasing function with .
1.2 Sample Spaces and Measurement Scales
The choice of an appropriate sample space very much depends on the measurement scale of the response . Most situations can be classified into binary, ordered or unordered categorical, count and bounded or unbounded absolutely continuous responses .
1.2.1 Binary Response
Definition 1.2 (Binary Response): The sample space is , i.e. the response can only take two outcomes or .
Note: We understand these two outcomes as being truely categorical and explicitly exclude dichotomisation. E.g. binary variables like “younger than 65 years” vs. “older than 65 years” should be modelled by a sample space appropriate for age as a numeric variable.