Independent, dependent, and hidden variables

Next: Energies, free energies, and Up: Basic model and notations Previous: Basic model and notations Contents

Independent, dependent, and hidden variables

Constructing theories means introducing concepts which are not directly observable. They should, however, explain empirical findings and thus have to be related to observations. Hence, it is useful and common to distinguish observable (visible) from non-observable (hidden) variables. Furthermore, it is often convenient to separate visible variables into dependent variables, representing results of measurements the theory is aiming to explain, and independent variables which are under explicit control and specify the kind of measurements performed.

Hence, we will consider the following three groups of variables

1.: observable (visible) independent variables ,
2.: observable (visible) dependent variables ,
3.: not directly observable (hidden, latent) variables .

This characterization of variables translates to the following factorization property, defining the model we will study,

$\begin{displaymath} p(x,y,{h}) = p(y\vert x,{h}) \, p(x) \, p({h}). \end{displaymath}$

(1)

In particular, we will be interested in scenarios where

= $(x_1,x_2,\cdots )$ and analogously

= $(y_1,y_2,\cdots )$ are decomposed into independent components, meaning that $p(y\vert x,{h})$ = $\prod_i p(y_i\vert x_i,{h})$ and

= $\prod_i p(x_i)$ factorize. Then,

$\begin{displaymath} p(x,y,{h}) = p({h})\, \prod_i p(y_i\vert x_i,{h}) \,p(x_i) . \end{displaymath}$

(2)

Fig.1 shows a graphical representation of the factorization model (2) as a directed acyclic graph [185,125,107,200]. The

and/or

itself can also be vectors.

The interpretation will be as follows: Variables ${h}\in {H}$ represent possible states of (the model of) Nature, being the invisible conditions for dependent variables . The set defines the space of all possible states of Nature for the model under study. We assume that states are not directly observable and all information about comes from observed variables (data) , . A given set of observed data results in a state of knowledge numerically represented by the posterior density $p({h}\vert f)$ over states of Nature.

Independent variables $x\in X$ describe the visible conditions (measurement situation, measurement device) under which dependent variables (measurement results) have been observed (measured). According to Eq. (1) they are independent of , i.e., $p(x\vert{h})$ = . The conditional density $p(y\vert x,{h})$ of the dependent variables is also known as likelihood of (under given ). Vector-valued can be treated as a collection of one-dimensional with the vector index being part of the variable, i.e., $y_\alpha (x) = y(x,\alpha) = y(\tilde x)$ with $\tilde x = (x,\alpha)$ .

In the setting of empirical learning available knowledge is usually separated into a finite number of training data = $\{(x_i,y_i)\vert 1\le i\le n\}$ = $\{(x_D,y_D)$ and, to make the problem well defined, additional a priori information . For data $D\cup D_0$ we write $p({h}\vert f) = p({h}\vert D,D_0)$ . Hypotheses represent in this setting functions = $p(y\vert x,{h})$ of two (possibly multidimensional) variables , . In density estimation is a continuous variable (the variable may be constant and thus be skipped), while in classification problems takes only discrete values. In regression problems on assumes $p(y\vert x,{h})$ to be Gaussian with fixed variance, so the function of interest becomes the regression function ${h} (x) = \int \,dy\, y p(y\vert x,{h})$ .

**Figure 1:** Directed acyclic graph for the factorization model (2).
$\begin{figure}\begin{center} \setlength{\unitlength}{1mm}\begin{picture}(50,37) ... ...(27.8,6.8){\vector(1,1){11.2}} %\{h\}->y^n \end{picture}\end{center}\end{figure}$

Next: Energies, free energies, and Up: Basic model and notations Previous: Basic model and notations Contents

Joerg_Lemm 2001-01-21