Visual fatigue measurement model in stereoscopy based on Bayesian network

Zhongyun Yuan; Jong Hak Kim; Jun Dong Cho

doi:10.1117/1.OE.52.8.083110

28 August 2013 Visual fatigue measurement model in stereoscopy based on Bayesian network

Zhongyun Yuan, Jong Hak Kim, Jun Dong Cho

Author Affiliations +

Optical Engineering, Vol. 52, Issue 8, 083110 (August 2013). https://doi.org/10.1117/1.OE.52.8.083110

Abstract

A stereoscopic visual fatigue measurement model based on Bayesian networks (BNs) is presented. Our approach focuses on the interdependencies between factors, such as contextual and environmental, and the phenomena of visual fatigue in stereoscopy. Specifically, the implementation of BN with the use of multiple features provides a systematic way to project and evaluate visual fatigue. Compared with another measurement model, our present BN-based scheme is more comprehensive. The test validation also indicates that our proposed model can be used as a reliable method for the visual fatigue inferring in stereoscopy.

1. Introduction

Recently, with various stereoscopy technologies commercialized, more three-dimensional (3-D) applications have been accepted as an element of modern life. Three-dimensional televisions (3-DTVs) and 3-D movie theaters are also becoming popular. However, the development of 3-D technology is facing some critical barriers, specifically stereoscopic visual fatigue. Visual fatigue caused by conflict between accommodation and convergence is unavoidable in most stereoscopic applications. As described in Refs. 1 and 2, although viewers are able to perceive a smooth 3-D watching experience after resolving the visual conflicts, a series of fatigue can be incurred (such as eyestrain and headaches), which is usually experienced after about 20 min of observation on 3-D displays. In order to ensure the safety of 3-D applications, it is essential to measure visual fatigue for stereoscopic images. Thus, many studies have investigated the visual fatigue of stereoscopy.³^–⁶

Figure 1(a) describes the main measurement schemes existing in 3-D visual fatigue research: the mean opinion score (MOS)-based scheme, the contact and contactless physiological feature (CLPF)-based scheme [such as electroencephalogram (EEG), electrocardiograph (ECG), and eye movement (EM) detection]. As noted by Kim and Cho,⁷ the MOS is to measure subjective 3-D visual fatigue using questionnaires that have high correlation with the subjective 3-D visual fatigue. Such as question “How much do you feel visual fatigue?” and answers “comfortable, a little uncomfortable, uncomfortable.” The CLPF, as shown in Kim et al.,⁸ and Chae et al.⁹ designs a visual fatigue measurement model using eyes’ response curve and blink frequency. According to the result of eye tracking, they determine the level of visual fatigue in stereoscopy. The contact physiological feature (CPF) as described by Gomarus et al.¹⁰ and Fang et al.¹¹ is a measurement model based on records of electrical activities to visual fatigue. The level of stereoscopic visual fatigue is determined by the reflection of bio-signals on human body.

Fig. 1

(a) Describes the main existing measurement schemes in recent research of three-dimensional (3-D) visual fatigue and (b) describes our proposed measurement model based on Bayesian networks (BNs).

However, both subjective and objective measurements have their own advantages and defects. Unfortunately, in most studies, they ignore the influence of extraneous state variable (e.g., the human body and testing environment). For this reason, with the same test method on different subjects, the results of measurement may have a significant deviation. Therefore, we develop a measurement model based on a strong correlation structure (the BN structure) as depicted in Fig. 1(b) that can reliably recognize stereoscopic visual fatigue.

Figure 1(b) shows our proposed measurement model on a BN structure. The feature vector (node) is comprised on the BN tree. The results of each node are fused with BN inference algorithm, and then the final fusion result could be inferred according to the probability values of different variable states. To the best of our knowledge, this is the first adaptation of a probabilistic framework on the BN structure for inferring the 3-D viewer’s state of visual fatigue. As opposed to the previous works described in Refs. 4, 5, 8, 10, and 12, our proposed model does not employ a single physiological feature as a decision factor, but deals with probability values of different variables’ states from interdependencies between aspects of both observation and contextual features.

The organization of this article is as follows. After a brief introduction in Sec. 1, Sec. 2 introduces the background and related work for this study. Section 3 describes the BN-based 3-D visual fatigue measurement framework. Section 4 presents the experimental results. Finally, Sec. 5 summarizes the article.

2. Background and Related Work

2.1.

Visual Fatigue Description in Stereoscopic

A binocular vision is produced when we use two separate images corresponding to the left and right eyes, although slightly different, merged in viewer’s brain to build a common impression.¹³ Hodges and McAllister¹⁴ describe the method of right and left perspective view in the 3-D display. Based on binocular parallax, the 3-D screen that can be implemented, relies on the format of the image presented and the viewing format. Figure 2 illustrates the watcher experiencing a stereoscopic sensation on images depending on presenting the appropriate view to each eye on a 3-D screen. Also, by improving depth perception, we can feel an added realism for stereoscopy. Although stereoscopic imagery can be presented on 3-D displays, it violates the relationship of natural viewing in the real world. In Fig. 2, the viewer observes a real object or an image on a two-dimensional (2-D) device, the eyes accommodate (focus on) and converge to a specific point. Accommodate distance matches with the convergence distance. Conversely, a viewer obtains a stereoscopic image on 3-D display, the remaining focus point is also on the plane of screen, while the eyes convergances of the image are located at a different distance. Because of the breakdown of the relationship between the accommodation and convergence, a visual discomfort is caused.

Fig. 2

Comparison between stereogram viewing and natural viewing.

For 3-D comfort evaluation, Choi et al.¹⁵ identify some factors to capture the spatiotemporal characteristics of disparity. The prediction of visual comfort is determined by factors fusing. Figure 3 illustrates types of disparity during stereoscopic viewing. Two disparities are indicated on the coordinate plane, positive (uncrossed) and negative (crossed) disparities by blue and red zone.¹³ In Fig. 3, the horizontal gray line position of display represents zero disparity planes. A zero disparity plane is a converged domain of stereoscopic imaging, and also the zero disparity area is commonly referred to as a comfortable zone of stereoscopic imaging.¹⁶^,¹⁷ Depending on the stereoscopic disparity, different 3-D imaging positions can be implemented, such as in front of or behind the screen. Stereoscopic disparity refers to the difference in image location of one object viewed by the left and right eyes. When a 3-D camera captures a stereoscopic image, each lens separately converges on the main object, and generates stereoscopic disparity. The main object can be seen as a single image, but the background would be seen as double images with disparity.

Fig. 3

Relationship between (a) positive disparity and (b) negative disparity; (c) is natural scene.

In Fig. 3(a), the positive disparity in the stereoscopic image corresponds to the uncrossed line. In Fig. 3(b), the negative disparity corresponds to the crossed line. The negative disparity exhibits crosstalk that occurs between accommodations of each eye. In addition, the negative disparity shows a larger disparity and object size than positive disparity, since the imaging in negative disparity is closer than in positive disparity. This phenomenon is related to the geometry of a binocular viewing. Therefore, negative disparity can incur more visual fatigue than positive disparity.¹⁷^,⁷ Yilmaz and Gudukbay¹⁸ point that the crosstalk (or ghosting effect) is the faded image viewed by the untargeted eye. This effect is undesirable because it may cause visual fatigue and other problems. Gudukbay and Yilmaz¹⁹ indicate that a more comfort stereo view can be achieved in terms of reduced crosstalk (or ghosting effect).

2.2.

Visual Fatigue Measurement Model Description

Body fatigue can be easily tracked from observable physiological features.²⁰^,²¹ This scheme is considered the relatively objective method for visual measurement. Physiological features may be classified into: The contactless and the contact features. Contactless features contain the EMs, head movement, etc., and these movements can be easily detected from a real-time monitor. Contact features contain the brain activity, heart rate variability, etc., and these movements can be detected by EEG, ECG, and other bio-sensor systems.

The CLPF-based scheme focuses on inferring the fatigue from the contactless features. Ji et al.²² demonstrate that the human in fatigue should exhibit some visual cues in long-time visual experiments. Horng et al.²³ present a fatigue measurement algorithm depend on the eye tracking and dynamic matching. Kim et al.²⁴ construct a neural network-based scheme for fatigue recognition by detecting the movement of the mouth and eyes, respectively.

The CPF-based scheme focuses on inferring the fatigue from the contact features. For example, the EEG can represent abundant information on the human cognitive states, according to the detection in the major EEG bands ( $δ$ , $θ$ , $α$ , and $β$ ). Lal et al.²⁵ present a fatigue recognition algorithm on different levels of EEG bands. Also, Jung et al.,²⁶ and Wilson and Bracewell²⁷ propose a method to estimate and predict the fatigue level based on the EEG power spectrum estimation and fuzzy neural network model. According to the main electroencephalography (EEG) activities ( $δ$ , $θ$ , $α$ , and $β$ ) for 52 subjects (36 males and 16 females) during fatigue measurement, Budi et al.²¹ found that $δ$ and $θ$ activities is stable over time, but there is a slight decrease for activity of $α$ , and a significant decrease for activity of $β$ . For the other important CPF ECG signal, in Refs. 28 and 29 fatigue recognition refer to heart exhibition on low frequency (LF), very low frequency (VFH), high frequency (HF), and the LF/HF ratio.

Previous physiological feature-based schemes focus only on a single specific aspect. That may lead to inaccurate results because the fatigue is not directly observable, which can only be inferred from the information available. There are a number of reasons for the inaccuracies using the scheme mentioned above: (1) Contextual factor. Fatigue recognition contains much subjectivity that cannot always reflect the real objectivity. (2) Environment factor. For example, when human is present in a not well acquainted environment,³⁰ an inaccurate interpretation of the facial expression (such as eye and mouth movement) would be caused, especially for the introverted persons. Therefore, to fuse as many as possible features from uncertain events is a better way to make an accurate inference.³¹ Further, Picard et al.³² figured out that it was necessary to fuse the contextual and physiological features and the human performance in order to make the fatigue measurement more reliable.

By considering the evidence and beliefs of the contextual information and physiological features from measurement, Ji et al.²² construct a BN-based algorithm to infer and predict the fatigue of human beings, enhancing the reliability of fatigue detection. Yang et al.²⁰ develop a BN-based fatigue recognition model to be used in systems that evolve over time. However, such visual fatigue network in Refs. 20, 22, 33^–36 mostly apply to driving, visual display terminals monitoring, and marine industry. To the best of our knowledge, there is no relating issue on stereoscopic visual fatigue based on probabilistic framework or BN. Eventually, considering the states and beliefs of contextual information and physiological features, a novel probabilistic framework-based (the BN-based) measurement model for stereoscopic visual fatigue is proposed in this article.

2.3.

Bayesian Networks Method Description

Hubbard³⁷ describes uncertainty as the lack of certainty, a state of having limited knowledge where it is difficult to infer precisely the existing state or future outcome. Decision making is generally recognized by engineers as an indispensable part of the whole engineering design process. Just as most fatigue recognition, the stereoscopic visual fatigue measurement is also comprised of a number of uncertainty factors. Because of the fact that uncertainty has a significant impact on judgment, the engineer tries to manage uncertainty via compound methods and intelligent systems. The most reliable tool for modeling uncertainty is the use of probabilities theory.³⁵

One of the most prevalent and effective graphical models to manage uncertainty is the BNs.³⁸ A BN, belief network or directed acyclic graphical model, is a probabilistic graphical model that correlates the conditional dependencies of a number of random variables with the use of a Directed Acyclic Graph (DAG). A DAG is a directed graph with no directed cycles. The formation of a DAG includes vertices and directed edges, each edge connecting one vertex to another so that a cyclic route is impossible to appear. Figure 4 shows an implementation of DAG in our application.

Fig. 4

The detailed BN structure used to measure visual fatigue in stereoscopy.

The basic concept in the Bayesian treatment of certainties in causal networks is conditional probability. Whenever a statement of the probability $P (A)$ of an event $A$ is given, then it is given conditioned by other known factors. Therefore, according to the feature vector mentioned above and conditional probability, the probability of estimated fatigue is obtained through Bayesian theorem in Refs. 20 and 39:

Eq. (1)

P (Z = z | E) = \frac{P (Z = z | e^{c}) P (e^{o} | Z = z)}{\sum_{j = 1}^{2} P (Z = z_{j} | e^{c}) P (e^{o} | Z = z_{j})}

• $Z$ represents the fatigue node, and $z$ represents the fatigue state value.
• $E$ represents the evidences ${e^{c}, e^{o}}$ , $e^{c}$ represents the contextual evidences and $e^{o}$ represents the observations.
• $P (Z = z | E)$ represents the posterior probability of $Z$ given $E$ , and hence it is the new estimation for the probability that the hypothesis $Z$ is true, taking evidence $E$ into account.
• $P (e^{o} | Z = z)$ represents the conditional probability of observable evidence $e^{o}$ , if the hypothesis $Z$ turns out to be true.
• $P (Z = z | e^{c})$ represents the prior probability of hypothesis before providing contextual evidences.
• $\sum_{i = 1}^{2} P (Z = z_{j} | e^{c}) P (e^{o} | Z = z_{j})$ represents the marginal probability, which is the prior probability under all possible fatigue hypotheses.

3. BN-Based Visual Fatigue Measurement Implementation

To set up a fatigue recognition model based on the discrete BN, the first step is to specify the nodes of the discrete BN. In other words, we need to specify the contextual, contactless and contact physiological variables that are used to construct the discrete BN. The second step is to determine the values that are used to represent the discrete variables. The third step is to configure the states of the variables, to calculate the conditional probability, and to evaluate the visual fatigue in stereoscopy. In the following, these steps are described.

3.1.

Specifying the Nodes of the Discrete Bayesian Networks

As remarked in Fig. 4, there are many contextual and physiological features related to fatigue. Among these features, some of them lead to more contributions to fatigue while others have lesser contributions to the fatigue. For the sake of simplicity but without any loss of generality, we only select those contextual and physiological features that have immediate relations with the fatigue measurement. In particular, the following features are described in step 1. For the contextual, hidden and observable selected in Fig. 4, the fuzzy method is used to determine the discrete values for each variable based on a set of heuristic knowledge rules.⁴⁰

3.1.1.

Stereoscopic contextual features node

Binocular disparity (BD) node. Lambooij et al.⁴¹ noted that the human eye experiences conflict between the accommodation and vergence that mostly affect visual fatigue in stereoscopy. Ohzawa et al.¹³ classified the disparity as positive disparity and negative disparity. Kim and Cho⁷ suggested a simplified relative visual fatigue metric that considers the “accommodation and vergence” factors that can be calculated by the disparities in stereoscopy. We are motivated by Ohzawa et al.¹³ and Kim and Cho.⁷ As exhibited in Fig. 5, several sets of different stereoscopic instances were provided to evaluate visual fatigue. The different sample image in the negative disparity zone and in the positive disparity zone has been shown in experiment for 3-D fatigue measuring.

Fig. 5

(a) Test images to evaluate visual fatigue and (b) the graph of mean opinion score result with various Avg. converged objects disparity and comfort zone in stereoscopy in Ref. 7. Note: The valuation is based on the five grades (1 to 5); 1: very comfortable, 2: comfortable, 3: a little uncomfortable, 4: uncomfortable, 5: very uncomfortable.

Display quality (DQ) node. As Michel et al.¹² described, with 3-DTV and 3-D cinema at the extremes of the screen size spectrum, comfort zone issues for stereoscopy are different when trying to use them to present the same content. Apparently, resolution and luminance are also key elements of display. For example, an unsuitable resolution and luminance also causes a visual discomfort. However, among these features, the screen size has immediate relations with the DQ on issues that are our concern as mentioned in Refs. 12 and 42. Therefore, the display size is taken as a main contextual features corresponding to the DQ nodes.

3.1.2.

Nonstereoscopic contextual features (NSCF) node

Sleeping quality (SQ) node. SQ is immediately associated with the fatigue.²⁹ Therefore, we take the SQ as a nonstereoscopic node on the BN DAG (Fig. 4). Gomarus et al.¹⁰ noted that the SQ is related to such quantities as the duration of sleep, difficulty in falling asleep at night, the sleeping environment, and so on. Among them, the sleeping time and the sleeping satisfaction were taken as the key contributors to the SQ, since a certain minimum sleep time is necessary for everyone, and also whether the SQ is satisfied depends on the human’s subjective judgments.

Circadian rhythm (CR) node. CR is also a cardinal factor in the fatigue measurement. Lal and Craig⁴³ identified that the CR plays an important role in the study of the fatigue recognition. There are two sleep peaks each day, one of which appears after midnight, and other appears approximately after lunch time. Humans are easily fatigued during these peak periods.

Experiment environment (EE) node. EE is the last selected factor by the proposed method. Apparently, light, noise, temperatures, and other EE factors have a strong relation with fatigue measurement, especially the light influence to the viewer on the screen. Therefore, we take the EE as a nonstereoscopic node on BN graph.

3.1.3.

Observation state node

EEG node. In the frequency domain, the EEG mainly includes the $δ$ band (0.5 to 4 Hz) corresponding to the sleep activity, the $θ$ band (4 to 7 Hz) that is related to drowsiness, the $α$ band (8 to 13 Hz) corresponding to relaxation and creativity, and the $β$ band (13 to 25 Hz) that corresponds to activity and alertness. Budi et al.²¹ note that the $β$ band has strong relations with visual fatigue. Through the variations in the EEG tracing, the power of $β$ frequencies increase as watching duration increases, and it is much stronger in 3-D rather than in 2-D conditions, as shown in Fig. 6(a). Li et al.⁴⁴ identified that the 3-D content affected the power of brain wave in the $β$ frequency. The $β$ power was stronger at viewing the 3-D contents. Also, subjective results also showed more strong visual fatigue in the 3-D condition than in the 2-D condition. Therefore, we take the waveband magnitude of the EEG spectrum in the $β$ band as an observable variables node in BN diagram.

Fig. 6

Physiological response: 3-D and 2-D compared, 3-D viewed first. ( $x$ -axis is time, $y$ -axis is magnitude).

EM node. The EM-based visual fatigue measurement is related to such quantities such as eye gaze, eye blink, and eyelid closure. These manifestations are described in Ref. 45 for the fatigue detection. Zhu and Lan²² pointed out that EM is a reliable and valid determination of fatigue. In Ref. 46 the percentage of eyelid closure over the pupil in a given time (PERCLOS) is indicated. It illustrates that the viewer is possibly in a state of fatigue if the eyes are at least 80% closed during a period of 1 min. Thus, the proportion of the eye-closed time was taken in this article as one of the observable variables corresponding to the nodes of the BN diagram.

3.2.

Determining Discrete Variables in Each Node

The construction of BN has two tasks: one is the determination of nodes; and the other is the determination of its parent discrete variables and their states for each note. In the previous step, the related nodes are determined. While in the following section in step 2, we describe the discrete variables and their states that indicate the likelihood of a particular feature that contributes to the fatigue.

Visual fatigue node: $Z = [Z_{1}, Z_{2}]$ in which $Z_{1}$ and $Z_{2}$ represent the fatigue and no-fatigue states, respectively.

Contextual features node: $X = [X_{1}, X_{2}, X_{3}]$ represents the nonstereoscopic factor node state, in which $X_{1}$ , $X_{2}$ , and $X_{3}$ represent the sleep quality, CR and EE, respectively. Here, $X_{1} = [X_{11}, X_{12}]$ in which $X_{11}$ and $X_{12}$ represent the sleep parameters, including the sleep time and sleep satisfaction. $Y = [Y_{1}, Y_{2}]$ represents the stereoscopic factor node, in which $Y_{1}$ and $Y_{2}$ represent the binocular disparity and DQ, respectively.

Observation features node: $O = [O_{1}, O_{2}]$ represents the observation features node, in which $O_{1}$ represents the CLPF (e.g., EM), and $O_{2}$ represents the CPF (e.g., EEG).

As remarked in Fig. 4, $z_{k}$ , $x_{i}^{j}$ , $y_{i}^{j}$ , and $o_{i}^{j}$ denote the specific values taken by $Z = [Z_{1}, Z_{2}]$ , $X = [X_{1}, X_{2}, X_{3}]$ , $Y = [Y_{1}, Y_{2}]$ , and $O = [O_{1}, O_{2}]$ , respectively. In Fig. 4 the variables, together with the directed edges, form the DAG. $P (x_{i}^{j})$ represents the probability of the sleep quality node states ${x_{1}^{1} = good, x_{1}^{2} = bad}$ , CR node states ${x_{2}^{1} = active, x_{2}^{2} = drowsy}$ and EE node states ${x_{3}^{1} = comfortable, x_{3}^{2} = uncomfortable}$ ; $P (y_{i}^{j})$ represents the probability of the binocular disparity node states ${y_{1}^{j} = disparity zone}$ and DQ node states ${y_{2}^{1} = small, y_{2}^{2} = large, y_{2}^{3} = ex-large}$ ; $P (o_{i}^{j})$ represents the probability of the contact physiological node states (EEG node) ${o_{2}^{1} = decrease, o_{2}^{2} = no-change, o_{2}^{3} = increase}$ and contactless physiological node states (EM node) ${o_{1}^{1} = large, o_{1}^{2} = medium, o_{1}^{3} = small}$ .

3.3.

Calculating Bayesian Networks

Assume that the evidences from the contextual nodes are represented as $e^{X, Y} = {e_{X Y}^{i j}}$ , and the evidences from the observable nodes are represented as $e^{O} = {e_{O}^{i j}}$ , where $e_{X Y}^{i j}$ represents the evidence of the $i$ ’th contextual node with the $j$ ’th state value ( $x_{i}^{j}$ and $y_{i}^{j}$ ), and $e_{O}^{i j}$ represents the evidence of the $i$ ’th observable node with the $j$ ’th state value ( $o_{i}^{j}$ ). $e = {e^{X Y}, e^{O}}$ as evidences from the contextual factor and observable feature nodes, respectively. In Eqs. (2) and (3), $P (Z = z_{k} | e^{X Y})$ is the prior probability of visual fatigue $Z$ that was inferred before the parents’ contextual evidence was available. $P (e^{O} | Z = z_{k})$ is the conditional probability of observable evidence $e^{O}$ , if the parent visual fatigue $Z$ turns out to be true.

Then the conditional probability of $Z$ given the occurrence of the $e^{X Y}$ node can be written as in Ref. 39

Eq. (2)

P (Z = z_{k} | e^{X, Y}) \propto P (Z = z_{k} | e_{X}^{i, j}) P (Z = z_{k} | e_{Y}^{i, j}) = [\sum_{i = 1}^{2} \sum_{j = 1}^{2} \sum_{l = 1}^{2} P (Z = z_{k} | x_{1}^{i}, x_{2}^{j}, x_{3}^{l}) P (x_{1}^{i}) P (x_{2}^{j}) P (x_{3}^{l})] \times [\sum_{i = 1}^{17} \sum_{j = 1}^{3} P (Z = z_{k} | y_{1}^{i}, y_{2}^{j}) P (y_{1}^{i}) P (y_{2}^{j})] k = 1, 2

The conditional probability of $e^{O}$ given the occurrence of node $Z$ can be written as in Ref. 39

Eq. (3)

P (e^{o} | Z = z_{k}) \propto P (e_{o}^{1, j} | Z = z_{k}) P (e_{o}^{2, j} | Z = z_{k}) = [\sum_{m = 1}^{3} P (e_{o}^{1, j} | o_{1}^{m}) P (o_{1}^{m} | Z = z_{k})] \times [\sum_{n = 1}^{3} P (e_{o}^{2, j} | o_{1}^{n}) P (o_{1}^{n} | Z = z_{k})] k = 1, 2 and j = 1, 2, 3.

According to the BN theorem,⁴ the conditional probability of node $Z$ given the occurrence evidence can be obtained by combining Eqs. (2) and (3); and it can be written as in Ref. 39.

Eq. (4)

P (Z = z_{k} | e) = \frac{P (Z = z_{k} | e^{X, Y}) P (e^{o} | Z = z_{k})}{\sum_{i = 1}^{2} P (Z = z_{i} | e^{X, Y}) P (e^{o} | Z = z_{i})} k = 1, 2,

where

\sum_{i = 1}^{2} P (Z = z_{i} | e^{X Y}) P (e^{O} | Z = z_{i})

is the marginal probability, which is the prior probability under all possible hypotheses of visual fatigue

Z

.

4. Simulation Results and Discussion

In this work, in order to acquire the conditional probabilities information for each node, we employ some previous research methods from several literatures. For example, the conditional probabilities information for the BD and DQ nodes are obtained from Refs. 12 and 7. The conditional probabilities information for the CR, SQ, and EE nodes is obtained from Refs. 20, 22, 29, 32, 47^–50. The conditional probabilities information for the EEG and EM nodes is obtained from Refs. 5, 20, and 8. However, some probabilities cannot be directly obtained from these studies; we adopted similar acquisition methods based on our experiments. For instance, binocular disparity comfort judgment is mainly based on personal satisfaction, due to the difference of visual sensing for each person. Here, subjective feeling (like MOS) is considered to be relatively high. In order to obtain this data set, we adopt a statistical analysis scheme to acquire them based on Ref. 7. Finally, depending on these efforts, all probabilities in BN model have been acquired which are shown as following. Table 1 describes the conditional probability that BD node states is the main factor of visual fatigue in stereoscopy. Table 2 describes the conditional probability for visual fatigue as the states of CR, SQ and EE. Table 3 describes the conditional probability for EEG and EM, respectively, as the event of visual fatigue takes place simultaneously.

Table 1

Conditional probability for fatigue node with BD.

BD negative	Fatigue node		BD positive	Fatigue node
BD negative	Normal	Fatigue	BD positive	Normal	Fatigue
$- 80$	0.05	0.95	0	0.98	0.02
$- 70$	0.11	0.89	10	0.95	0.05
$- 60$	0.38	0.62	20	0.94	0.06
$- 50$	0.57	0.43	30	0.91	0.09
$- 40$	0.69	0.31	40	0.91	0.09
$- 30$	0.81	0.19	50	0.89	0.11
$- 20$	0.87	0.13	60	0.86	0.14
$- 10$	0.93	0.07	70	0.82	0.18
0	0.98	0.02	80	0.75	0.25

Table 2

Conditional probability for fatigue node with CR, SQ, and EE.

CR node	SQ node	EE node	Fatigue node
CR node	SQ node	EE node	Normal	Fatigue
Active	Good	Comfortable	0.95	0.05
	Good	Uncomfortable	0.85	0.15
	Bad	Comfortable	0.73	0.27
	Bad	Uncomfortable	0.49	0.51
Drowsy	Good	Comfortable	0.23	0.77
	Good	Uncomfortable	0.12	0.88
	Bad	Comfortable	0.11	0.89
		Uncomfortable	0.02	0.98

Table 3

Conditional probabilities for EM and EEG given fatigue.

Fatigue	EEG node			EM node
Fatigue	Decrease	No-change	Increase	Large	Medium	Small
Fatigue	0.90	0.08	0.02	0.94	0.05	0.01
Normal	0.02	0.08	0.90	0.01	0.05	0.94

With the help of the System Neuroscience Laboratory at Sungkyunkwan University, we obtained the EEG and EM data sets. Here, we used EM tracking system called Eyelink II to measure at the 500 Hz temporal resolution. Twenty students from Sungkyunkwan University volunteered to participate in the experiments. Each participant was asked to watch the test 3-D image at different disparities on 3-DTV, and no break or rest was permitted during the 25 min experiment. Due to display limitations (our research only focus on the 3-D-HDTV application), we cannot include a variety of DQ requirements. The EEG and EM signals of each participant were collected at a rate of 1 sample/min. Then results were processed based on the statistical properties to form the evidence data sets that are needed to infer the viewer fatigue estimation. For example, according to the statistical properties of the contactless physiological data from the participants, if the PERCLOS value of EM is equal to 85, $P (e_{O}^{1, 1}) = 0.89$ , $P (e_{O}^{1, 2}) = 0.42$ , $P (e_{O}^{1, 3}) = 0.18$ ; and for the contact physiological data, if the EEG signal indicates that the decreases of $β$ rhythms are large, $P (e_{O}^{2, 1}) = 0.90$ , $P (e_{O}^{2, 2}) = 0.20$ , $P (e_{O}^{2, 3}) = 0.10$ .

In order to obtain the probability for CR, SQ, and EE, we adopted a statistical analysis-based questionnaire that mainly concerned the information about the CR, SQ, and EE state. The questionnaires were distributed among the twenty students before the simulation experiment. There are two groups of probability for CR and SQ. For the first group simulation, we required 20 students who did not have any kind of sleep disorder to maintain a relatively good SQ state before the test day, so the probability for SQ were $P (x_{1}^{1}) = 0.87$ and $P (x_{1}^{2}) = 0.13$ . We asked the volunteers to participate in the simulation test from 8:30 to 11:30 AM, so the probabilities for CR were $P (x_{2}^{1}) = 0.85$ and $P (x_{2}^{2}) = 0.15$ . For the second group simulation, some of the volunteers were deprived of a good sleep during the previous night (e.g., sleep time was less than 6 h), and we asked them to participate in the simulation test from 1:00 to 2:30 PM the next day. Then the probabilities for SQ and CR were $P (x_{1}^{1}) = 0.37$ , $P (x_{1}^{2}) = 0.63$ , $P (x_{2}^{1}) = 0.25$ , and $P (x_{2}^{2}) = 0.75$ . In our experiment, EE was relatively good, and the probabilities for EE were $P (x_{3}^{1}) = 0.80$ and $P (x_{3}^{2}) = 0.20$ .

A partial test image was shown in Fig. 5(a). We adopted a different parallax pairwise comparison in a stereoscopy for a fair evaluation. Figure 5(b) drew the MOS result from the total results with various averages of the converged objective disparity. We obtained a relatively accurate visual fatigue from the validated MOS evaluation in Ref. 7. MOS is a common evaluation method for stereoscopy visual fatigue. Therefore, we decided to fit a curve from these results as a contrast database in our simulation. From Fig. 5(b) we can observe that the disparity of the comfortable zone is between $- 30$ and disparity 70.

In Fig. 7(a), the measurement results are calculated with various converged objective disparities, based on the SQ, CR, and EE probabilities $P (x_{1}^{1}) = 0.87$ , $P (x_{1}^{2}) = 0.13$ , $P (x_{2}^{1}) = 0.85$ , $P (x_{2}^{2}) = 0.15$ , $P (x_{3}^{1}) = 0.80$ , and $P (x_{3}^{2}) = 0.20$ . In Fig. 7(b), the results are based on the different SQ and CR probabilities $P (x_{1}^{1}) = 0.37$ , $P (x_{1}^{2}) = 0.63$ , $P (x_{2}^{1}) = 0.25$ , and $P (x_{2}^{2}) = 0.75$ . From Fig. 7(b) we can observe that when we include an SQ and CR factor under a worse state to infer the viewer’s fatigue, the estimation will bring a large deviation in measuring the stereoscopic fatigue. In order to intuitively understand the results, we can also obtain a validation from the mean absolute error (MAE). Here, ${MEA}_{7 (a)} = 0.0848$ and ${MEA}_{7 (b)} = 0.2782$ . Thus, the measurement of the visual fatigue in stereoscopy is influenced by other factors (nonstereoscopic factors). If we ignore the nonstereoscopic contextual features factor, the measurement performance for visual fatigue is unreliable in stereoscopy, which can be explained by the fact that the MAE in Fig. 7(b) is 0.2782, while the MAE in Fig. 7(a) is 0.0848.

Fig. 7

(a) Visual fatigue measurement results in stereoscopy based on BN model with good sleeping quality (SQ) and Circadian rhythm (CR) states; and (b) with relative bad SQ and CR states.

5. Conclusion

We proposed a BN-based measurement model for stereoscopic visual fatigue estimation. Two important conclusions can be drawn from this study: (1) multiple features, including the stereoscopic contextual, nonstereoscopic contextual, contact physiological, and CLPFs were used to infer the viewer’s fatigue, providing a wide coverage of the categories of features. Covering more nodes in the BN that imply fatigue recognition helps to infer the fatigue more reliably and accurately. Especially, most previous studies have ignored the influence from condition variables such as CR, SQ, and EE. (2) Furthermore, the contactless physiological and CPFs are two important observation features for fatigue recognition. The test validation indicates that based on EM and EEG model the visual fatigue in stereoscopic can be accurately measured. It would be of significant interest to extend the current measurement model to handle more practical situations from various 3-D devices. We also have an interest in how to improve the subjective factors in determining the probability.

Acknowledgments

This work is supported by Ministry of Trade, Industry and Energy (MOTIE) Foundation of the World-Class300 Project: Development of automated manufacturing robot system technology integrating with the 6 Degree of Freedom (DOF) robot mechanism and the S/W platform for assembling mobile Information Technology (IT) products. (10043213).

References

1.

D. M. Hoffmanet al., “Vergence-accommodation conflicts hinder visual performance and cause visual fatigue,” J. Vision, 8 (3), 1 –30 (2008). http://dx.doi.org/10.1167/8.3.33 1534-7362 Google Scholar

2.

K. UkaiaP. Howarth, “Visual fatigue caused by viewing stereoscopic motion images: background, theories, and observations,” Displays, 29 (2), 106 –116 (2008). http://dx.doi.org/10.1016/j.displa.2007.09.004 DISPDP 0141-9382 Google Scholar

3.

S. Yanoet al., A Study of Visual Fatigue and Visual Comfort for 3D HDTV/HDTV Images, Displays, 191 –201 Elsevier, Amsterdam (2002). Google Scholar

4.

H. C. O. Liet al., “Method of measuring subjective 3D visual fatigue: a five-factor model,” in Digital Holography 2008, (2008). Google Scholar

5.

J. H. YuB. H. LeeD. H. Kim, “EOG based eye movement measure of visual fatigue cause by 2D and 3D displays,” in IEEE-EMBS Int. Conf. Biomedical and Health Informatics (BHI2012), 305 –308 (2012). Google Scholar

6.

J. S. Choiet al., “Visual fatigue modeling and analysis for stereoscopic video,” Opt. Eng., 51 (1), 017206 (2012). http://dx.doi.org/10.1117/1.OE.51.1.017206 OPEGAR 0091-3286 Google Scholar

7.

J. G. KimJ. D. Cho, “Simplified relative model to measure visual fatigue in a stereoscopy,” in IEICE Trans. Fundamentals of Electronics, Communications and Computer Sciences, 2830 –2831 (2011). Google Scholar

8.

D.Y. Kimet al., “Stereoscopic visual fatigue measurement based on fusional response curve and eye-blinks,” in 17th Int. Conf. Digital Signal Processing (DSP), 1 –6 (2011). Google Scholar

9.

H. B. Chaeet al., “Three-dimensional display system using a variable parallax barrier and eye tracking,” Opt. Eng., 50 (8), 087401 (2011). http://dx.doi.org/10.1117/1.3607962 OPEGAR 0091-3286 Google Scholar

10.

K. Gomaruset al., “The effects of memory load and stimulus relevance on the EEG during a visual selective memory search task: an ERP and ERD/ERS study,” Clin. Neurophysiol., 117 (4), 871 –884 (2006). http://dx.doi.org/10.1016/j.clinph.2005.12.008 CNEUFU 1388-2457 Google Scholar

11.

G. Fanget al., “NeuroGlasses: a neural sensing healthCare system for 3D vision technology,” IEEE Trans. Inf. Technol. Biomed., 6 (2), 1 –7 (2011). ITIBFX 1089-7771 Google Scholar

12.

B. Michel, “Production issues with 3D content targeting cinema, TV, and mobile devices,” European digital cinema forum, (2009). http://www. edcf. net/3d. html Google Scholar

13.

OhzawaG. DeangelisR. Freeman, “Stereoscopic depth discrimination in the visual cortex: neurons ideally suited as disparity detectors,” Science, 249 (4972), 1037 –1041 (1990). http://dx.doi.org/10.1126/science.2396096 SCIEAS 0036-8075 Google Scholar

14.

L. F. HodgesD. F. McAllister, “Stereo and alternating-pair techniques for display of computer-generated images,” IEEE Comput. Graphics Appl., 5 (9), 38 –45 (1985). http://dx.doi.org/10.1109/MCG.1985.276523 ICGADZ 0272-1716 Google Scholar

15.

J. H. Choiet al., “Visual comfort measurement for 2D/3D converted stereo video sequence,” in 3DTV-Conference: The True Vision Capture, Transmission and Display of 3D Video (3DTV-CON), 1 –4 (2012). Google Scholar

16.

J. G. KimJ. D. Cho, “Optimizing a virtual re-convergence system to reduce visual fatigue in stereoscopic camera,” IEICE Trans. Inf. Syst., E95D (5), 1238 –1247 (2012). http://dx.doi.org/10.1587/transinf.E95.D.1238 ITISEF 0916-8532 Google Scholar

17.

J. G. Kimet al., “A real- time virtual re-convergence hardware platform,” J. Semicond. Technol. Sci., 12 (2), 127 –138 (2012). Google Scholar

18.

T. YilmazU. Gudukbay, “Stereoscopic urban visualization based on graphics processor unit,” Opt. Eng., 47 (9), 097005 (2008). http://dx.doi.org/10.1117/1.2978948 OPEGAR 0091-3286 Google Scholar

19.

U. GudukbayT. Yilmaz, “Stereoscopic view-dependent visualization of terrain height fields,” IEEE Trans. Visualization Comput. Graphics, 8 (4), 330 –345 (2002). http://dx.doi.org/10.1109/TVCG.2002.1044519 TVCG 1077-2626 Google Scholar

20.

G. S. YangY. Z. LinP. Bhattaharya, “A driver fatigue recognition model based on information fusion and dynamic Bayesian network,” Inf. Sci., 180 (10), 1942 –1954 (2010). http://dx.doi.org/10.1016/j.ins.2010.01.011 ISIJBC 0020-0255 Google Scholar

21.

T. J. Budiet al., “Using EEG spectral components to assess algorithms for detecting fatigue,” Expert Syst. Appl., 36 (2), 2352 –2359 (2009). http://dx.doi.org/10.1016/j.eswa.2007.12.043 ESAPEH 0957-4174 Google Scholar

22.

Q. JiZ. ZhuP. Lan, “Real-time nonintrusive monitoring and prediction of driver fatigue,” IEEE Trans. Veh. Technol., 53 (4), 1052 –1068 (2004). http://dx.doi.org/10.1109/TVT.2004.830974 ITUTAB 0018-9545 Google Scholar

23.

W. Hornget al., “Driver fatigue detection based on the eye tracking and dynamic template matching,” in Proc. IEEE Int. Conf. Networking, Sensing and Control Taiwan, 7 –12 (2004). Google Scholar

24.

D. KimZ. BieK. Park, “Fuzzy neural network-based approach for personal facial expression recognition with novel feature selection method,” in Proc. 12th IEEE Int. Conf. Fuzzy System, 908 –913 (2003). Google Scholar

25.

K. L. S. Lalet al., “Development of an algorithm for an EEG-based driver fatigue countermeasure,” J. Safety Res., 34 (3), 321 –328 (2003). http://dx.doi.org/10.1016/S0022-4375(03)00027-6 JSFRAV 0022-4375 Google Scholar

26.

T. P. Junget al., “Estimating alertness from the EEG power spectrum,” IEEE Trans. Biomed. Eng., 44 (1), 60 –69 (1997). http://dx.doi.org/10.1109/10.553713 IEBEAX 0018-9294 Google Scholar

27.

B. J. WilsonT. D. Bracewell, “Alertness monitor using neural networks from EEG analysis,” in Proceedings of the 2000 IEEE Signal Processing Society Workshop on Neural Networks for Signal Processing X, 814 –820 (2000). Google Scholar

28.

T. H. LinhM. StodolskiS. Osowski, “On-line heart beat recognition using Hermite polynomials and neuro-fuzzy network,” IEEE Trans. Instrum. Meas., 52 (4), 1224 –1231 (2003). http://dx.doi.org/10.1109/TIM.2003.816841 IEIMAO 0018-9456 Google Scholar

29.

O. G. TalS. David, “Driver fatigue among military truck drivers,” Transp. Res. Part F, 3 (4), 195 –209 (2000). http://dx.doi.org/10.1016/S1369-8478(01)00004-3 1369-8478 Google Scholar

30.

C. Conati, “Probabilistic assessment of user’s emotions in educational games,” Appl. Artif. Intell., 16 (7–8), 555 –575 (2002). http://dx.doi.org/10.1080/08839510290030390 AAINEH 1087-6545 Google Scholar

31.

H. ChenP. Meer, “Robust fusion of uncertain information,” IEEE Trans. Syst. Man Cybernet. Part B, 35 (3), 578 –586 (2005). http://dx.doi.org/10.1109/TSMCB.2005.846659 ITSHFX 1083-4427 Google Scholar

32.

R.W. PicardE. VyzasJ. A. Healey, “Toward machine emotional intelligence: analysis of affective physiological state,” IEEE Trans. Pattern Anal. Mach. Intell., 23 (10), 1175 –1191 (2001). http://dx.doi.org/10.1109/34.954607 ITPIDJ 0162-8828 Google Scholar

33.

P. Vysoky, “Changes in car driver dynamics caused by fatigue,” Neural Network World, 14 (1), 109 –117 (2004). 1210-0552 Google Scholar

34.

Q. JiP. LanC. Looney, “A probabilistic framework for modeling and real-time monitoring human fatigue,” IEEE Trans. Syst. Man Cybernet. Part A, 36 (5), 862 –875 (2006). http://dx.doi.org/10.1109/TSMCA.2005.855922 ITSHFX 1083-4427 Google Scholar

35.

N. Vagias, “A Bayesian Network Application for the Prediction of Human Fatigue in the Maritime Industry,” (2010). Google Scholar

36.

Nikolaoset al., “Human fatigue: evaluation with the usage of Bayesian networks,” Computational Intelligence Systems in Industrial Engineering, 651 –676 Springer(2012). Google Scholar

37.

D. W. Hubbard, How to Measure Anything, 2nd ed.Tantor Audio, New Jersey (2010). Google Scholar

38.

J. Pearl, Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference, 2nd ed.Morgan Kaufmann, San Francisco (1988). Google Scholar

39.

R. O. DudaP. E. HartD. G. Stork, Pattern Classification, 2nd ed.Wiley, New York (2001). Google Scholar

40.

J. C. PrincipeS. K. GalaT. G. Chang, “Sleep staging automaton based on the theory of evidence,” IEEE Trans. Biomed. Eng., 36 (5), 503 –509 (1989). http://dx.doi.org/10.1109/10.24251 IEBEAX 0018-9294 Google Scholar

41.

M. LambooijW. IJsselsteijnI. Heynderickx, “Visual discomfort and visual fatigue of stereoscopic displays: a review,” J. Imaging Sci. Technol., 53 (3), 030201 (2009). http://dx.doi.org/10.2352/J.ImagingSci.Technol.2009.53.3.030201 JIMTE6 1062-3701 Google Scholar

42.

C. W. Tyler, “Spatial limitations of human stereoscopic vision,” in 21st Annual Technical Symposium, 36 –42 (1977). http://dx.doi.org/10.1117/12.955731 Google Scholar

43.

K. L. S. LalA. Craig, “A critical review of the psychophysiology of driver fatigue,” Biol. Psychol., 55 (3), 173 –194 (2001). http://dx.doi.org/10.1016/S0301-0511(00)00085-5 BLPYAX 0301-0511 Google Scholar

44.

H. C. O. Liet al., “Measurement of 3D Visual Fatigue Using Event-Related Potential (ERP): 3D Oddball Paradigm,” in 3DTV Conference: The True Vision Capture, Transmission and Display of 3D Video, 213 –216 (2008). Google Scholar

45.

Y. LinW. J. ZhangL.G. Watson, “Using eye movement parameters for evaluating human–machine interface frameworks under normal control operation and fault detection situations,” Int. J. Human Computer Stud., 59 (6), 837 –873 (2003). http://dx.doi.org/10.1016/S1071-5819(03)00122-8 1071-5819 Google Scholar

46.

W. W. Wierwilleet al., “Research on vehicle-based driver status/performance monitoring: development, validation, and refinement of algorithms for detection of driver drowsiness,” (1994). Google Scholar

47.

X. LiQ. Ji, “Active affective State detection and user assistance with dynamic Bayesian networks,” IEEE Trans. Syst. Man Cybernet. Part A, 35 (1), 93 –105 (2005). http://dx.doi.org/10.1109/TSMCA.2004.838454 ITSHFX 1083-4427 Google Scholar

48.

J. A. Healey, “Wearable and Automotive Systems for Affective Recognition from Physiology,” Massachusetts Institute of Technology, (2000). Google Scholar

49.

T. PierreB. Jacques, “Monotony of road environment and driver fatigue: a simulator study,” Accid. Anal. Prev., 35 381 –391 (2003). http://dx.doi.org/10.1016/S0001-4575(02)00014-3 AAPVB5 0001-4575 Google Scholar

50.

Y. ZhangQ. Ji, “Active and dynamic information fusion for facial expression understanding from image sequences,” IEEE Trans. Pattern Anal. Mach. Intell., 27 (5), 699 –714 (2005). http://dx.doi.org/10.1109/TPAMI.2005.93 ITPIDJ 0162-8828 Google Scholar

Biography

Zhongyun Yuan received his BS degree and MS degree in the Department of Electronic and Electrical Engineering from North University of China, in 2005 and 2008. He is currently pursuing a PhD degree in the Department of Electrical and Computer Engineering, Sungkyunkwan University (SKKU), Suwon, Korea. His interests include measurement, 3-D vision, data compression, data acquisition, and compressive sampling.

Jong Hak Kim received a BS degree in radio communication engineering from the Kyunghee University, Suwon, Korea, in 2009, the MS degree from the Department of Electrical and Computer Engineering, Sungkyunkwan University, in 2012, and he is studying for a PhD degree at Sungkyunkwan University. He is interested in efficient low-power and real-time processing systems for mobile equipment and currently studies image processing algorithms and hardware implementation for stereo-systems.

Jun Dong Cho received a BS degree in electronic engineering, Sungkyunkwan University in Seoul, Korea, 1980, an MS degree from Polytechnic University, Brooklyn, NY, 1989, and a PhD degree from Northwestern University, Evanston, IL, 1993, both in computer science. He was a senior CAD engineer at Samsung Electronics, Co., Ltd. He is now professor of Department of Electronic Engineering, Sungkyunkwan University, Korea. He was a visiting scientist of IBM T.J. Watson Research Center, from 2000 to 2001. He has been an IEEE senior member since April 1996. His research interests are in the area of VLSI/SoC CAD and lower power design of multimedia and communication.

CC BY: © The Authors. Published by SPIE under a Creative Commons Attribution 4.0 Unported License. Distribution or reproduction of this work in whole or in part requires full attribution of the original publication, including its DOI.

Citation Download Citation

Zhongyun Yuan, Jong Hak Kim, and Jun Dong Cho "Visual fatigue measurement model in stereoscopy based on Bayesian network," Optical Engineering 52(8), 083110 (28 August 2013). https://doi.org/10.1117/1.OE.52.8.083110

Published: 28 August 2013

Access the abstract

JOURNAL ARTICLE
11 PAGES

DOWNLOAD PAPER SAVE TO MY LIBRARY

GET CITATION

CITATIONS

Cited by 5 scholarly publications.

Explore citations on Lens.org

KEYWORDS

Visualization

Stereoscopy

Visual process modeling

Electroencephalography

Chromium

3D displays

3D modeling

1.

Introduction