Visually Induced Motion Sickness in Stereoscopic 3D

VIMS is evoked by conflicting motion sensory signals within the brain. Quantifying one’s VIMS experience during an experiment using the simulator sickness questionnaire (SSQ) or postural stability measures only measure the changes between pre- and post-experiment. The motion sickness susceptibility questionnaire (MSSQ) is widely used to provide individual’s sensitivity to motion sickness, but its applicability to the VIMS has not been proven. We are introducing a novel VIMS susceptibility measure from VIMS level rating during the experiment by combining measures of the subject’s ‘sensitivity’ and ‘endurance’ to the VIMS.

Motion in a distorted virtual 3D space may cause visually induced motion sickness. Geometric distortions in stereoscopic 3D can be caused by mismatches between image capture and display parameters. The geometric model is expressed as a function of camera convergence distance and the ratios of the three parameter pairs. We analyze the impacts of these four variables separately and their interactions on geometric distortions. Our model facilitates insights into the various distortions and leads us to a method whereby the user can correct geometric distortions by adjusting some variables to alter the effects of others.

Stereoscopic 3D (S3D) geometric distortions can be introduced by mismatches among image capture, display, and viewing configurations. In this paper, we combine geometric models and retinal disparity models to analyze geometric distortions from the viewer’s perspective where both monocular and binocular depth cues are considered. Results show that binocular and monocular depth-cue conflicts in a geometrically distorted S3D space. Moreover, user-initiated head translations averting from the optimal viewing position in conventional S3D displays can also introduce geometric distortions, which are inconsistent with our natural 3D viewing condition. The inconsistency of depth cues in a dynamic scene may be a source of visually induced motions sickness.


Geometric models are tools used to analyze geometric distortions in S3D viewing. In this paper, we extend standard geometric models to cover the conditions with vertical screen disparity. To test our models, we conduct experiments using random-dot stereograms (RDS) with vertical screen disparities introduced by rotating RDS images. This paper introduces a retinal eccentricity model to analyze geometric distortions from the viewer’s perspective, which indicates that the geometric distortions only affect the binocular depth cue and not monocular depth cues. The monocular-binocular depth cue conflicts with motions may induce motion sickness in S3D viewing.

Psychovisual Modulation Display Technology

Temporal psychovisual modulation (TPVM) is a new information display technology that aims to generate multiple visual precepts for different viewers on a single display simultaneously. In this paper, we proposed to solve the constrained NMF problem by a modified version of hierarchical alternating least squares (HALS) algorithms. Through experiments, we analyze the choice of parameters in the setup of TPVM system. This work serves as a guideline for practical implementation of TPVM display system.

Dual-view display is a technology that provides two different views concurrently for different users on a single medium. We propose a dual-view display system where users can see one view through a pair of specific glasses (called personal view) and see another view without the glasses (called shared view). In this paper, we introduce heuristic and iterative algorithms for the dual-view display. The iterative algorithm utilizes the Gaussian-like spatial integration window of human eyes. Compared to the heuristic algorithm, the iterative algorithm has significant improvements for the shared view with a preference in percentage of 90.2% on average. This paper serves as guidance for dual-view-based applications. Moreover, the method discussed in this paper can be also used to explore the temporal redundancy of displays.

QR (Quick Response) Codes are widely used as a convenient unidirectional communication channel to convey information, such as emails, hyperlinks, or phone numbers, from publicity materials to mobile devices. But the QR Code is not visually appealing and takes up valuable space for publicity materials. In this paper, we propose a new method to embed QR Code on digital screens via temporal psychovisual modulation (TPVM). We make QR Code perceptually transparent to humans but detectable for mobile devices. Based on the idea of invisible QR Code, many applications can be implemented, e.g., "physical hyperlink" for something interesting on TV or digital signage , "invisible watermark" for anti-piracy in theaters.

Nonnegative Matrix Factorization (NMF), which decomposes a target matrix in- to the product of two matrices with nonnegative elements, has been widely used in various fields of signal processing. In visual signal processing, the spatially nonuniformed distribution of perceptually meaningful information in image and video frames calls for a kind of Spatially-Weighted NMF (swNMF) that applies location-dependent weights into the decomposition problem. In this paper, we introduce swNMF solution based on the hierarchical alternating least squares (HALS) approach. Then we exemplify its application to a new information display diagram named temporal psychovisual modulation (TPVM) with a comparison with traditional HALS method and baseline algorithm of multiplicative update (MU).

Current display technologies have achieved high refresh rates and resolutions. However, spatial and temporal discrimination abilities of human eyes are limited. Some displays have spatial or temporal resolutions that are beyond the spatial and temporal discrimination thresholds of human visual systems (HVS). Those displays have psychovisual redundancy in spatial or temporal domains, thus extra capacity in the displays can be used. Psychovisual modulation exploits spatial and temporal redundancies of modern displays. There are mainly two ways to utilize the extra capacity. First, the psychovisual redundancy of display device can be used to generate multiple visual percepts for different viewers. Second, the redundancy can be used for data transmission to computing devices (i.e., visible light communication from display devices to computing devices). Thus, a unidirectional communication channel from display device to computing device can be widely used in many fields.