Emerging applications such as wireless visual sensor networks (WVSN) and wireless video surveillance are requiring lightweight video encoding with high coding efficiency and error-resilience. The WVSNs have a range of applications including surveillance networks, health care systems, and monitoring systems. The surveillance visual sensors combined with signal processing and computer vision techniques can be used to locate criminals, terrorists, or accidents. The sensor networks can be integrated with other multimedia networks to provide health care services. Remote medical centers are able to perform advanced remote monitoring of their patients via multimedia sensors with remote assistance services. The wireless capsule endoscopy provides visual recordings inside the human body for diagnosis and monitoring. The WVSN is possibly a part of advance health informatics challenge, which is one of the grand challenges enabling a new system of distributed tools to collect medical data. In addition, the monitoring systems using visual sensors are used to monitor natural environment, health of human-made structures, e.g bridges, building, ships, etc., and disasters. Multimedia sensors can be used to monitor and control the industrial processes and systems in critical conditions.
The WVSNs are challenged by requiring advanced video coding and processing techniques in the energy-constrained wireless communications. One of the main design objectives of the WVSNs is a local (on-board) coding and processing technique with high compression efficiency, low-complexity, and error-resilience. The WVSNs also require real-time performance for the process extracting visual information from physical environments (by cameras) to transmit it to control centers (by users). Thus, most camera sensors have embedded processors that only support lightweight processing algorithms. Distributed Video Coding (DVC) is a new coding paradigm which exploits the source statistics at the decoder side offering such benefits for these applications. Although there have been some advanced improvement techniques, improving the DVC coding efficiency is still challenging.
The project addresses this challenge by proposing several iterative algorithms at different working levels, e.g. bitplane, band, and frame levels. The first proposed algorithm applies parallel iterative decoding using multiple LDPC decoders to utilize cross bitplane correlation. To improve Side Information (SI) generation and noise modeling and also learn from the previous decoded Wyner-Ziv (WZ) frames, side information and noise learning (SING) is proposed. The SING scheme introduces an optical flow technique to compensate the weaknesses of the block based SI generation and also utilizes clustering of DCT blocks to capture cross band correlation and increase local adaptivity in noise modeling. During decoding, the updated information is used to iteratively reestimate the motion and reconstruction in the proposed motion and reconstruction reestimation (MORE) scheme. The MORE scheme not only reestimates the motion vectors for improving SI and noise modeling but also compensates the residual motion based on the previously decoded WZ frames. Furthermore, the MORE codec enhances the reconstruction by proposing a generalized reconstruction algorithm to optimize reconstructing with multiple competitive SIs. Finally, an adaptive mode decision is investigated to take advantage of skip and intra mode in DVC by deciding the coding modes based on the quality of key frames and rate of WZ frames. Overall, the proposed algorithms significantly improve the coding efficiency of DVC contributing valuable solutions for the emerging applications.
References
Copyright notice: Copyright of the above material belongs to the author. The material may not be copied or re-used without permission from the author.