Xiang Xiang: Image-set, Temporal and Spatiotemporal Representations of Videos for Recognizing, Localizing and Quantifying Actions. Johns Hopkins University Ph.D. Dissertation, Baltimore, USA, June 2018. [pdf][talk-a][talk-b][official page at JHU Sheridan libraries][official announcement at JHU CS Dept][semanticscholar]
Xiang Xiang: Object Tracking and Segmentation in Dynamic Scenes: University of Chinese Academy of Sciences M.S. Dissertation, Beijing, China, June 2012. [PDF][bib]
H. Wang, Xiang Xiang, K. Zon. Subject Identification Systems & Methods. US 10832035B2, EP 3642757A1, CN 111095264A, WO 2018234542A1, JP 2020524850A.
[CVML]: for generic Computer Vision & Machine Learning; [CVHI]: for Computer Vision in Health Informatics; [MISC] for miscellaneous applications[CVML] Xiang Xiang, F. Wang, Y. Tan, A. L. Yuille: Imbalanced Regression for Intensity Series of Pain Expression from Videos by Regularizing Spatio-Temporal Face Nets. Pattern Recognition Letters, 2022.
[MISC] Xiang Xiang: Bootstrapping Autonomous Lane Changes with Self-Supervised Augmented Runs. In ECCV workshops, 2022.
[CVML] S. Li, Xiang Xiang: Lightweight Human Pose Estimation Using Heatmap-Weighting Loss. In ICPR workshops, 2022. Best Paper Award (Honorable Mention)
[MISC] X. Wei, R. Qiu, H. Yu, Y. Yang, H. Tian, Xiang Xiang: Entropy-based Optimization via A* Algorithm for Parking Space Recommendation. International Conference on Traffic Engineering and Transportation System, Chongqing, China, 2021. [PDF]
[CVML]: Xiang Xiang*, Z. Wang, S. Lao, B. Zhang*: Pruning Multi-view Stereo Net for Efficient 3D Reconstruction. ISPRS Journal of Photogrammetry and Remote Sensing, vol. 168, pp. 17-27, Elsevier, Oct. 2020 (*correspondence co-author). [Official][Preprint]
[CVML] J. Huang, Z. Huang, Xiang Xiang, X. Gong, B. Zhang: Long-Short Graph Memory Network for Skeleton-based Action Recognition. In IEEE Winter Conference on Applications of Computer Vision 2020 (WACV 2020), Aspen, USA. [PDF][Code][Slides]
[CVHI] Xiang Xiang: Beyond Deep Feature Averaging: Sampling Videos Towards Practical Facial Pain Recognition. Full paper at IEEE Conference on Computer Vision and Pattern Recognition 2019 (CVPR 2019) workshop on Face and Gesture Analysis for Health Informatics, Long Beach, USA. [PDF][thecvf][upmc.fr]
[CVML] Xiang Xiang and Trac D. Tran: Linear Disentangled Representation Learning for Facial Actions. IEEE Transactions on Circuits and System for Video Technology (IEEE T-CSVT), Volume: 28, Issue: 12, 2018. [ieee][arxiv][github][Modeling Work (new perspective): video-based sparsity + PCP]
[CVML] Xiang Xiang*, Ye Tian*, Austin Reiter, Gregory D. Hager, Trac D. Tran: S3D: Stacking Segmental P3D for Action Quality Assessment. Full paper IEEE International Conference on Image Processing 2018 (ICIP 2018), Athens, Greece. (* Equal contribution.) [ieee][sigport][researchgate][github][linkedin][grant proposal][followup works at cvpr19]
[CVHI] Xiang Xiang: Effect of Spatial Alignment in Cataract Surgical Phase Recognition. Abstract paper at IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2018) workshop on Fine-Grained Instructional Video Understanding, Salt Lake City, USA. [pdf][umich][poster]
[CVHI] Wentao Zhu, Xiang Xiang, Trac D. Tran, Gregory D. Hager and Xiaohui Xie: Adversarial Deep Structural Networks for Mammographic Mass Segmentation. Full paper at IEEE International Symposium on Biomedical Imaging 2018 (ISBI 2018), Washington DC, USA. [arxiv] [github][linkedin]
[CVHI] Xiang Xiang, Wentao Zhu, Trac D. Tran, Gregory D. Hager: Survey on Multi-Scale CNNs for Lung Nodule Detection. Abstract paper at IEEE International Symposium on Biomedical Imaging 2018 (ISBI 2018), Washington DC, USA.
[CVHC] Xiang Xiang*, Ye Tian*, Gregory D. Hager, Trac D. Tran: Assessing Pain Levels from Videos Using Temporal Convolutional Networks. Abstract paper at IEEE Winter Conf. on Applications of Computer Vision 2018 (WACV 2018) workshop on Computer Vision for Active and Assisted Living , Lake Tahoe, USA. (* Equal contribution.) [youtube]
[CVML] Feng Wang, Xiang Xiang, Jian Cheng, Alan L. Yuille. NormFace: L2 Hypersphere Embedding for Face Verification. Full long paper at ACM MultiMedia (MM) Conference 2017, Mountain View, USA. [arxiv][github][researchgate][comment][linkedin][csdn in Chinese][Theoretic work (3 new propositions): the norm of weights and features matter for nonlinear representation; although demonstrated on faces, this is a generic deep learning paper showing that when L2 norm is applied prior to softmax, the networks dont converge. The novel design is to add a scale layer that is learnt to help the network converge. Theoretical results are shown then lower bounds of loss with respect to the scale parameters]
[CVHI] Feng Wang, Xiang Xiang*, Chang Liu, Trac D. Tran, Austin Reiter, Gregory D. Hager, Jian Cheng and Alan L. Yuille. Regularizing Face Verification Nets for Pain Intensity Regression. Full paper at IEEE International Conference on Image Processing 2017 (ICIP 2017), Beijing, China. (*corresponding author. Oral presentation.) [ieee][researchgate][arxiv] [github][slides][linkedin][Modeling work (new objective): fine-tuning a face net with a regression loss regularized by a classification loss to induce discrete values]
[CVML] Hao Zhu, Feng Wang, Xiang Xiang and Trac D. Tran. Supervised Hashing with Jointly Learning Embedding and Quantization. Full paper at IEEE International Conference on Image Processing 2017 (ICIP 2017), Beijing, China. [ieee][researchgate][linkedin][Modeling work (new objective): relaxed optimization for image retrieval]
[CVML] Xiang Xiang, Dung N. Tran, Trac D. Tran. Sparse Unsupervised Clustering with Mixture Observations for Video Summarization. Abstract paper at IEEE Applied Imagery Pattern Recognition Workshop (AIPR) 2017, Washington DC, USA. [ieee][researchgate]
[CVML&HI] Xiang Xiang and Trac D. Tran: Pose-Selective Max Pooling for Measuring Similarity. Workshop paper at IAPR International Conference on Pattern Recognition (ICPR) 2016 workshops, Cancun, Mexico. LNCS, vol. 10165 (Video Analytics), ISBN: 978-3-319-56686-3. [springer][github][researchgate][arxiv][aau][linkedin][google][Algorithmic work (2 new algorithms): video-based keyframe + deepface]
[CVML] Xiang Xiang and Trac D. Tran: Recursively Measured Action Units. Workshop paper at IAPR International Conference on Pattern Recognition (ICPR) 2016 workshops, Cancun, Mexico. LNAI, vol. 10183 (Pattern Recognition of Social Signals), 978-3-319-59258-9. [researchgate][books.google][springer][Algorithmic work (new algorithm): video-based LSTM + SMP]
[CVML] Xiang Xiang, Minh Dao, Gregory D. Hager, Trac D. Tran: Hierarchical Sparse and Collaborative Low-Rank Representation for Emotion Recognition. Full paper at IEEE International Conference on Acoustics, Speech, and Signal Processing 2015 (ICASSP 2015), Brisbane, Australia. ISBN: 978-1-4673-6997-8. [ieee][arxiv][github] [mathworks] [youtube] [elsevier][Modeling work (2 new objectives): video-based sparsity + PCP]
[CVHI] Xiang Xiang, Daniel Mirota, Austin Reiter, Gregory D. Hager: Is Multi-Model Feature Matching Better for Endoscopic Motion Estimation? Workshop paper at International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI) 2014 workshops, Boston, USA. LNCS, vol. 8899 (Comp.-Ass. & Rob. End.), 2014, ISBN: 978-3-319-13409-3. [springer][nih][researchgate][code][google][Empirical work: video-based 3D vision]
[CVML] Xiang Xiang, Hong Chang, Jiebo Luo: Online Web-Data-Driven Segmentation of Selected Moving Objects in Videos. Full paper at Asian Conference on Computer Vision 2012 (ACCV 2012): 134-146, Daejeon, Korean. LNCS, vol. 7725 (ACCV), 2013, ISBN: 978-3-642-37443-2. [demo: over 10 seconds!!!][springer][acm][pdf][researchgate][github][youtube] [dataset][google][cvpapers][visionbib][slides][Modeling work (new objective for segmentation + new algorithm for tracking): video-based MIL tracking + Graph cuts]
[CVHI] Xiang Xiang: A Localization Framework under Non-rigid Deformation for Rob. Surg.. Full paper at International Symposium on Visual Computing (ISVC) 2011: 11-22, Lake Tahoe, USA. LNCS, vol. 6938 (Advances in Visual Computing), 2011, ISBN: 978-3-642-24027-0. [springer][acm][linkedin][researchgate][google][visionbib][Modeling work (new formulation): geometry based registration]
[CVML] Xiang Xiang: An Attempt to Segment Foreground in Dynamic Scenes. Full paper at International Symposium on Visual Computing (ISVC) 2011: 124-134, Lake Tahoe, USA. LNCS, vol. 6938 (Advances in Visual Computing), 2011, ISBN: 978-3-642-24027-0. [springer][acm][researchgate][google][Empirical work: video-based graph cuts]
[CVML] Xiang Xiang, Wenhui Chen, Du Zeng: Intelligent Target Tracking and Shooting System with Mean Shift. Full paper at IEEE International Symposium on Parallel and Distributed Processing and Applications (ISPA) 2008: 417-421, Sydney, Australia. Parallel and Distributed Processing, IEEE, 2008, ISBN: 978-0-7695-3471-8. [ieee][pdf][github][youtube][whu][sjtu][Embedded vision work: video-based tracking camera] [Interesting cited paper] (a typical paper from this conference; no free lunch, but welcome to the jungle! It turns out that I start doing multi-core 10 years ago in college and observe it move to multiple CPU/GPU which induces great success of modern vision and graphics applications.)