Publications
Publications
Ahonen, J. I., & Le, N. (2024). NN-VVC: Versatile Video Coding boosted by self-supervisedly learned image coding for machines. https://synthical.com/article/2811d62d-1b22-4146-9724-d4676c6b66bd [Best paper award]
Zhang, H., Le, N., Cricri, F., Ahonen, J., & Tavakoli, H. R. (2023). Stabilizing the Convolution Operations for Neural Network-Based Image and Video Codecs for Machines. 2023 IEEE International Conference on Multimedia and Expo Workshops (ICMEW), 170–175. https://doi.org/10.1109/ICMEW59549.2023.00036
Ahonen, J. I., Le, N., Zhang, H., Cricri, F., & Rahtu, E. (2023). Region of Interest Enabled Learned Image Coding for Machines. 2023 IEEE 25th International Workshop on Multimedia Signal Processing (MMSP), 1–6. https://doi.org/10.1109/MMSP59012.2023.10337731
Afrabandpey, H., Rangu, G., Zhang, H., Criri, F., Aksu, E., & Tavakoli, H. R. (2022). On the Importance of Temporal Dependencies of Weight Updates in Communication Efficient Federated Learning. 2022 IEEE International Conference on Visual Communications and Image Processing (VCIP), 1–5. https://ieeexplore.ieee.org/abstract/document/10008860/
7. R. Yang, M. Santamaria, F. Cricri, H. Zhang, J. Lainema, R. G. Youvalari, M. M. Hannuksela, and T. Elomaa, "Overfitting NN loop-filters in video coding", IEEE International Conference on Visual Communications and Image Processing (VCIP), Dec. 2023.
Zou, Nannan, Francesco Cricri, Honglei Zhang, Hamed R. Tavakoli, Miska M. Hannuksela, and Esa Rahtu. 2022. “The Lottery Ticket Adaptation for Neural Video Coding.” In 2022 IEEE International Symposium on Multimedia (ISM), 141–45. https://doi.org/10.1109/ISM55400.2022.00028.
Zhang, Honglei, Francesco Cricri, Nannan Zou, Hamed R. Tavakoli, and Miska M. Hannuksela. 2022. “Adaptive Multi-Scale Progressive Probability Model for Lossless Image Compression.” In 2022 IEEE International Conference on Image Processing (ICIP), 721–25. https://doi.org/10.1109/ICIP46576.2022.9897690.
Zhang, Honglei, Francesco Cricri, Hamed Rezazadegan Tavakoli, Emre Aksu, and Miska M. Hannuksela. 2022. “Leveraging Progressive Model and Overfitting for Efficient Learned Image Compression.” arXiv. https://doi.org/10.48550/arXiv.2210.04112.
Yang, Ruiying, Maria Santamaria, Francesco Cricri, Honglei Zhang, Jani Lainema, Ramin G. Youvalari, and Miska M. Hannuksela. 2022. “Low-Precision Post-Filtering in Video Coding.” In 2022 IEEE International Symposium on Multimedia (ISM), 137–40. https://doi.org/10.1109/ISM55400.2022.00027.
Santamaria, Maria, Ruiying Yang, Francesco Cricri, Honglei Zhang, Jani Lainema, Ramin G. Youvalari, Hamed R. Tavakoli, and Miska M. Hannuksela. 2022. “Overfitting Multiplier Parameters for Content-Adaptive Post-Filtering in Video Coding.” In 2022 10th European Workshop on Visual Information Processing (EUVIP), 1–6. https://doi.org/10.1109/EUVIP53989.2022.9922721.
Santamaria, Maria, Francesco Cricri, Jani Lainema, Ramin G. Youvalari, Honglei Zhang, and Miska M. Hannuksela. 2022. “Content-Adaptive Neural Network Post-Processing Filter with NNR-Coded Weight-Updates.” In 2022 IEEE International Conference on Image Processing (ICIP), 2251–55. https://doi.org/10.1109/ICIP46576.2022.9897757.
Le, Nam, Honglei Zhang, Francesco Cricri, Ramin G. Youvalari, Hamed Rezazadegan Tavakoli, Emre Aksu, Miska M. Hannuksela, and Esa Rahtu. 2022. “Bridging the Gap Between Image Coding for Machines and Humans.” In 2022 IEEE International Conference on Image Processing (ICIP), 3411–15. https://doi.org/10.1109/ICIP46576.2022.9897916.
Ahonen, Jukka I., Ramin G. Youvalari, Nam Le, Honglei Zhang, Francesco Cricri, Hamed Rezazadegan Tavakoli, Miska M. Hannuksela, and Esa Rahtu. 2021. “Learned Enhancement Filters for Image Coding for Machines.” In 2021 IEEE International Symposium on Multimedia (ISM), 235–39. https://doi.org/10.1109/ISM52913.2021.00046.
Zou, Nannan, Honglei Zhang, Francesco Cricri, Hamed R. Tavakoli, Jani Lainema, Emre Aksu, Miska Hannuksela, and Esa Rahtu. 2021. “Learned Video Compression with Intra-Guided Enhancement and Implicit Motion Information.” Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops.
Zou, Nannan, Honglei Zhang, Francesco Cricri, Ramin G. Youvalari, Hamed R. Tavakoli, Jani Lainema, Emre Aksu, Miska Hannuksela, and Esa Rahtu. 2021. “Adaptation and Attention for Neural Video Coding.” In 2021 IEEE International Symposium on Multimedia (ISM), 240–44. https://doi.org/10.1109/ISM52913.2021.00047.
Seppälä, Joni, Honglei Zhang, Nam Le, Ramin G. Youvalari, Francesco Cricri, Hamed Rezazadegan Tavakoli, Emre Aksu, Miska M. Hannuksela, and Esa Rahtu. 2021. “Enhancing Image Coding for Machines with Compressed Feature Residuals.” In 2021 IEEE International Symposium on Multimedia (ISM), 217–25. https://doi.org/10.1109/ISM52913.2021.00044.
Santamaria, Maria, Yat-Hong Lam, Francesco Cricri, Jani Lainema, Ramin G. Youvalari, Honglei Zhang, Miska M. Hannuksela, Esa Rahtu, and Moncef Gaubbuj. 2021. “Content-Adaptive Convolutional Neural Network Post-Processing Filter.” In 2021 IEEE International Symposium on Multimedia (ISM), 99–106. https://doi.org/10.1109/ISM52913.2021.00025.
Zhang, Honglei, Francesco Cricri, Hamed Rezazadegan Tavakoli, Maria Santamaria, Yat-Hong Lam, and Miska M. Hannuksela. 2021. “Learn to Overfit Better: Finding the Important Parameters for Learned Image Compression.” In 2021 International Conference on Visual Communications and Image Processing (VCIP), 1–5. https://doi.org/10.1109/VCIP53242.2021.9675360.
Zou, Nannan, Honglei Zhang, Francesco Cricri, Hamed R. Tavakoli, Jani Lainema, Emre Aksu, Miska Hannuksela, and Esa Rahtu. “Learned Video Compression with Intra-Guided Enhancement and Implicit Motion Information.” Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops. 2021
Afrabandpey, Homayun, Anton Muravev, Hamed R. Tavakoli, Honglei Zhang, Francesco Cricri, Moncef Gabbouj, and Emre Aksu. “Mind the Structure: Adopting Structural Information for Deep Neural Network Compression.” IEEE International Conference on Image Processing (ICIP). 2021
Rezazadegan Tavakoli, Hamed, Joachim Wabnig, Francesco Cricri, Honglei Zhang, Emre Aksu, and Iraj Saniee. “Hybrid Pruning and Sparsification.” IEEE International Conference on Image Processing (ICIP). 2021
Le, Nam, Honglei Zhang, Francesco Cricri, Ramin Ghaznavi-Youvalari, Hamed R. Tavakoli, and Esa Rahtu. “Learned Image Coding for Machines: Content Adaptive Approach.” IEEE International Conference on Multimedia and Expo (ICME). 2021
Le, Nam, Honglei Zhang, Francesco Cricri, Hamed R. Tavakoli, Ramin Ghaznavi-Youvalari, Emre Aksu, Miska Hannuksela, and Esa Rahtu. “Image Coding For Machines: An End-To-End Learned Approach.” IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 2021
Zhang, Honglei, Francesco Cricri, Hamed R. Tavakoli, Nannan Zou, Emre Aksu, and Miska M. Hannuksela. “Lossless Image Compression Using a Multi-Scale Progressive Statistical Model.” In Proceedings of the Asian Conference on Computer Vision (ACCV). [Oral, Best paper candidate], 2020
Zou, Nannan, Honglei Zhang, Francesco Cricri, Hamed R. Tavakoli, Jani Lainema, Miska Hannuksela, Emre Aksu, and Esa Rahtu. “L2C -- Learning to Learn to Compress.” Proceedings of the IEEE 22nd International Workshop on Multimedia Signal Processing (MMSP), July. http://arxiv.org/abs/2007.16054. [ Most Innovative Solution Award ], 2020
Zou, Nannan, Honglei Zhang, Francesco Cricri, Hamed R. Tavakoli, Jani Lainema, Emre Aksu, Miska Hannuksela, and Esa Rahtu. “End-to-End Learning for Video Frame Compression With Self-Attention.” In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, 2020
Hyppönen, Jelena, Anna Hakala, Kaapo Annala, Honglei Zhang, Jukka Peltola, Esa Mervaala, and Reetta Kälviäinen. “Automatic Assessment of the Myoclonus Severity from Videos Recorded According to Standardized Unified Myoclonus Rating Scale Protocol and Using Human Pose and Body Movement Analysis.” Seizure 76 (January 24, 2020): 72–78. https://doi.org/10.1016/j.seizure.2020.01.014.
Villa, Jose, Jussi Taipalmaa, Mikhail Gerasimenko, Alexander Pyattaev, Mikko Ukonaho, Honglei Zhang, Jenni Raitoharju, et al. “AColor: Mechatronics, Machine Learning, and Communications in an Unmanned Surface Vehicle.” ArXiv:2003.00745 [Math], March 2, 2020. http://arxiv.org/abs/2003.00745.
Taipalmaa, Jussi, Nikolaos Passalis, Honglei Zhang, Moncef Gabbouj, and Jenni Raitoharju. “High-Resolution Water Segmentation for Autonomous Unmanned Surface Vehicles: A Novel Dataset and Evaluation.” In 2019 IEEE 29th International Workshop on Machine Learning for Signal Processing (MLSP), 1–6. Pittsburgh, PA, USA: IEEE, 2019. https://doi.org/10.1109/MLSP.2019.8918694.
H. Zhang and M. Gabbouj, “Feature Dimensionality Reduction with Graph Embedding and Generalized Hamming Distance,” IEEE International Conference on Image Processing (ICIP), 2018.
H. Zhang, S. Kiranyaz, and M. Gabbouj, “Finding Better Topologies for Deep Convolutional Neural Networks by Evolution,” ArXiv e-prints, Sept. 2018.
H. Zhang, S. Kiranyaz, and M. Gabbouj, “Data Clustering Based on Community Structure in Mutual k-Nearest Neighbor Graph,” in 2018 41st International Conference on Telecommunications and Signal Processing (TSP), pp. 1–7, July 2018. [ Best Paper Award ]
L. Xu, H. Zhang, J. Raitoharju, and M. Gabbouj, “Unsupervised Facial Image De-occlusion with Optimized Deep Generative Models,” in 2018 Eighth International Conference on Image Processing Theory, Tools and Applications (IPTA), pp. 1–6, Nov. 2018.
H. Zhang, S. Kiranyaz, and M. Gabbouj, “Outlier edge detection using random graph generation models and applications,” Journal of Big Data, vol. 4, p. 11, Apr. 2017.
H. Zhang, S. Kiranyaz, and M. Gabbouj, “A k-nearest neighbor multilabel ranking algorithm with application to content-based image retrieval,” in 2017 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2017 - Proceedings, pp. 2587–2591, IEEE, 2017.
H. Zhang, J. Raitoharju, S. Kiranyaz, and M. Gabbouj, “Limited random walk algorithm for big graph data clustering,” Journal of Big Data, vol. 3, no. 1, p. 26, 2016. [source code]
H. Zhang, S. Kiranyaz, and M. Gabbouj, “Cardinal Sparse Partial Least Square Feature Selection and Its Application in Face Recognition,” in Signal Processing Conference (EUSIPCO), 2014 Proceedings of the 22st European, Sept. 2014.
J. Raitoharju, H. Zhang, E. C. Ozan, M. A. Waris, M. Faisal, G. Cao, M. Roininen, R. S. Iftikhar Ahmed, S. P.C, S. Uhlmann, K. Samiee, S. Kiranyaz, and M. Gabbouj, “MUVIS: A Solution to MSR-Bing Challenge on Image Retrieval 2014,” in IEEE International Conference on Multimedia & Expo (ICME 2014), Chengdu China, July 2014.
G. Cao, I. Ahmad, H. Zhang, W. Xie, and M. Gabbouj, “Balance Learning to Rank in Big Data,” in Signal Processing Conference (EUSIPCO), 2014 Proceedings of the 22st European, Sept. 2014.
M.-A. Waris, H. Zhang, I. Ahmad, S. Kiranyaz, and M. Gabbouj, “Analysis of textural features for face biometric anti-spoofing,” in Signal Processing Conference (EUSIPCO), 2013 Proceedings of the 21st European, pp. 1–5, Sept. 2013.
M. Gunther, A. Costa-Pazo, C. Ding, E. Boutellaa, G. Chiachia, H. Zhang, M. de Assis Angeloni, V. Struc, E. Khoury, E. Vazquez-Fernandez, D. Tao, M. Bengherabi, D. Cox, S. Kiranyaz, T. de Freitas Pereira, J. Zganec-Gros, E. Argones-Rua, N. Pinto, M. Gabbouj, F. Simoes, S. Dobrisek, D. Gonzalez-Jimenez, A. Rocha, M. Neto, N. Pavesic, A. Falcao, R. Violato, and S. Marcel, “The 2013 face recognition evaluation in mobile environment,” in Biometrics (ICB), 2013 International Conference on, pp. 1–7, June 2013.
I. Chingovska, Z. Honglei, Z. Lei, D. Yi, S. Z. Li, O. Kähm, C. Glaser, N. Damer, A. Kuijper, and A. Nouak, “The 2nd Competition on Counter Measures to 2d Face Spoofing Attacks,” 2013.
Ph.D. Thesis
Zhang, Honglei. Graph Analysis and Applications in Clustering and Content-Based Image Retrieval. Tampere University, 2019. https://trepo.tuni.fi/handle/10024/116286.
Patents
H. Zhang, et.al,12/21/2023,"Apparatus, method and computer program product for quantizing neural networks ",US-2023412806-A1
H. Zhang, et.al,1/4/2024,"A method, an apparatus and a computer program product for video coding ",WO-2024002579-A1
H. Zhang, et.al,12/14/2023,"A method, an apparatus and a computer program product for video encoding and video decoding ",WO-2023237809-A1
H. Zhang, et.al,11/2/2023,Post processing filters suitable for neural-network-based codecs ,WO-2023208638-A1
H. Zhang, et.al,11/23/2023,"A method, an apparatus and a computer program product for machine learning ",WO-2023222313-A1
H. Zhang, et.al,10/19/2023,Model level update skipping in compressed incremental learning ,WO-2023200752-A1
H. Zhang, et.al,10/12/2023,Implementation Aspects Of Predictive Residual Encoding In Neural Networks Compression ,US-2023325644-A1
H. Zhang, et.al,10/19/2023,Apparatus and method for optimizing the overfitting of neural network filters ,WO-2023199172-A1
H. Zhang, et.al,10/12/2023,Apparatus and method for joint training of multiple neural networks ,US-2023325639-A1
H. Zhang, et.al,10/12/2023,"A method, an apparatus and a computer program product for video coding ",WO-2023194650-A1
H. Zhang, et.al,10/12/2023,"A method, an apparatus and a computer program product for video encoding and video decoding ",WO-2023194651-A1
H. Zhang, et.al,7/20/2023,Predictive and Residual Coding of Sparse Signals for Weight Update Compression ,US-2023232015-A1
H. Zhang, et.al,8/17/2023,"A method, an apparatus and a computer program product for video coding ",WO-2023151903-A1
H. Zhang, et.al,7/20/2023,High-level syntax of predictive residual encoding in neural network compression ,WO-2023135518-A1
H. Zhang, et.al,7/6/2023,"A method, an apparatus and a computer program product for video encoding and video decoding ",WO-2023126568-A1
H. Zhang, et.al,5/25/2023,"A method, an apparatus and a computer program product for video encoding and video decoding ",WO-2023089231-A1
H. Zhang, et.al,6/22/2023,"A method, an apparatus and a computer program product for video encoding and video decoding ",WO-2023111384-A1
H. Zhang, et.al,5/18/2023,Decoder-side fine-tuning of neural networks for video coding for machines ,US-2023154054-A1
H. Zhang, et.al,5/4/2023,"A method, an apparatus and a computer program product for video coding ",WO-2023073281-A1
H. Zhang, et.al,3/29/2023,"A method, an apparatus and a computer program product for video encoding and video decoding ",EP-4156691-A2
H. Zhang, et.al,3/1/2023,"A method, an apparatus and a computer program product for video encoding and video decoding ",EP-4142289-A1
H. Zhang, et.al,3/9/2023,"A method, an apparatus and a computer program product for video encoding and video decoding ",WO-2023031503-A1
H. Zhang, et.al,1/12/2023,Performance improvements of machine vision tasks via learned neural network based filter ,WO-2023280558-A1
H. Zhang, et.al,12/29/2022,"Method, apparatus and computer program product for federated learning for non independent and non identically distributed data ",WO-2022269469-A1
H. Zhang, et.al,12/29/2022,"Method, apparatus and computer program product for defining importance mask and importance ordering list ",WO-2022269432-A1
H. Zhang, et.al,12/29/2022,Learned adaptive motion estimation for neural video coding ,WO-2022269441-A1
H. Zhang, et.al,12/29/2022,"Method, apparatus and computer program product for providng an attention block for neural network-based image and video compression ",WO-2022269415-A1
H. Zhang, et.al,11/17/2022,"Method, apparatus and computer program product for providing finetuned neural network ",WO-2022238967-A1
H. Zhang, et.al,10/27/2022,"Method, apparatus and computer program product for providing finetuned neural network filter ",WO-2022224113-A1
H. Zhang, et.al,10/20/2022,A compression framework for distributed or federated learning with predictive compression paradigm ,WO-2022219240-A1
H. Zhang, et.al,10/20/2022,"A method, an apparatus and a computer program product for updating neural networks ",WO-2022219232-A1
H. Zhang, et.al,9/22/2022,"Method, apparatus and computer program product for end-to-end learned predictive coding of media frames ",WO-2022195409-A1
H. Zhang, et.al,9/29/2022,"A method, an apparatus and a computer program product for creating animated 3d models ",WO-2022200678-A1
H. Zhang, et.al,11/3/2022,"A method, an apparatus and a computer program product for video encoding and video decoding ",WO-2022229495-A1
H. Zhang, et.al,9/15/2022,"A method, an apparatus and a computer program product for generating three-dimensional models of a subject ",WO-2022189693-A1
H. Zhang, et.al,6/22/2023,Iterative overfitting and freezing of decoder-side neural networks ,US-2023196072-A1
H. Zhang, et.al,6/15/2023,Task-dependent selection of decoder-side neural network ,US-2023186054-A1
H. Zhang, et.al,6/23/2022,A Caching And Clearing Mechanism For Deep Convolutional Neural Networks ,WO-2022129694-A1
H. Zhang, et.al,6/1/2023,"Appratus, method and computer program product for probability model overfitting ",US-2023169372-A1
H. Zhang, et.al,4/28/2022,"Apparatus, method and computer program product for learned video coding for machine ",WO-2022084762-A1
H. Zhang,12/23/2021,Graph Diffusion for Structured Pruning of Neural Networks ,US-2021397965-A1
H. Zhang, et.al,1/5/2022,Encoding and decoding of extracted features for use with machines ,EP-3934254-A1
H. Zhang, et.al,4/26/2023,"Apparatus, method and computer program product for optimizing parameters of a compressed representation of a neural network ",EP-4168936-A1
H. Zhang, et.al,12/23/2021,Guided probability model for compressed representation of neural networks ,WO-2021255567-A1
H. Zhang, et.al,10/14/2021,Training a data coding system for use with machines ,WO-2021205066-A1
H. Zhang, et.al,6/28/2022,Feature-domain residual for video coding for machines ,US-11375204-B2
H. Zhang, et.al,10/14/2021,Training a data coding system comprising a feature extractor neural network ,WO-2021205065-A1
H. Zhang, et.al,7/15/2021,A cascaded prediction-transform approach for mixed machine-human targeted video coding ,WO-2021140273-A1
H. Zhang,3/2/2006,Enabling access to private information ,US-2006046742-A1
H. Zhang,12/11/2007,Method for determining a location ,US-7308273-B2
H. Zhang, A integrated monitoring system for biobankings, G05B19/042(2006.01)I, 06-Sep-2015.