Liyun Zhang

Specially-Appointed Researcher/Fellow

Intelligence and Sensing Lab (ISLab)
D3 Center
Osaka University

Address:
Techno-Alliance Building C, Floor 5, C503
Osaka University
2-8 Yamadaoka, Suita, Osaka 565-0871, Japan

Email: liyun.zhang@lab.ime.cmc.osaka-u.ac.jp

                   

Biography

I am currently a Specially-Appointed Researcher/Fellow at  ISLab,  Osaka University. I obtained my Ph.D. degree at  Takemura Lab,  Osaka University, advised by Prof. Haruo Takemura and Prof. Photchara Ratsamee. I was a visiting scholar at  Georgia Tech advised by Prof. Animesh Garg. I was a specially appointed researcher at  Sysmex Corporation, and embedded software engineer worked at   Huawei,     ZTE Corporation, etc.

Research interests: Multimodal LLMs, Embodied AI, Cognitive Interaction, Robot Learning, Affective Computing, and Multi-Annotator Learning.

Openings: I am open to potential research positions from Universities, Research Institutes, or Corporation Research Departments.
I look forward to receiving all kinds and forms of collaboration opportunities related to my research interests.

News

2025.4: One paper about Multi-Annotator Learning with Missing Labels submitted to arXiv.
2025.2: One paper about Multi-Annotator Tendency Learning submitted to arXiv.
2024.11: One paper about Diffusion Policy submitted to arXiv.
2024.8: One paper about Multimodal Large Language Model accepted to MRAC’24 @ ACMMM.
2024.4: I start a new research career as a Specially-Appointed Researcher/Fellow at ISLab.
2023.6: One paper about Vision Enhancement accepted to TCSVT.
2023.2: I start a research collaboration with Georgia Tech advised by Prof. Animesh Garg.
2023.1: One paper about Image Translation accepted to WACV 2023.
2022.11: One paper about Robotic Perception accepted to SSRR 2022.

Publications

# Corresponding author; * Equal contribution

MicroEmo: Time-Sensitive Multimodal Emotion Recognition with Subtle Clue Dynamics in Video Dialogues
Liyun Zhang, Zhaojie Luo, Shuqiong Wu, Yuta Nakashima#.
In Proceedings of the 2nd International Workshop on Multimodal and Responsible Affective Computing (MRAC’24 @ ACMMM), 2024.
[Paper

Panoptic-Level Image-to-Image Translation for Object Recognition and Visual Odometry Enhancement
Liyun Zhang, Photchara Ratsamee#, Zhaojie Luo#, Yuki Uranishi, Manabu Higashida, Haruo Takemura.
IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2023.
[Paper]  [Code,Data

Panoptic-aware Image-to-Image Translation
Liyun Zhang, Photchara Ratsamee, Bowen Wang, Zhaojie Luo, Yuki Uranishi, Manabu Higashida, Haruo Takemura.
IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2023.
[Paper]  [Code,Data]  [DOI]

Thermal-to-Color Image Translation for Enhancing Visual Odometry of Thermal Vision
Liyun Zhang, Photchara Ratsamee, Yuki Uranishi, Manabu Higashida, Haruo Takemura.
IEEE International Symposium on Safety, Security, and Rescue Robotics (SSRR), 2022.
[Paper

Uneven Illumination Image Segmentation Based on Multi-threshold S-F
Liyun Zhang, Nanyan Liu, Yuanbin Hou, Xiaojian Liu.
Opto-Electronic Engineering (OEE), 2014.
[Paper

Teams

Embodied AI:


Animesh Garg
Assistant Professor
(Georgia Tech)

Jason Orlosky
Associate Professor
(Augusta University & Osaka University)

Photchara Ratsamee
Lecturer
(Osaka Institute of Technology & Osaka University)

Masato Kobayashi
Assistant Professor
(Osaka University)

Cognitive Interaction:


Zheng Lian
Associate Professor
(Institute of Automation, Chinese Academy of Sciences)

Zhaojie Luo
Associate Professor
(Southeast University)

Shuqiong Wu
Assistant Professor
(Osaka University)

Student:


Xuanmeng Sha
PhD Student
(Osaka University)

Teaching & Working

2024.4-Current Specially-Appointed Researcher/Fellow Osaka University
2023.2-2024.3 Visiting Scholar  Georgia Tech
2023.7-2024.2 Research Assistant Osaka University
2021.5-2023.3 Student Researcher  Sysmex
2021.4-2021.9 Teaching Assistant Osaka University
2020.10-2021.3 Research Assistant Osaka University
2017.12-2018.9 Embedded Software Engineer           ZTE
2016.11-2017.3 Software R&D Engineer        Huawei
2015.7-2017.12 Embedded Software Engineer  Huaqin Tech

Invited Talks

In Progress

Recent Community Services

Reviewer Service

ACM International Conference on Multimedia (ACMMM) 2025
International Conference on Computer Vision (ICCV) 2025
Conference on Computer Vision and Pattern Recognition (CVPR) 2025
Pattern Recognition (PR) 2024
IEEE Transactions on Circuits and Systems for Video Technology (TCSVT) 2023
IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2023
Conference on Computer Vision and Pattern Recognition (CVPR) 2022
European Conference on Computer Vision (ECCV) 2022


©Liyun Zhang.  Last update: April, 2025.