Henry Hao-Tang Tsui

henrytsu@andrew.cmu.edu

Hi! I’m Hao-Tang Tsui, feel free to call me Henry!

I work on computer vision, vision–language models, and trying to build a usable bridge between pixels and words. I’m currently a Master’s student in Computer Vision at Carnegie Mellon University, working with Prof. Deva Ramanan on vision–language benchmarking.

Previously, I was a research assistant at Academia Sinica (YOLO-Lab) with Prof. Mark Liao, where I worked on YOLO-related research and re-release YOLO under the MIT license. I earned my B.S. in Electrical Engineering from National Yang Ming Chiao Tung University, collaborating with Prof. Hong-Han Shuai and Prof. Wen-Huang Cheng.

I enjoy turning research ideas into clean code, benchmarks, and occasionally opinions—ideally ones that connect vision and language a little better than before.

news

Apr 05, 2026	Our paper TTSG was accepted by CVPR 2026 Workshop!
Apr 05, 2026	My paper Σ StaDy4D was accepted by CVPR 2026 Workshop!
Dec 31, 2025	My code YOLO-MIT is now available on GitHub!
Aug 11, 2025	Started my Master of Science in Computer Vision at the Carnegie Mellon University.
Jan 23, 2025	My paper YOLO-RD was accepted by ICLR 2025!

selected publications

CVPRw

ΣStaDy4D: Towards Complete 4D Static-Dynamic Reconstruction with SIGMA

Hao-Tang Tsui^*, Ethan Lai^*, and Yu-Rou Tuan^*

In , Jun 2026

Bib Website

@inproceedings{tsui2026sigma,
  author = {Tsui, Hao-Tang and Lai, Ethan and Tuan, Yu-Rou},
  title = {$\Sigma$ StaDy4D: Towards Complete 4D Static-Dynamic Reconstruction with SIGMA},
  year = {2026},
  month = jun,
}

ECCV

TrajPrompt: Aligning Color Trajectory with Vision-Language Representations

Li-Wu Tsao, Hao-Tang Tsui, Yu-Rou Tuan, and 5 more authors

In Proceedings of the European Conference on Computer Vision (ECCV), Oct 2024

Bib PDF Website

@inproceedings{tsao2024trajprompt,
  author = {Tsao, Li-Wu and Tsui, Hao-Tang and Tuan, Yu-Rou and Chen, Pei-Chi and Wang, Kuan-Lin and Wu, Jhih-Ciang and Shuai, Hong-Han and Cheng, Wen-Huang},
  title = {TrajPrompt: Aligning Color Trajectory with Vision-Language Representations},
  booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
  year = {2024},
  month = oct,
}

ICLR

YOLO-RD: Introducing Relevant and Compact Explicit Knowledge to YOLO by Retriever-Dictionary

Hao-Tang Tsui, Chien-Yao Wang, and Hong-Yuan Mark Liao

In Proceedings of the International Conference on Learning Representations (ICLR), Apr 2025

arXiv Bib Code Website

@inproceedings{tsui2024yolord,
  author = {Tsui, Hao-Tang and Wang, Chien-Yao and Liao, Hong-Yuan Mark},
  title = {YOLO-RD: Introducing Relevant and Compact Explicit Knowledge to YOLO by Retriever-Dictionary},
  booktitle = {Proceedings of the International Conference on Learning Representations (ICLR)},
  year = {2025},
  month = apr,
  eprint = {2410.15346},
  archiveprefix = {arXiv},
  primaryclass = {cs.CV},
}