Henry Hao-Tang Tsui
henrytsu@andrew.cmu.edu
Hi! I’m Hao-Tang Tsui, feel free to call me Henry!
I work on computer vision, vision–language models, and trying to build a usable bridge between pixels and words. I’m currently a Master’s student in Computer Vision at Carnegie Mellon University, working with Prof. Deva Ramanan on vision–language benchmarking.
Previously, I was a research assistant at Academia Sinica (YOLO-Lab) with Prof. Mark Liao, where I worked on YOLO-related research and re-release YOLO under the MIT license. I earned my B.S. in Electrical Engineering from National Yang Ming Chiao Tung University, collaborating with Prof. Hong-Han Shuai and Prof. Wen-Huang Cheng.
I enjoy turning research ideas into clean code, benchmarks, and occasionally opinions—ideally ones that connect vision and language a little better than before.
news
| Dec 31, 2025 | My code YOLO-MIT is now available on GitHub! |
|---|---|
| Aug 11, 2025 | Started my Master of Science in Computer Vision at the Carnegie Mellon University. |
| Jan 23, 2025 | My paper YOLO-RD was accepted by ICLR 2025! |
| Jul 01, 2024 | My second-author paper TrajPrompt was accepted by ECCV 2024! |