We are a research lab focused on exploring the intersection of artificial intelligence, multimodal understanding, and human creativity. Our goal is to develop intelligent systems that can perceive, reason, and generate across different modalities — including text and vision.
📍 Located at National Taiwan University, Department of Computer Science & Information Engineering.
-
RIDGE: Relation-Rich Visual Document Generator for Visual Information Extraction
Zi-Han Jiang, Chien-Wei Lin, Wei-Hua Li, Hsuan-Tung Liu, Yi-Ren Yeh, Chu-Song Chen
Conference on Computer Vision and Pattern Recognition (CVPR)
[paper][code] -
PDSeg: Patch-Wise Distillation and Controllable Image Generation for Weakly-Supervised Histopathology Tissue Segmentation
Wei-Hua Li, Yu-Hsing Hsieh, Huei-Fang Yang, Chu-Song Chen
International Conference on Acoustics, Speech and Signal Processing (ICASSP)
[paper][code]
-
ACCEPT: Adaptive Codebook for Composite and Efficient Prompt Tuning
Yu-Chen Lin, Wei-Hua Li, Jun-Cheng Chen, Chu-Song Chen
Findings of the Association for Computational Linguistics: EMNLP
[paper][code] -
SAM4MLLM: Enhance Multi-Modal Large Language Model for Referring Expression Segmentation
Yi-Chia Chen, Wei-Hua Li, Cheng Sun, Yu-Chiang Frank Wang, Chu-Song Chen
European Conference on Computer Vision (ECCV)
[paper][code] -
RDPN6D: Residual-based Dense Point-wise Network for 6Dof Object Pose Estimation Based on RGB-D Images
Zong-Wei Hong, Yen-Yang Hung, Chu-Song Chen
CVPR Workshop DLGC, 2024
[paper][code] -
Open-Vocabulary Panoptic Segmentation Using Bert Pre-Training of Vision-Language Multiway Transformer Model
Yi-Chia Chen, Wei-Hua Li, Chu-Song Chen
International Conference on Image Processing (ICIP)
[paper][code]
-
Class-incremental Continual Learning for Instance Segmentation with Image-level Weak Supervision
Yu-Hsing Hsieh, Guan-Sheng Chen, Shun-Xian Cai, Ting-Yun Wei, Huei-Fang Yang, Chu-Song Chen
International Conference on Computer Vision (ICCV)
[paper][code] -
Domain-Generalized Face Anti-Spoofing with Unknown Attacks
Zong-Wei Hong, Yu-Chen Lin, Hsuan-Tung Liu, Yi-Ren Yeh, Chu-Song Chen
IEEE International Conference on Image Processing (ICIP)
[paper][code]
This profile is maintained by members of AI²Lab. For questions or collaborations, please open an issue or contact us directly.