Publications

Here is the full list of publications.

2024

Learning Unified Distance Metric Across Diverse Data Distributions with Parameter-Efficient Transfer Learning
Sungyeon Kim, Donghyun Kim, Suha Kwak
WACV 2024

Too many frames, not all useful: Efficient Strategies for Long-Form Video QA
Jongwoo Park, Kanchana Ranasinghe, Kumara Kahatapitiya, Wonjeong Ryoo, Donghyun Kim, Michael S Ryoo
NeurIPS 2024 Workshop

Unified Domain Generalization and Adaptation for Multi-View 3D Object Detection
Gyusam Chang, Jiwon Lee, Donghyun Kim, Jinkyu Kim, Dongwook Lee, Daehyun Ji, Sujin Jang, Sangpil Kim
NeurIPS 2024

MATE: Meet At The Embedding–Connecting Images with Long Texts
Young Kyun Jang, Junmo Kang, Yong Jae Lee, Donghyun Kim
EMNLP Findings 2024

Weak-to-Strong Compositional Learning from Generative Models for Language-based Object Detection
Kwanyong Park, Kuniaki Saito, Donghyun Kim
ECCV 2024 and CVPR 2024 Workshop on “Generative Models for Computer Vision”

Efficient and Versatile Robust Fine-Tuning of Zero-shot Models
Sungyeon Kim, Boseung Jeong, Donghyun Kim, Suha Kwak
ECCV 2024

Visual Delta Generator for Semi-supervised Composed Image Retrieval
Young Kyun Jang*, Donghyun Kim*, Zihang Meng, Dat Huynh, Ser-Nam Lim
CVPR 2024

What, How, and When Should Object Detectors Update in Continually Changing Test Domains?
Jayeon Yoo, Dongkwan Lee, Inseop Chung, Donghyun Kim*, Nojun Kwak*
CVPR 2024

LLM4SGG: Large Language Model for Weakly Supervised Scene Graph Generation
Kibum Kim, Kanghoon Yoon, Jaehyeong Jeon, Yeonjun In, Jinyoung Moon, Donghyun Kim, Chanyoung Park
CVPR 2024

Adaptive Self-training Framework for Fine-grained Scene Graph Generation
Kibum Kim, Kanghoon Yoon, Yeonjun In, Jinyoung Moon, Donghyun Kim, Chanyoung Park
ICLR 2024

Grafting Vision Transformers
Jongwoo Park, Kumara Kahatapitiya, Donghyun Kim, Shivchander Sudalairaj, Quanfu Fan, Michael S. Ryoo
WACV 2024

2023

Learning Human Action Recognition Representations Without Real Humans
Howard Zhong, Samarth Mishra, Donghyun Kim, SouYoung Jin, Rameswar Panda, Hilde Kuehne, Leonid Karlinsky, Venkatesh Saligrama, Aude Oliva, Rogerio Feris
NeurIPS 2023

Dense and Aligned Captions (DAC) Promote Compositional Reasoning in VL Models
Sivan Doveh, Assaf Arbelle, Sivan Harary, Roei Herzig, Donghyun Kim, Paola Cascante-Bonilla, Amit Alfassy, Rameswar Panda, Raja Giryes, Rogerio Feris, Shimon Ullman, Leonid Karlinsky
NeurIPS 2023 (Spotlight)

CDAC: Cross-domain Attention Consistency in Transformer for Domain Adaptive Semantic Segmentation
Kaihong Wang, Donghyun Kim, Rogerio Feris, Margrit Betke
ICCV 2023

Going Beyond Nouns With Vision & Language Models Using Synthetic Data
Cascante-Bonilla P, Shehada K, Smith JS, Doveh S, Kim D, Panda R, Varol G, Oliva A, Ordonez V, Feris R, Karlinsky L
ICCV 2023

Teaching Structured Vision&Language Concepts to Vision&Language Models
Doveh, S., Arbelle, A., Harary, S., Panda, R., Herzig, R., Schwartz, E., Kim, D., Giryes, R., Feris, R., Ullman, S. and Karlinsky, L
CVPR 2023

CODA-Prompt: COntinual Decomposed Attention-based Prompting for Rehearsal-Free Continual Learning
Smith, J.S., Karlinsky, L., Gutta, V., Cascante-Bonilla, P., Kim, D., Arbelle, A., Panda, R., Feris, R. and Kira, Z
CVPR 2023

ConStruct-VL: Data-Free Continual Structured VL Concepts Learning
Smith, J.S., Cascante-Bonilla, P., Arbelle, A., Kim, D., Panda, R., Cox, D., Yang, D., Kira, Z., Feris, R. and Karlinsky, L
CVPR 2023

2022

VisDA 2022 Challenge: Sim2Real Domain Adaptation for Industrial Recycling
D. Bashkirova, P. Teterwak, S. Mishra, D. Kim, , R. Lai, F. Alladkani, J. Akl, B. Calli, V. Ablavsky, S. Bargal, K. Saenko
NeurIPS 2022 Competition

A Broad Study of Pre-training for Domain Generalization and Adaptation
Donghyun Kim, Kaihong Wang, Stan Sclaroff, and Kate Saenko
ECCV 2022

A Unified Framework for Domain Adaptive Pose Estimation
Donghyun Kim, Kaihong Wang, Kate Saenko, Margrit Betke, Stan Sclaroff
ECCV 2022

2021

VisDA-2021 Competition: Universal Domain Adaptation to Improve Performance on Out-of-Distribution Data
D. Bashkirova, D Hendrycks, D. Kim, S. Mishra, K. Saenko, K. Saito, P. Teterwak, B. Usman
NeurIPS 2021 Competition

OpenMatch: Open-set Consistency Regularization for Semi-supervised Learning with Outliers
Kuniaki Saito, Donghyun Kim, Kate Saenko
NeurIPS 2021

CDS: Cross-domain Self-supervised Pre-training
Donghyun Kim, Kuniaki Saito, Tae-Hyun Oh, Bryan A. Plummer, Stan Sclaroff, Kate Saenko
ICCV 2021

Learning Cross-Modal Contrastive Features for Video Domain Adaptation
Donghyun Kim, Yi-Hsuan Tsai, Bingbing Zhuang, Xiang Yu, Stan Sclaroff, Kate Saenko, Manmohan Chandraker
ICCV 2021

Tune it the Right Way: Unsupervised Validation of Domain Adaptation via Neighborhood Density
Kuniaki Saito, Donghyun Kim, Piotr Teterwak, Stan Sclaroff, Trevor Darrell, Kate Saenko
ICCV 2021

Self-supervised Visual Attribute Learning for Fashion Compatibility
Donghyun Kim, Kuniaki Saito, Kate Saenko, Stan Sclaroff, Bryan A Plummer
ICCV VIPriors Workshop 2021

Multi-Task Learning from Videos via Efficient Inter-Frame Attention
Donghyun Kim, Tian Lan, Chuhang Zou, Ning Xu, Bryan A Plummer, Stan Sclaroff, Jayan Eledath, Gerard Medioni
ICCV MTL Workshop 2021

2020

Universal Domain Adaptation through Self Supervision
Kuniaki Saito, Donghyun Kim, Stan Sclaroff, Kate Saenko
NeurIPS 2020

Learning to Scale Multilingual Representations for Vision-Language Tasks
Andrea Burns, Donghyun Kim, Derry Wijaya, Kate Saenko, Bryan A Plummer
ECCV 2020 (spotlight)

Multi-way Encoding for Robustness
Donghyun Kim, Sarah Adel Bargal, Jianming Zhang, Stan Sclaroff
WACV 2020

MULE: Multimodal Universal Language Embedding
Donghyun Kim, Kuniaki Saito, Kate Saenko, Stan Sclaroff, Bryan A. Plummer
AAAI 2020

2019

Semi-supervised Domain Adaptation via Minimax Entropy
Kuniaki Saito, Donghyun Kim, Stan Sclaroff, Trevor Darrell, Kate Saenko
ICCV 2019

2018

Excitation Backprop for RNNs
Sarah Adel Bargal, Andrea Zunino, Donghyun Kim, Jianming Zhang, Vittorio Murino, Stan Sclaroff
CVPR 2018

2017

Deep 3D Face Identification
Donghyun Kim, Matthias Hernandez, Jongmoo Choi, G ́erard Medioni
International Joint Conference on Biometrics (IJCB) 2017

2016

Expression Invariant 3D Face Modeling from an RGB-D Video
Donghyun Kim, Jongmoo Choi, Jatuporn Toy Leksut, G ́erard Medioni
International Conference on Pattern Recognition (ICPR) 2016 (Oral)

Accurate 3D face modeling and recognition from RGB-D stream in the presence of large pose changes
Donghyun Kim, Jongmoo Choi, Jatuporn Toy Leksut, G ́erard Medioni
IEEE International Conference on Image Processing (ICIP) 2016