ATM: Action Temporality Modeling for Video Question Answering
Junwen Chen, Jie Zhu, and Yu Kong ACM Multimedia (ACM MM), 2023 Comming Soon!
Uncertainty-aware State Space Transformer for Egocentric 3D Trajectory Forecasting
Wentao Bao, Lele Chen, Libing Zeng, Zhong Li, Yi Xu, Junsong Yuan, and Yu Kong International Conference on Computer Vision (ICCV), 2023 ProjectarXiv
Catch Missing Details: Image Reconstruction with Frequency Augmented Variational Autoencoder
Xinmiao Lin, Yikang Li, Jenhao Hsiao, Chiu Man Ho, and Yu Kong IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023 PDFarXiv
Ancestor Search: Generalized Open Set Recognition via Hyperbolic Side Information Learning
Xiwen Dengxiong, Yu Kong Winter Conference on Applications of Computer Vision (WACV), 2023 PDF
2022
GateHUB: Gated History Unit with Background Suppression for Online Action Detection
Junwen Chen, Gaurav Mittal, Ye Yu, Yu Kong, Mei Chen IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022 PDFarXiv
OpenTAL: Towards Open Set Temporal Action Localization
Wentao Bao, Qi Yu, Yu Kong IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022 (Oral) PDFarXivCode
Learning of Global Objective for Network Flow in Multi-Object Tracking
Shuai Li, Yu Kong, Hamid Rezatofighi IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022 PDFarXivCode
2021
Explainable video entailment with grounded visual evidence
Junwen Chen, Yu Kong International Conference on Computer Vision (ICCV), 2021 PDF
Evidential Deep Learning for Open Set Action Recognition
Wentao Bao, Qi Yu, Yu Kong International Conference on Computer Vision (ICCV), 2021 (Oral) PDFarXivCode
DRIVE: Deep Reinforced Accident Anticipation with Visual Explanation
Wentao Bao, Qi Yu, Yu Kong International Conference on Computer Vision (ICCV), 2021 PDFarXivCode
Gradient Frequency Modulation for Visually Explaining Video Understanding Models
Xinmiao Lin, Wentao Bao, Matthew Wright, Yu Kong British Machine Vision Conference (BMVC), 2021 PDFarXiv
Multiple Instance Relational Learning for Video Anomaly Detection
Xiwen Dengxiong, Wentao Bao, Yu Kong International Joint Conference on Neural Network (IJCNN), 2021 DOI
Few-shot human motion prediction via learning novel motion dynamics
Chuanqi Zang, Mingtao Pei, Yu Kong International Conference on International Joint Conferences on Artificial Intelligence (IJCAI), 2021 PDFDOI
Revealing a history: palimpsest text separation with generative networks
Anna Starynska, David Messinger, Yu Kong International Journal on Document Analysis and Recognition (IJDAR), 2021 PDFDOI
2020
Group Activity Prediction with Sequential Relational Anticipation Model
Junwen Chen, Wentao Bao, Yu Kong European Conference on Computer Vision (ECCV), 2020 PDFarXivCode
Activity-driven Weakly-Supervised Spatio-Temporal Grounding from Untrimmed Videos
Junwen Chen, Wentao Bao, Yu Kong The 28th ACM International Conference on Multimedia (MM), 2020 DOI
Uncertainty-based Traffic Accident Anticipation with Spatio-Temporal Relational Learning
Wentao Bao, Qi Yu, Yu Kong The 28th ACM International Conference on Multimedia (MM), 2020 DOIarXivCodeDataset
RIT-18: A novel dataset for compositional group activity understanding
Junwen Chen, Haiting Hao, Hanbin Hong, Yu Kong IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR Workshop), 2020 PDFDataset
Object-Aware Centroid Voting for Monocular 3D Object Detection
Wentao Bao, Qi Yu, Yu Kong IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2020 PDFarXiv
Privacy Attributes-aware Message Passing Neural Network for Visual Privacy Attributes Classification
Hanbin Hong, Wentao Bao, Yuan Hong, Yu Kong International Conference on Pattern Recognition (ICPR), 2020 DOI