2024 Publication

DisCo: Disentangled Control for Realistic Human Dance Generation

Tan Wang, Linjie Li, Kevin Lin, Yuanhao Zhai, Chung-Ching Lin, Zhengyuan Yang, Hanwang Zhang, Zicheng Liu, Lijuan Wang

The IEEE/CVF Computer Vision and Pattern Recognition Conference. CVPR 2024

[paper][code]


Diffusion Time-step Curriculum for One Image to 3D Generation

YI Xuanyu, Zike Wu, Qingshan Xu, Pan Zhou, Joo Hwee Lim, Hanwang Zhang

The IEEE/CVF Computer Vision and Pattern Recognition Conference. CVPR 2024


Distributionally Generative Augmentation for Fair Facial Attribute Classification

Fengda Zhang, Qianpei He, Kun Kuang, Jiashuo Liu, Long Chen, Chao Wu, Jun Xiao, Hanwang Zhang

The IEEE/CVF Computer Vision and Pattern Recognition Conference. CVPR 2024

[paper][code]


Doubly Abductive Counterfactual Inference for Text-based Image Editing

Xue Song, Jiequan Cui, Hanwang Zhang, Jingjing Chen, Richang Hong, Yu-Gang Jiang

The IEEE/CVF Computer Vision and Pattern Recognition Conference. CVPR 2024

[paper][code]


Consistent3D: Towards Consistent High-Fidelity Text-to-3D Generation with Deterministic Sampling Prior

Zike Wu, Pan Zhou, Xuanyu Yi, Xiaoding Yuan, Hanwang Zhang

The IEEE/CVF Computer Vision and Pattern Recognition Conference. CVPR 2024

[paper][code]


Few‑shot Learner Parameterization by Diffusion Time‑steps

Zhongqi Yue, Pan Zhou, Richang Hong, Hanwang Zhang, Qianru Sun.

The IEEE/CVF Computer Vision and Pattern Recognition Conference. CVPR 2024

[paper][code]


Discriminative Probing and Tuning for Text-to-Image Generation

Leigang Qu, Wenjie Wang, Yongqi Li, Hanwang Zhang, Liqiang Nie, Tat-Seng Chua

The IEEE/CVF Computer Vision and Pattern Recognition Conference. CVPR 2024

[paper][code]


Empowering Dynamics-aware Text-to-Video Diffusion with LLMs

Hao Fei, Shengqiong Wu, Wei Ji, Hanwang Zhang, Tat-Seng Chua

The IEEE/CVF Computer Vision and Pattern Recognition Conference. CVPR 2024

[paper]


Classes Are Not Equal: An Empirical Study on Image Recognition Fairness

Jiequan Cui, Beier Zhu, Xin Wen, XIAOJUAN QI, Bei Yu, Hanwang Zhang

The IEEE/CVF Computer Vision and Pattern Recognition Conference. CVPR 2024

[paper][code]


Exploring Diffusion Time-steps for Unsupervised Representation Learning

Zhongqi Yue, Jiankun Wang, Qianru Sun, Lei Ji, Eric I-Chao Chang, Hanwang Zhang

International Conference on Learning Representations. ICLR 2024.

[paper][code]


Fine-tuning Multimodal LLMs to Follow Zero-shot Demonstrative Instructions

Juncheng Li, Kaihang Pan, Zhiqi Ge, Minghe Gao, Wei Ji, Wenqiao Zhang, Tat-Seng Chua, Siliang Tang, Hanwang Zhang, Yueting Zhuang

International Conference on Learning Representations. ICLR Spotlight 2024.

[code]


Dual-Perspective Knowledge Enrichment for Semi-Supervised 3D Object Detection

Yucheng Han, Na Zhao, Weiling Chen, Keng Teck Ma, Hanwang Zhang

The AAAI Conference on Artificial Intelligence. AAAI 2024

[paper][code]


MGNet: Learning Correspondences via Multiple Graphs

Luanyuan Dai, Xiaoyu Du, Hanwang Zhang, Jinhui Tang

The AAAI Conference on Artificial Intelligence. AAAI 2024

[paper]



2023 Publication

Generalized Logit Adjustment: Calibrating Fine-tuned Models by Removing Label Bias in Foundation Models

Beier Zhu, Kaihua Tang, Qianru Sun, Hanwang Zhang

Conference on Neural Information Processing Systems. NeurIPS 2023

[paper][code]


Make the U in UDA Matter: Invariant Consistency Learning for Unsupervised Domain Adaptation

Zhongqi Yue, Hanwang Zhang, Qianru Sun

Conference on Neural Information Processing Systems. NeurIPS 2023

[paper][code]


Tuning Multi-mode Token-level Prompt Alignment across Modalities

Dongsheng Wang, Miaoge Li, Xinyang Liu, MingSheng Xu, Bo Chen, Hanwang Zhang

Conference on Neural Information Processing Systems. NeurIPS 2023

[paper][code]


Imagine That! Abstract-to-Intricate Text-to-Image Synthesis with Scene Graph Hallucination Diffusion.

Shengqiong Wu, Hao Fei, Hanwang Zhang, Tat-Seng Chua

Conference on Neural Information Processing Systems. NeurIPS 2023

[paper]


Learning Trajectory-Word Alignments for Video-Language Tasks

Xu Yang, Zhangzikang Li, Haiyang Xu, Qinghao Ye, Chenliang Li, Ming Yan, Yu Zhang, Fei Huang,Songfang Huang, Hanwang Zhang

International Conference on Computer Vision. ICCV 2023

[paper]


Equivariant Similarity for Vision-Language Foundation Models

Tan Wang, Kevin Lin, Linjie Li, Chung-Ching Lin, Zhengyuan Yang, Hanwang Zhang, Zicheng Liu, Lijuan Wang

International Conference on Computer Vision. ICCV 2023

[paper][code]


Prompt-aligned Gradient for Prompt Tuning

Beier Zhu, Yulei Niu, Yucheng Han, Yue Wu, Hanwang Zhang

International Conference on Computer Vision. ICCV 2023

[paper][code]


Random Boxes Are Open-world Object Detectors

Yanghao Wang, Zhongqi Yue, Xian-Sheng Hua, Hanwang Zhang

International Conference on Computer Vision. ICCV 2023

[paper][code]


Invariant Feature Regularization for Fair Face Recognition

Jiali Ma, Zhongqi Yue, Tomoyuki Kagaya, Tomoki Suzuki, Karlekar Jayashree, Sugiri Pranata, Hanwang Zhang.

International Conference on Computer Vision. ICCV 2023


Invariant Training 2D-3D Joint Hard Samples for Few-Shot Point Cloud Recognition

YI Xuanyu, Jiajun Deng, Qianru Sun, Xian-Sheng Hua, Joo Hwee Lim, Hanwang Zhang

International Conference on Computer Vision. ICCV 2023

[paper][code]


Mitigating and Evaluating Static Bias of Action Representations in the Background and the Foreground

Haoxin Li, Yuan Liu, Hanwang Zhang, Boyang Li

International Conference on Computer Vision. ICCV 2023

[paper][code]


Counterfactual Active Learning for Out-of-Distribution Generalization

Xun Deng, Wenjie Wang, Fuli Feng, Hanwang Zhang, Xiangnan He, Yong Liao

Annual Meeting of the Association for Computational Linguistics. ACL 2023

[paper]


Hypothetical Training for Robust Machine Reading Comprehension of Tabular Context

Moxin Li, Wenjie Wang*, Fuli Feng, Hanwang Zhang, Qifan Wang, Tat-Seng Chua

Annual Meeting of the Association for Computational Linguistics. ACL Findings 2023

[paper]


Counterfactual Samples Synthesizing and Training for Robust Visual Question Answering

Long Chen, Yuhang Zheng, Yulei Niu, Hanwang Zhang, and Jun Xiao

IEEE Transactions on Pattern Analysis and Machine Intelligence. TPAMI 2023

[paper][code]


Bootstrap Your Own Prior: Towards Distribution-Agnostic Novel Class Discovery

Muli Yang, Liancheng Wang, Cheng Deng, Hanwang Zhang

The IEEE/CVF Computer Vision and Pattern Recognition Conference. CVPR 2023

[paper][code]


Semantic Scene Completion with Cleaner Self

Fengyun Wang, Dong Zhang, Hanwang Zhang, Jinhui Tang, Qianru Sun

The IEEE/CVF Computer Vision and Pattern Recognition Conference. CVPR 2023

[paper][code]


Unbiased Multiple Instance Learning for Weakly Supervised Video Anomaly Detection

Hui Lv, Zhongqi Yue, Qianru Sun, Bin Luo, Zhen Cui, Hanwang Zhang

The IEEE/CVF Computer Vision and Pattern Recognition Conference. CVPR 2023

[paper][code]


Compositional Prompt Tuning with Motion Cues for Open-vocabulary Video Relation Detection

Kaifeng Gao, Long Chen, Hanwang Zhang, Jun Xiao, Qianru Sun

The Eleventh International Conference on Learning Representations. ICLR 2023

[paper][code]


Debiased Fine-Tuning for Vision-language Models by Prompt Regularization

Beier Zhu, Yulei Niu, Saeil Lee, Minhoe Hur, Hanwang Zhang

The AAAI Conference on Artificial Intelligence. AAAI 2023

[paper][code]



2022 Publication

Respecting Transfer Gap in Knowledge Distillation

Yulei Niu, Long Chen, Chang Zhou, Hanwang Zhang

Conference on Neural Information Processing Systems. NeurIPS 2022

[arxiv][github]


Learning to Collocate Visual-Linguistic Neural Modules for Image Captioning

Xu Yang, Hanwang Zhang, Chongyang Gao, Jianfei Cai

International Journal of Computer Vision

[arxiv]


Identifying Hard Noise in Long-Tailed Sample Distribution[oral]

Xuanyu Yi , Xian-Sheng Hua, Joo-Hwee Lim, Hanwang Zhang

European Conference on Computer Vision. ECCV 2022

[arxiv][github]


Invariant Feature Learning for Generalized Long-Tailed Classification

Kaihua Tang, Mingyuan Tao, Jiaxin Qi, Zhenguang Liu, Hanwang Zhang

European Conference on Computer Vision. ECCV 2022

[arxiv][github]


Class Is Invariant to Context and Vice Versa: On Learning Invariance for Out-Of-Distribution Generalization

Qi, Jiaxin and Tang, Kaihua and Sun, Qianru and Hua, Xian-Sheng and Zhang, Hanwang

European Conference on Computer Vision. ECCV 2022

[arxiv][github]


Equivariance and Invariance Inductive Bias for Learning from Insufficient Data

Wang, Tan and Sun, Qianru and Pranata, Sugiri and Jayashree, Karlekar and Zhang, Hanwang

European Conference on Computer Vision. ECCV 2022

[arxiv][github]


Certified Robustness Against Natural Language Attacks by Causal Intervention

Haiteng Zhao*, Chang Ma*, Xinshuai Dong*, Anh Tuan Luu, Zhi-Hong Deng, Hanwang Zhang

International Conference on Machine Learning. ICML 2022


Class Re-Activation Maps for Weakly-Supervised Semantic Segmentation

Zhaozheng Chen, Tan Wang, Xiongwei Wu, Xian-Sheng Hua, Hanwang Zhang, Qianru Sun

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2022

[arxiv][github]


KQA Pro: A Dataset with Explicit Compositional Programs for Complex Question Answering over Knowledge Base

Shulin Cao, Jiaxin Shi, Liangming Pan, Lunyiu Nie, Yutong Xiang, Lei Hou, Juanzi Li, Bin He, Hanwang Zhang

Main Conference of ACL 2022


Learning to Imagine: Integrating Counterfactual Thinking in Neural Discrete Reasoning

Moxin Li, Fuli Feng, Hanwang Zhang, Xiangnan He, Fengbin ZHU, Tat-Seng Chua

Main Conference of ACL 2022


On Non-Random Missing Labels in Semi-Supervised Learning

Xinting Hu, Yulei Niu, Chunyan Miao, Xian-Sheng Hua, Hanwang Zhang

International Conference on Learning Representations. ICLR 2022.


Cross-Domain Empirical Risk Minimization for Unbiased Long-tailed Classification[oral]

Beier Zhu, Yulei Niu, Xian-Sheng Hua, Hanwang Zhang

The AAAI Conference on Artificial Intelligence. AAAI 2022.

[arxiv][github]


Deconfounded Visual Grounding[oral]

Jianqiang Huang, Yu Qin, Jiaxin Qi,Qianru Sun, Hanwang Zhang

The AAAI Conference on Artificial Intelligence. AAAI 2022.

[arxiv]


Deconfounded Image Captioning: A Causal Retrospect

Xu Yang, Hanwang Zhang, Jianfei Cai

IEEE Transactions on Pattern Analysis and Machine Intelligence. TPAMI.

[link]



2021 Publication

How Should Pre-Trained Language Models Be Fine-Tuned Towards Adversarial Robustness?

Xinshuai Dong , Anh Tuan Luu, Min Lin, Shuicheng Yan, Hanwang Zhang

Conference on Neural Information Processing Systems. NeurIPS 2021. Virtual. December 2021

[arxiv]


Introspective Distillation for Robust Question Answering

Yulei Niu, Hanwang Zhang

Conference on Neural Information Processing Systems. NeurIPS 2021. Virtual. December 2021

[arxiv][github]


Self-Supervised Learning Disentangled Group Representation as Feature[spotlight]

Tan Wang , Zhongqi Yue, Jianqiang Huang, Qianru Sun, Hanwang Zhang

Conference on Neural Information Processing Systems. NeurIPS 2021. Virtual. December 2021

[arxiv][github]


TransferNet: An Effective and Transparent Framework for Multi-hop Question Answering over Relation Graph

Jiaxin Shi, Shulin Cao, Lei Hou, Juanzi Li, Hanwang Zhang

Conference on Empirical Methods in Natural Language Processing. EMNLP 2021. Virtual. November 2021

[arxiv][github]


Transporting Causal Mechanisms for Unsupervised Domain Adaptation[oral]

Zhongqi Yue, Qianru Sun, Xian-Sheng Hua, Hanwang Zhang

IEEE International Conference on Computer Vision. ICCV 2021. Virtual. October 2021

[arxiv][github]


Causal Attention for Unbiased Visual Recognition

Tan Wang, Chang Zhou, Qianru Sun, Hanwang Zhang

IEEE International Conference on Computer Vision. ICCV 2021. Virtual. October 2021

[arxiv][github]


Self-Regulation for Semantic Segmentation

Dong Zhang, Hanwang Zhang, Jinhui Tang, Xian-Sheng Hua, Qianru Sun

IEEE International Conference on Computer Vision. ICCV 2021. Virtual. October 2021

[arxiv][github]


Auto-Parsing Network for Image Captioning and Visual Question Answering

Xu Yang, Chongyang Gao, Hanwang Zhang, Jianfei Cai

IEEE International Conference on Computer Vision. ICCV 2021. Virtual. October 2021

[arxiv]


Adversarial Visual Robustness by Causal Intervention

Kaihua Tang, Mingyuan Tao, Hanwang Zhang

arXiv preprint 2021

[arxiv]


Are Missing Links Predictable? An Inferential Benchmark for Knowledge Graph Completion

Yixin Cao, Xiang Ji, Xin Lv, Juanzi Li, Yonggang Wen and Hanwang Zhang

Association for Computational Linguistics and International Joint Conference on Natural Language Processing. ACL-IJCNLP 2021

[preprint coming soon]


Clicks can be Cheating: Counterfactual Recommendation for Mitigating Clickbait Issue

Wenjie Wang, Fuli Feng, Xiangnan He, Hanwang Zhang, Tat-Seng Chua

Special Interest Group on Information Retrieval. ACM SIGIR 2021

[arxiv]


Empowering Language Understanding with Counterfactual Reasoning

Fuli Feng, Jizhi Zhang, Xiangnan He, Hanwang Zhang and Tat-Seng Chua

Findings of ACL 2021

[preprint coming soon]


Cross-GCN: Enhancing Graph Convolutional Network with k-Order Feature Interactions

Fuli Feng, Xiangnan He, Hanwang Zhang, Tat-Seng Chua

IEEE Transactions on Knowledge and Data Engineering. IEEE TKDE 2021

[arxiv]


Distilling Causal Effect of Data in Class-Incremental Learning

Xinting Hu, Kaihua Tang, Chunyan Miao, Xian-Sheng Hua, Hanwang Zhang

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2021

[arxiv][github]


Counterfactual VQA: A Cause-Effect Look at Language Bias

Yulei Niu, Kaihua Tang, Hanwang Zhang, Zhiwu Lu, Xian-Sheng Hua, Ji-Rong Wen

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2021

[arxiv][github]


Counterfactual Zero-Shot and Open-Set Visual Recognition

Zhongqi Yue, Tan Wang, Hanwang Zhang, Qianru Sun, Xian-Sheng Hua

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2021

[arxiv][github]


Causal Attention for Vision-Language Tasks

Xu Yang, Hanwang Zhang, Guo-Jun Qi, Jianfei Cai

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2021

[arxiv][github]


The Blessings of Unlabeled Background in Untrimmed Videos

Yuan Liu, Jingyuan Chen, Zhenfang Chen, Bing Deng, Jianqiang Huang, Hanwang Zhang

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2021

[github][github]


Align R-CNN: A Pairwise Head Network for Visual Relationship Detection

Mitra Tajrobehkar, Kaihua Tang, Hanwang Zhang, Joo-Hwee Lim

IEEE Transactions on Multimedia. TMM

[IEEE Xplore]


Auto-encoding and Distilling Scene Graphs for Image Captioning

Xu Yang, Hanwang Zhang, Jianfei Cai

IEEE Transactions on Pattern Analysis and Machine Intelligence. TPAMI

[link]


Ref-NMS: Breaking Proposal Bottlenecks in Two-Stage Referring Expression Grounding

Long Chen, Wenbo Ma, Jun Xiao, Hanwang Zhang, Shih-Fu Chang

The AAAI Conference on Artificial Intelligence. AAAI 2021

[arxiv preprint]



2020 Publication

Causal Intervention for Weakly-Supervised Semantic Segmentation[oral]

Dong Zhang, Hanwang Zhang, Jinhui Tang, Xiansheng Hua, Qianru Sun

34th Conference on Neural Information Processing Systems, NeurIPS 2020, Vancouver, Canada.

[arxiv][github]


Interventional Few-Shot Learning

Zhongqi Yue, Hanwang Zhang, Qianru Sun, Xian-Sheng Hua

34th Conference on Neural Information Processing Systems, NeurIPS 2020, Vancouver, Canada.

[arxiv][github]


Long-Tailed Classification by Keeping the Good and Removing the Bad Momentum Causal Effect

Kaihua Tang, Jianqiang Huang, Hanwang Zhang

34th Conference on Neural Information Processing Systems, NeurIPS 2020, Vancouver, Canada.

[arxiv][github]


Hierarchical Scene Graph Encoder-Decoder for Image Paragraph Captioning

Xu Yang*, Chongyang Gao*, Hanwang Zhang, Jianfei Cai

ACM Multimedia 2020.

[preprint coming soon]


Counterfactual VQA: A Cause-Effect Look at Language Bias

Yulei Niu, Kaihua Tang, Hanwang Zhang, Zhiwu Lu, Xian-Sheng Hua, Ji-Rong Wen

[arxiv preprint]


Feature Pyramid Transformer

Dong Zhang, Hanwang Zhang, Jinhui Tang, Meng Wang, Xian-Sheng Hua, Qianru Sun

European Conference on Computer Vision. ECCV 2020

[arxiv preprint][github]


Self-Adaptive Neural Module Transformer for Visual Question Answering

Zhong Huasong, Jingyuan Chen, Chen Shen, Hanwang Zhang, Jianqiang Huang, Xian-Sheng Hua

IEEE Transactions on Multimedia. TMM 2020

[IEEExplore]


Iterative Context-Aware Graph Inference for Visual Dialog[oral]

Guo Dan, Wang Hui, Hanwang Zhang, Zheng-Jun Zha, Meng Wang

IEEE International Conference on Computer Vision and Pattern Recognition. CVPR 2020. Seattle, USA. June 2020

[arxiv preprint][github]


Unbiased Scene Graph Generation from Biased Training[oral]

Kaihua Tang, Yulei Niu, Jianqiang Huang, Jiaxin Shi, Hanwang Zhang

IEEE International Conference on Computer Vision and Pattern Recognition. CVPR 2020. Seattle, USA. June 2020

[arxiv preprint][github]


Visual Commonsense R-CNN

Tan Wang, Jianqiang Huang, Hanwang Zhang, Qianru Sun

IEEE International Conference on Computer Vision and Pattern Recognition. CVPR 2020. Seattle, USA. June 2020

[arxiv preprint][github]


Two Causal Principles for Improving Visual Dialog

Jiaxin Qi, Yulei Niu, Jianqiang Huang, Hanwang Zhang

IEEE International Conference on Computer Vision and Pattern Recognition. CVPR 2020. Seattle, USA. June 2020

[arxiv preprint][github]


Learning to Segment the Tail

Xinting Hu, Yi Jiang, Kaihua Tang, Chunyan Miao, Jingyuan Chen, Hanwang Zhang

IEEE International Conference on Computer Vision and Pattern Recognition. CVPR 2020. Seattle, USA. June 2020

[arxiv preprint][github]


Counterfactual Samples Synthesizing for Robust Visual Question Answering

Long Chen, Xin Yan, Jun Xiao, Hanwang Zhang, Siliang Pu, Yueting Zhuang

IEEE International Conference on Computer Vision and Pattern Recognition. CVPR 2020. Seattle, USA. June 2020

[arxiv preprint][github]


More Grounded Image Captioning by Distilling Image-Text Matching Model

Yuanen Zhou, Zhenzhen Hu, Daqing Liu, Meng Wang, Hanwang Zhang

IEEE International Conference on Computer Vision and Pattern Recognition. CVPR 2020. Seattle, USA. June 2020

[arxiv preprint][github]


Learning Filter Pruning Criteria for Deep Convolutional Neural Networks Acceleration

Yang He, Yuhang Ding, Ping Liu, Linchao Zhu, Hanwang Zhang, Yi Yang

IEEE International Conference on Computer Vision and Pattern Recognition. CVPR 2020. Seattle, USA. June 2020

[release soon]


Multi-Level Policy and Reward-Based Deep Reinforcement Learning Framework for Image Captioning

Ning Xu, Hanwang Zhang, An-An Liu, Weizhi Nie, Yuting Su, Jie Nie, Yongdong Zhang

IEEE Transactions on Multimedia. TMM 2020

[Link]


General Partial Label Learning via Dual Bipartite Graph Autoencoder

Brian Chen, Bo Wu, Alireza Zareian, Hanwang Zhang, Shih-Fu Chang

The AAAI Conference on Artificial Intelligence. AAAI 2020

[arxiv preprint]



2019 Publication

Learning to Assemble Neural Module Tree Networks for Visual Grounding [oral]

Daqing Liu, Hanwang Zhang, Zheng-Jun Zha, Feng Wu

IEEE International Conference on Computer Vision. ICCV 2019 . Seoul, Korea, November 2019

[arxiv preprint]


Counterfactual Critic Multi-Agent Training for Scene Graph Generation [oral]

Long Chen, Hanwang Zhang, Jun Xiao, Xiangnan He, Shiliang Pu, Shih-Fu Chang

IEEE International Conference on Computer Vision. ICCV 2019. Seoul, Korea, November 2019

[arxiv preprint]


Learning to Collocate Neural Modules for Image Captioning

Xu Yang, Hanwang Zhang, Jianfei Cai

IEEE International Conference on Computer Vision. ICCV 2019. Seoul, Korea, November 2019

[arxiv preprint]


Making History Matter: History-Advantage Sequence Training for Visual Dialog

Tianhao Yang, Zheng-Jun Zha, Hanwang Zhang

IEEE International Conference on Computer Vision. ICCV 2019. Seoul, Korea, November 2019

[arxiv preprint]  [2nd place in 1st VisualDialog Challenge]


Variational Context: Exploiting Visual and Textual Context for Grounding Referring Expressions

Yulei Niu, Hanwang Zhang, Zhiwu Lu, and Shih-Fu Chang

IEEE Transactions on Pattern Analysis and Machine Intelligence. TPAMI 2019

[arxiv preprint]


Single-shot Semantic Image Inpainting with Densely Connected Generative Networks

Ling Shen, Richang Hong, Haoran Zhang, Hanwang Zhang and Meng Wang

ACM International Conference on Multimedia. MM 2019. Nice, France, October 2019

[Link]


Learning Using Privileged Information for Food Recognition

Lei Meng, Long Chen, Xun Yang, Hanwang Zhang, Dacheng Tao, Chunyan Miao and Tat-Seng Chua

ACM International Conference on Multimedia. MM 2019. Nice, France, October 2019

[Link]


Question-Aware Tube-Switch Network for Video Question Answering

Tianhao Yang, Zheng-Jun Zha, Hongtao Xie, Meng Wang and Hanwang Zhang

ACM International Conference on Multimedia. MM 2019. Nice, France, October 2019

[Link]


Fast Discrete Collaborative Multi-modal Hashing for Large-scale Multimedia Retrieval

Chaoqun Zheng, Lei Zhu, Xu Lu, Jingjing Li, Zhiyong Cheng, Hanwang Zhang

IEEE Transactions on Knowledge and Data Engineering. TKDE 2019

[Link]


Learning to Compose and Reason with Language Tree Structures for Visual Grounding

Richang Hong, Daqing Liu, Xiaoyu Mo, Xiangnan He, Hanwang Zhang

IEEE Transactions on Pattern Analysis and Machine Intelligence. TPAMI 2019

[arxiv preprint]


Context-Aware Visual Policy Network for Fined-Grained Image Captioning

Zheng-Jun Zha, Daqing Liu, Hanwang Zhang, Yongdong Zhang, Feng Wu

IEEE Transactions on Pattern Analysis and Machine Intelligence. TPAMI 2019

[arxiv preprint]


Explainable and Explicit Visual Reasoning over Scene Graphs

Jiaxin Shi, Hanwang Zhang, Juanzi Li

IEEE International Conference on Computer Vision and Pattern Recognition. CVPR 2019. Long Beach, USA. June 2019

[arxiv preprint]


Recursive Visual Attention in Visual Dialog  [oral]

Yulei Niu, Hanwang Zhang, Manli Zhang, Jianhong Zhang, Zhiwu Lu, Ji-Rong Wen

IEEE International Conference on Computer Vision and Pattern Recognition. CVPR 2019. Long Beach, USA. June 2019

[arxiv preprint]


Auto-Encoding Scene Graphs for Image Captioning  [oral]

Xu Yang, Kaihua Tang, Hanwang Zhang, Jianfei Cai

IEEE International Conference on Computer Vision and Pattern Recognition. CVPR 2019. Long Beach, USA. June 2019

[arxiv preprint]


Learning to Compose Dynamic Tree Structures for Visual Contexts  [oral]

Kaihua Tang, Hanwang Zhang, Baoyuan Wu, Wenhan Luo, Wei Liu

IEEE International Conference on Computer Vision and Pattern Recognition. CVPR 2019. Long Beach, USA. June 2019

[arxiv preprint]


DeepChannel: Salience Estimation by Contrastive Learning for Extractive Document Summarization

Jiaxin Shi, Chen Liang, Lei Hou, Juanzi Li, Zhiyuan Liu, Hanwang Zhang

The Thirty-Second AAAI Conference on Artificial Intelligence. AAAI 2019

[arxiv preprint]  [codes]


Learning to Embed Sentences Using Attentive Recursive Trees

Jiaxin Shi, Lei Hou, Juanzi Li, Zhiyuan Liu, Hanwang Zhang

The Thirty-Second AAAI Conference on Artificial Intelligence. AAAI 2019

[arxiv preprint]  [codes]


2018 Publication

Low-shot Learning via Covariance-Preserving Adversarial Augmentation Network

Hang Gao, Zheng Shou, Alireza Zareian, Hanwang Zhang, Shih-Fu Chang

Thirty-second Conference on Neural Information Processing Systems. NIPS 2018

[arxiv preprint]


More is Better: Precise and Detailed Image Captioning using Online Positive Recall and Missing Concepts Mining

Mingxing Zhang, Yang Yang, Hanwang Zhang, Yanli Ji, Heng-Tao Shen, Tat-Seng Chua

IEEE Transactions on Image Processing. TIP 2018


Shuffle-Then-Assemble: Learning Object-Agnostic Visual Relationship Features

Xu Yang, Hanwang Zhang, Jianfei Cai

15th European Conference on Computer Vision. ECCV 2018. Munich, Germany. Sep 2018

[arxiv preprint]  [merged in vtranse]


Context-Aware Visual Policy Network for Sequence-Level Image Captioning  [oral]

Daqing Liu, Zheng-Jun Zha, Hanwang Zhang, Yongdong Zhang, Feng Wu

ACM International Conference on Multimedia. MM 2018. Seoul, Korea, October 2018

[arxiv preprint]  [codes]


Discrete Factorization Machines for Fast Feature-based Recommendation

Han Liu, Xiangnan He, Fuli Feng, Liqiang Nie, Rui Liu, Hanwang Zhang

The 27th International Joint Conference on Artificial Intelligence. IJCAI 2018. Stockholm, Sweden, July, 2018


Multi-Level Policy and Reward Reinforcement Learning for Image Captioning

An-An Liu, Ning Xu, Hanwang Zhang, Weizhi Nie, Yuting Su, Yongdong Zhang

The 27th International Joint Conference on Artificial Intelligence. IJCAI 2018. Stockholm, Sweden, July, 2018


Self-Supervised Video Hashing with Hierarchical Binary Auto-encoder

Jingkuan Song, Hanwang Zhang, Xiangpeng Li, Lianli Gao, Meng Wang, Richang Hong

IEEE Transactions on Image Processing. TIP 2018


Attributed Social Network Embedding

Lizi Liao, Xiangnan He, Hanwang Zhang, Tat-Seng Chua

IEEE Transactions on Knowledge and Data Engineering. TKDE 2018


Zero-Shot Visual Recognition using Semantics-Preserving Adversarial Embedding Network

Long Chen, Hanwang Zhang, Jun Xiao, Wei Liu, Shih-Fu Chang

IEEE International Conference on Computer Vision and Pattern Recognition. CVPR 2018. Salt Late City, USA. June 2018

[arxiv preprint]  [codes]


Grounding Referring Expressions in Images by Variational Context

Hanwang Zhang, Yulei Niu, Shih-Fu Chang

IEEE International Conference on Computer Vision and Pattern Recognition. CVPR 2018. Salt Late City, USA. June 2018

[arxiv preprint]  [codes]


Learning to Guide Decoding for Image Captioning

Wenhao Jiang, Lin Ma, Xinpeng Chen, Hanwang Zhang, Wei Liu

The Thirty-Second AAAI Conference on Artificial Intelligence. AAAI 2018. New Orleans, USA, Feb 2018


Early Publication

PPR-FCN: Weakly Supervised Visual Relation Detection via Parallel Pairwise R-FCN

Hanwang Zhang, Zawlin Kyaw, Jinyang Yu, and Shih-Fu Chang

International Conference on Computer Vision. ICCV 2017. Venice, Italy, October 2017

[arxiv preprint]


Improving Event Extraction via Cross-Modal Integration

Tongtao Zhang, Spencer Whitehead, Hanwang Zhang, Hongzhi Li, Joseph Ellis, Lifu Huang, Wei Liu, Heng Ji and Shih-Fu Chang

ACM International Conference on Multimedia. MM 2017. Mountain View, CA USA, October 2017


Enhancing Micro-video Understanding by Harnessing External Sounds  [oral]

Liqiang Nie, Xiang Wang, Jianglong Zhang, Xiangnan He, Hanwang Zhang, Richang Hong and Qi Tian

ACM International Conference on Multimedia. MM 2017. Mountain View, CA USA, October 2017


Video Visual Relation Detection

Xindi Shang, Tongwei Ren, Jingfan Guo, Hanwang Zhang and Tat-Seng Chua

ACM International Conference on Multimedia. MM 2017. Mountain View, CA USA, October 2017


Video Question Answering via Gradually Refined Attention over Appearance and Motion

Dejing Xu, Zhou Zhao, Jun Xiao, Fei Wu, Hanwang Zhang, Xiangnan He and Yueting Zhuang

ACM International Conference on Multimedia. MM 2017. Mountain View, CA USA, October 2017


Attentional Factorization Machines: Learning the Weight of Feature Interactions via Attention Networks

Jun Xiao, Hao Ye, Xiangnan He, Hanwang Zhang, Fei Wu and Tat-Seng Chua

The 26th International Joint Conference on Artificial Intelligence. IJCAI 2017. Melbourne, Australia, August 2017


Videos Captioning with Attention-based LSTM and Semantic Consistency

Lianli Gao, Zhao Guo, Hanwang Zhang, Xing Xu, and Heng-Tao Shen

IEEE Transactions on Multimedia. TMM 2017


VideoWhisper: Towards Discriminative Unsupervised Video Feature Learning with Attention Based Recurrent Neural Networks

Na Zhao, Hanwang Zhang, Richang Hong, Meng Wang, Tat-Seng Chua

IEEE Transactions on Multimedia. TMM 2017


Attentive Collaborative Filtering: Multimedia Recommendation with Feature- and Item-level Attention  [oral]

Jingyuan Chen, Hanwang Zhang, Xiangnan He, Liqiang Nie, Wei Liu, Chua Tat-Seng

International ACM SIGIR Conference on Research and Development in Information Retrieval. SIGIR 2017. Tokyo, Japan. August 2017

[pdf]  [codes]


Visual Translation Embedding Network for Visual Relation Detection

Hanwang Zhang, Zawlin Kyaw, Shih-Fu Chang, Tat-Seng Chua

IEEE International Conference on Computer Vision and Pattern Recognition. CVPR 2017. Hawaii, USA. July 2017

[arxiv preprint]  [codes & data]


SCA-CNN: Spatial and Channel-wise Attention in Convolutional Networks for Image Captioning

Long Chen, Hanwang Zhang, Jun Xiao, Liqiang Nie, Jian Shao, Wei Liu, Tat-Seng Chua

IEEE International Conference on Computer Vision and Pattern Recognition. CVPR 2017. Hawaii, USA. July 2017

[arxiv preprint]  [codes]


Matryoshka Peek: Towards Learning Fine-grained, Robust, Discriminative Features for Product Search

Zawlin Kyaw, Shuhan Qi, Ke Gao, Hanwang Zhang, Luming Zhang, Jun Xiao, Xuan Wang, Tat-Seng Chua

IEEE Transactions on Multimedia. TMM 2017


Neural Collaborative Filtering  [oral]

Xiangnan He, Lizi Liao, Hanwang Zhang, Liqiang Nie, Xia Hu, Tat-Seng Chua

26th International World Wide Web Conference. WWW 2017. Perth, Australia, April 2017

[pdf]  [codes]


I Know What You Want to Express: Sentence Element Inference by Incorporating External Knowledge Base

Xiaochi Wei, Heyan Huang, Liqiang Nie, Hanwang Zhang, Xian-Ling Mao, Chua, Tat-Seng

IEEE Transactions on Knowledge and Data Engineering. TKDE 2016


Micro Tells Macro: Predicting the Popularity of Micro-Videos via a Transductive Model  [oral]

Jingyuan Chen, Xuemeng Song, Liqiang Nie, Xiang Wang, Hanwang Zhang, Tat-Seng Chua

ACM International Conference on Multimedia. MM 2016. Amsterdam, The Netherlands, October 2016


Play and Rewind: Optimizing Binary Representations of Videos by Self-Supervised Temporal Hashing  [oral]

Hanwang Zhang, Meng Wang, Richang Hong, Tat-Seng Chua

ACM International Conference on Multimedia. MM 2016. Amsterdam, The Netherlands, October 2016

[pdf]  [codes]


Learning from Collective Intelligence: Feature Learning Using Social Images and Tags

Hanwang Zhang, Xindi Shang, Huanbo Luan, Meng Wang, Tat-Seng Chua.

ACM Transactions on Multimedia Computing, Communications and Applications. TOMM (formerly known as TOMCCAP) 2016

[pdf]


Event Classification in Microblog via Social Tracking

Yue Gao, Hanwang Zhang, Xibin Zhao, Shuicheng Yan

ACM Transactions on Intelligent Systems and Technology. TIST. 2016


Discrete Collaborative Filtering  [oral]  Best Paper Honorable Mention

Hanwang Zhang, Fumin Shen, Wei Liu, Xiangnan He, Huanbo Luan, Chua Tat-Seng

International ACM SIGIR Conference on Research and Development in Information Retrieval. SIGIR 2016. Pisa, Italy. July 2016

[pdf]  [codes]


Fast Matrix Factorization for Online Recommendation with Implicit Feedback  [oral]

Xiangnan He, Hanwang Zhang, Min-Yen Kan, Tat-Seng Chua

International ACM SIGIR Conference on Research and Development in Information Retrieval. SIGIR 2016. Pisa, Italy. July 2016


Online Collaborative Learning for Open-Vocabulary Visual Classifiers

Hanwang Zhang, Xindi Shang, Wenzhuo Yang, Huan Xu, Huanbo Luan, Tat-Seng Chua.

IEEE International Conference on Computer Vision and Pattern Recognition. CVPR 2016. Las Vegas, USA. Jun 2016

[pdf]  [codes]


Discrete Image Hashing Using Large Weakly Annotated Photo Collections

Hanwang Zhang, Na Zhao, Xindi Shang, Huanbo Luan, Tat-Seng Chua.

The Thirtieth AAAI Conference on Artificial Intelligence. AAAI 2016. Phoenix, Arizona, USA. Feb 2016.


Learning Image and User Features for Recommendation in Social Networks

Xue Geng, Hanwang Zhang, Jingwen Bian, Tat-Seng Chua.

IEEE International Conference on Computer Vision. ICCV 2015. Santiago, Chile. Nov 2015.


Learning Features from Large-Scale, Noisy and Social Image-Tag Collection

Hanwang Zhang, Xindi Shang, Huanbo Luan, Yang Yang, Tat-Seng Chua.

ACM International Conference on Multimedia. MM 2015. Brisbane, Australia. Oct 2015.


Visual Coding in a Semantic Hierarchy  [oral]

Yang Yang, Hanwang Zhang, Mingxing Zhang, Fumin Shen, Xuelong Li.

ACM International Conference on Multimedia. MM 2015. Brisbane, Australia. Oct 2015.


Deep Fusion of Multiple Semantic Cues for Complex Event Recognition

Xishan Zhang, Hanwang Zhang, Yongdong Zhang, Yang Yang, Meng Wang, Huanbo Luan, Jintao Li, Tat-Seng Chua.

IEEE Transactions on Image Processing. TIP 2015


Deep Aging Face Verification with Large Gaps

Luoqi Liu, Xiong Chao, Hanwang Zhang, Zhiheng Niu, Meng Wang, Shuicheng Yan

IEEE Transactions on Multimedia. TMM 2015


Multimedia Summarization for Social Events in Microblog Stream

Jingwen Bian, Yang Yang, Hanwang Zhang, Yue Gao, Tat-Seng Chua

IEEE Transactions on Multimedia. TMM 2015


Enhancing Video Event Recognition using Automatically Constructed Semantic-Visual Knowledge Base

Xishan Zhang, Yang Yang, Yongdong Zhang, Huanbo Luan, Jintao Li, Hanwang Zhang, Tat-Seng Chua.

IEEE Transactions on Multimedia. TMM 2015


Start from Scratch: Towards Automatically Identifying, Modeling, and Naming Visual Attributes  [oral]

Hanwang Zhang, Yang Yang, Huanbo Luan, Shuicheng Yan, Tat-Seng Chua.

ACM International Conference on Multimedia. MM 2014. Orlando, USA. Nov 2014.

[pdf]  [slides]  [demo (ID:deep, PWD: deep123456)]  [models]  [codes]


Perception-Guided Multimodal Feature Fusion for Photo Aesthetics Assessment  [oral]

Luming Zhang, Yue Gao, Chao Zhang, Hanwang Zhang, Qi Tian, Roger Zimmermann.

ACM International Conference on Multimedia. MM 2014. Orlando, USA. Nov 2014.


One of a Kind: User Profiling by Social Curation  [oral]

Xue Geng, Hanwang Zhang, Zheng Song, Yang Yang, Huanbo Luan, Tat-Seng Chua.

ACM International Conference on Multimedia. MM 2014. Orlando, USA. Nov 2014.

[pdf][slides]


Image Tagging with Social Assistance  [oral]

Yang Yang, Yue Gao, Hanwang Zhang, Jie Shao and Tat-Seng Chua.

ACM International Conference on Multimedia Retrieval. ICMR 2014. Glasgow, Scotland, Apr 2014.


Robust (Semi-) Nonnegative Graph Embedding

Hanwang Zhang, Zheng-Jun Zha, Yang Yang, Shuicheng Yan, Tat-Seng Chua.

IEEE Transactions on Image Processing. TIP 2014

[pdf][codes]


Attribute-augmented Semantic Hierarchy: Towards a Unified Framework for Content-based Image Retrieval

Hanwang Zhang, Zheng-Jun Zha, Yang Yang, Shuicheng Yan, Yue Gao, Tat-Seng Chua.

ACM Transactions on Multimedia Computing, Communications and Applications. TOMM (formerly known as TOMCCAP) 2014

[pdf]


Attribute-augmented Semantic Hierarchy  [oral]

Hanwang Zhang, Zheng-Jun Zha, Yang Yang, Shuicheng Yan, Yue Gao, Tat-Seng Chua.

ACM International Conference on Multimedia. MM 2013. Barcelona, Spain. Oct 2014.

[pdf]  [slides]  Best Student Paper


Attribute Feedback  [oral]

Hanwang Zhang, Zheng-Jun Zha, Shuicheng Yan, Jingwen Bian, Tat-Seng Chua.

ACM International Conference on Multimedia. MM 2012. Nara, Japan. Oct 2012.

[pdf]  [slides]  [demo]  Best Demo Runner-up


Robust Non-negative Graph Embedding: Towards Noisy Data, Unreliable Graphs, and Noisy Labels

Hanwang Zhang, Zheng-Jun Zha, Shuicheng Yan, Meng Wang, Tat-Seng Chua.

IEEE International Conference on Computer Vision and Pattern Recognition. CVPR 2012. Rhode Island, USA. June 2012.

[pdf]  [codes]