Rui Qian (钱瑞)

I'm working at Google CoreML on Generative AI. I received Ph.D. in Computer Science from Cornell University and Cornell Tech, advised by Prof. Serge Belongie. Prior to Cornell, I received the B.S. in Computer Science with Summa Cum Laude from Peking University working with Prof. Jiaying Liu. I'm interested in label-efficient and multimodal video understanding. I have taken several wonderful internships at Google Research(2020-2022), Bytedance AI Lab(2019) and Microsoft Research(2018-2019).

[Github] [Google Scholar] [Linkedin]


(Arxiv, 2022)
Multimodal Open-Vocabulary Video Classification via Pre-Trained Vision and Language Models

Rui Qian, Yeqing Li, Zheng Xu, Ming-Hsuan Yang, Serge Belongie, Yin Cui

(Arxiv, 2021)
Revisiting 3D ResNets for Video Recognition

Xianzhi Du, Yeqing Li, Yin Cui, Rui Qian, Jing Li, Irwan Bello

Selected Publications

(ECCV 2022)
Exploring Fine-grained Audiovisual Categorization

Rui Qian, Kimberly Wilber, Hartwig Adam, Oisin Mac Aodha, Serge Belongie, Grant Van Horn

(BMVC 2022)
Exploring Temporal Granularity in Self-Supervised Video Representation Learning

Rui Qian, Yeqing Li, Liangzhe Yuan, Boqing Gong, Ting Liu, Matthew Brown, Serge Belongie, Ming-Hsuan Yang, Hartwig Adam, Yin Cui

(CVPR 2022)
Contextualized Spatio-Temporal Contrastive Learning with Self-Supervision

Liangzhe Yuan, Rui Qian, Yin Cui, Boqing Gong, Florian Schroff, Ming-Hsuan Yang, Hartwig Adam, Ting Liu

(CVPR 2021)
Spatiotemporal Contrastive Video Representation Learning

Rui Qian*, Tianjian Meng*, Boqing Gong, Ming-Hsuan Yang, Huisheng Wang, Serge Belongie, Yin Cui

(CVPR 2021, Oral)
Simple Copy-Paste is a Strong Data Augmentation Method for Instance Segmentation

Golnaz Ghiasi*, Yin Cui*, Aravind Srinivas*, Rui Qian, Tsung-Yi Lin, Ekin D. Cubuk, Quoc V. Le, Barret Zoph

(NeurIPS 2021)
VATT: Transformers for Multimodal Self-Supervised Learning from Raw Video, Audio and Text

Hassan Akbari, Liangzhe Yuan, Rui Qian, Wei-Hong Chuang, Shih-Fu Chang, Yin Cui, Boqing Gong

(CVPR 2020)
End-to-end Pseudo-LiDAR for Image-Based 3D Object Detection

Rui Qian*, Divyansh Garg*, Yan Wang*, Yurong You*, Serge Belongie, Bharath Hariharan, Mark Campbell, Kilian Weinberger, Wei-Lun Chao

(AAAI 2019, Spotlight)
Weakly Supervised Scene Parsing with Point-Based Distance Metric Learning

Rui Qian, Yunchao Wei, Honghui Shi, Jiachen Li, Jiaying Liu, Thomas Huang

(CVPR 2018, Spotlight)
Attentive Generative Adversarial Network for Raindrop Removal from A Single Image

Rui Qian, Robby T. Tan, Wenhan Yang, Jiajun Su, Jiaying Liu


Google Research

Research Intern
May 2022 - Aug 2022
Host: Dr. Yin Cui, Dr. Boqing Gong,
Dr. Tsung-Yi Lin, Prof. Ming-Hsuan Yang

Bytedance AI Research

Research Intern
Mar 2019 - Jul 2019
Host: Dr. Ding Liu, Dr. Xiaohui Shen

Microsoft Research

Research Intern
Sept 2018 - Mar 2019
Host: Dr. Stephen Lin


I really love my workspace at Cornell Tech which has 180 degree view of Manhattan (day and night).

Here is the view from the House at Cornell Tech (summer and winter).