Tong He
Email : tonghe90[at]gmail[dot]com
I am now a Research Fellow at Shanghai AI Lab, working with Prof. Ouyang Wanli and Prof. Qiao Yu . I was a Research Fellow at Australian Institute for Machine Learning (AIML), the University of Adelaide, working with Prof. Chunhua Shen and Prof. Anton van den Hengel

I got my PhD in computer science at the University of Adelaide and supervised by Chunhua Shen. I was a visiting student at MMLAB of the Chinese University of Hong Kong at Shenzhen under the supervision of Dr.Weilin Huang and Prof.Yu Qiao.


  • Oct, 2022: One paper has been accepted by SIGGRAPH ASIA.
  • Oct, 2022: The extended version of DyCo3D has been accepted by T-PAMI
  • July, 2022: One paper has been accepted by ECCV22
  • April, 2022: Check our latest instance segmentation paper for 3D point cloud.
  • March, 2021: One T-PAMI has been accepted.
  • March, 2021: One IJCV has been accepted.
  • March, 2021: Two CVPR papers have been accepted.
  • Nov, 2020: Got Ph.D degree and my thesis was awarded the Dean’s Commendation for Doctoral Thesis Excellence.
  • Oct, 2020: The extended version of FCOS is accepted by T-PAMI.
  • July, 2020: Two ECCV papers have been accepted.
  • March, 2020: One CVPR paper has been accepted.


Ponder: Point Cloud Pre-training via Neural Rendering
D. Huang, S. Peng, T. He, X. Zhou and W. Ouyang.
arxiv 2023, [PDF] [code]
MM-3DScene: 3D Scene Understanding by Customizing Masked Modeling with Informative-Preserved Reconstruction and Self-Distilled Consistency
M. Xu, M. Xu, T. He, W. Ouyang, Y. Wang, X. Han, and Y. Qiao.
arxiv 2022, [PDF]
OBMO: One Bounding Box Multiple Objects for Monocular 3D Object Detection
C. Huang, T. He, H. Ren, W. Wang, B. Lin, and D. Cai.
arxiv 2022, [PDF]
Frozen CLIP Model is An Efficient Point Cloud Backbone
X. Huang, S. Li, W. Qu, T. He, Y. Zuo and W. Ouyang.
arxiv 2022, [PDF] [code]
GD-MAE: Generative Decoder for MAE Pre-training on LiDAR Point Clouds
H. Yang, T. He, J. Liu, H. Chen, B. Wu, B. Lin, X. He and W. Ouyang.
arxiv 2022, [PDF] [code]
3D-QueryIS: A Query-based Framework for 3D Instance Segmentation
J. Liu, T. He, H. Yang, R. Su, J. Tian, J. Wu, H. Guo, K. Xu and W. Ouyang.
arxiv 2022, [PDF] [code]
CP3: Unifying Point Cloud Completion by Pretrain-Prompt-Predict Paradigm
M. Xu, Y. Wang, Y. Liu, T. He, and Y. Qiao.
arxiv 2022, [PDF] [code]
Reconstructing Hand-Held Objects from Monocular Video
D. Huang, X. Ji, X. He, J. Sun, T. He, Q. Shuai, W. Ouyang, and X. Zhou.
SIGGRAPH Asia 2022, [PDF] [Project Page] [code]
The Equalization Losses: Gradient-Driven Training for Long-tailed Object Recognition
J. Tan, B. Li, X. Lu, Y. Yao, F. Yu, T. He, W. Ouyang.
Arxiv, 2022 [PDF] [code]
Dynamic Convolution for 3D Point Cloud Instance Segmentation
T. He, C. Shen and A. Hengel
T-PAMI, 2022 [PDF] [code]
PointInst3D: Segmenting 3D Instances by Points
T. He, W. Yin, C. Shen, A. Hengel
ECCV, 2022 [PDF] [code]
ABCNet v2: Adaptive Bezier-Curve Network for Real-time End-to-end Text Spotting
Y. Liu, C. Shen, L. Jin, T. He, P. Chen, C. Liu and H. Chen
T-PAMI, 2021 [PDF] [code]
Exploring the Capacity of Sequential-free Box Discretization Network for Omnidirectional Scene Text Detection
Y. Liu, T. He, H. Chen, X. Wang, C. Luo, S. Zhang, C. Shen and L. Jin
IJCV, 2021 [PDF] [code]
HCRF-Flow: Scene Flow from Point Clouds with Continuous High-order CRFs and Position-aware Flow Embedding
R. Li, G. Lin, T. He, F. Liu and C. Shen
CVPR, 2021 [PDF]
DyCo3D: Robust Instance Segmentation of 3D Point Clouds through Dynamic Convolution
T. He, C. Shen, and A. Hengel
CVPR, 2021 [PDF] [Code]
Learning and Memorizing Representative Prototypes for 3D Point Cloud Semantic and Instance Segmentation
T. He, D. Gong, Z. Tian and C. Shen
ECCV, 2020 [PDF]
Instance-Aware Embedding for Point Cloud Instance Segmentation
T. He, Y. Liu, C. Shen, X. Wang and C.Sun
ECCV, 2020 [PDF]
FCOS: A Simple and Strong Anchor-free Object Detector
Z. Tian, C. Shen, H. Chen, T. He
T-PAMI, 2020. [PDF] [Code]
ABCNet: Real-time Scene Text Spotting with Adaptive Bezier-Curve Network
Y. Liu, H. Chen, C. Shen, T. He, L. Jin, L. Wang
CVPR 2020 [PDF] [Code]
FCOS: Fully Convolutional One-Stage Object Detection
Z. Tian, C. Shen, H. Chen, T. He
ICCV, 2019 [PDF] [Code]
Knowledge Translation and Adaptation for Efficient Semantic Segmentation
T. He, C. Shen, Z. Tian, D. Gong, C. Sun, Y. Yan
CVPR, 2019 [PDF] [Results On Cityscapes Test Set]
Decoders Matter for Semantic Segmentation: Data-Dependent Decoding Enables Flexible Feature Aggregation
Z. Tian, T. He, C. Shen, Y. You
CVPR, 2019 [PDF]
An End-to-End TextSpotter with Explicit Alignment and Attention
T. He, Z. Tian, W. Huang, C. Shen, Y. Qiao, C. Sun
CVPR, 2018 [PDF] [code]
Single Shot Text Detector with Regional Attention
P. He, W. Huang, T. He, Q. Zhu, Y. Qiao, X. Li
ICCV, 2017 [PDF] [code]
Orientation-Aware Text Proposals Network for Scene Text Detection
H. Huang, Z. Tian, T. He, W. Huang, Y. Qiao
CCBR, 2017
Detecting Text in Natural Image with Connectionist Text Proposal Network
T. Zhi, W. Huang, T. He, P. He, Y. Qiao
ECCV, 2016 [demo] [code]
Accurate Text Localization in Natural Image with Cascaded Convolutional Text Network
T. He, W. Huang, Y. Qiao and J.Yao
arxiv [arxiv 1510.03283]
Text-Attentional Convolutional Neural Networks for Scene Text Detection
T. He, W. Huang, Y. Qiao and J.Yao
T-IP 2016 [arxiv 1510.03283]
An efficient method for text detection from indoor panorama images using extremal regions
Y. Liu, K. Zhang, J. Yao, T. He, Y. Liu and J. Tu
ICIA, 2015.
Accurate Multi-Scale License Plate Localization Via Image Saliency
T. He, J. Yao, K. Zhang, Y. Hou and S. Han
ITSC, 2014. [oral]

