Hou-Ning Hu

My Profile Avatar

Ph.D. Candidate, Vision Science Lab
National Tsing Hua University, Taiwan

Take a look at his résumé [pdf].
last updated in December 2020.

NEWS:
Oct. 2020 - I am honored to receive the 2020 Google Ph.D. Fellowship!
Hou-Ning Hu is a Ph.D. candidate at the Department of Electrical Engineering, National Tsing Hua University. He works with Prof. Min Sun as a member of the Vision Science Lab on Deep Learning and its applications in Computer Vision.

He is the recipant of the 2020 Google Ph.D. Fellowship in Machine Perception, Speech Technology and Computer Vision field. His research interests span on wide applications of computer vision techniques, such as user experience, visual saliency and depth estimation on 360 videos, video dynamic, super-resolution.

Recently, his research focuses are learning 3D geometry from visual perception of the surroundings, in applications of 3D tracking and multi-sensor fusion. Also, he actively participates in CarePLUS.ai, an AI-assisted home care system project debuted at CES 2020, as a principal AI technical leader. For 3D tracking, he works with Prof. Trevor Darrell and Dr. Fisher Yu. For depth completion, he works with Prof. Wei-Chen Chiu and NEC Labs America researcher Yi-Hsuan Tsai.

Monocular Quasi-Dense 3D Object Tracking

Hou-Ning Hu, Yong-Hsu Yang, Tobias Fischer, Trevor Darrell, and Fisher Yu, Min Sun
In Submission

@article{Hu2021QD3DT,
author = {Hu, Hou-Ning and Yang, Yung-Hsu and Fischer, Tobias and Yu, Fisher and Darrell, Trevor and Sun, Min},
title = {Monocular Quasi-Dense 3D Object Tracking},
journal = {ArXiv:2103.07351},
year = {2021}
}

Joint Monocular 3D Vehicle Detection and Tracking

Hou-Ning Hu, Qizhi Cai, Dequan Wang, Ji Lin, Min Sun, Philipp Krähenbühl, Trevor Darrell, and Fisher Yu
ICCV 2019
ICCV 2019 Workshop

@inproceedings{Hu2019Mono3DT,
author = {Hu, Hou-Ning and Cai, Qi-Zhi and Wang, Dequan and Lin, Ji and Sun, Min and Krähenbühl, Philipp and Darrell, Trevor and Yu, Fisher},
title = {Joint Monocular 3D Vehicle Detection and Tracking},
journal = {ICCV},
year = {2019}
}

3D LiDAR and Stereo Fusion using Stereo Matching Network with Conditional Cost Volume Normalization

Tsun-Hsuan Wang, Hou-Ning Hu, Chieh Hubert Lin, Yi-Hsuan Tsai, Wei-Chen Chiu, and Min Sun
IROS 2019

@inproceedings{WangCCVNorm19,
author = {Wang, Tsun-Hsuan and Hu, Hou-Ning and Lin, Chieh Hubert and Tsai, Yi-Hsuan and Chiu, Wei-Chen and Sun, Min},
title = {3D LiDAR and Stereo Fusion using Stereo Matching Network with Conditional Cost Volume Normalization},
journal = {IROS},
year = {2019}
}

Self-Supervised Learning of Depth and Camera Motion from 360° Videos

Fu-En Wang*, Hou-Ning Hu*, Hsien-Tzu Cheng*, Juan-Ting Lin, Shang-Ta Yang, Meng-Li Shih, Hung-Kuo Chu, and Min Sun (*indicate equal contribution)
ACCV 2018 Oral
ECCV 2018 Workshop

@inproceedings{WangACCV18,
author = {Wang, Fu-En and Hu, Hou-Ning and Cheng, Hsien-Tzu and Lin, Juan-Ting and Yang, Shang-Ta and Shih, Meng-Li and Chu, Hung-Kuo and Sun, Min},
title = {Self-Supervised Learning of Depth and Camera Motion from 360° Videos},
journal = {Asian Conference on Computer Vision (ACCV)},
year = {2018}
}

Self-view Grounding Given a Narrated 360° Video

Shih-Han Chou, Yi-Chun Chen, Kuo-Hao Zeng, Hou-Ning Hu, Jianlong Fu, and Min Sun
AAAI 2018
ICCV 2017 Workshop

@inproceedings{ChouAAAI18,
author = {Chou, Shih-Han and Chen, Yi-Chun and Zeng, Kuo-Hao and Hu, Hou-Ning and Fu, Jianlong and Sun, Min},
title = {Self-view Grounding Given a Narrated 360° Video},
journal = {AAAI Conference on Artificial Intelligence (AAAI)},
year = {2018}
}

Deep 360 Pilot: Learning a Deep Agent for Piloting through 360° Sports Videos

Hou-Ning Hu*, Yen-Chen Lin*, Ming-Yu Liu, Hsien-Tzu Cheng, Yung-Ju Chang, and Min Sun
IEEE CVPR 2017 Oral
(* indicates equal contribution)

@inproceedings{HuCVPR17,
author = {Hu, Hou-Ning and Lin, Yen-Chen and Liu, Ming-Yu and Cheng, Hsien-Tzu and Chang, Yung-Ju and Sun, Min},
title = {Deep 360 Pilot: Learning a Deep Agent for Piloting through 360° Sports Videos},
journal = {IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
year = {2017}
}

Tell Me Where to Look: Investigating Ways for Assisting Focus in 360° Video

Yen-Chen Lin, Yung-Ju Chang, Hou-Ning Hu, Hsien-Tzu Cheng, Chi-Wen Huang, and Min Sun
ACM CHI 2017

@inproceedings{LinCHI17,
author = {Lin, Yen-Chen and Chang, Yung-Ju and Hu, Hou-Ning and Cheng, Hsien-Tzu and Huang, Chi-Wen and Sun, Min},
title = {Tell Me Where to Look: Investigating Ways for Assisting Focus in 360° Video},
booktitle = {Proceedings of the 2017 CHI Conference on Human Factors in Computing Systems},
series = {CHI '17},
year = {2017},
isbn = {978-1-4503-4655-9},
location = {Denver, Colorado, USA},
pages = {2535--2545},
numpages = {11},
url = {http://doi.acm.org/10.1145/3025453.3025757},
doi = {10.1145/3025453.3025757},
acmid = {3025757},
publisher = {ACM},
address = {New York, NY, USA},
keywords = {360-degree videos, auto pilot, focus assistance, video experience, visual guidance},
}

Services

Organizer

2nd 360° Perception and Interaction Workshop.

Principal Organizer
Oct. 2019 - Oct. 2019
Seoul, Korea

1st 360° Perception and Interaction Workshop.

Principal Organizer
Sep. 2018 - Sep. 2018
Munich, Germany

Student Staff

3rd Augmented Intelligence and Interaction (AII) Workshop

Major Student Staff
June 2019 - July 2019

1st Augmented Intelligence and Interaction (AII) Workshop

Major Student Staff
June 2017 - June 2017

The 13th Asian Conference on Computer Vision (ACCV’16)

Student Staff
Nov. 2016 - Nov. 2016

Reviewer

ICCV 2019, WACV 2020, CVPR 2020, ECCV 2020, ICRA2021

1st 360PI Workshop, 2nd 360PI Workshop

Experiences

Novatek Microelectronics Corp.

Research Intern
Jul. 2017 - Aug. 2017
Computer Vision Algorithm Development in Display Gamma Curves Correction

CAREPLUS.ai 居家照護 AI新選擇

Principal AI Technical Leader
Jan. 2019 - Feb. 2021
Home Care System Architecture
and Data Annotation Pipeline Design

EE6485 Computer Vision

Head Teaching Assistant
Sep. 2016 - Jan. 2017

Side Projects

Tensorflow Implementation of SoundNet

A Tensorflow implementation of SoundNet from the paper "SoundNet: Learning Sound Representations from Unlabeled Video" by Yusuf Aytar, Carl Vondrick, Antonio Torralba. NIPS 2016

Perspective Transformation along Specific Axes

A wrapper to conduct perspective transformation along given axes. It was modified and used in Kaggle contests.

Get in touch