Automatic disparity search range estimation in deep learning stereo

PhD Thesis

Perera, Ruveen. 2021. Automatic disparity search range estimation in deep learning stereo. PhD Thesis Doctor of Philosophy. University of Southern Queensland. https://doi.org/10.26192/m0vj-5p57

Authors
Title	Automatic disparity search range estimation in deep learning stereo
Type	PhD Thesis
Author	Perera, Ruveen
Supervisor	Low, Tobias
	Billingsley, John
Institution of Origin	University of Southern Queensland
Qualification Name	Doctor of Philosophy
Number of Pages	207
Year	2021
Digital Object Identifier (DOI)	https://doi.org/10.26192/m0vj-5p57
Abstract	This research concerns the deep stereo networks used for inferring depth from images captured with a stereo pair of cameras. Central to the process is the measurement of disparity between the images, with the computational effort depending on the limit of the 'disparity search range (DSR)' over which the search is to be performed. Like the traditional stereo techniques, deep learning stereo methods also require users to specify the upper bound of DSR, commonly known as the ‘Maximum Disparity’, manually. Selecting a substantially lower or a higher maximum disparity than necessary for a given scene can lead to disparity estimation errors or performance degradations during disparity inference. This thesis presents an automatic disparity search range estimation technique for deep learning stereo which can be seamlessly embedded into the stereo algorithm itself without requiring pre-processing or explicit configuration by the users. The method incorporates a novel metric referred to as the Sum of New Cost Extrema (SNCE). The stated metric can be estimated on a per-layer basis during the cost volume construction phase of the stereo method. This can serve as the criterion on which to decide whether to continue the cost volume creation process at a given disparity, or to terminate it. In this way, the maximum disparity for a given scene is determined. The SCNE metric is further utilised in a deep stereo algorithm which produces accurate disparity maps while estimating the disparity search range automatically without user intervention. The memory efficient design of the algorithm makes it possible to optimise memory for standard desktop computers and consumer-grade graphics processing hardware. Evaluations conducted using the benchmark stereo datasets indicate that the algorithm is able to produce accurate results for synthetic and real stereo image sequences without requiring users to set any parameter values during disparity inference, setting a new state-of-the-art benchmark in stereo disparity estimation. Results indicate improvements in performance due to the computational efficiency arising from the optimal cost volume sizes. When used with standard stereo image sequences, the algorithm performed up to 50% faster compared to a reference implementation with a fixed cost volume size. Extended testing with common stereo evaluation metrics on various real-world stereo datasets showed that the algorithm can produce accurate results under varying scene conditions. Additional tests on images captured with a custom-built stereo camera confirmed the generalization capabilities of the algorithm while demonstrating the possibility of achieving complete independence from user specified parameters values when inferring depth from real world stereo imagery.
Keywords	Stereo Vision, Deep Learning Stereo, Automatics Disparity Search Range Estimation, Deep Stereo, Parameterless Stereo Vision
ANZSRC Field of Research 2020	460304. Computer vision
	461103. Deep learning
Byline Affiliations	School of Mechanical and Electrical Engineering

Permalink -

https://research.usq.edu.au/item/q6782/automatic-disparity-search-range-estimation-in-deep-learning-stereo

Download files

Published Version

	PhD_Thesis_Ruveen_Perera_revised_Final.pdf
File access level: Anyone

342
total views
117
total downloads
1
views this month
1
downloads this month

Export as

Related outputs

Robustness of Deep Transfer Learning-Based Crack Detection against Uncertainty in Hyperparameter Tuning and Input Data

Nguyen, Andy, Chianese, Riccardo R., Gharehbaghi, Vahid R., Perera, Ruveen, Low, Tobias, Aravinthan, Thiru, Yu, Yang, Samali, Bijan, Guan, Hong, Khuc, Tung and Le, Thach N.. 2022. "Robustness of Deep Transfer Learning-Based Crack Detection against Uncertainty in Hyperparameter Tuning and Input Data." Guan, Hong, Chan, Tommy H. T. and Li, Jianchun (ed.) Recent Advances in Structural Health Monitoring Research in Australia. New York, United States. Nova Science Publishers. pp. 215-243

A computationally efficient crack detection approach based on deep learning assisted by stockwell transform and linear discriminant analysis

Nguyen, Andy, Nguyen, Canh Long, Gharehbaghi, Vahidreza, Perera, Ruveen, Brown, Jason, Yu, Yang and Kalbkhani, Hashem. 2022. "A computationally efficient crack detection approach based on deep learning assisted by stockwell transform and linear discriminant analysis." Structures. 45, pp. 1962-1970. https://doi.org/10.1016/j.istruc.2022.09.107