Depth Estimation for Light-Field Images Using Stereo Matching and Convolutional Neural Networks

Research output: Contribution to journalArticle

1 Citation (Scopus)
22 Downloads (Pure)


The paper presents a novel depth-estimation method for light-field (LF) images based on innovative multi-stereo matching and machine-learning techniques. In the first stage, a novel block-based stereo matching algorithm is employed to compute the initial estimation. The proposed algorithm is specifically designed to operate on any pair of sub-aperture images (SAIs) in the LF image and to compute the pair’s corresponding disparity map. For the central SAI, a disparity fusion technique is proposed to compute the initial disparity map based on all available pairwise disparities. In the second stage, a novel pixel-wise deep-learning (DL)-based method for residual error prediction is employed to further refine the disparity estimation. A novel neural network architecture is proposed based on a new structure of layers. The proposed DL-based method is employed to predict the residual error of the initial estimation and to refine the final disparity map. The experimental results demonstrate the superiority of the proposed framework and reveal that the proposed method achieves an average improvement of 15.65% in root mean squared error (RMSE), 43.62% in mean absolute error (MAE), and 5.03% in structural similarity index (SSIM) over machine-learning-based state-of-the-art methods.
Original languageEnglish
Article number6188
Pages (from-to)1-20
Number of pages20
Issue number21
Publication statusPublished - 1 Nov 2020


  • Depth Estimation
  • Machine Learning
  • Residual learning

Fingerprint Dive into the research topics of 'Depth Estimation for Light-Field Images Using Stereo Matching and Convolutional Neural Networks'. Together they form a unique fingerprint.

Cite this