Please use this identifier to cite or link to this item: doi:10.22028/D291-37878
Volltext verfügbar? / Dokumentlieferung
Title: Deep Learning-based Semantic Analysis of Sparse Light Field Ray Sets
Author(s): Chelli, Kelvin
Tamboli, Roopak R.
Herfet, Thorsten
Language: English
Title: 2021 IEEE 23rd International Workshop on Multimedia Signal Processing (MMSP)
Pages: 1-6
Publisher/Platform: IEEE
Year of Publication: 2021
Place of publication: Piscataway, NJ
Place of the conference: Tampere, Finland
Free key words: light fields
light field analysis
deep learning
froxels
froxel histograms
DDC notations: 004 Computer science, internet
621.3 Electrical engineering, electronics
Publikation type: Conference Paper
Abstract: With the emergence of various light field (LF) acquisition systems and of novel techniques for processing and visualizing LFs, end-to-end LF systems start to head for the consumer market. Towards this, the semantic analysis of LFs can play a crucial role in LF processing (e.g. compression, storage and transmission), and in the standardization of LF representation schemes across various use cases. In this regard, we earlier have introduced fristograms as a tool to integrate semantics into LF processing. Fristograms collect sets of rays within a volume of a number of pixels in all 3 directions (horizontal, vertical and disparity) and thus enable semantic analysis based on the ray sets, and consequently semantic processing of LFs. Consequently, fristograms enable the application of filtering techniques considering the underlying characteristic of the scene (e.g. differentiate between Lambertian and non-Lambertian, occluded and dis-occluded regions in the scene). Motivated by the earlier results through statistical analysis of froxels enabling a significant reduction in number of rays while maintaining quality, in this paper, we explore learning-based analysis of froxels. Specifically, we propose to use a deep learning network to classify material properties (such as Lambertian, non-Lambertian, and outliers). Once the classification is done, the LF is filtered semantically. Preliminary results show that compared to the statistical ray analysis of froxels, a learning-based approach can reduce the number of rays even further, yet maintain the visual quality of the LF as measured by well-known quality metrics.
DOI of the first publication: 10.1109/MMSP53017.2021.9733489
Link to this record: urn:nbn:de:bsz:291--ds-378788
hdl:20.500.11880/34255
http://dx.doi.org/10.22028/D291-37878
ISBN: 978-1-6654-3288-7
978-1-66543-287-0 (ISBN der parallelen Ausg. auf e. anderen Datenträger)
978-1-66543-289-4 (ISBN der Printausgabe)
Date of registration: 8-Nov-2022
Third-party funds sponsorship: The work underlying this paper has been funded by the German National Science Foundation (DFG) under the project FiDaLiS, grant number 429078454.
Faculty: MI - Fakultät für Mathematik und Informatik
Department: MI - Informatik
Professorship: MI - Prof. Dr. Thorsten Herfet
Collections:SciDok - Der Wissenschaftsserver der Universität des Saarlandes

Files for this record:
There are no files associated with this item.


Items in SciDok are protected by copyright, with all rights reserved, unless otherwise indicated.