Document Type

Article

Abstract

We present a method for extracting high-level semantic information through successful landmark detection using feature fusion between RGB and depth information. We focus on the classification of specific labels (open path, humans, staircases, doorways, obstacles) in the encountered scene, which can be a fundamental source of information enhancing scene understanding, and acting towards the safe navigation of the mobile unit. Experiments are conducted using a manual wheelchair equipped with a stereo RGB-D camera that captures image instances consisting of multiple labels before fine-tuning on a pre-trained Vision Transformer (ViT).

Disciplines

Computer Sciences | Physical Sciences and Mathematics

Publication Date

7-1-2023

Language

English

License

This work is licensed under a Creative Commons Attribution 4.0 International License.

Recommended Citation

Sevastopoulos, Christos; Acharya, Sneh; and Makedon, Fillia, "An RGB-D Fusion System for Indoor Wheelchair Navigation" (2023). Computer Science and Engineering Faculty Publications. 4.
https://mavmatrix.uta.edu/cse_facpubs/4

Download

Included in

Computer Sciences Commons

COinS

Computer Science and Engineering Faculty Publications

An RGB-D Fusion System for Indoor Wheelchair Navigation

Document Type

Abstract

Disciplines

Publication Date

Language

License

Recommended Citation

Included in

Search

Browse

Author & Creator Corner

Links

Computer Science and Engineering Faculty Publications

An RGB-D Fusion System for Indoor Wheelchair Navigation

Authors

Document Type

Abstract

Disciplines

Publication Date

Language

License

Recommended Citation

Included in

Share

Search

Browse

Author & Creator Corner

Links