Towards Visually Intelligent Agents (VIA): A Hybrid Approach

Chiatti, Agnese (2021). Towards Visually Intelligent Agents (VIA): A Hybrid Approach. In: The Semantic Web: ESWC 2021 Satellite Events (Verborgh, Ruben; Hogan, Aidan; Tiddi, Ilaria; Mayer, Simon; Tommasini, Riccardo; Dimou, Anastasia; d' Amato, Claudia; Bröring, Arne; Ongenae, Femke and Alam, Mehwish eds.), Lecture Notes in Computer Science, Springer, Cham, pp. 195–206.



Service robots can undertake tasks that are impractical or even dangerous for us - e.g., industrial welding, space exploration, and others. To carry out these tasks reliably, however, they need Visual Intelligence capabilities at least comparable to those of humans. Despite the technological advances enabled by Deep Learning (DL) methods, Machine Visual Intelligence is still vastly inferior to Human Visual Intelligence. Methods which augment DL with Semantic Web technologies, on the other hand, have shown promising results. In the lack of concrete guidelines on which knowledge properties and reasoning capabilities to leverage within this new class of hybrid methods, this PhD work provides a reference framework of epistemic requirements for the development of Visually Intelligent Agents (VIA). Moreover, the proposed framework is used to derive a novel hybrid reasoning architecture, to address real-world robotic scenarios which require Visual Intelligence.

Viewing alternatives


Public Attention

Altmetrics from Altmetric

Number of Citations

Citations from Dimensions

Item Actions