Skip to content
enhancing-human-dog-interaction-through-deep-learning-and-explainable-ai-–-scientific-reports

Enhancing human-dog interaction through deep learning and explainable AI – Scientific Reports

  • Article
  • Open access
  • Published:
  • Michał Kopczyński1 &
  • Michał Czubenko1 

Scientific Reports (2026) Cite this article

We are providing an unedited version of this manuscript to give early access to its findings. Before final publication, the manuscript will undergo further editing. Please note there may be errors present which affect the content, and all legal disclaimers apply.

Subjects

Abstract

This article ventures into the novel field of recognizing dogs’ emotions, employing the Deep Learning and Transfer Learning techniques. A unique aspect of our study was the creation of a robust dataset from scratch, based on the latest research in the field of dogs’ emotion recognition. Using this dataset, we trained a model employing a transfer learning approach, fine-tuning established architectures to accurately identify dogs’ emotions. Our results demonstrate substantial success rates, underscoring the efficacy of these methodologies in enhancing human-dog interaction and improving animal welfare. To ensure that our model’s predictions were transparent and interpretable, we incorporated an eXplainable Artificial Intelligence (XAI) approach, utilizing Gradient-weighted Class Activation Mapping (Grad-CAM). This technique offers visual elucidations of the model’s predictions, identifying the critical regions within the images for the emotional classification. Our findings underscore the substantial potential of deep learning and transfer learning approaches for understanding canine emotions. We evaluated ten image classification models, including convolutional neural network architectures, transformer-based models (ConvNeXt), and a classical machine learning approach based on Support Vector Machines (SVM) using DINO v2 features. The highest performance was achieved by the YOLO v11 (You Only Look Once) architecture, with both accuracy and F1-score reaching 0.84. Notably, the DINO v2-SVM model ranked second, attaining an accuracy of 0.75 and an F1-score of 0.76, while the ensemble model combining MobileNet, EfficientNet, and ResNet50 (Residual Network with 50 layers) achieved third place, with both metrics equal to 0.75.

Acknowledgements

Thanks to the developers team: Tymoteusz Byrwa, Jakub Kłopotek-Główczewski, Maksymilian Terebus, Krzysztof Dymanowski.

Author information

Authors and Affiliations

  1. Department of Decision Systems and Robotics, Faculty of Electronics Telecommunications and Informatics, Gdańsk University of Technology, Narutowicza 11/12, 80-233, Gdańsk, Pomeranian, Poland

    Michał Kopczyński & Michał Czubenko

Authors

  1. Michał Kopczyński
  2. Michał Czubenko

Corresponding author

Correspondence to Michał Czubenko.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Kopczyński, M., Czubenko, M. Enhancing human-dog interaction through deep learning and explainable AI. Sci Rep (2026). https://doi.org/10.1038/s41598-026-51009-9

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1038/s41598-026-51009-9

Keywords

colind88

Back To Top