Advancing Facial Expression Recognition -- Enhanced MobileNetV3 with Integrated Coordinate Attention and Dynamic Kernel Adaptation

Miloud Kamline; Ridha Ilyas Bendjillali; Mohammed Sofiane Bendelhoum; Asma Ouardas; Ali Abderrazak Tadjeddine

doi:10.26636/jtit.2025.2.2146

Authors

Miloud Kamline Tahri Mohammed University, Bechar, Algeria https://orcid.org/0009-0007-2949-4859
Ridha Ilyas Bendjillali University Center Nour Bachir, El Bayadh, Algeria https://orcid.org/0000-0003-2465-8192
Mohammed Sofiane Bendelhoum University Center Nour Bachir, El Bayadh, Algeria https://orcid.org/0000-0002-9789-8712
Asma Ouardas University Center Nour Bachir, El Bayadh, Algeria https://orcid.org/0000-0002-7569-3572
Ali Abderrazak Tadjeddine University Center Nour Bachir, El Bayadh, Algeria https://orcid.org/0000-0003-0926-3440

DOI:

https://doi.org/10.26636/jtit.2025.2.2146

Keywords:

coordinate attention mechanism, dynamic kernel adaptation, facial expression recognition, MobileNetV3, SoftSwish activation function

Abstract

This paper presents an improved approach for facial expression recognition (FER), which incorporates the Coordinate Attention (CAM) mechanism into MobileNetV3, a lightweight CNN widely used for its real-time applications on low-power devices. The CA mechanism greatly improves the ability of the model to focus on face regions of interest, as it incorporates positional information, making feature extraction more accurate. Additionally, dynamic kernel adaptation (DKA) and SoftSwish are incorporated into the model to enhance the flexibility and computational efficiency of MobileNetV3. The proposed model was tested in three sets of JAFFE, CK+, and FER2013, where accuracy improvements were reported of 98.84% in the JAFFE dataset, 99.56% on the CK+ dataset, and 88.50% on the FER2013 dataset. These results support the viability and utility of the proposed approach to improve FER, especially in applications that favor higher numerical performance.

Downloads

Download data is not yet available.

References

[1] R.I. Bendjillali, M. Beladgham, K. Merit, and T.A. Abdelmalik, "Wavelet-based Facial Recognition", 2018 6th International Conference on Control Engineering & Information Technology (CEIT), Istanbul, Türkiye, 2018.
View in Google Scholar

[2] R.I. Bendjillali et al., "A Robust-facial Expressions Recognition System Using Deep Learning Architectures", 2023 International Conference on Decision Aid Sciences and Applications (DASA), Annaba, Algeria, 2023.
View in Google Scholar

[3] M. Kamline, M.L. Abdelmounaim, and R.I. Bendjillali, "Arabic Handwriting Recognition System Based on Genetic Algorithm and Deep CNN Architectures", 2021 International Conference on Decision Aid Sciences and Application (DASA), Sakheer, Bahrain, 2021.
View in Google Scholar

[4] R.I. Bendjillali, M.S. Bendelhoum, A.A. Tadjeddine, and M. Kamline, "Deep Learning-powered Beamforming for 5G Massive MIMO Systems", Journal of Telecommunications and Information Technology, no. 4, pp. 38-45, 2023. DOI: https://doi.org/10.26636/jtit.2023.4.1332
View in Google Scholar

[5] A. Howard et al., "Searching for MobileNetV3", 2019 IEEE/CVF International Conference on Computer Vision (ICCV), Seoul, South Korea, 2019. DOI: https://doi.org/10.1109/ICCV.2019.00140
View in Google Scholar

[6] Q. Hou, D. Zhou, and J. Feng, "Coordinate Attention for Efficient Mobile Network Design", 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, USA, 2021. DOI: https://doi.org/10.1109/CVPR46437.2021.01350
View in Google Scholar

[7] J.L. Ngwe et al., "PAtt-Lite: Lightweight Patch and Attention MobileNet for Challenging Facial Expression Recognition", IEEE Access, vol. 12, pp. 79327-79341, 2024. DOI: https://doi.org/10.1109/ACCESS.2024.3407108
View in Google Scholar

[8] J. Zhu and Y. Cao, "Face Expression Recognition Combining Improved DeeplabV3+ and Migration Learning", Journal of Physics: Conference Series, vol. 2555, 2023. DOI: https://doi.org/10.1088/1742-6596/2555/1/012020
View in Google Scholar

[9] Y. Gan, J. Chen, Z. Yang, and L. Xu, "Multiple Attention Network for Facial Expression Recognition", IEEE Access, vol. 8, pp. 7383-7393, 2020. DOI: https://doi.org/10.1109/ACCESS.2020.2963913
View in Google Scholar

[10] Z. Hu and C. Yan, "Lightweight Multi-scale Network with Attention for Facial Expression Recognition", 2021 4th International Conference on Advanced Electronic Materials, Computers and Software Engineering (AEMCSE), Changsha, China, 2021. DOI: https://doi.org/10.1109/AEMCSE51986.2021.00143
View in Google Scholar

[11] R. Poyiadzi et al., "Domain Generalization for Apparent Emotional Facial Expression Recognition across Age-groups", ArXiv, 2021.
View in Google Scholar

[12] I. Dominguez-Catena, D. Paternain, and M. Galar, "Gender Stereotyping Impact in Facial Expression Recognition", 7th Workshop on Data Science for Social Good, Grenoble, France, 2022. DOI: https://doi.org/10.1007/978-3-031-23618-1_1
View in Google Scholar

[13] S. Xie, M. Li, S. Liu, and X. Tang, "ResNet with Attention Mechanism and Deformable Convolution for Facial Expression Recognition", 2021 4th International Conference on Information Communication and Signal Processing (ICICSP), Shanghai, China, 2021. DOI: https://doi.org/10.1109/ICICSP54369.2021.9611962
View in Google Scholar

[14] M. Zeng, Y. Luo, and G. Liu, "Lightweight Facial Expression Recognition Network with Dynamic Deep Mutual Learning", Proc. of the 2023 3rd International Conference on Bioinformatics and Intelligent Computing, pp. 222-226, 2023. DOI: https://doi.org/10.1145/3592686.3592726
View in Google Scholar

[15] B. Zoph, V. Vasudevan, J. Shlens, and Q.V. Le, "Learning Transferable Architectures for Scalable Image Recognition", IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, USA, 2018. DOI: https://doi.org/10.1109/CVPR.2018.00907
View in Google Scholar

[16] A.G. Howard et al., "MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications", ArXiv, 2017.
View in Google Scholar

[17] M. Tan and Q.V. Le, "EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks", ArXiv, 2019.
View in Google Scholar

[18] M. Tan et al., "MnasNet: Platform-aware Neural Architecture Search for Mobile", IEEE/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, USA, 2019. DOI: https://doi.org/10.1109/CVPR.2019.00293
View in Google Scholar

[19] X. Zhang, X. Zhou, M. Lin, and J. Sun, "ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices", Proc. of Conference on Computer Vision and Pattern Recognition, pp. 6848-6856, 2018. DOI: https://doi.org/10.1109/CVPR.2018.00716
View in Google Scholar

[20] D. Han, J. Kim, and J. Kim, "Deep Pyramidal Residual Networks", 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, USA, 2017. DOI: https://doi.org/10.1109/CVPR.2017.668
View in Google Scholar

[21] P. Ramachandran, B. Zoph, and Q.V. Le, "Searching for Activation Functions", ArXiv, 2017.
View in Google Scholar

[22] M. Lyons, S. Akamatsu, M. Kamachi, and J. Gyoba, "Coding Facial Expressions with Gabor Wavelets", Third IEEE International Conference on Automatic Face and Gesture Recognition, Nara, Japan, 1998.
View in Google Scholar

[23] P. Lucey et al., "The Extended Cohn-Kanade Dataset (CK+): A Complete Dataset for Action Unit and Emotion-specified Expression", 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Workshops, San Francisco, USA, 2010. DOI: https://doi.org/10.1109/CVPRW.2010.5543262
View in Google Scholar

[24] I.J. Goodfellow et al., "Challenges in Representation Learning: A Report on Three Machine Learning Contests", ArXiv, 2013. DOI: https://doi.org/10.1007/978-3-642-42051-1_16
View in Google Scholar

[25] J. Li et al., "Attention Mechanism-based CNN for Facial Expression Recognition", Neurocomputing, vol. 411, pp. 340-350, 2020. DOI: https://doi.org/10.1016/j.neucom.2020.06.014
View in Google Scholar

[26] C. Liang et al., "Facial Expression Recognition Using LBP and CNN Networks Integrating Attention Mechanism", 2023 Asia Symposium on Image Processing (ASIP), Tianjin, China, 2023. DOI: https://doi.org/10.1109/ASIP58895.2023.00009
View in Google Scholar

[27] M. Chen et al., "Facial Expression Recognition Method Combined with Attention Mechanism", Mobile Information Systems, 2021. DOI: https://doi.org/10.1155/2021/5608340
View in Google Scholar

[28] S. Li et al., "Auto-FERNet: A Facial Expression Recognition Network with Architecture Search", IEEE Transactions on Network Science and Engineering, vol. 8, pp. 2213-2222, 2021. DOI: https://doi.org/10.1109/TNSE.2021.3083739
View in Google Scholar

[29] Y. Kong et al., "Lightweight Facial Expression Recognition Method Based on Attention Mechanism and Key Region Fusion", Journal of Electronic Imaging, vol. 30, art. no. 063002, 2021. DOI: https://doi.org/10.1117/1.JEI.30.6.063002
View in Google Scholar

Advancing Facial Expression Recognition -- Enhanced MobileNetV3 with Integrated Coordinate Attention and Dynamic Kernel Adaptation

Authors

DOI:

Keywords:

Abstract

Downloads

References

Downloads

Published

Issue

Section

License

How to Cite

Most read articles by the same author(s)

Effects of Deformation of Main Reflector of Double Reflector Spherical Antenna on Its Aperture Field - ROT-54/2.6 Antenna Case

Information

LATEST PUBLICATIONS

Indexing

TOP CITED