Advancing Facial Expression Recognition -- Enhanced MobileNetV3 with Integrated Coordinate Attention and Dynamic Kernel Adaptation
DOI:
https://doi.org/10.26636/jtit.2025.2.2146Keywords:
coordinate attention mechanism, dynamic kernel adaptation, facial expression recognition, MobileNetV3, SoftSwish activation functionAbstract
This paper presents an improved approach for facial expression recognition (FER), which incorporates the Coordinate Attention (CAM) mechanism into MobileNetV3, a lightweight CNN widely used for its real-time applications on low-power devices. The CA mechanism greatly improves the ability of the model to focus on face regions of interest, as it incorporates positional information, making feature extraction more accurate. Additionally, dynamic kernel adaptation (DKA) and SoftSwish are incorporated into the model to enhance the flexibility and computational efficiency of MobileNetV3. The proposed model was tested in three sets of JAFFE, CK+, and FER2013, where accuracy improvements were reported of 98.84% in the JAFFE dataset, 99.56% on the CK+ dataset, and 88.50% on the FER2013 dataset. These results support the viability and utility of the proposed approach to improve FER, especially in applications that favor higher numerical performance.
Downloads
References
[1] R.I. Bendjillali, M. Beladgham, K. Merit, and T.A. Abdelmalik, "Wavelet-based Facial Recognition", 2018 6th International Conference on Control Engineering & Information Technology (CEIT), Istanbul, Türkiye, 2018.
View in Google Scholar
[2] R.I. Bendjillali et al., "A Robust-facial Expressions Recognition System Using Deep Learning Architectures", 2023 International Conference on Decision Aid Sciences and Applications (DASA), Annaba, Algeria, 2023.
View in Google Scholar
[3] M. Kamline, M.L. Abdelmounaim, and R.I. Bendjillali, "Arabic Handwriting Recognition System Based on Genetic Algorithm and Deep CNN Architectures", 2021 International Conference on Decision Aid Sciences and Application (DASA), Sakheer, Bahrain, 2021.
View in Google Scholar
[4] R.I. Bendjillali, M.S. Bendelhoum, A.A. Tadjeddine, and M. Kamline, "Deep Learning-powered Beamforming for 5G Massive MIMO Systems", Journal of Telecommunications and Information Technology, no. 4, pp. 38-45, 2023.
View in Google Scholar
[5] A. Howard et al., "Searching for MobileNetV3", 2019 IEEE/CVF International Conference on Computer Vision (ICCV), Seoul, South Korea, 2019.
View in Google Scholar
[6] Q. Hou, D. Zhou, and J. Feng, "Coordinate Attention for Efficient Mobile Network Design", 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, USA, 2021.
View in Google Scholar
[7] J.L. Ngwe et al., "PAtt-Lite: Lightweight Patch and Attention MobileNet for Challenging Facial Expression Recognition", IEEE Access, vol. 12, pp. 79327-79341, 2024.
View in Google Scholar
[8] J. Zhu and Y. Cao, "Face Expression Recognition Combining Improved DeeplabV3+ and Migration Learning", Journal of Physics: Conference Series, vol. 2555, 2023.
View in Google Scholar
[9] Y. Gan, J. Chen, Z. Yang, and L. Xu, "Multiple Attention Network for Facial Expression Recognition", IEEE Access, vol. 8, pp. 7383-7393, 2020.
View in Google Scholar
[10] Z. Hu and C. Yan, "Lightweight Multi-scale Network with Attention for Facial Expression Recognition", 2021 4th International Conference on Advanced Electronic Materials, Computers and Software Engineering (AEMCSE), Changsha, China, 2021.
View in Google Scholar
[11] R. Poyiadzi et al., "Domain Generalization for Apparent Emotional Facial Expression Recognition across Age-groups", ArXiv, 2021.
View in Google Scholar
[12] I. Dominguez-Catena, D. Paternain, and M. Galar, "Gender Stereotyping Impact in Facial Expression Recognition", 7th Workshop on Data Science for Social Good, Grenoble, France, 2022.
View in Google Scholar
[13] S. Xie, M. Li, S. Liu, and X. Tang, "ResNet with Attention Mechanism and Deformable Convolution for Facial Expression Recognition", 2021 4th International Conference on Information Communication and Signal Processing (ICICSP), Shanghai, China, 2021.
View in Google Scholar
[14] M. Zeng, Y. Luo, and G. Liu, "Lightweight Facial Expression Recognition Network with Dynamic Deep Mutual Learning", Proc. of the 2023 3rd International Conference on Bioinformatics and Intelligent Computing, pp. 222-226, 2023.
View in Google Scholar
[15] B. Zoph, V. Vasudevan, J. Shlens, and Q.V. Le, "Learning Transferable Architectures for Scalable Image Recognition", IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, USA, 2018.
View in Google Scholar
[16] A.G. Howard et al., "MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications", ArXiv, 2017.
View in Google Scholar
[17] M. Tan and Q.V. Le, "EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks", ArXiv, 2019.
View in Google Scholar
[18] M. Tan et al., "MnasNet: Platform-aware Neural Architecture Search for Mobile", IEEE/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, USA, 2019.
View in Google Scholar
[19] X. Zhang, X. Zhou, M. Lin, and J. Sun, "ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices", Proc. of Conference on Computer Vision and Pattern Recognition, pp. 6848-6856, 2018.
View in Google Scholar
[20] D. Han, J. Kim, and J. Kim, "Deep Pyramidal Residual Networks", 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, USA, 2017.
View in Google Scholar
[21] P. Ramachandran, B. Zoph, and Q.V. Le, "Searching for Activation Functions", ArXiv, 2017.
View in Google Scholar
[22] M. Lyons, S. Akamatsu, M. Kamachi, and J. Gyoba, "Coding Facial Expressions with Gabor Wavelets", Third IEEE International Conference on Automatic Face and Gesture Recognition, Nara, Japan, 1998.
View in Google Scholar
[23] P. Lucey et al., "The Extended Cohn-Kanade Dataset (CK+): A Complete Dataset for Action Unit and Emotion-specified Expression", 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Workshops, San Francisco, USA, 2010.
View in Google Scholar
[24] I.J. Goodfellow et al., "Challenges in Representation Learning: A Report on Three Machine Learning Contests", ArXiv, 2013.
View in Google Scholar
[25] J. Li et al., "Attention Mechanism-based CNN for Facial Expression Recognition", Neurocomputing, vol. 411, pp. 340-350, 2020.
View in Google Scholar
[26] C. Liang et al., "Facial Expression Recognition Using LBP and CNN Networks Integrating Attention Mechanism", 2023 Asia Symposium on Image Processing (ASIP), Tianjin, China, 2023.
View in Google Scholar
[27] M. Chen et al., "Facial Expression Recognition Method Combined with Attention Mechanism", Mobile Information Systems, 2021.
View in Google Scholar
[28] S. Li et al., "Auto-FERNet: A Facial Expression Recognition Network with Architecture Search", IEEE Transactions on Network Science and Engineering, vol. 8, pp. 2213-2222, 2021.
View in Google Scholar
[29] Y. Kong et al., "Lightweight Facial Expression Recognition Method Based on Attention Mechanism and Key Region Fusion", Journal of Electronic Imaging, vol. 30, art. no. 063002, 2021.
View in Google Scholar
Downloads
Published
Issue
Section
License
Copyright (c) 2025 Miloud Kamline, Ridha Ilyas Bendjillali, Mohammed Sofiane Bendelhoum, Asma Ouardas, Ali Abderrazak Tadjeddine

This work is licensed under a Creative Commons Attribution 4.0 International License.