Improved Lung Sound Classification Model Using Combined Residual Attention Network and Vision Transformer for Limited Dataset

Muhammad Jurej, Roslidar Roslidar, Yunida Yunida

Abstract


According to WHO data, the prevalence of respiratory disorders is increasing, exacerbated by a shortage of skilled medical professionals. Consequently, there is an urgent need for an automated lung sound classification system. Current methods rely on deep learning, but limited lung sound data resulted in low model accuracy. The widely used ICBHI 2017 dataset has an imbalanced class distribution, with a normal class at 52.8%, wheezing at 27.0%, crackles at 12.8%, and combined wheeze and crackles at 7.3%. The imbalance of the dataset may affect the model's efficiency and performance in classifying lung sounds. Given these data limitations, we propose a hybrid model, combining residual attention network (RAN) and vision transformer (ViT), to construct an effective respiratory sound classification model with a small dataset. We employ feature fusion techniques between convolutional neural network (CNN) feature maps and image patches to enrich lung sound features. Additionally, our preprocessing involves bandpass filtering, resampling sounds to 16 kHz, and normalizing volume to 15 dB. Our model achieves impressive ICBHI scores with 97.28% specificity, 92.83% sensitivity, and an average score of 95.05%, marking a 10% improvement over state-of-the-art models in previous research.

Keywords


lung sound; ICBHI score; residual attention network; vision transformer

References


Forum of International Respiratory Societies, The Global Impact of Respiratory Disease, 2nd ed. Sheffi eld, European Respiratory Society, 2017.

A. M. Alqudah, S. Qazan, and Y. M. Obeidat, “Deep learning models for detecting respiratory pathologies from raw lung auscultation sounds,” Soft comput., vol. 26, no. 24, pp. 13405–13429, 2022.

T. Aptekarev, V. Sokolovsky, E. Furman, N. Kalinina, and G. Furman, “Application of deep learning for bronchial asthma diagnostics using respiratory sound recordings,” PeerJ Comput Sci., vol. 9, pp. e1173, 2023.

Y. Kim, Y. Hyon, S. Soo Jung, S. Lee, G. Yoo, C. Chung, and T. Ha., “Respiratory sound classification for crackles, wheezes, and rhonchi in the clinical field using deep learning,” Sci Rep., vol. 11, no. 1, 2021.

N. Klaembt, R. Conradt, U. Koehler, W. Cassel, and P. Fischer, “Overnight registration of crackles, cough and wheezing in patients with interstitial lung disease”, A. miner, 2023.

B. Herrero-Cortina, M. Francín-Gallego, A. Sáez-Pérez, J., M. San Miguel-Pagola, L. Anoro-Abenoza, C. Gómez-González, J. Montero-Marco, M. Charlo-Bernardos, E. Altarribas-Bolsa, A. Pérez-Trullén, and C. Jácome, “Reliability and Validity of Computerized Adventitious Respiratory Sounds in People with Bronchiectasis,” J Clin Med, vol. 11, no. 24, 2022.

H. Melbye, J. Ravn, M. Pabiszczak, L. A. Bongo, J. Carlos, and A. Solis, “Validity of deep learning algorithms for detecting wheezes and crackles from lung sound recordings in adults,” medRxiv, vol. 75, pp. 1–12, 2022.

J. C. Aviles-Solis, C. Jácome, A. Davidsen, R. Einarsen, S. Vanbelle, H. Pasterkamp, and H. Melbye, “Prevalence and clinical associations of wheezes and crackles in the general population: The Tromsø study,” BMC Pulm Med, vol. 19, no. 1, pp. 1–11, 2019.

J. S. Park, K. Kim, J. H. Kim, Y. J. Choi, K. Kim, and D. I. Suh, “A machine learning approach to the development and prospective evaluation of a pediatric lung sound classification model,” Sci Rep, vol. 13, no. 1, pp. 1–10, 2023.

S. Reichert, R. Gass, C. Brandt, and E. Andrès, “Analysis of Respiratory Sounds: State of the Art,” Clinic. Med.: Circul., Respir. and Pulmon. Med., vol. 2, no. 5, 2008.

H. Useyin Polat, Inan, and Guler, “A Simple Computer-Based Measurement and Analysis System of Pulmonary Auscultation Sounds,” J. of Med. Syst., vol. 28, pp. 665–672, 2005.

N. S. Haider, B. K. Singh, R. Periyasamy, and A. K. Behera, “Respiratory Sound Based Classification of Chronic Obstructive Pulmonary Disease: A Risk Stratification Approach in Machine Learning Paradigm,” J Med Syst, vol. 43, no. 8, 2019.

N. Jakovljević, et al., “Hidden Markov model based respiratory sound classification,” in International Federation for Medical and Biological Engineering, 2009. IFMBE Proceedings, 2009. Springer Verlag, 2018, pp. 39–43.

G. Chambres, et al., “Automatic Detection of Patient with Respiratory Diseases Using Lung Sound Analysis,” in 2018 International Conference on Content-Based Multimedia Indexing (CBMI), IEEE, 2018, pp. 1–6.

J. Acharya and A. Basu, “Deep Neural Network for Respiratory Sound Classification in Wearable Devices Enabled by Patient Specific Model Tuning,” IEEE Trans Biomed Circuits Syst., vol. 14, no. 3, pp. 535–544, 2020.

S. Gairola, et al., “RespireNet: A Deep Neural Network for Accurately Detecting Abnormal Lung Sounds in Limited Data Setting,” in Annual International Conference of the IEEE Engineering in Medicine and Biology Society. 2021, pp. 527-530.

R. Liu, et al., “Detection of Adventitious Respiratory Sounds based on Convolutional Neural Network,” in 2019 International Conference on Intelligent Informatics and Biomedical Sciences (ICIIBMS), IEEE, 2019, pp. 298–303.

L. Shi, K. Du, C. Zhang, H. Ma, and W. Yan, “Lung Sound Recognition Algorithm Based on VGGish-BiGRU,” IEEE Access, vol. 7, pp. 139438–139449, 2019.

J. J. M. Escobar, O. Morales Matamoros, R. Tejeida Padilla, L. Chanona Hernández, J. P. F. Posadas Durán, A. K. Pérez Martínez, I. Lina Reyes, and H. Quintana Espinosa, “Biomedical signal acquisition using sensors under the paradigm of parallel computing,” Sensors (Switzerland), vol. 20, no. 23, pp. 1–36, 2020.

K. Kochetov, et al., “Noise masking recurrent neural network for respiratory sound classification,” in Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), Springer Verlag, 2018, pp. 208–217.

T. Nguyen and F. Pernkopf, “Lung Sound Classification Using Co-tuning and Stochastic Normalization,” IEEE Trans. Biomed. Eng., vol. 69, no. 9, pp. 2872–2882, 2022.

F. Demir, A. M. Ismael, and A. Sengur, “Classification of Lung Sounds with CNN Model Using Parallel Pooling Structure,” IEEE Access, vol. 8, pp. 105376–105383, 2020.

E. Messner, M. Fediuk, P. Swatek, S. Scheidl, F. Maria, S. Jüttner, H. Olschewski, and F. Pernkopf, “Multi-channel lung sound classification with convolutional recurrent neural networks,” Comput Biol Med., vol. 122, pp. 103831, 2020.

G. Petmezas, G. A. Cheimariotis, L. Stefanopoulos, L. Rocha, R. P. Paiva, A. K. Katsaggelos, and N. Maglaveras, “Automated Lung Sound Classification Using a Hybrid CNN-LSTM Network and Focal Loss Function,” Sensors, vol. 22, no. 3, 2022.

C. Wu, D. Lei, et al., “Respiratory Disease Classification Model Based on Feature Fusion,” in 2023 4th International Conference on Intelligent Computing and Human-Computer Interaction (ICHCI), IEEE, Aug. 2023, pp. 148–155.

L. D. Mang, F. J. Canadas-Quesada, J. J. Carabias-Orti, E. F. Combarro, and J. Ranilla, “Cochleogram-based adventitious sounds classification using convolutional neural networks,” Biomed Signal Process Control, vol. 82, pp. 104555, 2023.

P. Bhushan et al., “A Self-Attention Based Hybrid CNN-LSTM Architecture for Respiratory Sound Classification,” GMSARN International Journal, vol 18, pp. 54-61, 2024.

I. Moummad, et al., “Pretraining Respiratory Sound Representations using Metadata and Contrastive Learning,” in 2023 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), New Paltz, NY, USA, 2023, pp. 1-5.

Y. Gong, Y.-A. Chung, and J. Glass, “AST: Audio Spectrogram Transformer,” arXiv, vol. 3, 2021.

W. Ariyanti, et al., “Abnormal Respiratory Sound Identification Using Audio-Spectrogram Vision Transformer,” in 2023 45th Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC), IEEE, 2023, pp. 1–4.

H. Yan, Z. Li, W. Li, C. Wang, M. Wu, and C. Zhang, “ConTNet: Why not use convolution and transformer at the same time?,” arXiv, vol. 3, 2021.

A. Hassani, S. Walton, N. Shah, A. Abuduweili, J. Li, and H. Shi, “Escaping the Big Data Paradigm with Compact Transformers,” arXiv, vol. 4, 2022.

J. Neto, et al., “Convolution-Vision Transformer for Automatic Lung Sound Classification,” in 2022 35th SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI), Natal, Brazil, 2022, pp. 97-102.

F. Wang et al., “Residual Attention Network for Image Classification,” in 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA: IEEE, Jul 2017, pp. 6450–6458.

S. Woo, J. Park, J.-Y. Lee, and I. S. Kweon, “CBAM: Convolutional Block Attention Module,” Computer Vision – ECCV, vol. 11211, pp. 3–19, 2018.

B. M. Rocha et al., “A respiratory sound database for the development of automated classification,” in IFMBE Proceedings, Springer Verlag, 2018, pp. 33–37.

B. M. Rocha, D. Filos, L. Mendes, G. Serbes, S. Ulukaya, Y. P. Kahya, N. Jakovljevic, T. L. Turukalo, I. M. Vogiatzis, E. Perantoni, E. Kaimakamis, P. Natsiavas, A. Oliveira, C. Jácome, A. Marques, N. Maglaveras, R. Pedro Paiva, I. Chouvarda, and P. de Carvalho, “An open access database for the evaluation of respiratory sound classification algorithms,” Physiol Meas, vol. 40, no. 3, 2019.

B. Ustubioglu, G. Tahaoglu, and G. Ulutas, “Detection of audio copy-move-forgery with novel feature matching on Mel spectrogram,” Expert Syst Appl, vol. 213, p. 118963, 2023.

H. Chen, X. Yuan, Z. Pei, M. Li, and J. Li, “Triple-classification of respiratory sounds using optimized S-transform and deep residual networks,” IEEE Access, vol. 7, pp. 32845–32852, 2019.


Full Text: PDF

Refbacks

  • There are currently no refbacks.


 

Indonesian Journal of Electrical Engineering and Informatics (IJEEI)
ISSN 2089-3272

Creative Commons Licence

This work is licensed under a Creative Commons Attribution 4.0 International License.

web analytics
View IJEEI Stats