Improved Lung Sound Classification Model Using Combined Residual Attention Network and Vision Transformer for Limited Dataset
Abstract
Keywords
References
Forum of International Respiratory Societies, The Global Impact of Respiratory Disease, 2nd ed. Sheffi eld, European Respiratory Society, 2017.
A. M. Alqudah, S. Qazan, and Y. M. Obeidat, “Deep learning models for detecting respiratory pathologies from raw lung auscultation sounds,” Soft comput., vol. 26, no. 24, pp. 13405–13429, 2022.
T. Aptekarev, V. Sokolovsky, E. Furman, N. Kalinina, and G. Furman, “Application of deep learning for bronchial asthma diagnostics using respiratory sound recordings,” PeerJ Comput Sci., vol. 9, pp. e1173, 2023.
Y. Kim, Y. Hyon, S. Soo Jung, S. Lee, G. Yoo, C. Chung, and T. Ha., “Respiratory sound classification for crackles, wheezes, and rhonchi in the clinical field using deep learning,” Sci Rep., vol. 11, no. 1, 2021.
N. Klaembt, R. Conradt, U. Koehler, W. Cassel, and P. Fischer, “Overnight registration of crackles, cough and wheezing in patients with interstitial lung disease”, A. miner, 2023.
B. Herrero-Cortina, M. Francín-Gallego, A. Sáez-Pérez, J., M. San Miguel-Pagola, L. Anoro-Abenoza, C. Gómez-González, J. Montero-Marco, M. Charlo-Bernardos, E. Altarribas-Bolsa, A. Pérez-Trullén, and C. Jácome, “Reliability and Validity of Computerized Adventitious Respiratory Sounds in People with Bronchiectasis,” J Clin Med, vol. 11, no. 24, 2022.
H. Melbye, J. Ravn, M. Pabiszczak, L. A. Bongo, J. Carlos, and A. Solis, “Validity of deep learning algorithms for detecting wheezes and crackles from lung sound recordings in adults,” medRxiv, vol. 75, pp. 1–12, 2022.
J. C. Aviles-Solis, C. Jácome, A. Davidsen, R. Einarsen, S. Vanbelle, H. Pasterkamp, and H. Melbye, “Prevalence and clinical associations of wheezes and crackles in the general population: The Tromsø study,” BMC Pulm Med, vol. 19, no. 1, pp. 1–11, 2019.
J. S. Park, K. Kim, J. H. Kim, Y. J. Choi, K. Kim, and D. I. Suh, “A machine learning approach to the development and prospective evaluation of a pediatric lung sound classification model,” Sci Rep, vol. 13, no. 1, pp. 1–10, 2023.
S. Reichert, R. Gass, C. Brandt, and E. Andrès, “Analysis of Respiratory Sounds: State of the Art,” Clinic. Med.: Circul., Respir. and Pulmon. Med., vol. 2, no. 5, 2008.
H. Useyin Polat, Inan, and Guler, “A Simple Computer-Based Measurement and Analysis System of Pulmonary Auscultation Sounds,” J. of Med. Syst., vol. 28, pp. 665–672, 2005.
N. S. Haider, B. K. Singh, R. Periyasamy, and A. K. Behera, “Respiratory Sound Based Classification of Chronic Obstructive Pulmonary Disease: A Risk Stratification Approach in Machine Learning Paradigm,” J Med Syst, vol. 43, no. 8, 2019.
N. Jakovljević, et al., “Hidden Markov model based respiratory sound classification,” in International Federation for Medical and Biological Engineering, 2009. IFMBE Proceedings, 2009. Springer Verlag, 2018, pp. 39–43.
G. Chambres, et al., “Automatic Detection of Patient with Respiratory Diseases Using Lung Sound Analysis,” in 2018 International Conference on Content-Based Multimedia Indexing (CBMI), IEEE, 2018, pp. 1–6.
J. Acharya and A. Basu, “Deep Neural Network for Respiratory Sound Classification in Wearable Devices Enabled by Patient Specific Model Tuning,” IEEE Trans Biomed Circuits Syst., vol. 14, no. 3, pp. 535–544, 2020.
S. Gairola, et al., “RespireNet: A Deep Neural Network for Accurately Detecting Abnormal Lung Sounds in Limited Data Setting,” in Annual International Conference of the IEEE Engineering in Medicine and Biology Society. 2021, pp. 527-530.
R. Liu, et al., “Detection of Adventitious Respiratory Sounds based on Convolutional Neural Network,” in 2019 International Conference on Intelligent Informatics and Biomedical Sciences (ICIIBMS), IEEE, 2019, pp. 298–303.
L. Shi, K. Du, C. Zhang, H. Ma, and W. Yan, “Lung Sound Recognition Algorithm Based on VGGish-BiGRU,” IEEE Access, vol. 7, pp. 139438–139449, 2019.
J. J. M. Escobar, O. Morales Matamoros, R. Tejeida Padilla, L. Chanona Hernández, J. P. F. Posadas Durán, A. K. Pérez Martínez, I. Lina Reyes, and H. Quintana Espinosa, “Biomedical signal acquisition using sensors under the paradigm of parallel computing,” Sensors (Switzerland), vol. 20, no. 23, pp. 1–36, 2020.
K. Kochetov, et al., “Noise masking recurrent neural network for respiratory sound classification,” in Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), Springer Verlag, 2018, pp. 208–217.
T. Nguyen and F. Pernkopf, “Lung Sound Classification Using Co-tuning and Stochastic Normalization,” IEEE Trans. Biomed. Eng., vol. 69, no. 9, pp. 2872–2882, 2022.
F. Demir, A. M. Ismael, and A. Sengur, “Classification of Lung Sounds with CNN Model Using Parallel Pooling Structure,” IEEE Access, vol. 8, pp. 105376–105383, 2020.
E. Messner, M. Fediuk, P. Swatek, S. Scheidl, F. Maria, S. Jüttner, H. Olschewski, and F. Pernkopf, “Multi-channel lung sound classification with convolutional recurrent neural networks,” Comput Biol Med., vol. 122, pp. 103831, 2020.
G. Petmezas, G. A. Cheimariotis, L. Stefanopoulos, L. Rocha, R. P. Paiva, A. K. Katsaggelos, and N. Maglaveras, “Automated Lung Sound Classification Using a Hybrid CNN-LSTM Network and Focal Loss Function,” Sensors, vol. 22, no. 3, 2022.
C. Wu, D. Lei, et al., “Respiratory Disease Classification Model Based on Feature Fusion,” in 2023 4th International Conference on Intelligent Computing and Human-Computer Interaction (ICHCI), IEEE, Aug. 2023, pp. 148–155.
L. D. Mang, F. J. Canadas-Quesada, J. J. Carabias-Orti, E. F. Combarro, and J. Ranilla, “Cochleogram-based adventitious sounds classification using convolutional neural networks,” Biomed Signal Process Control, vol. 82, pp. 104555, 2023.
P. Bhushan et al., “A Self-Attention Based Hybrid CNN-LSTM Architecture for Respiratory Sound Classification,” GMSARN International Journal, vol 18, pp. 54-61, 2024.
I. Moummad, et al., “Pretraining Respiratory Sound Representations using Metadata and Contrastive Learning,” in 2023 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), New Paltz, NY, USA, 2023, pp. 1-5.
Y. Gong, Y.-A. Chung, and J. Glass, “AST: Audio Spectrogram Transformer,” arXiv, vol. 3, 2021.
W. Ariyanti, et al., “Abnormal Respiratory Sound Identification Using Audio-Spectrogram Vision Transformer,” in 2023 45th Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC), IEEE, 2023, pp. 1–4.
H. Yan, Z. Li, W. Li, C. Wang, M. Wu, and C. Zhang, “ConTNet: Why not use convolution and transformer at the same time?,” arXiv, vol. 3, 2021.
A. Hassani, S. Walton, N. Shah, A. Abuduweili, J. Li, and H. Shi, “Escaping the Big Data Paradigm with Compact Transformers,” arXiv, vol. 4, 2022.
J. Neto, et al., “Convolution-Vision Transformer for Automatic Lung Sound Classification,” in 2022 35th SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI), Natal, Brazil, 2022, pp. 97-102.
F. Wang et al., “Residual Attention Network for Image Classification,” in 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA: IEEE, Jul 2017, pp. 6450–6458.
S. Woo, J. Park, J.-Y. Lee, and I. S. Kweon, “CBAM: Convolutional Block Attention Module,” Computer Vision – ECCV, vol. 11211, pp. 3–19, 2018.
B. M. Rocha et al., “A respiratory sound database for the development of automated classification,” in IFMBE Proceedings, Springer Verlag, 2018, pp. 33–37.
B. M. Rocha, D. Filos, L. Mendes, G. Serbes, S. Ulukaya, Y. P. Kahya, N. Jakovljevic, T. L. Turukalo, I. M. Vogiatzis, E. Perantoni, E. Kaimakamis, P. Natsiavas, A. Oliveira, C. Jácome, A. Marques, N. Maglaveras, R. Pedro Paiva, I. Chouvarda, and P. de Carvalho, “An open access database for the evaluation of respiratory sound classification algorithms,” Physiol Meas, vol. 40, no. 3, 2019.
B. Ustubioglu, G. Tahaoglu, and G. Ulutas, “Detection of audio copy-move-forgery with novel feature matching on Mel spectrogram,” Expert Syst Appl, vol. 213, p. 118963, 2023.
H. Chen, X. Yuan, Z. Pei, M. Li, and J. Li, “Triple-classification of respiratory sounds using optimized S-transform and deep residual networks,” IEEE Access, vol. 7, pp. 32845–32852, 2019.
Refbacks
- There are currently no refbacks.
Indonesian Journal of Electrical Engineering and Informatics (IJEEI)
ISSN 2089-3272
This work is licensed under a Creative Commons Attribution 4.0 International License.