J Korean Soc Laryngol Phoniatr Logoped.  2024 Apr;35(1):24-29. 10.22469/jkslp.2024.35.1.24.

Prediction of Unilateral Vocal Cord Paralysis Patients Through Machine Learning Analysis of Acoustic Parameters: A Preliminary Study

Affiliations
  • 1School of Electronics Engineering, Kyungpook National University, Daegu, Korea
  • 2Department of Speech-Language Pathology, Kyungpook National University Chilgok Hospital, Daegu, Korea
  • 3Department of Speech-Language Pathology, Daegu University, Daegu, Korea
  • 4Department of Neurosurgery, Kyungpook National University School of Medicine, Daegu, Korea
  • 5Department of Industrial Engineering, Konkuk University, Seoul, Korea
  • 6Department of Otorhinolaryngology-Head and Neck Surgery, Kyungpook National University School of Medicine, Daegu, Korea

Abstract

Background and Objectives
The purpose of this study is to evaluate value of diagnostic tool for vocal cord palsy utilizing artificial intelligence without laryngoscope Materials and Method A dataset consisting of recordings from patients with unilateral vocal cord paralysis (n=54) as well as normal individuals (n=163). The dataset included prolonged pronunciations of the vowels /ah/, /u/, /i/, and vocal cord data from paralyzed patients. Various acoustic parameters such as Mel-frequency cepstral coefficients, jitter, shimmer, harmonics-to-noise ratio, and fundamental frequency statistics were analyzed. The classification of vocal cord paralysis encompassed paralysis status, paralysis degree, and paralysis location. The deep learning model employed the leave-one-out method, and the feature set with the highest performance was selected using the following methods.
Results
Vocal Cord Paralysis Classifier: The classifier accurately distinguished normal voice from vocal cord paralysis, achieving an accuracy and F1 score of 1.0. Paralysis Location Classifier: The classifier accurately differentiated between median and paramedian vocal cord paralysis, achieving an accuracy and micro F1 score of 1.0. Breathiness Degree Classifier: The classifier achieved an accuracy of 0.795 and a mean absolute error of 0.2857 in distinguishing different degrees of breathiness.
Conclusion
Although the small sample size raises concerns of potential overfitting, this preliminary study highlights distinctive acoustic features in cases of unilateral vocal fold paralysis compared to those of normal individuals. These findings suggest the feasibility of determining the presence, degree, and location of paralysis through the utilization of acoustic parameters. Further research is warranted to validate and expand upon these results.

Keyword

Unilateral vocal cord paralysis; Machine learning; Acoustic parameters; Artificial intelligence; Classification; 일측성 성대마비; 머신러닝; 음향학적 지표; 인공지능; 분류
Full Text Links
  • JKSLPL
Actions
Cited
CITED
export Copy
Close
Share
  • Twitter
  • Facebook
Similar articles
Copyright © 2024 by Korean Association of Medical Journal Editors. All rights reserved.     E-mail: koreamed@kamje.or.kr