ASHA journals
Browse

LENA ASP validation for adult and child segments (Meera et al., 2025)

Download (662.8 kB)
Version 2 2025-01-02, 20:07
Version 1 2024-12-06, 14:19
online resource
posted on 2025-01-02, 20:07 authored by Shoba S. Meera, Divya Swaminathan, Sri Ranjani Venkata Murali, Reny Raju, Malavi Srikar, Sahana Shyam Sundar, Senthil Amudhan, Alejandrina Cristia, Rahul Pawar, Achuth Rao, Prathyusha P. Vasuki, Shree Volme, Ashok Mysore

Purpose: The Language ENvironment Analysis (LENA) technology uses automated speech processing (ASP) algorithms to estimate counts such as total adult words and child vocalizations, which helps understand children’s early language environment. This ASP has been validated in North American English and other languages in predominantly monolingual contexts but not in a multilingual context like India. Thus, the current study aims to validate the classification accuracy of the LENA algorithm specifically focusing on speaker recognition of adult segments (AdS) and child segments (ChS) in a sample of bi/multilingual families from India.

Method: Thirty neurotypical children between 6 and 24 months (M = 12.89, SD = 4.95) were recruited. Participants were growing up in bi/multilingual environment hearing a combination of Kannada, Tamil, Malayalam, Telugu, Hindi, and/or English. Daylong audio recordings were collected using LENA and processed using the ASP to automatically detect segments across speaker categories. Two human annotators manually annotated ~900 min (37,431 segments across speaker categories). Performance accuracy (recall and precision) was calculated for AdS and ChS.

Results: The recall and precision for AdS were 0.62 (95% confidence interval [CI] [0.61, 0.63]) and 0.83 (95% CI [0.8, 0.83]), respectively. This indicated that 62% of the segments identified as AdS by the human annotator were also identified as AdS by the LENA ASP algorithm and 83% of the segments labeled by the LENA ASP as AdS were also labeled by the human annotator as AdS. Similarly, the recall and precision for ChS were 0.65 (95% CI [0.64, 0.66]) and 0.55 (95% CI [0.54, 0.56]), respectively.

Conclusions: This study documents the performance of the ASP in correctly classifying speakers as adult or child in a sample of families from India, indicating recall and precision that is relatively low. This study lays the groundwork for future investigations aiming to refine the algorithm models, potentially facilitating more accurate performance in bi/multilingual societies like India.

Supplemental Material S1. Validation of the Language ENvironment Analysis (LENA®) automated speech processing algorithm labels for adult and child segments in a sample of families from India.

Meera, S. S., Swaminathan, D., Murali, S. R. V., Raju, R., Srikar, M., Sundar, S. S., Amudhan, S., Cristia, A., Pawar, R., Rao, A., Vasuki, P. P., Volme, S., & Mysore, A. (2025). Validation of the Language ENvironment Analysis (LENA) automated speech processing algorithm labels for adult and child segments in a sample of families from India. Journal of Speech, Language, and Hearing Research, 68(1), 40–53. https://doi.org/10.1044/2024_JSLHR-24-00099

Funding

This work was supported by the Department of Biotechnology/Wellcome Trust India Alliance Fellowship Grant IA/CPHE/18/1/503968, awarded to Shoba S. Meera.

History