Coswara: A respiratory sounds and symptoms dataset for remote screening of SARS-CoV-2 infection

madhukar.pai · July 4, 2023, 10:45am

This paper presents the Coswara dataset, a dataset containing diverse set of respiratory sounds and rich meta-data, recorded between April-2020 and February-2022 from 2635 individuals (1819 SARS-CoV-2 negative, 674 positive, and 142 recovered subjects). The respiratory sounds contained nine sound categories associated with variants of breathing, cough and speech. The rich metadata contained demographic information associated with age, gender and geographic location, as well as the health information relating to the symptoms, pre-existing respiratory ailments, comorbidity and SARS-CoV-2 test status. Our study is the first of its kind to manually annotate the audio quality of the entire dataset (amounting to 65 hours) through manual listening. The paper summarizes the data collection procedure, demographic, symptoms and audio data information. A COVID-19 classifier based on bi-directional long short-term (BLSTM) architecture, is trained and evaluated on the different population sub-groups contained in the dataset to understand the bias/fairness of the model. This enabled the analysis of the impact of gender, geographic location, date of recording, and language proficiency on the COVID-19 detection performance.

https://www.nature.com/articles/s41597-023-02266-0

Topic	Replies	Views
The Sensitivity and Costs of Testing for SARS-CoV-2 Infection With Saliva Versus Nasopharyngeal Swabs Discussion covid-19	429	January 12, 2021
Prediction models for diagnosis and prognosis of covid-19: systematic review and critical appraisal Discussion systematic-reviews , covid-19	507	June 5, 2020
Self-Service Diagnosis of COVID-19—Ready for Prime Time? Discussion covid-19	316	March 16, 2020
Diagnostic Testing for Severe Acute Respiratory Syndrome–Related Coronavirus-2: A Narrative Review Discussion covid-19	354	April 14, 2020
Sars-cov-2 diagnostics: performance data Discussion covid-19	361	March 27, 2020

Coswara: A respiratory sounds and symptoms dataset for remote screening of SARS-CoV-2 infection

Related topics