. 2018 Nov 27;15(11):e1002705.

doi: 10.1371/journal.pmed.1002705. eCollection 2018 Nov.

Mobile detection of autism through machine learning on home video: A development and prospective validation study

Qandeel Tariq^{1

2}, Jena Daniels^{1

2}, Jessey Nicole Schwartz^{1

2}, Peter Washington^{1

2}, Haik Kalantarian^{1

2}, Dennis Paul Wall^{1

2}

Affiliations

¹ Department of Pediatrics, Division of Systems Medicine, Stanford University, California, United States of America.
² Department of Biomedical Data Science, Stanford University, California, United States of America.

PMID: 30481180
PMCID: PMC6258501
DOI: 10.1371/journal.pmed.1002705

Mobile detection of autism through machine learning on home video: A development and prospective validation study

Qandeel Tariq et al. PLoS Med. 2018.

. 2018 Nov 27;15(11):e1002705.

doi: 10.1371/journal.pmed.1002705. eCollection 2018 Nov.

Authors

Qandeel Tariq^{1

2}, Jena Daniels^{1

2}, Jessey Nicole Schwartz^{1

2}, Peter Washington^{1

2}, Haik Kalantarian^{1

2}, Dennis Paul Wall^{1

2}

Affiliations

¹ Department of Pediatrics, Division of Systems Medicine, Stanford University, California, United States of America.
² Department of Biomedical Data Science, Stanford University, California, United States of America.

PMID: 30481180
PMCID: PMC6258501
DOI: 10.1371/journal.pmed.1002705

Abstract

Background: The standard approaches to diagnosing autism spectrum disorder (ASD) evaluate between 20 and 100 behaviors and take several hours to complete. This has in part contributed to long wait times for a diagnosis and subsequent delays in access to therapy. We hypothesize that the use of machine learning analysis on home video can speed the diagnosis without compromising accuracy. We have analyzed item-level records from 2 standard diagnostic instruments to construct machine learning classifiers optimized for sparsity, interpretability, and accuracy. In the present study, we prospectively test whether the features from these optimized models can be extracted by blinded nonexpert raters from 3-minute home videos of children with and without ASD to arrive at a rapid and accurate machine learning autism classification.

Methods and findings: We created a mobile web portal for video raters to assess 30 behavioral features (e.g., eye contact, social smile) that are used by 8 independent machine learning models for identifying ASD, each with >94% accuracy in cross-validation testing and subsequent independent validation from previous work. We then collected 116 short home videos of children with autism (mean age = 4 years 10 months, SD = 2 years 3 months) and 46 videos of typically developing children (mean age = 2 years 11 months, SD = 1 year 2 months). Three raters blind to the diagnosis independently measured each of the 30 features from the 8 models, with a median time to completion of 4 minutes. Although several models (consisting of alternating decision trees, support vector machine [SVM], logistic regression (LR), radial kernel, and linear SVM) performed well, a sparse 5-feature LR classifier (LR5) yielded the highest accuracy (area under the curve [AUC]: 92% [95% CI 88%-97%]) across all ages tested. We used a prospectively collected independent validation set of 66 videos (33 ASD and 33 non-ASD) and 3 independent rater measurements to validate the outcome, achieving lower but comparable accuracy (AUC: 89% [95% CI 81%-95%]). Finally, we applied LR to the 162-video-feature matrix to construct an 8-feature model, which achieved 0.93 AUC (95% CI 0.90-0.97) on the held-out test set and 0.86 on the validation set of 66 videos. Validation on children with an existing diagnosis limited the ability to generalize the performance to undiagnosed populations.

Conclusions: These results support the hypothesis that feature tagging of home videos for machine learning classification of autism can yield accurate outcomes in short time frames, using mobile devices. Further work will be needed to confirm that this approach can accelerate autism diagnosis at scale.

PubMed Disclaimer

Conflict of interest statement

I have read the journal's policy and the authors of this manuscript have the following competing interests: DPW is the scientific founder of Cognoa, a company focused on digital pediatric healthcare; the approach and findings presented in this paper are independent from/not related to Cognoa. All other authors have declared no competing interests exist.

Figures

**Fig 1. Feature-to-classifier mapping.**
Video analysts scored each video with 30 features. This matrix shows which feature corresponds to which classifier. Darker colored features indicate higher overlap, and lighter colors indicate lower overlap across the models. The features are rank ordered according to their frequency of use across the 8 classifiers. Further details about the classifiers are provided in Table 1. The bottom 7 features were not part of the machine learning process but were chosen because of their potential relationship with the autism phenotype and for use in further evaluation of the models’ feature sets when constructing a video feature–specific classifier. ADTree7, 7-feature alternating decision tree; ADTree8, 8-feature alternating decision tree; LR5, 5-feature logistic regression classifier; LR10, 10-feature logistic regression classifier; SVM5, 5-feature support vector machine; SVM10, 10-feature support vector machine; SVM12, 12-feature support vector machine.

**Fig 2. Accuracy across different permutations of 9 raters for 50 videos.**
We performed the analysis to determine the optimal number (the minimum number to reach a consensus on classification) of video raters needed to maintain accuracy without loss of power. Nine raters analyzed and generated feature tags for a subset of n = 50 videos (n = 25 ASD, n = 25 non-ASD) on which we ran the ADTree8 classifier (Table 1). The increase in accuracy conferred by the use of 3 versus 9 raters was not significant. We therefore set the optimal rater number to 3 for subsequent analyses. ADTree8, 8-feature alternating decision tree; ASD, autism spectrum disorder.

**Fig 3. Overall procedure for rapid and mobile classification of ASD versus non-ASD and performance of models from Table 1.**
Participants were recruited to participate via crowdsourcing methods and provided video by direct upload or via a preexisting YouTube link. The minimum for majority rules of 3 video raters tagged all features, generating feature vectors to run each of the 8 classifiers automatically. The sensitivity and specificity based on majority outcome generated by the 3 raters on 162 (119 with autism) videos are provided. Highlighted in yellow is the best performing model, LR5. ADTree7, 7-feature alternating decision tree; ADTree8, 8-feature alternating decision tree; ASD, autism spectrum disorder; LR5, 5-feature logistic regression classifier; LR9, 9-feature logistic regression classifier; LR10, 10-feature logistic regression classifier; SVM5, 5-feature support vector machine; SVM10, 10-feature support vector machine; SVM12, 12-feature support vector machine.

**Fig 4. Performance for LR5 by age.**
LR5 exhibited the highest classifier performance (89% accuracy) out of the 8 classifiers tested (Table 1). This model performed best on children between the ages of 2 and 6 years. (A) shows the performance of LR5 across 4 age ranges, and (B) provides the ROC curve for LR5’s performance for children ages 2 to 6 years. Table 3 provides additional details, including the number of affected and unaffected control participants within each age range. AUC, area under the curve; LR5, 5-feature logistic regression classifier; ROC, receiver operating characteristic.

**Fig 5. ROC curve for LR-EN-VF showing performance on test data along with an ROC for L2 loss with no feature reduction.**
The former chose 8 out of 30 video features. AUC, area under the curve; LR-EN-VF, logistic regression with an elastic net penalty; ROC, receiver operating characteristic.

See this image and copyright information in PMC

Cited by

From Clinic to Computer and Back Again: Practical Considerations When Designing and Implementing Machine Learning Solutions for Pediatrics.
Nagaraj S, Harish V, McCoy LG, Morgado F, Stedman I, Lu S, Drysdale E, Brudno M, Singh D. Nagaraj S, et al. Curr Treat Options Pediatr. 2020;6(4):336-349. doi: 10.1007/s40746-020-00205-4. Epub 2020 Sep 15. Curr Treat Options Pediatr. 2020. PMID: 38624409 Free PMC article. Review.
A Perspective on Crowdsourcing and Human-in-the-Loop Workflows in Precision Health.
Washington P. Washington P. J Med Internet Res. 2024 Apr 11;26:e51138. doi: 10.2196/51138. J Med Internet Res. 2024. PMID: 38602750 Free PMC article.
Harnessing the Power of Mobile Phone Technology: Screening and Identifying Autism Spectrum Disorder With Smartphone Apps.
Reddy K, Taksande A, Kurian B. Reddy K, et al. Cureus. 2024 Feb 26;16(2):e55004. doi: 10.7759/cureus.55004. eCollection 2024 Feb. Cureus. 2024. PMID: 38550425 Free PMC article. Review.
Using #ActuallyAutistic on Twitter for Precision Diagnosis of Autism Spectrum Disorder: Machine Learning Study.
Jaiswal A, Washington P. Jaiswal A, et al. JMIR Form Res. 2024 Feb 14;8:e52660. doi: 10.2196/52660. JMIR Form Res. 2024. PMID: 38354045 Free PMC article.
Longitudinal study of stool-associated microbial taxa in sibling pairs with and without autism spectrum disorder.
Tataru C, Martin A, Dunlap K, Peras M, Chrisman BS, Rutherford E, Deitzler GE, Phillips A, Yin X, Sabino K, Hannibal RL, Hartono W, Lin M, Raack E, Wu Y, DeSantis TZ, Iwai S, Wall DP, David MM. Tataru C, et al. ISME Commun. 2021 Dec 18;1(1):80. doi: 10.1038/s43705-021-00080-6. ISME Commun. 2021. PMID: 37938270 Free PMC article.

See all "Cited by" articles

References

1. Prince M, Patel V, Saxena S, Maj M, Maselko J, Phillips MR, et al. Global mental health 1 - No health without mental health. Lancet. 2007;370(9590):859–77. 10.1016/S0140-6736(07)61238-0 - DOI - PubMed
1. Baio J, Wiggins L, Christensen DL, Maenner MJ, Daniels J, Warren Z, et al. Prevalence of Autism Spectrum Disorder Among Children Aged 8 Years—Autism and Developmental Disabilities Monitoring Network, 11 Sites, United States, 2014. MMWR Surveillance Summaries. 2018;67(6):1 10.15585/mmwr.ss6706a1 . PMCID: PMC5919599. - DOI - PMC - PubMed
1. Hertz-Picciotto I, Delwiche L. The Rise in Autism and the Role of Age at Diagnosis. Epidemiology. 2009;20(1):84–90. 10.1097/EDE.0b013e3181902d15 . PMCID: PMC4113600. - DOI - PMC - PubMed
1. Christensen DL, Baio J, Van Naarden Braun K, Bilder D, Charles J, Constantino JN, et al. Prevalence and Characteristics of Autism Spectrum Disorder Among Children Aged 8 Years--Autism and Developmental Disabilities Monitoring Network, 11 Sites, United States, 2012. MMWR Surveill Summ. 2016;65(3):1–23. 10.15585/mmwr.ss6503a1 . - DOI - PMC - PubMed
1. Christensen DL, Bilder DA, Zahorodny W, Pettygrove S, Durkin MS, Fitzgerald RT, et al. Prevalence and characteristics of autism spectrum disorder among 4-year-old children in the autism and developmental disabilities monitoring network. Journal of Developmental & Behavioral Pediatrics. 2016;37(1):1–8. 10.1097/DBP.0000000000000235 . - DOI - PubMed

Publication types

Actions
Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources
Medical
- Genetic Alliance

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Mobile detection of autism through machine learning on home video: A development and prospective validation study

Affiliations

Mobile detection of autism through machine learning on home video: A development and prospective validation study

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Medical

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Related information

Grants and funding

LinkOut - more resources

Full Text Sources

Medical