# A Specialized Automatic Speech Recognition and Conversational Platform to Enable Socially Assistive Robots for Persons with Mild-to-Moderate Alzheimer's Disease and Related Dementia

> **NIH NIH R44** · CARE.COACH CORPORATION · 2020 · $1,502,690

## Abstract

Abstract
1 in 3 seniors in the United States dies with dementia, of which Alzheimer’s disease (AD) is the most common
form. AD patients suffer from decreased ability to meaningfully communicate and interact, which causes
significant stress and burden for both professional caregivers and family members. Socially assistive robots
(SARs) have been designed to promote therapeutic interaction and communication. Unfortunately, artificial
intelligence (AI) has long been challenged by the speech of elderly persons, who exhibit age-related voice
tremors, hesitations, imprecise production of consonants, increased variability of fundamental frequency, and
other barriers that can be exacerbated by the neurological changes associated with AD, further complicated by
common environmental noises such as the ceiling fan, television, etc. Because of the resulting poor real-world
speech and language understanding by available SAR technologies, scarce human caregivers are often
required to guide AD patients through SAR interactions, limiting SARs to small deployments, mostly as part of
research studies. Unlike existing approaches relying purely on AI, care.coach™ is developing a SAR-like
avatar that converses with elderly and AD patients through truly natural speech. Each avatar is controlled by a
24x7 team of trained human staff who can cost-effectively monitor and engage 12 or more patients
sequentially (2 simultaneously) through the audio/visual feeds from the patient’s avatar device. The staff
communicate with each patient by sending text commands which are converted into the avatar’s voice through
a speech synthesis engine. The staff contribute to the system their human abilities for speech and natural
language processing (NLP) and for generating free-form conversational responses to help patients build
personal relationships with the avatar. The staff are guided by a software-driven expert system embedded into
their work interface, which is programmed with evidence-based prompting and protocols to support healthy
behaviors and self-care. This SBIR Fast-Track project will leverage the unique data generated by our human-
in-the-loop platform to develop new ASR capabilities, enabling fully automatic conversational protocols to
engage and support AD patients without human intervention. We aim in Phase I to leverage our unique prior
work dataset to train an automatic speech recognition (ASR) engine to enable the understanding of certain
types of elderly and AD patient speech more successfully than any currently available engine. We aim in
Phase II to incorporate this new engine along with an NLP module into our existing human-in-the-loop avatar
system, recruiting a population of AD patients to further train and validate with during a 2-year human subjects
study so that we can demonstrate full automation of a significant portion of our avatar conversations with mild-
to-moderate level AD patients. Thus, we will improve the commercial scalability of our avatars, whil...

## Key facts

- **NIH application ID:** 10230460
- **Project number:** 4R44AG062014-02
- **Recipient organization:** CARE.COACH CORPORATION
- **Principal Investigator:** Victor Wang
- **Activity code:** R44 (R01, R21, SBIR, etc.)
- **Funding institute:** NIH
- **Fiscal year:** 2020
- **Award amount:** $1,502,690
- **Award type:** 4N
- **Project period:** 2019-07-01 → 2022-06-30

## Primary source

NIH RePORTER: https://reporter.nih.gov/project-details/10230460

## Citation

> US National Institutes of Health, RePORTER application 10230460, A Specialized Automatic Speech Recognition and Conversational Platform to Enable Socially Assistive Robots for Persons with Mild-to-Moderate Alzheimer's Disease and Related Dementia (4R44AG062014-02). Retrieved via AI Analytics 2026-05-23 from https://api.ai-analytics.org/grant/nih/10230460. Licensed CC0.

---

*[NIH grants dataset](/datasets/nih-grants) · CC0 1.0*
