The mobile app for users with visual impairment needs to hear user speech and provide voice responses, requiring speech-to-text (speech recognition) and text-to-speech capabilities. Deep learning neural networks are widely used for speech recognition tasks, as they can effectively process and transcribe spoken language. AWS services like Amazon Transcribe, which uses deep learning for speech recognition, can fulfill this requirement by converting user speech to text, and Amazon Polly can generate voice responses.
Exact Extract from AWS AI Documents:
From the AWS Documentation on Amazon Transcribe:
"Amazon Transcribe uses deep learning neural networks to perform automatic speech recognition (ASR), converting spoken language into text with high accuracy. This is ideal for applications requiring voice input, such as accessibility features for visually impaired users."
(Source: Amazon Transcribe Developer Guide, Introduction to Amazon Transcribe)
Detailed Explanation:
Option A: Use a deep learning neural network to perform speech recognition.This is the correct answer. Deep learning neural networks are the foundation of modern speech recognition systems, as used in AWS services like Amazon Transcribe. They enable the app to hear and transcribe user speech, and a service like Amazon Polly can handle voice responses, meeting the requirements.
Option B: Build ML models to search for patterns in numeric data.This option is irrelevant, as the task involves processing speech (audio data) and generating voice responses, not analyzing numeric data patterns.
Option C: Use generative AI summarization to generate human-like text.Generative AI summarization focuses on summarizing text, not processing speech orgenerating voice responses. This option does not address the core requirement of speech recognition.
Option D: Build custom models for image classification and recognition.Image classification and recognition are unrelated to processing speech or generating voice responses, making this option incorrect for an app focused on audio interaction.
[References:, Amazon Transcribe Developer Guide: Introduction to Amazon Transcribe (https://docs.aws.amazon.com/transcribe/latest/dg/what-is.html), Amazon Polly Developer Guide: Text-to-Speech Overview (https://docs.aws.amazon.com/polly/latest/dg/what-is.html), AWS AI Practitioner Learning Path: Module on Speech Recognition and Synthesis, , , , ]