Intermediate

AWS AI/ML Services (36%)

Domain 3 of the AIF-C01 exam — the highest-weighted domain. Master every AWS AI and ML service, know when to use each one, and understand how they fit together to build AI-powered applications.

This is the most important domain. At 36% of the exam, this domain alone determines whether most candidates pass or fail. Spend the most study time here. For each service, know: what it does, when to use it, and what type of data it works with.

AWS AI Services Overview

AWS AI services fall into two categories: AI Services (pre-trained, no ML expertise needed) and ML Services (for building custom models). The exam tests your ability to choose the right category and the right service within that category.

Amazon Bedrock (Generative AI)

Bedrock is the central generative AI service on AWS. We covered it in the previous lesson, but here is what matters for this domain:

  • Use when: You need text generation, summarization, chatbots, code generation, image generation, or any generative AI task
  • Key feature: Access to multiple foundation models (Claude, Titan, Llama, etc.) through one API
  • Knowledge Bases: Managed RAG for Q&A over your documents
  • Agents: Multi-step task automation with API calling
  • Guardrails: Content filtering, PII redaction, denied topics

Amazon SageMaker (Custom ML)

SageMaker is the comprehensive ML platform for building, training, and deploying custom machine learning models. It is the most feature-rich ML service on AWS.

  • Use when: You need a custom ML model, the pre-trained AI services do not meet your requirements, or you have specific data and performance needs
  • SageMaker Studio — Web-based IDE for ML development
  • SageMaker Canvas — No-code ML for business analysts (drag-and-drop model building)
  • SageMaker JumpStart — Pre-trained models and solution templates you can deploy with one click
  • SageMaker Autopilot — AutoML that automatically trains and tunes models from your data
  • Built-in algorithms — XGBoost, Linear Learner, K-Means, and more
  • Model deployment — Real-time endpoints, batch transform, serverless inference
💡
Exam decision: "No ML expertise" or "pre-trained" or "managed AI service" = use an AI Service (Rekognition, Comprehend, etc.) or Bedrock. "Custom model" or "specific training data" or "unique requirements" = use SageMaker. "No-code ML" = SageMaker Canvas.

Vision Services

Amazon Rekognition

Pre-trained computer vision service for image and video analysis.

  • Use when: You need to analyze images or video without building a custom vision model
  • Capabilities: Object/scene detection, facial analysis, face comparison, celebrity recognition, text in images (OCR), content moderation (inappropriate content), custom labels (train with your own images), video analysis (people pathing, activity detection)
  • Input: Images (JPEG, PNG) or video (stored in S3 or streamed via Kinesis Video Streams)

Amazon Textract

Extract text, forms, and tables from scanned documents.

  • Use when: You need to extract structured data from documents (invoices, receipts, forms, IDs)
  • Key difference from Rekognition: Textract understands document structure (tables, forms with key-value pairs). Rekognition just reads text in images.
  • Capabilities: Plain text extraction, form data extraction (key-value pairs), table extraction, specialized queries ("What is the invoice total?")
💡
Exam trap: Rekognition can detect text in images, and Textract extracts text from documents. The exam will test whether you know the difference. If the question mentions "documents," "forms," "invoices," or "tables" = Textract. If it mentions "images," "video," "faces," or "objects" = Rekognition.

Language Services

Amazon Comprehend

Natural Language Processing (NLP) service for text analysis.

  • Use when: You need to analyze text for sentiment, entities, key phrases, language, or topics
  • Capabilities: Sentiment analysis (positive/negative/neutral/mixed), entity recognition (people, places, dates, organizations), key phrase extraction, language detection, topic modeling, PII detection and redaction
  • Comprehend Medical: Specialized version for healthcare text (extracts medical conditions, medications, dosages, procedures)

Amazon Translate

Neural machine translation between languages.

  • Use when: You need to translate text between languages in real-time or batch
  • Supports: 75+ languages, custom terminology (enforce specific translations for brand names or technical terms), real-time and batch translation

Amazon Transcribe

Automatic speech-to-text (speech recognition).

  • Use when: You need to convert audio/video to text
  • Capabilities: Real-time and batch transcription, speaker identification, custom vocabularies, automatic language identification, subtitle generation
  • Transcribe Medical: Specialized for medical conversations (doctor-patient dialogues)

Amazon Polly

Text-to-speech service that converts text to lifelike audio.

  • Use when: You need to convert text to spoken audio
  • Capabilities: Multiple voices and languages, Neural TTS (most natural), SSML support (control pronunciation, pauses, emphasis), real-time streaming
💡
Remember the pair: Transcribe = speech TO text (ears). Polly = text TO speech (mouth). The exam often presents scenarios where you need to pick the correct direction.

Conversational AI

Amazon Lex

Build conversational interfaces (chatbots and voice bots).

  • Use when: You need a chatbot or voice bot with structured conversation flows (intents, slots, fulfillment)
  • Key concepts: Intents (what the user wants), utterances (how they say it), slots (parameters to collect), fulfillment (action to take)
  • Integration: Works with Lambda for business logic, Connect for call centers, and Bedrock for generative responses
  • Same technology as Alexa

Business Intelligence & Personalization

Amazon Personalize

Real-time personalization and recommendation engine.

  • Use when: You need product recommendations, personalized content, or personalized search results
  • Input: User interaction data (clicks, purchases, views), item metadata, user metadata

Amazon Forecast

Time-series forecasting service.

  • Use when: You need to predict future values based on historical time-series data (demand forecasting, inventory planning, revenue forecasting)

Amazon Kendra

Intelligent enterprise search powered by ML.

  • Use when: You need an enterprise search engine that understands natural language questions and finds answers across documents, FAQs, and knowledge bases
  • Difference from Bedrock Knowledge Bases: Kendra returns search results (ranked documents). Bedrock Knowledge Bases generates answers using an LLM. Kendra can feed into Bedrock for a RAG solution.

Other Important Services

Amazon Q

Generative AI assistant for business and development.

  • Amazon Q Business — AI assistant that answers questions from company data (documents, wikis, databases)
  • Amazon Q Developer — AI coding assistant (code generation, debugging, transformation)

AWS DeepRacer

Autonomous racing car for learning reinforcement learning. Used for education and competitions.

Amazon CodeWhisperer (now Amazon Q Developer)

AI-powered code generation and suggestion tool integrated into IDEs.

Service Selection Cheat Sheet

💡
Images/Video: Rekognition
Documents/Forms: Textract
Text analysis (sentiment, entities): Comprehend
Translation: Translate
Speech to text: Transcribe
Text to speech: Polly
Chatbots: Lex
Recommendations: Personalize
Time-series forecasting: Forecast
Enterprise search: Kendra
Generative AI: Bedrock
Custom ML models: SageMaker
No-code ML: SageMaker Canvas
Business AI assistant: Amazon Q Business
Code assistant: Amazon Q Developer

Practice Questions

📝
Q1: A company needs to automatically extract invoice numbers, dates, and line items from thousands of scanned PDF invoices. Which AWS service should they use?

A) Amazon Rekognition
B) Amazon Textract
C) Amazon Comprehend
D) Amazon Translate
Show Answer

B) Amazon Textract. Textract is designed to extract text, forms, and tables from documents. It understands document structure and can extract key-value pairs (invoice number, date) and tables (line items). Rekognition (A) can read text in images but does not understand document structure. Comprehend (C) analyzes text meaning, not document structure. Translate (D) is for language translation.

📝
Q2: A media company wants to automatically detect inappropriate content in user-uploaded images before displaying them on their website. Which AWS service is MOST appropriate?

A) Amazon Comprehend
B) Amazon Bedrock Guardrails
C) Amazon Rekognition
D) Amazon Textract
Show Answer

C) Amazon Rekognition. Rekognition includes content moderation capabilities that detect inappropriate, suggestive, or violent content in images. Comprehend (A) analyzes text, not images. Bedrock Guardrails (B) filters LLM text outputs. Textract (D) extracts text from documents.

📝
Q3: A call center wants to build a voice bot that greets callers, identifies their intent, collects account information, and routes them to the appropriate department. Which AWS service should they use?

A) Amazon Polly
B) Amazon Transcribe
C) Amazon Lex
D) Amazon Comprehend
Show Answer

C) Amazon Lex. Lex is the conversational AI service that builds chatbots and voice bots. It handles intent recognition, slot filling (collecting information), and fulfillment (routing). Polly (A) converts text to speech but does not understand conversation. Transcribe (B) converts speech to text but does not manage conversation flow. Comprehend (C) analyzes text meaning but does not drive conversations.

📝
Q4: A business analyst with no coding experience wants to build a machine learning model to predict customer churn using a CSV of historical data. Which AWS service is MOST appropriate?

A) Amazon SageMaker Studio
B) Amazon SageMaker Canvas
C) Amazon Bedrock
D) Amazon Comprehend
Show Answer

B) Amazon SageMaker Canvas. Canvas is the no-code ML tool designed for business analysts. It provides a visual, drag-and-drop interface for building ML models without writing code. SageMaker Studio (A) requires coding experience. Bedrock (C) is for generative AI, not tabular prediction. Comprehend (D) is for NLP, not churn prediction.

📝
Q5: An e-commerce company wants to show personalized product recommendations to each customer based on their browsing and purchase history. Which AWS service should they use?

A) Amazon Kendra
B) Amazon Personalize
C) Amazon Forecast
D) Amazon Comprehend
Show Answer

B) Amazon Personalize. Personalize is the real-time personalization and recommendation service. It uses user interaction data (browsing, purchases) to generate product recommendations. Kendra (A) is for enterprise search. Forecast (C) is for time-series prediction. Comprehend (D) is for text analysis.