Amazon SageMaker のアイコン

Amazon SageMaker Popular2017年〜

An integrated platform for building, training, and deploying machine learning models

What It Does

Amazon SageMaker is an integrated platform that manages the entire machine learning (ML) model lifecycle. It covers data preparation, model building and training, tuning, deployment, and monitoring in a single service. It provides Jupyter notebook environments, built-in algorithms, distributed training, and automatic model tuning.

Use Cases

Image classification, natural language processing, recommendations, fraud detection, demand forecasting, anomaly detection - virtually any ML workload. It serves a wide range of skill levels from ML engineers to data scientists.

Everyday Analogy

Think of a professional kitchen with cooking classes. It has everything you need for every step - from ingredient prep (data preparation) to cooking (training), tasting (evaluation), and plating (deployment). Beginner recipes (built-in algorithms) are included too.

What Is SageMaker?

Amazon SageMaker is an integrated service covering ML development through operations. Centered around SageMaker Studio (an IDE), it brings together data labeling (Ground Truth), feature store (Feature Store), experiment tracking (Experiments), model registry, and pipelines.

Training and Deployment

SageMaker training jobs train models on specified instance types and automatically terminate instances when complete. Distributed training is supported for efficient training on large datasets. Trained models can be deployed as real-time inference endpoints, batch transforms, or serverless inference.

SageMaker Canvas and JumpStart

SageMaker Canvas is a no-code tool for building ML models without writing code. Upload CSV data, select the target column, and AutoML builds the model automatically. SageMaker JumpStart is a catalog of pre-trained foundation models and ML solutions that can be deployed in just a few clicks. To broaden your knowledge of SageMaker Canvas and JumpStart, related books on Amazon are a great resource.

Getting Started

Launch SageMaker Studio from the SageMaker console and open a Jupyter notebook. Numerous sample notebooks are available, letting you try image classification and NLP tutorials right away. For a code-free start, SageMaker Canvas is the easiest entry point.

Things to Watch Out For

  • Training and notebook instances are billed hourly - always stop them when not in use
  • Real-time inference endpoints run continuously and incur charges. Consider serverless inference for low-traffic scenarios
  • Improper VPC configuration during SageMaker Studio domain creation is difficult to change later
共有するXB!