SlideShare a Scribd company logo
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
J U N E 2 0 2 4
Customizing Foundation Models on
Amazon SageMaker
Miron Perel
Principal AI/ML GTM Specialist
Kristine Pearce
Principal AI/ML GTM Specialist
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Today’s Agenda
1. GenAI Trends
2. Gen AI on Amazon SageMaker
3. SageMaker Training Demo
4. Getting Started
G E N E R A T I V E A I
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2023, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Financial
Services
Healthcare and
Life Sciences
Automotive Manufacturing
Media &
Entertainment
Telecom Energy
Generative AI is transforming all industries
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. 4
Innovations at
Amazon
SCALE
How can I
scale this?
Rufus
An expert
shopping assistant
Customer
reviews
On Amazon.com
Amazon
Pharmacy
faster prescriptions and
more helpful support
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
More than use Amazon SageMaker for GenAI/ML
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Primary reasons cited is due to
control and customization for
a given use case
Prefer
fine-tuning or training
models for specific needs
Enterprises increasing
open source usage or
switching in 2024 and onward
80% 78%
90%
GenAI Trends in the Enterprise
Ways Enterprises are Building, Buying and Optimizing FMs (a16z)
Source: https://a16z.com/generative-ai-enterprise-2024/
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Cost differentials can skyrocket with high end LLMs at scale
The Problem of Sustained Inference
Price to generate a million
tokens with a open source fine
tuned LLM
Price to generate a million
tokens with a closed source LLM
$60 $3
* Upwards of
** For example
**
*
Source: https://medium.com/@ja_adimi/comparison-cost-analysis-should-
we-invest-in-open-source-or-closed-source-llms-bfd646ae1f74
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Common approaches for customizing FMs
Zero-shot Few-shots Retrieval
Augmented
Generation
(RAG)
Fine-tuning Re-Training
Complexity,
Quality,
Cost,
Time
8
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Common approaches for customizing FMs
Zero-shot Retrieval
Augmented
Generation
(RAG)
Fine-tuning P/re-Training
Complexity,
Quality,
Cost,
Time
9
Few-shots
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. 10
Automatically generate
a customer response
with the knowledge of
your products
Make faster decisions
by automatically
extracting data from
documents
PDF
Extraction
Customer
Support
Automating process
of manually
categorizing
documents and
content
Classification
Use an LLM to
understand how
customers feel about
products and services
Customer
Sentiment
72% of enterprises are fine-tuning Domain/Task adapted LLMs balancing customization with cost and efficiency
Use Case Examples of Domain & Task Adapted LLM
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. 11
Parameter efficient
re-training of FM
parameters with new
training data
Adjusting pre-trained
FMs on a specific
dataset with labeled
examples
Supervised Fine
Tuning (SFT)
Re-Training
Develop more precise
context-aware FMs
based on un/semi-
supervised learning
Domain Adaptation
Fine Tuning (DAFT)
FM is fine-tuned on the
data, with specific
instructions or
guidelines, and formats
Instruction
Fine-Tuning (IT)
72% of enterprises are fine-tuning Domain/Task adapted LLMs balancing customization with cost and efficiency
Fine Tuning & Re-Training Techniques
Fine-tune any open source LLM for your Domain Adapted LLMs (finance, healthcare, programming) OR
your Task Adapted LLMs (data summarization, PDF extraction, sentiment analysis, audio transcription)
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
FM Customization approaches using Amazon SageMaker
Foundation
model
Customize Task-
specific FM
W H Y Y O U U S E I T
• Faster time to market
• Maximize accuracy for specific tasks
• Achieve domain adaptation
Customizing FMs for task and
domain specific use cases
W H Y Y O U U S E I T
• Build a proprietary or re-train open-source
• Commercialize or deploy for internal use
• Drive revenue and cut operational costs
Build Train Deploy
Build/ReTrain FMs
from scratch
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Select Evaluate Customize Deploy
Amazon
SageMaker
Build, train, and
deploy ML models at
scale, including FMs
Jumpstart Clarify
Ground Truth
Training
Studio
Inference
SageMaker Generative AI for ML Practitioners
Customizing FMs for Task & Domain Specific Use Cases
Customization
and control with
TTM / TCO
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Discover
Fine tune,
evaluate,
and deploy
Select from the broadest and latest selection of foundation models
A V A I L A B L E I N A M A Z O N S A G E M A K E R J U M P S T A R T
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Unique challenges to manage hardware resources
efficiently for large scale FM training
Strategies for
distributed training
Infrastructure
stability
Clusters provision &
management
15
Collect data
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon SageMaker for Large Scale FM Training:
Self-healing clusters
reduce training time
by up to 20%
Resilient
environment
SageMaker distributed
training libraries
improve performance
by up to 20%
Streamline distributed
training
Control over compute
environment and
workload scheduling
Optimized resources
utilization (SMHP)
Focus on ML without
the need to manage
infrastructure
Managed training
environment (SMTJ)
OR
AND AND
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Demo video
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
SageMaker Training Jobs for FM Training
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2023, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Financial
Services
Healthcare and
Life Sciences
Automotive Manufacturing
Media &
Entertainment
Telecom Energy
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
GenAI Success Stories
H E L P I N G E N T E R P R I S E S T O U S E F O U N D A T I O N M O D E L S
50%
lower costs for
hosting FMs
7 months
reduced time-to-
value from 12-18
months
80%
reduction in
inference latency
66%
cost savings with
GPU utilization
Enhance Customer Experience
Enhance Customer Experience Boost Employee Productivity
Streamline Business Processes
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Start your generative AI journey today
Additional Resources
View
product page
Watch deep
dive demo videos
Read
blog posts
Read the
technical docs
SageMaker
Training
Example
© 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Thank you!

More Related Content

Similar to [D2T2S04] SageMaker를 활용한 Generative AI Foundation Model Training and Tuning

Machine learning for developers & data scientists with Amazon SageMaker - AIM...
Machine learning for developers & data scientists with Amazon SageMaker - AIM...Machine learning for developers & data scientists with Amazon SageMaker - AIM...
Machine learning for developers & data scientists with Amazon SageMaker - AIM...
Amazon Web Services
 
Amazon SageMaker Deep Dive for Builders
Amazon SageMaker Deep Dive for BuildersAmazon SageMaker Deep Dive for Builders
Amazon SageMaker Deep Dive for Builders
Amazon Web Services
 
AWS re:Invent 2018 - Machine Learning recap (December 2018)
AWS re:Invent 2018 - Machine Learning recap (December 2018)AWS re:Invent 2018 - Machine Learning recap (December 2018)
AWS re:Invent 2018 - Machine Learning recap (December 2018)
Julien SIMON
 
AI Services for Developers - Floor28
AI Services for Developers - Floor28AI Services for Developers - Floor28
AI Services for Developers - Floor28
Boaz Ziniman
 
AI Services for Developers | AWS Floor28
AI Services for Developers | AWS Floor28AI Services for Developers | AWS Floor28
AI Services for Developers | AWS Floor28
Amazon Web Services
 
Securing your Amazon SageMaker model development in a highly regulated enviro...
Securing your Amazon SageMaker model development in a highly regulated enviro...Securing your Amazon SageMaker model development in a highly regulated enviro...
Securing your Amazon SageMaker model development in a highly regulated enviro...
Amazon Web Services
 
Introduction to AWS Global Accelerator - SVC212 - New York AWS Summit
Introduction to AWS Global Accelerator - SVC212 - New York AWS SummitIntroduction to AWS Global Accelerator - SVC212 - New York AWS Summit
Introduction to AWS Global Accelerator - SVC212 - New York AWS Summit
Amazon Web Services
 
Supercharge Your ML Model with SageMaker - AWS Summit Sydney 2018
Supercharge Your ML Model with SageMaker - AWS Summit Sydney 2018Supercharge Your ML Model with SageMaker - AWS Summit Sydney 2018
Supercharge Your ML Model with SageMaker - AWS Summit Sydney 2018
Amazon Web Services
 
Best of re:Invent for Startups
Best of re:Invent for StartupsBest of re:Invent for Startups
Best of re:Invent for Startups
Amazon Web Services
 
Rendi le tue app più smart con i servizi AI di AWS
Rendi le tue app più smart con i servizi AI di AWSRendi le tue app più smart con i servizi AI di AWS
Rendi le tue app più smart con i servizi AI di AWS
Amazon Web Services
 
Introduction to Sagemaker
Introduction to SagemakerIntroduction to Sagemaker
Introduction to Sagemaker
Amazon Web Services
 
Racing with Artificial Intelligence
Racing with Artificial IntelligenceRacing with Artificial Intelligence
Racing with Artificial Intelligence
Daniel Zivkovic
 
High Performance Computing (HPC) on AWS 101
High Performance Computing (HPC) on AWS 101High Performance Computing (HPC) on AWS 101
High Performance Computing (HPC) on AWS 101
Amazon Web Services
 
打造新一代的企業 IT - Transforming Enterprise IT
打造新一代的企業 IT - Transforming Enterprise IT打造新一代的企業 IT - Transforming Enterprise IT
打造新一代的企業 IT - Transforming Enterprise IT
Amazon Web Services
 
Predicting Demand In A Diverse Retail Environment - AWS Summit Sydney
Predicting Demand In A Diverse Retail Environment - AWS Summit SydneyPredicting Demand In A Diverse Retail Environment - AWS Summit Sydney
Predicting Demand In A Diverse Retail Environment - AWS Summit Sydney
Amazon Web Services
 
Artifical Intelligence and Machine Learning 201, AWS Federal Pop-Up Loft
Artifical Intelligence and Machine Learning 201, AWS Federal Pop-Up LoftArtifical Intelligence and Machine Learning 201, AWS Federal Pop-Up Loft
Artifical Intelligence and Machine Learning 201, AWS Federal Pop-Up Loft
Amazon Web Services
 
Build Machine Learning Models with Amazon SageMaker (April 2019)
Build Machine Learning Models with Amazon SageMaker (April 2019)Build Machine Learning Models with Amazon SageMaker (April 2019)
Build Machine Learning Models with Amazon SageMaker (April 2019)
Julien SIMON
 
Supercharge Your Machine Learning Model with Amazon SageMaker
Supercharge Your Machine Learning Model with Amazon SageMakerSupercharge Your Machine Learning Model with Amazon SageMaker
Supercharge Your Machine Learning Model with Amazon SageMaker
Amazon Web Services
 
Building intelligent applications using AI services
Building intelligent applications using AI servicesBuilding intelligent applications using AI services
Building intelligent applications using AI services
Amazon Web Services
 
Are you Well Architected?
Are you Well Architected?Are you Well Architected?
Are you Well Architected?
Amazon Web Services
 

Similar to [D2T2S04] SageMaker를 활용한 Generative AI Foundation Model Training and Tuning (20)

Machine learning for developers & data scientists with Amazon SageMaker - AIM...
Machine learning for developers & data scientists with Amazon SageMaker - AIM...Machine learning for developers & data scientists with Amazon SageMaker - AIM...
Machine learning for developers & data scientists with Amazon SageMaker - AIM...
 
Amazon SageMaker Deep Dive for Builders
Amazon SageMaker Deep Dive for BuildersAmazon SageMaker Deep Dive for Builders
Amazon SageMaker Deep Dive for Builders
 
AWS re:Invent 2018 - Machine Learning recap (December 2018)
AWS re:Invent 2018 - Machine Learning recap (December 2018)AWS re:Invent 2018 - Machine Learning recap (December 2018)
AWS re:Invent 2018 - Machine Learning recap (December 2018)
 
AI Services for Developers - Floor28
AI Services for Developers - Floor28AI Services for Developers - Floor28
AI Services for Developers - Floor28
 
AI Services for Developers | AWS Floor28
AI Services for Developers | AWS Floor28AI Services for Developers | AWS Floor28
AI Services for Developers | AWS Floor28
 
Securing your Amazon SageMaker model development in a highly regulated enviro...
Securing your Amazon SageMaker model development in a highly regulated enviro...Securing your Amazon SageMaker model development in a highly regulated enviro...
Securing your Amazon SageMaker model development in a highly regulated enviro...
 
Introduction to AWS Global Accelerator - SVC212 - New York AWS Summit
Introduction to AWS Global Accelerator - SVC212 - New York AWS SummitIntroduction to AWS Global Accelerator - SVC212 - New York AWS Summit
Introduction to AWS Global Accelerator - SVC212 - New York AWS Summit
 
Supercharge Your ML Model with SageMaker - AWS Summit Sydney 2018
Supercharge Your ML Model with SageMaker - AWS Summit Sydney 2018Supercharge Your ML Model with SageMaker - AWS Summit Sydney 2018
Supercharge Your ML Model with SageMaker - AWS Summit Sydney 2018
 
Best of re:Invent for Startups
Best of re:Invent for StartupsBest of re:Invent for Startups
Best of re:Invent for Startups
 
Rendi le tue app più smart con i servizi AI di AWS
Rendi le tue app più smart con i servizi AI di AWSRendi le tue app più smart con i servizi AI di AWS
Rendi le tue app più smart con i servizi AI di AWS
 
Introduction to Sagemaker
Introduction to SagemakerIntroduction to Sagemaker
Introduction to Sagemaker
 
Racing with Artificial Intelligence
Racing with Artificial IntelligenceRacing with Artificial Intelligence
Racing with Artificial Intelligence
 
High Performance Computing (HPC) on AWS 101
High Performance Computing (HPC) on AWS 101High Performance Computing (HPC) on AWS 101
High Performance Computing (HPC) on AWS 101
 
打造新一代的企業 IT - Transforming Enterprise IT
打造新一代的企業 IT - Transforming Enterprise IT打造新一代的企業 IT - Transforming Enterprise IT
打造新一代的企業 IT - Transforming Enterprise IT
 
Predicting Demand In A Diverse Retail Environment - AWS Summit Sydney
Predicting Demand In A Diverse Retail Environment - AWS Summit SydneyPredicting Demand In A Diverse Retail Environment - AWS Summit Sydney
Predicting Demand In A Diverse Retail Environment - AWS Summit Sydney
 
Artifical Intelligence and Machine Learning 201, AWS Federal Pop-Up Loft
Artifical Intelligence and Machine Learning 201, AWS Federal Pop-Up LoftArtifical Intelligence and Machine Learning 201, AWS Federal Pop-Up Loft
Artifical Intelligence and Machine Learning 201, AWS Federal Pop-Up Loft
 
Build Machine Learning Models with Amazon SageMaker (April 2019)
Build Machine Learning Models with Amazon SageMaker (April 2019)Build Machine Learning Models with Amazon SageMaker (April 2019)
Build Machine Learning Models with Amazon SageMaker (April 2019)
 
Supercharge Your Machine Learning Model with Amazon SageMaker
Supercharge Your Machine Learning Model with Amazon SageMakerSupercharge Your Machine Learning Model with Amazon SageMaker
Supercharge Your Machine Learning Model with Amazon SageMaker
 
Building intelligent applications using AI services
Building intelligent applications using AI servicesBuilding intelligent applications using AI services
Building intelligent applications using AI services
 
Are you Well Architected?
Are you Well Architected?Are you Well Architected?
Are you Well Architected?
 

Recently uploaded

Training on CSPro and step by steps.pptx
Training on CSPro and step by steps.pptxTraining on CSPro and step by steps.pptx
Training on CSPro and step by steps.pptx
lenjisoHussein
 
Accounting and Auditing Laws-Rules-and-Regulations
Accounting and Auditing Laws-Rules-and-RegulationsAccounting and Auditing Laws-Rules-and-Regulations
Accounting and Auditing Laws-Rules-and-Regulations
DALubis
 
Dataguard Switchover Best Practices using DGMGRL (Dataguard Broker Command Line)
Dataguard Switchover Best Practices using DGMGRL (Dataguard Broker Command Line)Dataguard Switchover Best Practices using DGMGRL (Dataguard Broker Command Line)
Dataguard Switchover Best Practices using DGMGRL (Dataguard Broker Command Line)
Alireza Kamrani
 
Parcel Delivery - Intel Segmentation and Last Mile Opt.pdf
Parcel Delivery - Intel Segmentation and Last Mile Opt.pdfParcel Delivery - Intel Segmentation and Last Mile Opt.pdf
Parcel Delivery - Intel Segmentation and Last Mile Opt.pdf
AltanAtabarut
 
Data Analytics for Decision Making By District 11 Solutions
Data Analytics for Decision Making By District 11 SolutionsData Analytics for Decision Making By District 11 Solutions
Data Analytics for Decision Making By District 11 Solutions
District 11 Solutions
 
Bimbingan kaunseling untuk pelajar IPTA/IPTS di Malaysia
Bimbingan kaunseling untuk pelajar IPTA/IPTS di MalaysiaBimbingan kaunseling untuk pelajar IPTA/IPTS di Malaysia
Bimbingan kaunseling untuk pelajar IPTA/IPTS di Malaysia
aznidajailani
 
From Signals to Solutions: Effective Strategies for CDR Analysis in Fraud Det...
From Signals to Solutions: Effective Strategies for CDR Analysis in Fraud Det...From Signals to Solutions: Effective Strategies for CDR Analysis in Fraud Det...
From Signals to Solutions: Effective Strategies for CDR Analysis in Fraud Det...
Milind Agarwal
 
Acid Base Practice Test 4- KEY.pdfkkjkjk
Acid Base Practice Test 4- KEY.pdfkkjkjkAcid Base Practice Test 4- KEY.pdfkkjkjk
Acid Base Practice Test 4- KEY.pdfkkjkjk
talha2khan2k
 
DESIGN AND DEVELOPMENT OF AUTO OXYGEN CONCENTRATOR WITH SOS ALERT FOR HIKING ...
DESIGN AND DEVELOPMENT OF AUTO OXYGEN CONCENTRATOR WITH SOS ALERT FOR HIKING ...DESIGN AND DEVELOPMENT OF AUTO OXYGEN CONCENTRATOR WITH SOS ALERT FOR HIKING ...
DESIGN AND DEVELOPMENT OF AUTO OXYGEN CONCENTRATOR WITH SOS ALERT FOR HIKING ...
JeevanKp7
 
Audits Of Complaints Against the PPD Report_2022.pdf
Audits Of Complaints Against the PPD Report_2022.pdfAudits Of Complaints Against the PPD Report_2022.pdf
Audits Of Complaints Against the PPD Report_2022.pdf
evwcarr
 
Parcel Delivery - Intel Segmentation and Last Mile Opt.pptx
Parcel Delivery - Intel Segmentation and Last Mile Opt.pptxParcel Delivery - Intel Segmentation and Last Mile Opt.pptx
Parcel Delivery - Intel Segmentation and Last Mile Opt.pptx
AltanAtabarut
 
The Rise of Python in Finance,Automating Trading Strategies: _.pdf
The Rise of Python in Finance,Automating Trading Strategies: _.pdfThe Rise of Python in Finance,Automating Trading Strategies: _.pdf
The Rise of Python in Finance,Automating Trading Strategies: _.pdf
Riya Sen
 
Selcuk Topal Arbitrum Scientific Report.pdf
Selcuk Topal Arbitrum Scientific Report.pdfSelcuk Topal Arbitrum Scientific Report.pdf
Selcuk Topal Arbitrum Scientific Report.pdf
SelcukTOPAL2
 
Full Disclosure Board Policy.docx BRGY LICUMA
Full  Disclosure Board Policy.docx BRGY LICUMAFull  Disclosure Board Policy.docx BRGY LICUMA
Full Disclosure Board Policy.docx BRGY LICUMA
brgylicumaormoccity
 
SFBA Splunk Usergroup meeting July 17, 2024
SFBA Splunk Usergroup meeting July 17, 2024SFBA Splunk Usergroup meeting July 17, 2024
SFBA Splunk Usergroup meeting July 17, 2024
Becky Burwell
 
Combined supervised and unsupervised neural networks for pulse shape discrimi...
Combined supervised and unsupervised neural networks for pulse shape discrimi...Combined supervised and unsupervised neural networks for pulse shape discrimi...
Combined supervised and unsupervised neural networks for pulse shape discrimi...
Samuel Jackson
 
Field Diary and lab record, Importance.pdf
Field Diary and lab record, Importance.pdfField Diary and lab record, Importance.pdf
Field Diary and lab record, Importance.pdf
hritikbui
 
Systane Global education training centre
Systane Global education training centreSystane Global education training centre
Systane Global education training centre
AkhinaRomdoni
 
SOFTWARE ENGINEERING-UNIT-1SOFTWARE ENGINEERING
SOFTWARE ENGINEERING-UNIT-1SOFTWARE ENGINEERINGSOFTWARE ENGINEERING-UNIT-1SOFTWARE ENGINEERING
SOFTWARE ENGINEERING-UNIT-1SOFTWARE ENGINEERING
PrabhuB33
 
Aws MLOps Interview Questions with answers
Aws MLOps Interview Questions  with answersAws MLOps Interview Questions  with answers
Aws MLOps Interview Questions with answers
Sathiakumar Chandr
 

Recently uploaded (20)

Training on CSPro and step by steps.pptx
Training on CSPro and step by steps.pptxTraining on CSPro and step by steps.pptx
Training on CSPro and step by steps.pptx
 
Accounting and Auditing Laws-Rules-and-Regulations
Accounting and Auditing Laws-Rules-and-RegulationsAccounting and Auditing Laws-Rules-and-Regulations
Accounting and Auditing Laws-Rules-and-Regulations
 
Dataguard Switchover Best Practices using DGMGRL (Dataguard Broker Command Line)
Dataguard Switchover Best Practices using DGMGRL (Dataguard Broker Command Line)Dataguard Switchover Best Practices using DGMGRL (Dataguard Broker Command Line)
Dataguard Switchover Best Practices using DGMGRL (Dataguard Broker Command Line)
 
Parcel Delivery - Intel Segmentation and Last Mile Opt.pdf
Parcel Delivery - Intel Segmentation and Last Mile Opt.pdfParcel Delivery - Intel Segmentation and Last Mile Opt.pdf
Parcel Delivery - Intel Segmentation and Last Mile Opt.pdf
 
Data Analytics for Decision Making By District 11 Solutions
Data Analytics for Decision Making By District 11 SolutionsData Analytics for Decision Making By District 11 Solutions
Data Analytics for Decision Making By District 11 Solutions
 
Bimbingan kaunseling untuk pelajar IPTA/IPTS di Malaysia
Bimbingan kaunseling untuk pelajar IPTA/IPTS di MalaysiaBimbingan kaunseling untuk pelajar IPTA/IPTS di Malaysia
Bimbingan kaunseling untuk pelajar IPTA/IPTS di Malaysia
 
From Signals to Solutions: Effective Strategies for CDR Analysis in Fraud Det...
From Signals to Solutions: Effective Strategies for CDR Analysis in Fraud Det...From Signals to Solutions: Effective Strategies for CDR Analysis in Fraud Det...
From Signals to Solutions: Effective Strategies for CDR Analysis in Fraud Det...
 
Acid Base Practice Test 4- KEY.pdfkkjkjk
Acid Base Practice Test 4- KEY.pdfkkjkjkAcid Base Practice Test 4- KEY.pdfkkjkjk
Acid Base Practice Test 4- KEY.pdfkkjkjk
 
DESIGN AND DEVELOPMENT OF AUTO OXYGEN CONCENTRATOR WITH SOS ALERT FOR HIKING ...
DESIGN AND DEVELOPMENT OF AUTO OXYGEN CONCENTRATOR WITH SOS ALERT FOR HIKING ...DESIGN AND DEVELOPMENT OF AUTO OXYGEN CONCENTRATOR WITH SOS ALERT FOR HIKING ...
DESIGN AND DEVELOPMENT OF AUTO OXYGEN CONCENTRATOR WITH SOS ALERT FOR HIKING ...
 
Audits Of Complaints Against the PPD Report_2022.pdf
Audits Of Complaints Against the PPD Report_2022.pdfAudits Of Complaints Against the PPD Report_2022.pdf
Audits Of Complaints Against the PPD Report_2022.pdf
 
Parcel Delivery - Intel Segmentation and Last Mile Opt.pptx
Parcel Delivery - Intel Segmentation and Last Mile Opt.pptxParcel Delivery - Intel Segmentation and Last Mile Opt.pptx
Parcel Delivery - Intel Segmentation and Last Mile Opt.pptx
 
The Rise of Python in Finance,Automating Trading Strategies: _.pdf
The Rise of Python in Finance,Automating Trading Strategies: _.pdfThe Rise of Python in Finance,Automating Trading Strategies: _.pdf
The Rise of Python in Finance,Automating Trading Strategies: _.pdf
 
Selcuk Topal Arbitrum Scientific Report.pdf
Selcuk Topal Arbitrum Scientific Report.pdfSelcuk Topal Arbitrum Scientific Report.pdf
Selcuk Topal Arbitrum Scientific Report.pdf
 
Full Disclosure Board Policy.docx BRGY LICUMA
Full  Disclosure Board Policy.docx BRGY LICUMAFull  Disclosure Board Policy.docx BRGY LICUMA
Full Disclosure Board Policy.docx BRGY LICUMA
 
SFBA Splunk Usergroup meeting July 17, 2024
SFBA Splunk Usergroup meeting July 17, 2024SFBA Splunk Usergroup meeting July 17, 2024
SFBA Splunk Usergroup meeting July 17, 2024
 
Combined supervised and unsupervised neural networks for pulse shape discrimi...
Combined supervised and unsupervised neural networks for pulse shape discrimi...Combined supervised and unsupervised neural networks for pulse shape discrimi...
Combined supervised and unsupervised neural networks for pulse shape discrimi...
 
Field Diary and lab record, Importance.pdf
Field Diary and lab record, Importance.pdfField Diary and lab record, Importance.pdf
Field Diary and lab record, Importance.pdf
 
Systane Global education training centre
Systane Global education training centreSystane Global education training centre
Systane Global education training centre
 
SOFTWARE ENGINEERING-UNIT-1SOFTWARE ENGINEERING
SOFTWARE ENGINEERING-UNIT-1SOFTWARE ENGINEERINGSOFTWARE ENGINEERING-UNIT-1SOFTWARE ENGINEERING
SOFTWARE ENGINEERING-UNIT-1SOFTWARE ENGINEERING
 
Aws MLOps Interview Questions with answers
Aws MLOps Interview Questions  with answersAws MLOps Interview Questions  with answers
Aws MLOps Interview Questions with answers
 

[D2T2S04] SageMaker를 활용한 Generative AI Foundation Model Training and Tuning

  • 1. © 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. J U N E 2 0 2 4 Customizing Foundation Models on Amazon SageMaker Miron Perel Principal AI/ML GTM Specialist Kristine Pearce Principal AI/ML GTM Specialist
  • 2. © 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Today’s Agenda 1. GenAI Trends 2. Gen AI on Amazon SageMaker 3. SageMaker Training Demo 4. Getting Started G E N E R A T I V E A I
  • 3. © 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. © 2023, Amazon Web Services, Inc. or its affiliates. All rights reserved. Financial Services Healthcare and Life Sciences Automotive Manufacturing Media & Entertainment Telecom Energy Generative AI is transforming all industries
  • 4. © 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. 4 Innovations at Amazon SCALE How can I scale this? Rufus An expert shopping assistant Customer reviews On Amazon.com Amazon Pharmacy faster prescriptions and more helpful support
  • 5. © 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. More than use Amazon SageMaker for GenAI/ML
  • 6. © 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Primary reasons cited is due to control and customization for a given use case Prefer fine-tuning or training models for specific needs Enterprises increasing open source usage or switching in 2024 and onward 80% 78% 90% GenAI Trends in the Enterprise Ways Enterprises are Building, Buying and Optimizing FMs (a16z) Source: https://a16z.com/generative-ai-enterprise-2024/
  • 7. © 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Cost differentials can skyrocket with high end LLMs at scale The Problem of Sustained Inference Price to generate a million tokens with a open source fine tuned LLM Price to generate a million tokens with a closed source LLM $60 $3 * Upwards of ** For example ** * Source: https://medium.com/@ja_adimi/comparison-cost-analysis-should- we-invest-in-open-source-or-closed-source-llms-bfd646ae1f74
  • 8. © 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Common approaches for customizing FMs Zero-shot Few-shots Retrieval Augmented Generation (RAG) Fine-tuning Re-Training Complexity, Quality, Cost, Time 8
  • 9. © 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. © 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Common approaches for customizing FMs Zero-shot Retrieval Augmented Generation (RAG) Fine-tuning P/re-Training Complexity, Quality, Cost, Time 9 Few-shots
  • 10. © 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. 10 Automatically generate a customer response with the knowledge of your products Make faster decisions by automatically extracting data from documents PDF Extraction Customer Support Automating process of manually categorizing documents and content Classification Use an LLM to understand how customers feel about products and services Customer Sentiment 72% of enterprises are fine-tuning Domain/Task adapted LLMs balancing customization with cost and efficiency Use Case Examples of Domain & Task Adapted LLM
  • 11. © 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. 11 Parameter efficient re-training of FM parameters with new training data Adjusting pre-trained FMs on a specific dataset with labeled examples Supervised Fine Tuning (SFT) Re-Training Develop more precise context-aware FMs based on un/semi- supervised learning Domain Adaptation Fine Tuning (DAFT) FM is fine-tuned on the data, with specific instructions or guidelines, and formats Instruction Fine-Tuning (IT) 72% of enterprises are fine-tuning Domain/Task adapted LLMs balancing customization with cost and efficiency Fine Tuning & Re-Training Techniques Fine-tune any open source LLM for your Domain Adapted LLMs (finance, healthcare, programming) OR your Task Adapted LLMs (data summarization, PDF extraction, sentiment analysis, audio transcription)
  • 12. © 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. FM Customization approaches using Amazon SageMaker Foundation model Customize Task- specific FM W H Y Y O U U S E I T • Faster time to market • Maximize accuracy for specific tasks • Achieve domain adaptation Customizing FMs for task and domain specific use cases W H Y Y O U U S E I T • Build a proprietary or re-train open-source • Commercialize or deploy for internal use • Drive revenue and cut operational costs Build Train Deploy Build/ReTrain FMs from scratch
  • 13. © 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Select Evaluate Customize Deploy Amazon SageMaker Build, train, and deploy ML models at scale, including FMs Jumpstart Clarify Ground Truth Training Studio Inference SageMaker Generative AI for ML Practitioners Customizing FMs for Task & Domain Specific Use Cases Customization and control with TTM / TCO
  • 14. © 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Discover Fine tune, evaluate, and deploy Select from the broadest and latest selection of foundation models A V A I L A B L E I N A M A Z O N S A G E M A K E R J U M P S T A R T
  • 15. © 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Unique challenges to manage hardware resources efficiently for large scale FM training Strategies for distributed training Infrastructure stability Clusters provision & management 15 Collect data
  • 16. © 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon SageMaker for Large Scale FM Training: Self-healing clusters reduce training time by up to 20% Resilient environment SageMaker distributed training libraries improve performance by up to 20% Streamline distributed training Control over compute environment and workload scheduling Optimized resources utilization (SMHP) Focus on ML without the need to manage infrastructure Managed training environment (SMTJ) OR AND AND
  • 17. © 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Demo video
  • 18. © 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 19. © 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 20. © 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. SageMaker Training Jobs for FM Training
  • 21. © 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. © 2023, Amazon Web Services, Inc. or its affiliates. All rights reserved. Financial Services Healthcare and Life Sciences Automotive Manufacturing Media & Entertainment Telecom Energy
  • 22. © 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. GenAI Success Stories H E L P I N G E N T E R P R I S E S T O U S E F O U N D A T I O N M O D E L S 50% lower costs for hosting FMs 7 months reduced time-to- value from 12-18 months 80% reduction in inference latency 66% cost savings with GPU utilization Enhance Customer Experience Enhance Customer Experience Boost Employee Productivity Streamline Business Processes
  • 23. © 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Start your generative AI journey today Additional Resources View product page Watch deep dive demo videos Read blog posts Read the technical docs SageMaker Training Example
  • 24. © 2024, Amazon Web Services, Inc. or its affiliates. All rights reserved. Thank you!