ããã«ã¡ã¯ãã¢ãžã£ã€ã«äºæ¥éšã®ã¿ã¡ã®ããã§ããAWS re:Invent 2025 ã«çŸå°åå ããŠããŸã!
ãã®èšäºã¯ãOptimize for AWS with intelligent automation (AIM235-S)ãã®ã»ãã·ã§ã³ã¬ããŒãã§ãã
IBM ã® Turbonomic ããŒã ããAgentic AI ã®ãªãœãŒã¹æé©åãšã³ã¹ãåæžã«ã€ããŠç޹ä»ããŸããã
æŠèŠ
ã»ãã·ã§ã³ã§ã¯ãAgentic AI ã AI ã¢ããªã±ãŒã·ã§ã³ãããããäºæž¬äžèœãªãªãœãŒã¹éèŠã®èª²é¡ãšãããã解決ããã€ã³ããªãžã§ã³ããªèªååã®ææ³ãèªãããŸãããç¹ã«å°è±¡çã ã£ãã®ã¯ãGPU ã€ã³ã¹ã¿ã³ã¹ãèªåã§é©åãªãµã€ãºã«èª¿æŽããããšã§æé¡ $13,800 ã®ã³ã¹ãåæžãå®çŸããäºäŸããã㊠IBM å éšããŒã ã GPU ã®ãããã«ãŒã ã 3 ãã 16 ã«æ¡å€§ãã13 åã® GPU ãä»ã®ã¯ãŒã¯ããŒãã«åé åã§ããå®çžŸã§ãã
ãããªæ¹ã«ãããã
- Agentic AI ã AI ã¢ããªã±ãŒã·ã§ã³ãæ¬çªç°å¢ã§éçšããŠããããŸãã¯éçšãæ€èšããŠããæ¹
- GPU ã€ã³ã¹ã¿ã³ã¹ã®ã³ã¹ãæé©åã«èª²é¡ãæããŠããæ¹
- AWS ç°å¢ã§ã®ãªãœãŒã¹èªåæé©åã«èå³ãããæ¹
- FinOps ãå¯èŠ³æž¬æ§ããŒã«ã ãã§ã¯è§£æ±ºã§ããªã課é¡ã«çŽé¢ããŠããæ¹
ç»å£è
- Chris Zaloumis ããïŒIBM, Director of Product ManagementïŒ
Agentic AI ãããããé ããã³ã¹ã
Chris ããã¯ãŸããAgentic AI ã®æ¡çšç¶æ³ã«ã€ããŠäŒå Žã«ã¢ã³ã±ãŒããåãããŸããã

- æ¢çŽ¢äžïŒåŠç¿æ®µéïŒ: æ°å
- PoC 宿œäž: ããªãã®æ°
- æ©æãããã¯ã·ã§ã³: æ°å
- ããžãã¹ã¯ãªãã£ã«ã«: ã»ãŒãŒã
ããžãã¹ã¯ãªãã£ã«ã«ãªã¬ãã«ãŸã§å°éããŠããçµç¹ã¯ãŸã ã»ãšãã©ãªããšããçµæã§ããããã㯠IBM ã®é¡§å®¢ã§ãåæ§ã®åŸåã ããã§ãã
ãªãŒããŒããããžã§ãã³ã°ãšãªãœãŒã¹ã®ç¡é§
Agentic AI ã AI ã¢ããªã±ãŒã·ã§ã³ã®å€§ããªèª²é¡ã¯ããªãœãŒã¹ã®äœ¿çšãäºæž¬äžèœã§ããããšã§ããåŸæ¥ã®ã¢ããªã±ãŒã·ã§ã³ã¯ç·åœ¢ã«ãªãœãŒã¹ãæ¶è²»ããŸãããAgentic AI ã¯ç°ãªããŸãã
Chris ããã®èª¬æã«ãããšãAgentic AI ã¯ä»¥äžã®ãããªç¹åŸŽããããšã®ããšã§ãã
- èšç»ãç«ãŠãåå²ããæ°çŸã®ãã€ã¯ãã¯ãŒã¯ããŒããçæãã
- Agentic AI ãå¥ã® Agentic AI ãšéä¿¡ããããè€æ°ã® API ãåŒã³åºããããã
- 1 ã€ã®ãŠãŒã¶ãŒãªã¯ãšã¹ããæ°çŸã®äžæµã¿ã¹ã¯ãçæããããšããã
ãã®äºæž¬äžèœãªåäœã«ãããå€ãã®çµç¹ã¯ããã©ãŒãã³ã¹äœäžãæããŠãªãœãŒã¹ãéå°ã«ããããžã§ãã³ã°ãããšããéžæãããŠããŸããŸãã

å ·äœçã«ã¯ã
- GPU ã€ã³ã¹ã¿ã³ã¹ã®éå°ãªãµã€ãžã³ã°
- RAM ãã¡ãŒã ã®éå°å²ãåœãŠ
- ã¹ãã¬ãŒãžã®éå°ããããžã§ãã³ã°
ããã«ãããé«é¡ãªãªãœãŒã¹ãã¢ã€ãã«ç¶æ ã®ãŸãŸæŸçœ®ããããšããäºæ ãçºçããŠããŸãã
Chris ããã瀺ããäºäŸã§ã¯ã以äžã®ãããªç¶æ³ããã£ããšã®ããšã§ãã
- GPU 䜿çšçã 30% ãããªã
- å¿ èŠéã® 10ã20 åã® GPU æéã確ä¿ããŠãã
ããããçµç¹ã§ã¯ãããã©ãŒãã³ã¹ãªã¹ã¯ãæããããŸãããªãœãŒã¹ãã¢ã€ãã«ç¶æ ã§æŸçœ®ãããé ããã³ã¹ããšããŠçµç¹ã«è² æ ããããŠããŸãã
æ£çŽããã®åé¡ã¯å®æã§ããŸãããAgentic AI ã®ãããªäºæž¬äžèœãªã¯ãŒã¯ããŒãã«å¯ŸããŠãããšãããã倧ããã«ç¢ºä¿ããŠããããšãã倿ã¯çè§£ã§ããŸãããã³ã¹ãé¢ã§ã¯å€§ããªèª²é¡ã«ãªããšæããŸãã

éçãªã¹ã±ãŒãªã³ã°ããªã·ãŒã®éç
å€ãã®ããŒã ã¯éçãªã¹ã±ãŒãªã³ã°ããªã·ãŒãèšå®ããŠããŸãããããã«ãåé¡ããããš Chris ããã¯ææããŸããã
- ãªã¢ã¯ãã£ãïŒäºåŸå¯Ÿå¿åïŒ: ããŒã¯ãçºçããŠããã¹ã±ãŒã«ããããããã§ã«æé ã
- 人çä»å ¥ãå¿ èŠ: ã¹ã±ãŒãªã³ã°ããªã·ãŒã®ç¶ç¶çãªç£èŠãšèª¿æŽãå¿ èŠ
Agentic AI ã®ãããªäºæž¬äžèœãªã¯ãŒã¯ããŒãã§ã¯ãéçãªããªã·ãŒã§ã¯å¯Ÿå¿ããããªããšããããšã§ããã
æŽå¯ãšã¢ã¯ã·ã§ã³ã®éã®ã®ã£ãã
Chris ããã¯ãæ¢åã®å¯èŠ³æž¬æ§ããŒã«ã FinOps ããŒã«ã®éçã«ã€ããŠãèšåãããŸããã

å¯èŠ³æž¬æ§ããŒã«ã®èª²é¡
å¯èŠ³æž¬æ§ããŒã«ã¯ä»¥äžã®ãããªæ å ±ãæäŸããŠãããŸãã
- CPU ã¹ããããªã³ã°ãçºçããŠãããïŒ
- GPU ã®ç«¶åãèµ·ããŠãããïŒ
- ã¬ã€ãã³ã·ãŒã®ã¹ãã€ã¯ã¯ãããïŒ
ãããããããã®ããŒã«ã¯ãäœãèµ·ããŠããããã¯æããŠãããŸããããããã«å¯ŸããŠäœããã¹ãããã¯æããŠãããŸãããRCA(æ ¹æ¬åå åæ)ãäºåŸã«è¡ãããšã¯ã§ããŸããããªã¢ã«ã¿ã€ã ã§ã®ã¢ã¯ã·ã§ã³ã¯åããŸããã
FinOps ããŒã«ã®èª²é¡
FinOps ããŒã«ã¯ä»¥äžã®ãããªæ©èœãæäŸããŸãã
- ã³ã¹ãæ¯åºã®å¯èŠå
- ã¬ããŒãäœæ
- ã·ã§ãŒããã¯/ãã£ãŒãžããã¯
ããããFinOps ããŒã«ã¯äž»ã«ã¬ããŒãã£ã³ã°ã«çŠç¹ãåœãŠãŠãããGPU ã®ãªãµã€ãžã³ã°ãåçãªã·ããªãªãžã®å¯Ÿå¿ã¯ã§ããŸãããã³ã¹ãé åã¯åŸæã§ãããåé¡ã®è§£æ±ºã«ã¯è³ããªããšããããšã§ããã
çµæãšããŠãå€ãã®çµç¹ã§ã¯ã
- ããã©ãŒãã³ã¹ãš SLO ç¶æã®ããã«ãªãŒããŒãµã€ãžã³ã°ãããã©ã«ãã«ãªã
- éçãªã¹ã±ãŒãªã³ã°ã«é ŒããããåŸãªã
ãã®èª²é¡ã¯ãæ£çŽãªãšããå€ãã®çµç¹ã§å ±éããŠãããšæããŸããå¯èŠ³æž¬æ§ãš FinOps ã ãã§ã¯ãåçãªãªãœãŒã¹æé©åãŸã§ã¯å®çŸã§ããªããã§ããã
Turbonomic ã«ãããªã¢ã«ã¿ã€ã æé©å
Chris ããã¯ãIBM ã® Turbonomic ããããã®èª²é¡ãã©ã®ããã«è§£æ±ºãããã説æãããŸããã

Turbonomic ã®ä»çµã¿
Turbonomic ã¯ä»¥äžã®ãããªã¢ãããŒãã§ãªã¢ã«ã¿ã€ã æé©åãå®çŸããŠããŸãã
- ç¶ç¶çãªåæ: GPU ã€ã³ã¹ã¿ã³ã¹ãvCPUãvMEM ã®é£œå床ãã¹ãã¬ãŒãžãšãããã¯ãŒã¯ã®ã¹ã«ãŒããããæç³»åã§åæ
- ç°å¢å šäœã®ãµãã©ã€ãã§ãŒã³åæ: Agentic AI ã¢ããªã±ãŒã·ã§ã³ã ãã§ãªããããããµããŒããããªãœãŒã¹å šäœãèŠã
- ãªã¢ã«ã¿ã€ã ãªã¹ã±ãŒã«ã¢ãã/ããŠã³: éèŠã«å¿ããŠåçã«ãªãœãŒã¹ã調æŽ
- EC2ãGPUãEKSããã€ããªããç°å¢ã«å¯Ÿå¿: AWS ã ãã§ãªãããªã³ãã¬ãã¹ãä»ã®ã¯ã©ãŠãã«ã察å¿
éèŠãªã®ã¯ãããã©ãŒãã³ã¹ãç¶æããªãã SLO ãæºãããã³ã¹ããå¹ççã«ç®¡çãããšããç¹ã§ãã
GPU æé©åã®å ·äœäŸ
Chris ããã¯ãå®éã® Turbonomic ã®ç»é¢ãèŠããªãããå ·äœçãªæé©åã¢ã¯ã·ã§ã³ã説æãããŸããã

äºäŸ: P3DN.24xlarge GPU ã€ã³ã¹ã¿ã³ã¹ã®æé©å
- çŸåšã®ã€ã³ã¹ã¿ã³ã¹: P3DN.24xlargeïŒé«é¡ãª GPU ã€ã³ã¹ã¿ã³ã¹ïŒ
- æšå¥šã¢ã¯ã·ã§ã³: P3.8xlarge ã«ã¹ã±ãŒã«ããŠã³
- æé¡ã³ã¹ãåæž: $13,800
ãã®æšå¥šã¯ãGPU 䜿çšçãGPU ã¡ã¢ãªãã€ã³ã¹ã¿ã³ã¹ãµã€ãžã³ã°ãå šäœã®äœ¿çšçãåæããçµæã§ãã
詳现ãªåæããŒã¿
- GPU ã«ãŠã³ã䜿çšç: çŽ13%ïŒ8åã® GPU ã®ãã¡ãã»ãšãã©äœ¿ãããŠããªãïŒ
- GPU ã¡ã¢ãªäœ¿çšç: çŽ22%
- vCPU 䜿çšç: 3ã4%
ãã®ç¶æ³ãããTurbonomic ã¯ä»¥äžã®ãããªæšå¥šãè¡ã£ãŠããŸãã
- GPU ã«ãŠã³ã: 8 â 4 ã«åæž
- äºæž¬ããã GPU 䜿çšç: 13% â 26%ïŒå®å šãªç¯å²å ïŒ
- GPU ã¡ã¢ãª: 32GB â 16GB
- äºæž¬ãããã¡ã¢ãªäœ¿çšç: 22% â 44%ïŒå®å šãªç¯å²å ïŒ
- vCPU: 13% çšåºŠã«äžæïŒå®å šãªç¯å²å ïŒ
ã³ã¹ãåæžã®è©³çް
- ãªã³ããã³ãæé: $31.21/æé â $12/æé
- RIïŒãªã¶ãŒããã€ã³ã¹ã¿ã³ã¹ïŒã Savings Plans ãèæ ®
- æé¡åæžé¡: çŽ $13,800
ããã¯ãã£ã1ã€ã® GPU ã€ã³ã¹ã¿ã³ã¹ããã®åæžé¡ãªã®ã§ãè€æ°ã®ã€ã³ã¹ã¿ã³ã¹ãããã°ããã«å€§ããªç¯çŽã«ãªããšããããšã§ããã
Chris ããã¯ããªãã¬ãŒã¿ãŒããã®ã¢ã¯ã·ã§ã³ãä¿¡é Œã§ããããã«ã詳现ãªã¡ããªã¯ã¹ãæäŸããŠããç¹ã匷調ãããŸãããããã©ãŒãã³ã¹ãç¶æãããããšã確èªã§ããã®ã§ãèªä¿¡ãæã£ãŠã¢ã¯ã·ã§ã³ãå®è¡ã§ãããšã®ããšã§ãã
ãã®åãçµã¿ã¯ãã³ã¹ãåæžãšããã©ãŒãã³ã¹ç¶æã®ãã©ã³ã¹ãçŽ æŽããããšæããŸããç¹ã«ããªãã¬ãŒã¿ãŒã倿ã§ããããã«è©³çްãªããŒã¿ãæäŸããŠããç¹ãå®çšçã§ããã
Turbonomic ã®äž»èŠæ©èœ
Chris ããã¯ãTurbonomic ãæäŸããäž»èŠãªæ©èœã«ã€ããŠã説æãããŸããã

ã¹ããŒã GPU æé©å
- GPU ãèªåçã«ãã¥ãŒãã³ã°
- éèŠãå¢å ãããšãªã¢ã«ã¿ã€ã ã§ãªãœãŒã¹ã远å
- éèŠãæžå°ãããšå¹çåã®ããã«ãªãœãŒã¹ãåæž
- ã¢ã€ãã«å®¹éãæé€
ãªã¢ã«ã¿ã€ã å¯èŠæ§
- ãã¹ãŠã®ãªãœãŒã¹ããªã¢ã«ã¿ã€ã ã§ç£èŠ
- ããžãã¹ã¢ããªããç©çãªãœãŒã¹ãŸã§ã®ãµãã©ã€ãã§ãŒã³ããããã³ã°
- EC2ãEKSãGPU ç°å¢ãªã©ãããŸããŸãªãªãœãŒã¹éã®ã¡ããªã¯ã¹ãçžé¢åæ
- æœåšçãªåé¡ãããã«ããã¯ãç¹å®ããããã¢ã¯ãã£ãã«å¯ŸåŠ
ãªãŒã±ã¹ãã¬ãŒã·ã§ã³çµ±å
- Pod ã®é 眮ãã¢ãã£ããã£ããªãœãŒã¹ã¯ã©ãŒã¿ãã¹ã±ãžã¥ãŒãªã³ã°ãçè§£
- ã³ã³ããã€ã³ãã©ãšåŸæ¥ã®ã€ã³ãã©ã¬ã€ã€ãŒãäžç·ã«æé©å
ããã¢ã¯ãã£ããªèªåå
- åäžã®ã¢ã¯ã·ã§ã³ã ãã§ãªããå®å šãªèªååãå¯èœ
- 人çä»å ¥ãªãã§æé©åãå®è¡
- æœåšçãªåé¡ãç¹å®ããã€ã³ã·ãã³ããçºçããåã«å¯ŸåŠ
- äºåŸã® RCA ãåé¡è§£æ±ºãäžèŠ
ROI ã®å¯èŠå
- åå¥ã®ã¢ã¯ã·ã§ã³ã®å¹æãéèš
- ãã¹ãŠã®èªååã«ããå šäœç㪠ROI ãæç€º
- ããžãã¹äŸ¡å€ãæç¢ºã«ç€ºã
ãã®æ©èœã»ããã¯ãFinOps ãšå¯èŠ³æž¬æ§ã®ã®ã£ãããåãããã®ã ãšæããŸãããç¹ã«ãããã¢ã¯ãã£ããªèªååã«ãããåé¡ãçºçããåã«å¯ŸåŠã§ããç¹ãçŽ æŽãããã§ããã
IBM å éšã§ã®å®çžŸ: BAM ããŒã
Chris ããã¯ãIBM å éšã® Big AI Models ããŒã ïŒBAMïŒã®äºäŸã玹ä»ãããŸããã

BAM ããŒã 㯠watsonx ã®èåŸã«ãã LLM ããµããŒãããŠããããŒã ã§ã以äžã®ãããªèª²é¡ãæ±ããŠãããšã®ããšã§ãã
- ç°å¢ã管çããããã®æåãã¥ãŒãã³ã°ãæžãããã
- æ°çŸã®ã³ã³ãããšçŽ 100 åã® NVIDIA A100 GPU ã Kubernetes ã§éçš
- GPU ã®å¯åºŠãé«ã㊠ROI ãåäžãããã
Turbonomic å°å ¥åŸã®ææ
Turbonomic ãå°å ¥ããçµæã以äžã®ãããªææãåŸããããšã®ããšã§ãã

- ã¢ã€ãã« GPU ãªãœãŒã¹ã5.3ååæž: ãããã«ãŒã ã3ãã16ã«æ¡å€§
- ã¹ã«ãŒãããã2ååäž: ã¬ã€ãã³ã·ãŒã«åœ±é¿ãªã
- 13åã® GPU ãåæž: ãããã® GPU ãä»ã®ã¯ãŒã¯ããŒãã«åé å
13åãã® GPU ãä»ã®ã¯ãŒã¯ããŒãã«å²ãåœãŠããããšããã®ã¯ã倧ããªææã§ãããæ¢åã® GPU ãããå¯ã«æŽ»çšããããšã§ãæ°ãã AI ã¯ãŒã¯ããŒãã«ãªãœãŒã¹ãå²ãåœãŠãããããã«ãªããŸããã
ãã®äºäŸã¯ãã³ã¹ãåæžã ãã§ãªããªãœãŒã¹ã®æå¹æŽ»çšãšãã芳ç¹ã§ãçŽ æŽããããšæããŸãã
ãŸãšã: èŠããŠããã¹ã3ã€ã®ããš
Chris ããã¯ãã»ãã·ã§ã³ã®æåŸã«3ã€ã®éèŠãªãã€ã³ãã匷調ãããŸããã

1. Agentic AI ã¯äºæž¬äžèœã§ãªãœãŒã¹éçŽç
- ç·åœ¢ã«ã¹ã±ãŒã«ããªã: åŸæ¥ã®ã¢ããªã±ãŒã·ã§ã³ãšã¯ç°ãªãåäœ
- ããŒã¹ãæ§ãé«ãäºæž¬äžèœ: ãªãœãŒã¹éèŠãæ¥æ¿ã«å€åãã
- æç床æ²ç·ãæèãã: Agentic AI ãããžã§ã¯ããé²ããéã¯ããã®ç¹æ§ãèæ ®ããå¿ èŠããã
2. å¯èŠæ§ã ãã§ã¯äžåå
- å¯èŠæ§ã ãã§ã¯æåäœæ¥ãå¢ãã: åé¡ãèŠã€ããŠãã察åŠã«ã¯æåäœæ¥ãå¿ èŠ
- æŽå¯ãç¶ç¶çãªã¢ã¯ã·ã§ã³ã«å€æãã: ãªã¢ã«ã¿ã€ã ã§ãªãœãŒã¹ã驿£åããã¢ããªã±ãŒã·ã§ã³ã«å¿ èŠãªãªãœãŒã¹ã確ä¿ãããœãªã¥ãŒã·ã§ã³ãéèŠ
3. é«äŸ¡å€ãªã¯ãŒã¯ããŒãããå§ãã
- 1ã€ã®é«äŸ¡å€ã¯ãŒã¯ããŒããã¿ãŒã²ããã«ãã: ããã©ãŒãã³ã¹ãç¶æããªãã GPU ãå¹çåã§ããããšã蚌æ
- ãããã¹ã±ãŒã«ã¢ãŠããã: ææã確èªããŠããå±éãåºãã
å°ããå§ããŠæ€èšŒããŠããåºããããšããã¢ãããŒãã¯åœããåã®ããã§ãå®éã«ããã®ã¯é£ããã§ããããã§ãããã®æ éãªã¹ããããæåã®éµã«ãªããšæããŸãã
å šäœãéããŠã®ææ
Agentic AI ã®ãªãœãŒã¹æé©åãšããããŸãã«ä»ããããªèª²é¡ã«å¯ŸããŠãå ·äœçãªãœãªã¥ãŒã·ã§ã³ãšå®çžŸã瀺ããŠããã»ãã·ã§ã³ã§ããã
ç¹ã«å°è±¡çã ã£ãã®ã¯ãå¯èŠ³æž¬æ§ãš FinOps ã®ã®ã£ãããåãããšããæç¢ºãªããžã·ã§ãã³ã°ã§ããå€ãã®çµç¹ããäœãèµ·ããŠãããã¯åããããäœããã¹ããåãããªãããšããç¶æ³ã«é¥ã£ãŠããäžã§ãTurbonomic ã¯ãã®ã®ã£ãããåãããœãªã¥ãŒã·ã§ã³ãšããŠäœçœ®ã¥ããããŠããŸããã
ãŸãã$13,800/æã®ã³ã¹ãåæžãã13åã® GPU ã®åé åãšãã£ãå ·äœçãªæ°åã瀺ãããŠããç¹ã説åŸåããããŸãããåãªãçè«ã§ã¯ãªããå®éã«ææãåºãŠãããšããç¹ãéèŠã ãšæããŸããã
Agentic AI ã AI ã¢ããªã±ãŒã·ã§ã³ã®æ¬çªéçšãæ€èšããŠããæ¹ãç¹ã« GPU ã³ã¹ãã«èª²é¡ãæããŠããæ¹ã«ã¯ãéåžžã«åèã«ãªãå 容ã ã£ããšæããŸãã