iPhoneでみずきちゃんとﾁｭｯﾁｭしたいだけの人生だった（AWS Polly + Swift）

３次元で合成音声話す女子がいたら今すぐ連絡ください。
ペッパーはだめです。

というわけでやっていきます。

AWS Polly

いろんな言語を話してくれる音声合成AIです。AIに入るの？
CLIからの利用記事が目立ちますが、今回はどうしてもiPhoneにいれて好きな言葉を話させたかったので、swiftでラッパーを作りました。
ちなみにVersionが違うせいか公式docのままじゃ動きません。

手元の環境

iOS = 10.0
Xcode = 8.2
Swift = 2.2

構築手順

CognitoIdentityPoolをつくる
enable access to unauthenticated identities みたいなのにチェックいれとく
サービスクラスをつくる

import Foundation
import UIKit
import AVFoundation
import AWSCore
import AWSPolly
import AWSCognito

class AudioGuidanceService: UIViewController, NSURLSessionDownloadDelegate {
    var audioPlayer = AVAudioPlayer()

    func say(message: String) {
        let credentialsProvider = AWSCognitoCredentialsProvider(
            regionType: .USWest2,
            identityPoolId: "作成したPoolID"
        )

        // 匿名で認証する
        let configuration = AWSServiceConfiguration(region: .USWest2, credentialsProvider: credentialsProvider)
        AWSServiceManager.defaultServiceManager().defaultServiceConfiguration = configuration

        // みずきちゃんフォーマットを組み立てる
        let input = AWSPollySynthesizeSpeechURLBuilderRequest()

        input.textType = AWSPollyTextType.Text
        input.text = message
        input.outputFormat = AWSPollyOutputFormat.Mp3
        input.voiceId = AWSPollyVoiceId.Mizuki

        let builder = AWSPollySynthesizeSpeechURLBuilder.defaultPollySynthesizeSpeechURLBuilder().getPreSignedURL(input)

        // みずきちゃん召喚！
        builder.continueWithSuccessBlock({ (awsTask: AWSTask) in
            let result = awsTask.result! as! NSURL

            let config: NSURLSessionConfiguration = NSURLSessionConfiguration.backgroundSessionConfigurationWithIdentifier("backgroundSession")
            let session: NSURLSession = NSURLSession(configuration: config, delegate: self, delegateQueue: nil)

            let url: NSURL = result
            let request: NSURLRequest = NSURLRequest(URL: url)
            let task:NSURLSessionDownloadTask = session.downloadTaskWithRequest(request)

            // タスクを実行.
            task.resume()

            return nil
        })
    }

    // みずきちゃん召喚後の実体に喋らせる
    func URLSession(
        session: NSURLSession,
        downloadTask: NSURLSessionDownloadTask,
        didFinishDownloadingToURL url: NSURL) {
        do {

            self.audioPlayer = try AVAudioPlayer(contentsOfURL: url)

            self.audioPlayer.prepareToPlay()
            self.audioPlayer.play()
        } catch let error as NSError {
            print(error)
        } catch {
            print("error")
        }
    }
}

利用する

var audio = AudioGuidanceService()
audio.say("はろーわーるど！！")

余談

swift歴２週間なので変なところがあったらご指摘ください！

ちなみに 4の発音が疑問形っぽくなってかわいい。

元記事はこちら

「iPhoneでみずきちゃんとﾁｭｯﾁｭしたいだけの人生だった（AWS Polly + Swift）」

この記事を書いた人

iret.media 編集部

最強。敗北を知らない。 iret.media 編集部が書いた記事

iPhoneでみずきちゃんとﾁｭｯﾁｭしたいだけの人生だった（AWS Polly + Swift）

AWS Polly

手元の環境

構築手順

余談

元記事はこちら

MSPで働くってどんな感じ？そんなよくある質問に答えます！

第一開発事業部のワーケーションまとめ

スマートフォンでhosts設定を行う方法

AWSの生成AI活用事例集GenUを使い倒す

Oracle のロックされているテーブルのセッションを知りたい [cloudpack OSAKA blog]

iPhoneでみずきちゃんとﾁｭｯﾁｭしたいだけの人生だった（AWS Polly + Swift）

AWS Polly

手元の環境

構築手順

余談

元記事はこちら

関連記事Related Articles

初めてAppleWatch開発を行った中で詰まった6点

SAMを使ってAPI Gateway Lambda Authorizerを設定する

Amazon Aurora と Amazon RDS が MySQL と PostgreSQL データベースの延長サポートを発表

【iretテクニカルアンバサダーブログリレー】-オンラインで快適に楽しむ- AWS re:Invent 2023に向けて（村上 桃子）

【iretテクニカルアンバサダーブログリレー】 -初re:Invent 5番勝負 想定vs実際- AWS re:Invent 2023に向けて（高橋 修一）

【iretテクニカルアンバサダーブログリレー】-オンラインで快適に楽しむ- AWS re:Invent 2023に向けて（村上桃子）

【iretテクニカルアンバサダーブログリレー】 -初re:Invent 5番勝負想定vs実際- AWS re:Invent 2023に向けて（高橋修一）