ElevenLabs 文本转语音

此笔记本演示了如何与 ElevenLabs API 交互以实现文本转语音功能。

首先，您需要设置一个 ElevenLabs 账户。您可以按照说明此处。

%pip install --upgrade --quiet  elevenlabs langchain-community

import os

os.environ["ELEVENLABS_API_KEY"] = ""

使用

from langchain_community.tools import ElevenLabsText2SpeechTool

text_to_speak = "Hello world! I am the real slim shady"

tts = ElevenLabsText2SpeechTool()
tts.name

API 参考：ElevenLabsText2SpeechTool

'eleven_labs_text2speech'

我们可以生成音频，将其保存到临时文件然后播放。

speech_file = tts.run(text_to_speak)
tts.play(speech_file)

或者直接流式传输音频。

tts.stream_speech(text_to_speak)

在 Agent 中使用

from langchain.agents import AgentType, initialize_agent, load_tools
from langchain_openai import OpenAI

API 参考：AgentType | initialize_agent | load_tools | OpenAI

llm = OpenAI(temperature=0)
tools = load_tools(["eleven_labs_text2speech"])
agent = initialize_agent(
    tools=tools,
    llm=llm,
    agent=AgentType.STRUCTURED_CHAT_ZERO_SHOT_REACT_DESCRIPTION,
    verbose=True,
)

audio_file = agent.run("Tell me a joke and read it out for me.")

[1m> Entering new AgentExecutor chain...[0m
[32;1m[1;3mAction:
\`\`\`
{
  "action": "eleven_labs_text2speech",
  "action_input": {
    "query": "Why did the chicken cross the playground? To get to the other slide!"
  }
}
\`\`\`

[0m
Observation: [36;1m[1;3m/tmp/tmpsfg783f1.wav[0m
Thought:[32;1m[1;3m I have the audio file ready to be sent to the human
Action:
\`\`\`
{
  "action": "Final Answer",
  "action_input": "/tmp/tmpsfg783f1.wav"
}
\`\`\`

[0m

[1m> Finished chain.[0m

tts.play(audio_file)

工具概念指南
工具操作指南

使用​

在 Agent 中使用​

相关​

使用

在 Agent 中使用

相关