跳到主要内容

Yuan2.0

Yuan2.0是由IEIT系统开发的新一代基础大型语言模型。我们发布了所有三个模型,Yuan 2.0-102B,Yuan 2.0-51B和Yuan 2.0-2B。我们还为其他开发人员提供了用于预训练、微调和推理服务的相关脚本。Yuan2.0基于Yuan1.0,利用更广泛的高质量预训练数据和指令微调数据集,以增强模型对语义、数学、推理、代码、知识等方面的理解。

此示例介绍了如何使用LangChain与Yuan2.0 (2B/51B/102B)推理进行文本生成交互。

Yuan2.0设置了一个推理服务,因此用户只需请求推理api即可获得结果,这在Yuan2.0推理服务器中介绍。

from langchain.chains import LLMChain
from langchain_community.llms.yuan2 import Yuan2
API 参考:LLMChain | Yuan2
# default infer_api for a local deployed Yuan2.0 inference server
infer_api = "http://127.0.0.1:8000/yuan"

# direct access endpoint in a proxied environment
# import os
# os.environ["no_proxy"]="localhost,127.0.0.1,::1"

yuan_llm = Yuan2(
infer_api=infer_api,
max_tokens=2048,
temp=1.0,
top_p=0.9,
use_history=False,
)

# turn on use_history only when you want the Yuan2.0 to keep track of the conversation history
# and send the accumulated context to the backend model api, which make it stateful. By default it is stateless.
# llm.use_history = True
question = "请介绍一下中国。"
print(yuan_llm.invoke(question))

此页面是否有所帮助?