本节我们将探索 LangChain 的内部工作原理,LangChain 是一个有用的 Python 库,可以简化使用 LLM 的过程。您将能够创建直观、可重用且可扩展的应用程序,这些应用程序可以改变您与 LLM 的交互方式。


大型语言模型 (LLM) 是在大量数据上预先训练的非常大的深度学习模型。底层转换器是一组神经网络,由具有自注意力功能的编码器和解码器组成,使其能够理解和生成类似人类的文本。这些模型是构建应用程序的基础。其中包括GPT-3.5-turbo,这是由 OpenAI 开发的 LLM。

ChatOpenAI 类允许您通过 OpenAI API 与 GPT-3.5-turbo 模型进行交互。

from langchain_openai import ChatOpenAI

# Create a ChatOpenAI instance
chat = ChatOpenAI(temperature=0.0, model="gpt-3.5-turbo")

通过将温度参数设置为 0.0,我们指示模型生成确定性输出,确保多次运行的一致性。这在构建需要可靠和可重复结果的应用程序时特别有用。



from langchain.prompts import ChatPromptTemplate

template_string = """Translate the text that is delimited by double quotes into a style that is {style}. text: ''{text}''"""

prompt_template = ChatPromptTemplate.from_template(template_string)


# Output
PromptTemplate(input_variables=['style', 'text'], template='Translate the text that is delimited by triple quotes into a style that is {style}. text: """{text}"""n')

在上面的示例中,我们定义了一个提示词模板,该模板指示语言模型将给定文本转换为指定的样式。通过使用提示,我们可以执行广泛的任务,从语言翻译到内容生成,甚至是复杂的分析任务。如果您需要以更动态和个性化的方式与语言模型进行交互,可以借助使用 prompt_template.format_messages 方法设置消息格式的功能,您可以轻松地将用户输入、上下文信息或任何其他相关数据用于提示,从而确保与语言模型的交互针对每个独特的方案进行定制。

customer_style = "American English in a calm and respectful tone"
customer_email = "Arrr, I be fuming that me blender lid flew off and splattered me kitchen walls with smoothie! ..."

# Call the LLM to translate to the style of the customer message
customer_messages = prompt_template.format_messages(style=customer_style, text=customer_email)
customer_response = chat.invoke(customer_messages)

# Output
I am really frustrated that my blender lid flew off and splattered my kitchen walls with smoothie! And to make matters worse, the warranty doesn't cover the cost of cleaning up my kitchen. I could really use your help right now, friend.



虽然提示词会引导语言模型的输入,但输出解析器在解释和构建其响应方面起着至关重要的作用。这些分析器将语言模型生成的原始文本转换为可由应用程序轻松使用和处理的结构化格式。想象一下,您正在构建一个电子商务应用程序,该应用程序依赖于客户评论来提供见解。使用 LangChain 的 ResponseSchema 和 StructuredOutputParser,您可以定义语言模型的预期输出格式,并从客户评论中提取相关信息。

from langchain.output_parsers import ResponseSchema, StructuredOutputParser

# Define the expected output schema
gift_schema = ResponseSchema(name="gift", description="Was the item purchased as a gift?")
delivery_days_schema = ResponseSchema(name="delivery_days", description="How many days did it take for the product to arrive?")
price_value_schema = ResponseSchema(name="price_value", description="Extract any sentences about the value or price.")

response_schemas = [gift_schema, delivery_days_schema, price_value_schema]

# Create the output parser
output_parser = StructuredOutputParser.from_response_schemas(response_schemas)
format_instructions = output_parser.get_format_instructions()

# Output 
The output should be a markdown code snippet formatted in the following schema, 
    "gift": string  // Was the item purchased as a gift for someone else? Answer True if yes, False if not or unknown.
    "delivery_days": string  // How many days did it take for the product to arrive? If this information is not found, output -1.
    "price_value": string  // Extract any sentences about the value or price, and output them as a comma separated Python list.

通过定义所需的输出模式,LangChain会生成格式指令,指导语言模型以特定格式生成其输出。然后,可以使用 output_parser.parse 方法轻松将此结构化输出解析为 Python 字典,从而允许您提取信息,例如该物品是否是礼物、交货时间以及有关产品价值或价格的任何评论。

review_template_2 = """
For the following text, extract the following information:

gift: Was the item purchased as a gift for someone else? 
Answer True if yes, False if not or unknown.

delivery_days: How many days did it take for the product
to arrive? If this information is not found, output -1.

price_value: Extract any sentences about the value or price,
and output them as a comma separated Python list.

text: {text}


prompt = ChatPromptTemplate.from_template(template=review_template_2)

messages = prompt.format_messages(
    text=customer_review, format_instructions=format_instructions

response = chat.invoke(messages)

# Output
    "gift": "True",
    "delivery_days": "2",
    "price_value": "It's slightly more expensive than the other leaf blowers out there, but I think it's worth it for the extra features."

output_dict = output_parser.parse(response_content)
print(output_dict["gift"])  # Output: True
print(output_dict["delivery_days"])  # Output: 2
print(output_dict["price_value"])  # Output: "It's slightly more expensive than other leaf blowers, but worth it for the extra features."




  • 可重用性:LangChain允许您定义可重用的组件,这些组件可以在应用程序之间共享,甚至可以与团队或社区中的其他开发人员共享,而不是从头开始为每个新任务重新创建提示和解析器。
  • 一致性:通过集中提示和输出格式,无论使用何种任务或语言模型,都可以确保为用户提供一致且可预测的体验。
  • 可扩展性:随着应用程序的增长和演变,LangChain的模块化方法可以更轻松地适应不断变化的需求,合并新的语言模型,或与其他数据源或API集成。
  • 内置库:LangChain为摘要、问答和数据库交互等常见任务提供了丰富的预构建提示和解析器库,使您能够启动开发过程并专注于构建独特的功能,使您的应用程序与众不同。
  • 无缝集成:LangChain的抽象促进了应用程序、语言模型和解析输出之间的平滑交互,使您能够构建使用LLM的端到端解决方案,同时保持干净且可维护的代码库。





