DeepInfra is a serverless inference as a service that provides access to a variety of LLMs and embeddings models. This notebook goes over how to use LangChain with DeepInfra for language models.

Set the Environment API Key

Make sure to get your API key from DeepInfra. You have to Login and get a new token.

You are given a 1 hour free of serverless GPU compute to test different models. (see here) You can print your token with deepctl auth token

# get a new token:

from getpass import getpass

import os


Create the DeepInfra instance

You can also use our open-source deepctl tool to manage your model deployments. You can view a list of available parameters here.

from langchain_community.llms import DeepInfra

llm = DeepInfra(model_id="meta-llama/Llama-2-70b-chat-hf")
llm.model_kwargs = {
"temperature": 0.7,
"repetition_penalty": 1.2,
"max_new_tokens": 250,
"top_p": 0.9,
API Reference:DeepInfra
# run inferences directly via wrapper
llm("Who let the dogs out?")
'This is a question that has puzzled many people'
# run streaming inference
for chunk in"Who let the dogs out?"):

Create a Prompt Template

We will create a prompt template for Question and Answer.

from langchain_core.prompts import PromptTemplate

template = """Question: {question}

Answer: Let's think step by step."""

prompt = PromptTemplate.from_template(template)
API Reference:PromptTemplate

Initiate the LLMChain

from langchain.chains import LLMChain

llm_chain = LLMChain(prompt=prompt, llm=llm)
API Reference:LLMChain

Run the LLMChain

Provide a question and run the LLMChain.

question = "Can penguins reach the North pole?"
"Penguins are found in Antarctica and the surrounding islands, which are located at the southernmost tip of the planet. The North Pole is located at the northernmost tip of the planet, and it would be a long journey for penguins to get there. In fact, penguins don't have the ability to fly or migrate over such long distances. So, no, penguins cannot reach the North Pole. "

