Create thread and run
Create a thread and run it in one request.
Request body
assistant_id
string
Required
The ID of the assistant to use to execute this run.
thread
object
Options to create a new thread. If no thread is provided when running a request, an empty thread will be created.messages
array
A list of messages to start the thread with.
items
object
role
string
Required
The role of the entity that is creating the message. Allowed values include:
user
: Indicates the message is sent by an actual user and should be used in most cases to represent user-generated messages.assistant
: Indicates the message is generated by the assistant. Use this value to insert messages from the assistant into the conversation.
user
string
assistant
string
content
string or array
Text content
string
Required
The text contents of the message.Array of content parts
array
Required
An array of content parts with a defined type, each can be of type
text
or images can be passed withimage_url
orimage_file
. Image types are only supported on Vision-compatible models.Image file
object
References an image File in the content of a message.
type
string
Required
Always
image_file
.image_file
string
image_file
object
Required
file_id
string
Required
The File ID of the image in the message content. Set
purpose="vision"
when uploading the File if you need to later display the file content.detail
string
Defaults: auto
Specifies the detail level of the image if specified by the user.
low
uses fewer tokens, you can opt in to high resolution usinghigh
.auto
string
low
string
high
string
Image URL
object
References an image URL in the content of a message.type
string
Required
The type of the content part.image_url
string
image_url
object
Required
url
string
Required
The external URL of the image, must be a supported image types: jpeg, jpg, png, gif, webp.detail
string
Defaults: auto
Specifies the detail level of the image.
low
uses fewer tokens, you can opt in to high resolution usinghigh
. Default value isauto
auto
string
low
string
high
string
Text
object
The text content that is part of a message.type
string
Required
Always
text
.text
string
text
string
Required
Text content to be sent to the model
attachments
array or null
A list of files attached to the message, and the tools they should be added to.items
object
file_id
string
The ID of the file to attach to the message.tools
array
The tools to add this file to.Code interpreter tool
object
type
string
Required
The type of tool being defined:
code_interpreter
code_interpreter
string
FileSearch tool
object
type
string
Required
The type of tool being defined:
file_search
file_search
string
metadata
object or null
Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard. Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
tool_resources
object or null
A set of resources that are made available to the assistant's tools in this thread. The resources are specific to the type of tool. For example, the
code_interpreter
tool requires a list of file IDs, while thefile_search
tool requires a list of vector store IDs.code_interpreter
object
file_ids
array
Defaults:
A list of file IDs made available to the
code_interpreter
tool. There can be a maximum of 20 files associated with the tool.items
string
file_search
object
file_search
object
file_search
object
metadata
object or null
Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard. Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.
model
string
The ID of the Model to be used to execute this run. If a value is provided here, it will override the model associated with the assistant. If not, the model associated with the assistant will be used.
model
string
model
string
gpt-4.1
string
gpt-4.1-mini
string
gpt-4.1-nano
string
gpt-4.1-2025-04-14
string
gpt-4.1-mini-2025-04-14
string
gpt-4.1-nano-2025-04-14
string
gpt-4o
string
gpt-4o-2024-11-20
string
gpt-4o-2024-08-06
string
gpt-4o-2024-05-13
string
gpt-4o-mini
string
gpt-4o-mini-2024-07-18
string
gpt-4.5-preview
string
gpt-4.5-preview-2025-02-27
string
gpt-4-turbo
string
gpt-4-turbo-2024-04-09
string
gpt-4-0125-preview
string
gpt-4-turbo-preview
string
gpt-4-1106-preview
string
gpt-4-vision-preview
string
gpt-4
string
gpt-4-0314
string
gpt-4-0613
string
gpt-4-32k
string
gpt-4-32k-0314
string
gpt-4-32k-0613
string
gpt-3.5-turbo
string
gpt-3.5-turbo-16k
string
gpt-3.5-turbo-0613
string
gpt-3.5-turbo-1106
string
gpt-3.5-turbo-0125
string
gpt-3.5-turbo-16k-0613
string
instructions
string or null
Override the default system message of the assistant. This is useful for modifying the behavior on a per-run basis.tools
array or null
Override the tools the assistant can use for this run. This is useful for modifying the behavior on a per-run basis.Code interpreter tool
object
type
string
Required
The type of tool being defined:
code_interpreter
code_interpreter
string
FileSearch tool
object
type
string
Required
The type of tool being defined:
file_search
file_search
string
file_search
object
Overrides for the file search tool.max_num_results
integer
The maximum number of results the file search tool should output. The default is 20 for
gpt-4*
models and 5 forgpt-3.5-turbo
. This number should be between 1 and 50 inclusive.Note that the file search tool may output fewer than
max_num_results
results. See the file search tool documentation for more information.ranking_options
object
The ranking options for the file search. If not specified, the file search tool will use the
auto
ranker and a score_threshold of 0.See the file search tool documentation for more information.
ranker
string
The ranker to use for the file search. If not specified will use the
auto
ranker.auto
string
default_2024_08_21
string
score_threshold
number
Required
The score threshold for the file search. All values must be a floating point number between 0 and 1.
Function tool
object
type
string
Required
The type of tool being defined:
function
function
string
function
object
Required
description
string
A description of what the function does, used by the model to choose when and how to call the function.name
string
Required
The name of the function to be called. Must be a-z, A-Z, 0-9, or contain underscores and dashes, with a maximum length of 64.parameters
object
The parameters the functions accepts, described as a JSON Schema object. See the guide for examples, and the JSON Schema reference for documentation about the format.
Omitting
parameters
defines a function with an empty parameter list.strict
boolean or null
Defaults: false
Whether to enable strict schema adherence when generating the function call. If set to true, the model will follow the exact schema defined in the
parameters
field. Only a subset of JSON Schema is supported whenstrict
istrue
. Learn more about Structured Outputs in the .
tool_resources
object or null
A set of resources that are used by the assistant's tools. The resources are specific to the type of tool. For example, the
code_interpreter
tool requires a list of file IDs, while thefile_search
tool requires a list of vector store IDs.code_interpreter
object
file_ids
array
Defaults:
A list of file IDs made available to the
code_interpreter
tool. There can be a maximum of 20 files associated with the tool.items
string
file_search
object
vector_store_ids
array
The ID of the vector store attached to this assistant. There can be a maximum of 1 vector store attached to the assistant.
items
string
metadata
object or null
Set of 16 key-value pairs that can be attached to an object. This can be useful for storing additional information about the object in a structured format, and querying for objects via API or the dashboard. Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters.temperature
number or null
Defaults: 1
What sampling temperature to use, between 0 and 2. Higher values like 0.8 will make the output more random, while lower values like 0.2 will make it more focused and deterministic.top_p
number or null
Defaults: 1
An alternative to sampling with temperature, called nucleus sampling, where the model considers the results of the tokens with top_p probability mass. So 0.1 means only the tokens comprising the top 10% probability mass are considered. We generally recommend altering this or temperature but not both.stream
boolean or null
If
true
, returns a stream of events that happen during the Run as server-sent events, terminating when the Run enters a terminal state with adata: [DONE]
message.max_prompt_tokens
integer or null
The maximum number of prompt tokens that may be used over the course of the run. The run will make a best effort to use only the number of prompt tokens specified, across multiple turns of the run. If the run exceeds the number of prompt tokens specified, the run will end with status
incomplete
. Seeincomplete_details
for more info.max_completion_tokens
integer or null
The maximum number of completion tokens that may be used over the course of the run. The run will make a best effort to use only the number of completion tokens specified, across multiple turns of the run. If the run exceeds the number of completion tokens specified, the run will end with status
incomplete
. Seeincomplete_details
for more info.truncation_strategy
object
tool_choice
object
parallel_tool_calls
boolean
Defaults: true
Whether to enable parallel function calling during tool use.
response_format
string or object
Specifies the format that the model must output. Compatible with GPT-4o, GPT-4 Turbo, and all GPT-3.5 Turbo models since
gpt-3.5-turbo-1106
.Setting to
enables Structured Outputs which ensures the model will match your supplied JSON schema. Learn more in the Structured Outputs guide.
Setting to
enables JSON mode, which ensures the message the model generates is valid JSON.
Important: when using JSON mode, you must also instruct the model to produce JSON yourself via a system or user message. Without this, the model may generate an unending stream of whitespace until the generation reaches the token limit, resulting in a long-running and seemingly "stuck" request. Also note that the message content may be partially cut off if
finish_reason="length"
, which indicates the generation exceededmax_tokens
or the conversation exceeded the max context length.response_format
string
auto
is the default valueauto
string
Text
object
Default response format. Used to generate text responses.type
string
Required
The type of response format being defined. Always
text
.text
string
JSON object
object
JSON object response format. An older method of generating JSON responses. Using
json_schema
is recommended for models that support it. Note that the model will not generate JSON without a system or user message instructing it to do so.type
string
Required
The type of response format being defined. Always
json_object
.json_object
string
JSON schema
object
JSON Schema response format. Used to generate structured JSON responses. Learn more about Structured Outputs.
type
string
Required
The type of response format being defined. Always
json_schema
.json_schema
string
json_schema
object
Required
Structured Outputs configuration options, including a JSON Schema.description
string
A description of what the response format is for, used by the model to determine how to respond in the format.name
string
Required
The name of the response format. Must be a-z, A-Z, 0-9, or contain underscores and dashes, with a maximum length of 64.schema
object
The schema for the response format, described as a JSON Schema object. Learn how to build JSON schemas here.
strict
boolean or null
Defaults: false
Whether to enable strict schema adherence when generating the output. If set to true, the model will always follow the exact schema defined in the
schema
field. Only a subset of JSON Schema is supported whenstrict
istrue
. To learn more, read the Structured Outputs guide.
Response
A run object.
1 curl https://api.openai.com/v1/threads/runs \2 -H "Authorization: Bearer $OPENAI_API_KEY" \3 -H "Content-Type: application/json" \4 -H "OpenAI-Beta: assistants=v2" \5 -d '{6 "assistant_id": "asst_abc123",7 "thread": {8 "messages": [9 {"role": "user", "content": "Explain deep learning to a 5 year old."}10 ]11 }12 }'
1 {2 "id": "run_abc123",3 "object": "thread.run",4 "created_at": 1699076792,5 "assistant_id": "asst_abc123",6 "thread_id": "thread_abc123",7 "status": "queued",8 "started_at": null,9 "expires_at": 1699077392,10 "cancelled_at": null,11 "failed_at": null,12 "completed_at": null,13 "required_action": null,14 "last_error": null,15 "model": "gpt-4o",16 "instructions": "You are a helpful assistant.",17 "tools": [],18 "tool_resources": {},19 "metadata": {},20 "temperature": 1.0,21 "top_p": 1.0,22 "max_completion_tokens": null,23 "max_prompt_tokens": null,24 "truncation_strategy": {25 "type": "auto",26 "last_messages": null27 },28 "incomplete_details": null,29 "usage": null,30 "response_format": "auto",31 "tool_choice": "auto",32 "parallel_tool_calls": true33 }