List evals

GEThttps:/api.openai.com/v1/evals

List evaluations for a project.

Query parameters

  • after
    string
    Identifier for the last eval from the previous pagination request.
  • limit
    integer
    Defaults: 20
    Number of evals to retrieve.
  • order
    string
    Defaults: asc

    Sort order for evals by timestamp. Use asc for ascending order or desc for descending order.

  • order_by
    string
    Defaults: created_at

    Evals can be ordered by creation time or last updated time. Use created_at for creation time or updated_at for last updated time.

Response

A list of evals matching the specified filters.

Example request
1
curl https://api.openai.com/v1/evals?limit=1 \
2
-H "Authorization: Bearer $OPENAI_API_KEY" \
3
-H "Content-Type: application/json"
Example response
1
{
2
"object": "list",
3
"data": [
4
{
5
"id": "eval_67abd54d9b0081909a86353f6fb9317a",
6
"object": "eval",
7
"data_source_config": {
8
"type": "stored_completions",
9
"metadata": {
10
"usecase": "push_notifications_summarizer"
11
},
12
"schema": {
13
"type": "object",
14
"properties": {
15
"item": {
16
"type": "object"
17
},
18
"sample": {
19
"type": "object"
20
}
21
},
22
"required": [
23
"item",
24
"sample"
25
]
26
}
27
},
28
"testing_criteria": [
29
{
30
"name": "Push Notification Summary Grader",
31
"id": "Push Notification Summary Grader-9b876f24-4762-4be9-aff4-db7a9b31c673",
32
"type": "label_model",
33
"model": "o3-mini",
34
"input": [
35
{
36
"type": "message",
37
"role": "developer",
38
"content": {
39
"type": "input_text",
40
"text": "\nLabel the following push notification summary as either correct or incorrect.\nThe push notification and the summary will be provided below.\nA good push notificiation summary is concise and snappy.\nIf it is good, then label it as correct, if not, then incorrect.\n"
41
}
42
},
43
{
44
"type": "message",
45
"role": "user",
46
"content": {
47
"type": "input_text",
48
"text": "\nPush notifications: {{item.input}}\nSummary: {{sample.output_text}}\n"
49
}
50
}
51
],
52
"passing_labels": [
53
"correct"
54
],
55
"labels": [
56
"correct",
57
"incorrect"
58
],
59
"sampling_params": null
60
}
61
],
62
"name": "Push Notification Summary Grader",
63
"created_at": 1739314509,
64
"metadata": {
65
"description": "A stored completions eval for push notification summaries"
66
}
67
}
68
],
69
"first_id": "eval_67abd54d9b0081909a86353f6fb9317a",
70
"last_id": "eval_67aa884cf6688190b58f657d4441c8b7",
71
"has_more": true
72
}
Built with