List evals

GEThttps:/api.openai.com/v1/evals

List evaluations for a project.

Query parameters

after
string
Identifier for the last eval from the previous pagination request.
limit
integer
Defaults: 20
Number of evals to retrieve.
order
string
Defaults: asc
Sort order for evals by timestamp. Use asc for ascending order or desc for descending order.
order_by
string
Defaults: created_at
Evals can be ordered by creation time or last updated time. Use created_at for creation time or updated_at for last updated time.

Response

A list of evals matching the specified filters.

Example request

1curl https://api.openai.com/v1/evals?limit=1 \
2  -H "Authorization: Bearer $OPENAI_API_KEY" \
3  -H "Content-Type: application/json"

Example response

1{
2  "object": "list",
3  "data": [
4    {
5      "id": "eval_67abd54d9b0081909a86353f6fb9317a",
6      "object": "eval",
7      "data_source_config": {
8        "type": "stored_completions",
9        "metadata": {
10          "usecase": "push_notifications_summarizer"
11        },
12        "schema": {
13          "type": "object",
14          "properties": {
15            "item": {
16              "type": "object"
17            },
18            "sample": {
19              "type": "object"
20            }
21          },
22          "required": [
23            "item",
24            "sample"
25          ]
26        }
27      },
28      "testing_criteria": [
29        {
30          "name": "Push Notification Summary Grader",
31          "id": "Push Notification Summary Grader-9b876f24-4762-4be9-aff4-db7a9b31c673",
32          "type": "label_model",
33          "model": "o3-mini",
34          "input": [
35            {
36              "type": "message",
37              "role": "developer",
38              "content": {
39                "type": "input_text",
40                "text": "\nLabel the following push notification summary as either correct or incorrect.\nThe push notification and the summary will be provided below.\nA good push notificiation summary is concise and snappy.\nIf it is good, then label it as correct, if not, then incorrect.\n"
41              }
42            },
43            {
44              "type": "message",
45              "role": "user",
46              "content": {
47                "type": "input_text",
48                "text": "\nPush notifications: {{item.input}}\nSummary: {{sample.output_text}}\n"
49              }
50            }
51          ],
52          "passing_labels": [
53            "correct"
54          ],
55          "labels": [
56            "correct",
57            "incorrect"
58          ],
59          "sampling_params": null
60        }
61      ],
62      "name": "Push Notification Summary Grader",
63      "created_at": 1739314509,
64      "metadata": {
65        "description": "A stored completions eval for push notification summaries"
66      }
67    }
68  ],
69  "first_id": "eval_67abd54d9b0081909a86353f6fb9317a",
70  "last_id": "eval_67aa884cf6688190b58f657d4441c8b7",
71  "has_more": true
72}

Built with