Model Usage Tracking
Per-completion token tracking and basic aggregation is available to all users.
Per-model usage reporting and filtering by tags is available to enterprise users via the Cost Tracking API. Contact sales (opens in a new tab) for more information.
Cost Tracking API
GET
/api/metrics/modelStats
Parameters
Name | Required | Description | Type | Example |
---|---|---|---|---|
organization_id | Yes | The organization to retrieve usage stats for | String | "org_12345" |
tags | No | Comma separated list of tags to filter by. Can be provided multiple times. Completions will only be counted if all supplied tags are matched (AND ). | String | "tag1,tag2,tag3" |
startDate | Yes | The start date of the range to retrieve stats for. | Date | "2024-01-01" |
endDate | Yes | The end date of the range to retrieve stats for. | Date | "2024-01-02" |
Response
Returns an array of datapoints that represent the input, output, and total token counts per model per day for the trailing 30 days.
Sample response
{
"data": [
{
"metric": "tokens_per_model_per_day",
"timestamp": "2024-04-25T00:00:00.000Z",
"total_tokens": 700,
"input_tokens": 366,
"output_tokens": 334,
"completions_count": 10,
"model": "gpt-4-1106-preview"
},
{
"metric": "tokens_per_model_per_day",
"timestamp": "2024-04-25T00:00:00.000Z",
"total_tokens": 400,
"input_tokens": 200,
"output_tokens": 200,
"completions_count": 6,
"model": "gpt-4-turbo"
},
{
"metric": "tokens_per_model_per_day",
"timestamp": "2024-04-24T00:00:00.000Z",
"total_tokens": 7325,
"input_tokens": 6951,
"output_tokens": 374,
"completions_count": 599,
"model": "gpt-4-1106-preview"
},
{
"metric": "tokens_per_model_per_day",
"timestamp": "2024-04-24T00:00:00.000Z",
"total_tokens": 4137,
"input_tokens": 3787,
"output_tokens": 350,
"completions_count": 400,
"model": "gpt-4-turbo"
},
...
]
}