API Latency Test

OpenAI-compatible API checker

OpenAI API Checker

Test latency, streaming, model access, and OpenAI-compatible response behavior before you buy or integrate an API relay.

We do not store your API key. It is used only for this live request, not saved to accounts, rankings, or hosted-key features. A low-scope test key is recommended.

No account. API keys are used only for this live test.

FAQ

What does this tool actually check?

It checks model listing, selected model availability, OpenAI-compatible response shape, streaming behavior, latency, and a fixed set of control questions for instruction following, basic knowledge, structured output, and model-family behavior.

What do latency, TTFT, streaming total, and tokens mean?

Latency is the time from sending a normal request to getting the first response. TTFT, or first token time, is when the first streamed word or chunk appears. Streaming total is the full end-to-end streaming time. Tokens/sec is output speed. Input tokens are the prompt tokens you sent, and output tokens are the tokens the model returned.

Why is the result useful?

The score combines multiple technical signals instead of trusting one field. That makes it easier to notice relays that look usable at first but have unstable structure, weak model behavior, or slow response patterns.

Do you store API keys?

No. The API key is used only for this live test. It is not saved to an account system, ranking database, or hosted-key feature. Prefer a low-scope or temporary test key.

Can this prove a relay is absolutely safe?

No. Treat it as a technical health check before buying, integrating, or relying on a relay. Long-term stability and business risk still require repeated testing and your own judgment.

What does the score actually mean?

The score is a single-run quick-check score built from question accuracy, model-target match, protocol and response-shape consistency, plus streaming and latency signals. It is not a long-term benchmark or statistical confidence score.

Why can the same model get different scores?

Relays may switch upstream providers, routes, or regions. Latency and response behavior can also fluctuate, so repeated tests are more meaningful than a single result.