Upload a file for synchronous OCR processing. This endpoint is intended for lightweight, low-volume document parsing and returns results inline in the response.
For large documents, high-throughput workloads, or asynchronous processing, use the /files endpoints, which are optimized for those use cases.
Supported file types: .pdf, .png, .jpg, .jpeg, .pptx, .ppt, .odp, .docx, .odt, .doc, .html
A maximum of 16 pages are processed per request. For documents exceeding this limit, split the content across multiple sequential calls using the pages parameter (e.g., pages="1-16" for the first call, pages="17-32" for the second).
Bearer authentication header of the form Bearer <token>, where <token> is your auth token.
The document to parse.
technical_name of an enabled parser model. Falls back to platform default.
Page range to parse. Formats: "all", "1-5", "1,3,7", "2-4,8". Maximum 16 pages per request. For larger documents, make multiple calls with different page ranges.
When enabled, detects repetitive generation loops and gradually increases the sampling temperature to break out of them.
Controls the randomness of the model output. Lower values (e.g. 0.1) produce more deterministic results, higher values (e.g. 1.0) increase variety. Range: 0.0–2.0.
0 <= x <= 2Maximum number of tokens the model can generate per page. Higher values allow longer outputs but increase processing time. Range: 1–16384.
1 <= x <= 16384Penalizes repeated tokens to reduce redundant output. A value of 1.0 applies no penalty; higher values (e.g. 1.2) discourage repetition more strongly. Range: 1.0–2.0.
1 <= x <= 2