Upload a file and get the parsing result immediately
output_formats
| Available output formats | Default | Description |
|---|---|---|
json / markdown / zip | ["markdown", "json"] | Multiple selections supported. Uses the default when omitted. zip packages the Markdown output and all image files into an archive. When output_formats includes zip, element_formats.image must be file |
element_formats
| Field | Available output formats | Default | Description |
|---|---|---|---|
image | url / base64 / file / none | url | Single selection only. When image is set to file, output_formats must include zip. none means images are not returned |
formula | latex / mathml / ascii | latex | Single selection only. Specifies the output format for formulas |
table | markdown / html / image | html | Single selection only. In markdown mode, merged cells are automatically split into independent cells and filled with the same content |
cs | image | image | Single selection only. Output format for chemical structures; smiles format is coming soon |
feature_config
| Field | Default | Description |
|---|---|---|
enable_text_cross_page | false | Cross-page text merging: merge text blocks spanning pages into continuous paragraphs |
enable_table_cross_page | false | Cross-page table merging: merge tables spanning pages into a single table |
enable_title_level_recognition | false | Heading level recognition: detect document heading hierarchy (H1/H2/H3…) |
enable_inline_image | true | Inline images: return images inside text paragraphs |
enable_table_image | true | Images in tables: return images inside table cells |
enable_image_understanding | true | Image understanding: perform semantic understanding and structured description of document images |
keep_header_footer | false | Keep headers and footers: headers and footers are filtered by default; enable this if you need to preserve them |
待解析的文件,支持 PDF、图片、Office 格式
API 密钥,格式 sk-***
输出格式,可多选。不传时默认为 ["markdown", "json"]。支持 json / markdown / zip,其中 zip 将所有输出文件打包为压缩包
json, markdown, zip 元素格式配置,控制各类元素的格式
特色功能配置(参数已从 extract_config 更名为 feature_config)