Docugym - Private Document AI

Home Why Docugym How it works Contact Sign In

Document Processing API

To use the API, you need an API key from your organization settings. Contact your administrator to obtain one.

API Endpoint

POST

/api/resolvesync

Authentication

All API requests require authentication using a Bearer token in the Authorization header:

Authorization: Bearer YOUR_API_KEY

HTTP Header

Request Schema

Field	Type	Required	Description
`id`	string	Required	Unique request identifier for tracking
`input.file`	string	Required	Base64 encoded document file (PDF, PNG, JPG)
`input.corpusName`	string	Required	Name of the corpus to use for processing
`input.saveOODToRepository`	string	Optional	Repository identifier to save out-of-distribution (unknown) documents
`input.metadata`	object	Optional	Custom metadata to attach to the request
`input.supervision`	object	Optional	Supervision data for training or evaluation
`input.useThresholds`	boolean	Optional	Enable unknown document detection using similarity thresholds (default: false)
`callback`	string	Optional	URL for async callback (webhook)

When a callback URL is provided, the API returns immediately with a status response and POSTs the full results to your webhook when processing completes.

Response Schema

{
  "status": "OK",
  "requestId": "request-123",
  "timestamp": "2024-01-01T00:00:00.000Z",
  "duration": "2.45",
  "model": "gpt-4-vision",
  "metadata": {},
  "pages": [
    {
      "page": 1,
      "documentType": "Invoice",
      "documentTypeVariant": "Standard",
      "documentTypeId": "52",
      "documentTypeVariantId": "222",
      "pageLayout": "invoice-standard-page1",
      "pipelineId": "1",
      "pipelineName": "Invoice Extraction",
      "pipelineResult": {
        "extraction": {
          "invoice_number": "INV-2024-001",
          "total_amount": 1250.5
        },
        "enhanced_image_png_base64": "BASE64_PNG"
      },
      "attributes": [
        {
          "item": "Invoice Number",
          "text": "INV-2024-001",
          "bbox_2d": [
            100,
            50,
            250,
            70
          ],
          "namedEntity": "invoice_number"
        }
      ],
      "image_height": 1100,
      "image_width": 850,
      "matchInfo": {
        "documentPageId": "layout-123",
        "layoutId": "layout-123",
        "maxSim": 0.95
      },
      "iclExamples": [
        {
          "documentPageId": "doc-456",
          "layoutId": "layout-123",
          "maxSim": 0.93,
          "documentType": "Invoice",
          "documentTypeVariant": "Standard",
          "pageLayout": "invoice-standard-page1"
        },
        {
          "documentPageId": "doc-789",
          "layoutId": "layout-123",
          "maxSim": 0.91,
          "documentType": "Invoice",
          "documentTypeVariant": "Standard",
          "pageLayout": "invoice-standard-page1"
        }
      ]
    },
    {
      "page": 2,
      "documentType": "unknown",
      "documentTypeVariant": null,
      "pageLayout": null,
      "attributes": [],
      "isUnknown": true,
      "unknownReason": "Below class similarity threshold (0.42 < 0.65)",
      "confidence": 0.42,
      "nearestMatch": {
        "documentType": "Receipt",
        "similarity": 0.42
      }
    }
  ],
  "pipelineSchemas": [
    {
      "documentTypeId": "52",
      "documentType": "Invoice",
      "documentTypeVariant": "Standard",
      "pipelineId": "1",
      "pipelineName": "Invoice Extraction",
      "schema": "{...JSON schema text...}"
    }
  ]
}

JSON

Immediate Response:

{
  "status": "processed",
  "requestId": "request-123",
  "callbackUrl": "https://your-callback-url.com/webhook"
}

JSON

Webhook POST (sent to callback URL):

Same format as synchronous response above

Error Responses

Status Code	Error Type	Example Response
401	Unauthorized	`{"error": "Unauthorized: Invalid API key"}`
400	Bad Request	`{"error": {"requestId": "123", "message": "Missing required field: input.file"}}`
500	Server Error	`{"error": {"requestId": "123", "message": "Internal processing error"}}`

Code Examples

curl -X POST https://api.docugym.com/api/resolvesync \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "id": "request-123",
    "input": {
      "file": "BASE64_ENCODED_FILE_CONTENT",
      "corpusName": "your-corpus-name",
      "saveOODToRepository": "my-repo-key",
      "metadata": {
        "source": "api-test",
        "timestamp": "2024-01-01T00:00:00Z"
      }
    },
    "callback": "https://your-callback-url.com/webhook"
  }'

cURL

Unknown Document Detection

Similarity Thresholds

Enable unknown document detection by setting input.useThresholds: true in your request.

When enabled, the system uses two-stage rejection:

Stage 1: Rejects documents with centroid similarity < 0.5
Stage 2: Rejects documents below class-specific thresholds (typically 0.92-1.0)

Unknown documents will have isUnknown: true with an explanation and nearest match information.

In-Context Learning (ICL)

Intelligent Entity Extraction

When enabled for a corpus, the system automatically uses In-Context Learning (ICL) to improve entity extraction accuracy by providing the VLM with labeled examples from similar documents.

How it works:

The system identifies the top matching examples from your corpus based on similarity scores
Only examples with the same document type and variant as the highest match are included
Up to 3 labeled examples are provided to the VLM as context
The iclExamples field in the response shows which examples were used
Each ICL example includes document metadata and similarity scores for transparency

Note: The iclExamples field only appears in the response when ICL is enabled for the corpus and examples were successfully used.

Processing Pipeline

How it works:

Authentication - API key is validated against your organization
Document Classification - Each page is analyzed and classified against known document layouts in your corpus
Unknown Detection - When useThresholds is enabled, documents below similarity thresholds are marked as unknown
Layout Matching - Pages are matched to specific layouts with similarity scores
In-Context Learning - If enabled, the system selects up to 3 similar labeled examples from the corpus to improve extraction accuracy
Pipeline Extraction - When a pipeline is configured for a document type, the system invokes it to produce schema-compliant structured output for each page
Response Generation - Structured data is returned with pipeline results per page and the schema used for each document type (inpipelineSchemas)

File Size Limitations

File size is limited to 10MB after Base64 encoding.

Document Processing API

API Endpoint

Authentication

Request Schema

Response Schema

Synchronous Response (no callback)

Asynchronous Response (with callback)

Immediate Response:

Webhook POST (sent to callback URL):

Error Responses

Code Examples

Unknown Document Detection

Similarity Thresholds

In-Context Learning (ICL)

Intelligent Entity Extraction

Processing Pipeline

How it works:

File Size Limitations