Brightspot CMS User Guide

Configuring Amazon Textract


This topic explains how to configure the Amazon Textract integration in Brightspot.

To configure Amazon Textract:

  1. Obtain the following Textract settings from your AWS console:

    • SQS Queue Name
    • Topic ARN
    • Role ARN
  2. Click menu > Admin > Sites & Settings.
  3. In the Sites widget, select Global. The Edit Global widget appears.
  4. Configure the interface with AWS Textract by doing the following:

    1. Under Main, expand AWS Textract, and enter the SQS Queue Name, Topic ARN, and Role ARN you determined in step 1.
    2. In the Minimum Block Confidence field, enter confidence values for text within each block. Generally, higher confidence levels provide more accurate results (fewer false positives) but may miss some matches (more false negatives).
  5. Configure the thumbnail generator by doing the following:

    1. Expand DAM Document Data Extraction Settings.
    2. Under Extractor Services, click add_circle_outline, and select Textract Document Data Extractor. A form appears.
    3. From the Thumbnail Extractor list, select Pdf Document Data Extractor.
  6. Click Save.

Textract is configured, and editors can view the results of a text extraction in the content edit form.

Previous Topic
Amazon Textract
Next Topic
Extracting text from a PDF or image with Amazon Textract
Was this topic helpful?
Thanks for your feedback.