Skip to main content
Version: 3.1.0

OCR plugin setup

Infrastructure Prerequisites:

The OCR plugin is available as two docker images, so we need previously to configure:

RabbitMQ

This infrastructure item is required internally

basic RabbitMQ helm values.yaml

  ocr-queue:
image:
registry: docker.io
repository: bitnami/rabbitmq
tag: 3.7.17-debian-9-r37
persistence:
enabled: true
path: /opt/bitnami/rabbitmq/var/lib/rabbitmq
size: 1Gi
rabbitmq:
clustering:
address_type: hostname
k8s_domain: cluster.local
existingPasswordSecret: svc-ocr-application-config
username: users
service:
distPort: 25672
managerPort: 15672
port: 5672
tlsPort: 5671
type: ClusterIP

Min.io deployment

A basic configuration for minio:

  • A basic helm values.yaml

    MINIO_ACCESS_KEY: minio.access-key
    MINIO_SECRET_KEY: minio.secret-key

Configuration

OCR

Configuration for the two components should be grouped in a pod with two containers and configurations should be provided for:

  • Internal queue system
  • Kafka configuration
  • Min.io Configuration
CELERY_BROKER_URL: amqp://users:users_328947@jx-ocr-queue:5672

KAFKA_ADDRESS: flowxdev-default-noauth-kafka-bootstrap.kafka.svc:9092
KAFKA_CONSUME_SCHEDULE: "30"
KAFKA_INPUT_TOPIC: paperflow-ocr-receive
KAFKA_OCR_CONSUMER_GROUPID: ocr_group
KAFKA_OUTPUT_TOPIC: paperflow-ocr-send

MINIO_HOST: minio-service:9000
MINIO_LOCATION: zone
MINIO_OCR_SCANS_BUCKET: pdf-scans
MINIO_OCR_SIGNATURE_BUCKET: extracted-signatures
MINIO_OCR_SIGNATURE_FILENAME: extracted_signature_{}.png
caution

The Engine is listening for messages on topics with names of a certain pattern, make sure to use correct outgoing topic names when configuring the notifications plugin.


Was this page helpful?