Skip to main content

pdf2Data Editor

pdf2Data Editor is a browser-based app that streamlines the process of extracting data from PDFs, making it more efficient and accessible for everyone.
With its identification and extraction interface, even Subject Matter Experts (SMEs) can create and maintain extraction templates without needing technical skills. Once you've created the rules, you can use it with PDF2Data parsing engine to automatically extract the data you need, saving you time and effort.

important

For trial purposes, pdf2Data Editor is deployed on pdf2Data.online. Feel free to request your free 30-day trial.

At the same time, we recommend you to have Editor deployed locally as pdf2Data.online always has the latest possible pdf2Data UI deployed, so templates are downloaded from pdf2Data.online, which may not be compatible with the version of the parsing engine you're utilizing within your workflow.
To avoid these compatibility issues, it's a good idea to deploy the Editor locally on your system, to make sure both Editor and parsing engine have the same version.

Introduction

These instructions will guide you on how to deploy pdf2Data Editor using Docker Compose. You can also use these instructions to upgrade an existing pdf2Data 4 installation to a specific version.

caution

If you have a legacy version of pdf2Data (1.0-3.1.1), it cannot be seamlessly upgraded to pdf2Data 4. Instead, you need to create an empty instance of pdf2Data 4.0 and import existing templates as described in the guide.

pdf2Data Editor has four main components (Docker containers):

ContainerAWS ECRDocker Hub (older versions)
manager-frontend4.2.0+4.0.0-4.1.3
manager-backend4.2.0+4.0.0-4.1.3
manager-database4.2.0+4.0.0-4.1.3
editor4.2.0+4.0.0-4.1.3

Depending on your needs, there are two different deployment modes of Editor available:

  • standalone-editor - only basic pdf2Data Editor which provides template creation/editing functionality is being installed
  • full - Fully-functional UI. Besides basic pdf2Data Editor, it includes template storage and also Management and Access control facilities

To deploy pdf2Data with Docker Compose it is needed to create 2 configuration files:

  1. docker-compose.yml with a configuration of Docker Compose
  2. .env - environment configuration. it contains application-related parameters.

For your convenience, we provide the helper script for each version of pdf2Data starting with 4.0. That script can automatically create configuration files, pull the necessary images from Docker Hub, and deploy pdf2Data UI. We strongly recommend you install pdf2Data Editor using a helper script.

If this does not work for you, or you just like rolling up your sleeves from time to time, manual deployment of pdf2Data Editor is also available.