An OCR solution for gas flow chart recognition | Portfolio

Type A set of scripts

TECHNOLOGIES Python

TEAM 1 developer

An OCR solution was implemented for one of the Canadian oil and gas companies. The implemented business process implies recognition of the charts which are generated by two or three pens driven by sensors attached to gas pipes. These pens plot traces on a rotating paper disk. As a result, the disc has the values of different parameters such as gas pressure, water, temperature. The next step is to convert this chart into a readable format.

This conversion process used to be performed manually by our customer’s employees. They had to manually delineate the contours of each trace using the mouse and then create the table with the values of the trace. Obviously, the whole process was very time-consuming and painful for the employee since he had no room for mistake. So, there was a need to build an OCR solution to automate this process.

Challenges

The main challenge we faced was how we actually should recognize the chart itself and all the traces it contains. The traces on the chart could hardly be seen or overlap each other.

These kinds of issues were the most challenging for us since we had to recognize each trace separately and form the table with the values of each parameter.

Solutions

To solve the problems which were described above, we did step-by-step work. To be more specific, we implemented the following things:

Implementation of recognizing the empty chart template without any traces;
Then recognizing the traces on the chart;
Then recognizing the colors of each trace since the traces can be red, black, blue, etc. It depends on the pen’s color;
Defining where the trace starts and where it ends;
Calculation of the average value of the trace.

To learn more about our approach to these challenges please see the article in our blog at the link.

Results

As a result of the first phase we developed the solution that takes the images of these charts input data, recognizes each trace and generates the result file as a table with the needed metrics of pressure, temperature, water.

This solution is currently used by the customer’s employees and provides them with a high level of accuracy. Now we are planning the second phase of development which is aimed to increase the current functionality.

Areas of expertise

External services integration (AWS S3, AWS Lambda) // Image rendering // Data mining // Image processing // Computer vision

Additional Technologies

NumPy // OpenCV // Tesseract

A Gas Flow Chart OCR solution

Fill contact form

Login Register

Login Register

Password reset

Change Password