20

Me: why are we paying for OCR when the API offers both json and pdf format for the data?
Manager: because we need to have the data in a PDF format for reporting to this 3rd party
Me: sure, but can we not just request both json and PDF from the vendor (it’s the same data). send the json for the automated workflow (save time, money and get better accuracy) and send the PDF to the 3rd party?
Manager: we made a commercial decision to use PDF, so we will use PDF as the format.
Me: but ...

Comments
  • 7
    Classic case of "I have no idea what I'm doing it that you're talking about so I'll just use my filler response"
  • 1
    Why do you need ocr for parsing textual data from pdf? It's not an image format.
  • 1
    @iiii Probably, because the OCR vendor prints the PDF before running it through a scanner which performs OCR
  • 2
    @iiii “ocr” is always used as a marketing term for processing any image or PDF/docx type documents. I’ve seen it used like that in a lot of companies .

    They more think of it as way to turn “humans data” to “machine data”.
Add Comment