Does Google Cloud Vision API detect formatting in OCRed text like bold, italics, font name (helvetica or times new roman), etc?

我们两清 提交于 2021-02-17 05:35:31

问题


The quick brown fox jumps over the lazy dog

In such a case like this, assuming there are different font families too, can cloud VIsion API detect this. Or any other OCR API detect this cleanly. Tesseract has capabilities but its so inaccurate.


回答1:


ABBYY Cloud OCR will be quite accurate, but at the end, everything depends on your fonts and scanning quality.




回答2:


Does google cloud vision API detect formatting in OCRed text like bold, italics, font name (helvetica or times new roman), etc?

Unfortunately, no.

In my project, I use ABBYY Cloud OCR SDK for this purpose. If you want to try, you can start free trial which includes 500 free requests (pages). After you create your trial account, you will receive an email from ABBYY which will contain your Application ID and Application password. Use these 2 values to create your authentication header according to Authentication.

See the following example:

  1. Perform processImage request. Pass your image in the request body.

Request:

POST / https://cloud.ocrsdk.com/v2/processImage?exportFormat=xml&profile=documentConversion&xml:writeFormatting=true
Authorization: <your token>

Response:

{
    "taskId": "a226a0b6-6705-4d6f-9f4c-517fa9b4e28e",
    "registrationTime": "2020-07-26T09:42:39Z",
    "statusChangeTime": "2020-07-26T09:42:39Z",
    "status": "Queued",
    "filesCount": 1,
    "requestStatusDelay": 10000
}
  1. Perform getTaskStatus request in order to check if your task is completed. Use taskId from the response of the previous step.

Request:

GET / https://cloud.ocrsdk.com/v2/getTaskStatus?taskId=a226a0b6-6705-4d6f-9f4c-517fa9b4e28e
Authorization: <your token>

Response:

{
    "taskId": "a226a0b6-6705-4d6f-9f4c-517fa9b4e28e",
    "registrationTime": "2020-07-26T09:42:39Z",
    "statusChangeTime": "2020-07-26T09:42:40Z",
    "status": "Completed",
    "filesCount": 1,
    "requestStatusDelay": 0,
    "resultUrls": [
        "https://ocrsdk.blob.core.windows.net/files/a226a0b6-6705-4d6f-9f4c-517fa9b4e28e.result?sv=2012-02-12&se=2020-07-26T19%3A00%3A00Z&sr=b&si=downloadResults&sig=4k9FcRoBfhodq%2BMj%2Ffj%2BGLBfwK2BsO7sj15JQOLcArk%3D"
    ]
}
  1. Download the result (see resultUrls from the response of the previous step).

I used the following picture and received the following result



来源:https://stackoverflow.com/questions/62947592/does-google-cloud-vision-api-detect-formatting-in-ocred-text-like-bold-italics

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!