google-cloud-vision

Does Google Cloud Vision API detect formatting in OCRed text like bold, italics, font name (helvetica or times new roman), etc?

我们两清 提交于 2021-02-17 05:35:31
问题 The quick brown fox jumps over the lazy dog In such a case like this, assuming there are different font families too, can cloud VIsion API detect this. Or any other OCR API detect this cleanly. Tesseract has capabilities but its so inaccurate. 回答1: ABBYY Cloud OCR will be quite accurate, but at the end, everything depends on your fonts and scanning quality. 回答2: Does google cloud vision API detect formatting in OCRed text like bold, italics, font name (helvetica or times new roman), etc?

Google cloud vision api- OCR

跟風遠走 提交于 2021-02-08 11:22:40
问题 I want to use text-detection from image (OCR) of google cloud vision api. But i dont know how to get the subscription key from and how to authenticate and make calls in C#. Can some body tell me the step by step procedure to do that. Im very new this btw. 回答1: I think the question is a bit messed up, so let me take a step back and try to cover the most important things regarding authentication when using the Cloud Vision API. First of all, the documentation offers a really clear explanation

Google Cloud Vision API - Python

余生长醉 提交于 2021-02-07 08:35:28
问题 I can't seem to find where to add the API key or where I need to locate to the google credentials file in my google cloud vision code: import argparse import base64 import httplib2 import validators import requests from apiclient.discovery import build from oauth2client.client import GoogleCredentials def main(photo_file): '''Run a label request on a single image''' API_DISCOVERY_FILE = 'https://vision.googleapis.com/$discovery/rest?version=v1' http = httplib2.Http() credentials =

Can Google Cloud Vision API be trained using your image data?

泄露秘密 提交于 2021-02-07 04:11:31
问题 IBM Watson has a capability where you can train the classifiers on Watson using your images but I am unable to find a similar capability on Google Cloud Vision API? What I want is that I upload 10-15 classes of images and on the bases of upload images classify any images loaded after that. IBM Bluemix (Watson) has this capability but their pricing is significantly higher than Google. I am open to other services as well, if prices ares below Google's 回答1: As far as I know Google Cloud Vision

Can Google Cloud Vision API be trained using your image data?

余生颓废 提交于 2021-02-07 04:10:04
问题 IBM Watson has a capability where you can train the classifiers on Watson using your images but I am unable to find a similar capability on Google Cloud Vision API? What I want is that I upload 10-15 classes of images and on the bases of upload images classify any images loaded after that. IBM Bluemix (Watson) has this capability but their pricing is significantly higher than Google. I am open to other services as well, if prices ares below Google's 回答1: As far as I know Google Cloud Vision

Google Vision hexadecimal numbers recognition

自作多情 提交于 2021-01-29 15:20:26
问题 Google Vision OCR recognizes for hexadecimal numbers with mistakes very often (the accuracy is about 60%). For example when I try to recognize a scanned image with muber "78 30 3D 61" the Google OCR recognizes it with text like "78 30 30 61". For OCR recognition I used the live demo and .NET Api client with the same incorrect result. Here is my C# code: var image = await Google.Cloud.Vision.V1.Image.FromFileAsync("c:\\path\\to\\file.png"); var imageContext = new ImageContext(); imageContext

Trying detect image Values using Google Cloud Vision using c# asp.net c#

半城伤御伤魂 提交于 2021-01-29 10:01:30
问题 Trying detect image Values using Google Cloud Vision using c# asp.net c# but i am getting below error. Error loading native library. Not found in any of the possible locations: C:\Users\mazharkhan\Documents\Visual Studio 2013\WebSites\googleapi\bin\grpc_csharp_ext.x86.dll,C:\Users\mazharkhan\Documents\Visual Studio 2013\WebSites\googleapi\bin\runtimes/win/native\grpc_csharp_ext.x86.dll,C:\Users\mazharkhan\Documents\Visual Studio 2013\WebSites\googleapi\bin\../..\runtimes/win/native\grpc

How to group blocks that are part of a bigger sentences in Google Cloud Vision API?

a 夏天 提交于 2021-01-29 09:40:57
问题 I am using Google Cloud Vision API on Python to detect text values in hoarding boards that are usually found above a shop/store. So far I have been able to detect individual words and their bounding polygons' coordinates. Is there a way to group the detected words based on their relative positions and sizes? For example, the name of the store is usually written in same size and the words are aligned. Does the API provide some functions that group those words which probably are parts of a

google vision API returns empty bounding box vertexes, instead it returns normalised_vertexes

て烟熏妆下的殇ゞ 提交于 2021-01-28 14:01:14
问题 I am using vision.enums.Feature.Type.DOCUMENT_TEXT_DETECTION to extract some dense text in a pdf document. Here is my code: from google.cloud import vision def extract_text(bucket, filename, mimetype): print('Looking for text in PDF {}'.format(filename)) # BATCH_SIZE; How many pages should be grouped into each json output file. # """OCR with PDF/TIFF as source files on GCS""" # Detect text feature = vision.types.Feature( type=vision.enums.Feature.Type.DOCUMENT_TEXT_DETECTION) # Extract text

google vision API returns empty bounding box vertexes, instead it returns normalised_vertexes

给你一囗甜甜゛ 提交于 2021-01-28 13:53:07
问题 I am using vision.enums.Feature.Type.DOCUMENT_TEXT_DETECTION to extract some dense text in a pdf document. Here is my code: from google.cloud import vision def extract_text(bucket, filename, mimetype): print('Looking for text in PDF {}'.format(filename)) # BATCH_SIZE; How many pages should be grouped into each json output file. # """OCR with PDF/TIFF as source files on GCS""" # Detect text feature = vision.types.Feature( type=vision.enums.Feature.Type.DOCUMENT_TEXT_DETECTION) # Extract text