aws-textract

How to retrieve tables which exists in a pdf using AWS Textract in java

£可爱£侵袭症+ 提交于 2020-06-25 19:00:39
问题 I found article below to do in python. https://docs.aws.amazon.com/textract/latest/dg/examples-export-table-csv.html also I used article below to extract text. https://docs.aws.amazon.com/textract/latest/dg/detecting-document-text.html but above article helped to get only text, I also used function "block.getBlockType()" of Block but none of block returned its type as "CELL" even tables are there in image/pdf. Help me found java library similar to "boto3" to extract all tables. 回答1: What I

AWS-Textract-Key-Value-Pair Java - thread “main” java.lang.NullPointerException

天大地大妈咪最大 提交于 2020-06-22 04:19:26
问题 I am using AWS Textract in a Java Spring boot project. I have set up AWS CLI and have the SDK as a maven dependency. I have written Java code, converted from C# in order to extract the Key and Value pairs and I am receiving the following error after successfully extracting some words " AGENCYCUSTOMERID:FEIN(ifapplicable)MARITALSTATUS/CIVILUNION(ifapplicable)INSUREDLOCATIONCODEBUSPRIMARYE-MAILADDRESS:FEIN(ifapplicable)LINEOFBUSINESSCELLMARITALSTATUScivilUNION(ifapplicable)CELLCELLHOME ":