avro | 易学教程

Accessing nested fields in AVRO GenericRecord (Java/Scala)

阅读更多关于 Accessing nested fields in AVRO GenericRecord (Java/Scala)

问题 I have a GenericRecord with nested fields. When I use genericRecord.get(1) it returns an Object that contains the nested AVRO data. I want to be able to access that object like genericRecord.get(1).get(0) , but I can't because AVRO returns an Object. Is there an easy way around this? When I do something like returnedObject.get("item") it says item not a member of returnedObject . 回答1: I figured out one way to do it. Cast the returned Object as a GenericRecord . Example (scala): val data

Confluent Schema Registry : Schema ID deletion

阅读更多关于 Confluent Schema Registry : Schema ID deletion

问题 We are in development and trying to delete the schema for a topic , since the change in incompatible with older schema. We deleted the schema / subject and tried creating the new schema under the same subject name and schema was successfully created. However, when we run the application, it is still pointing to same schema ID. Older schema ID ( for subject 'topic1") : 51 New schema ID ( for subject 'topic1') : 52 Application fails with an error deserializing the message at org.apache.kafka

Apache Avro C Installation

阅读更多关于 Apache Avro C Installation

问题 I am working on a project and I am using Apache Avro. I have downloaded Apache Avro for C and I followed the instructions provided in order to install it on my system (Ubuntu Linux v14.04). After the installation, I have some header files under the /include directory and some libraries under /lib directory. All of those are the ones that were installed from Apache Avro. At this point, I have created my C source files which are as follows: 1) socket_client.h : #include <stdio.h> #include <sys

In schema registry, consumer's schema could differ from the producer's, what actually it means

阅读更多关于 In schema registry, consumer's schema could differ from the producer's, what actually it means

问题 While producing AVRO data to Kafka, Avro serializer writing the same schema ID in the byte array which is used while writing the data. Kafka Consumer fetches the schema from Schema Registry based on schema ID in the byte array received. So same schema ID is used in both i.e. Producer and Consumer and so the schema. But why many article including this one says The consumer's schema could differ from the producer's. Please help me in understanding this. 回答1: Kafka Consumer fetches the schema

Dataflow Python SDK Avro Source/Sync

阅读更多关于 Dataflow Python SDK Avro Source/Sync

问题 I am looking to ingest and write Avro files in GCS with the Python SDK. Is this currently possible with Avro leveraging the Python SDK? If so how would I do this? I see TODO comments in the source regarding this so I am not too optimistic. 回答1: You are correct: the Python SDK does not yet support this, but it will soon. 回答2: As of version 2.6.0 of the Apache Beam/Dataflow Python SDK, it is indeed possible to read (and write to) avro files in GCS. Even better, the Python SDK for Beam now

Issue in using snappy with avro in python

阅读更多关于 Issue in using snappy with avro in python

问题 I am reading the .gz file and converting to AVRO format. When I was using the codec='deflate' . It is working fine. i.e., I was able to convert to avro format. When I use codec='snappy' it is throwing an error stating below: raise DataFileException("Unknown codec: %r" % codec) avro.datafile.DataFileException: Unknown codec: 'snappy' with deflate --> working fine writer = DataFileWriter(open(avro_file, "wb"), DatumWriter(), schema, codec='deflate') with snappy --> throwing an error writer =

Issue in using snappy with avro in python

阅读更多关于 Issue in using snappy with avro in python

Issue in using snappy with avro in python

阅读更多关于 Issue in using snappy with avro in python

Write nullable item to avro record in Avro C

阅读更多关于 Write nullable item to avro record in Avro C

问题 Schema: const char schema[] = "{ \"type\":\"record\", \"name\":\"foo\"," "\"fields\": [" "{ \"name\": \"nullableint\", \"type\":[\"int\",\"null\"]}" "]}"; Setting the schema: avro_datum_t foo_record = avro_record(schema); Setting the nullable datum up: avro_datum_t nullableint = avro_int32(1); Set the item: int err = avro_record_set(foo_record,"nullableint",nullableint); Write the item: int err2 = avro_file_writer_append(avro_writer, foo_record); And there is an error. Somehow, I must set the

KafkaAvroDeserializer failing with Kyro Exception

阅读更多关于 KafkaAvroDeserializer failing with Kyro Exception

问题 I have written a consumer to read Avro's generic record using a schema registry. FlinkKafkaConsumer010 kafkaConsumer010 = new FlinkKafkaConsumer010(KAFKA_TOPICS, new KafkaGenericAvroDeserializationSchema(schemaRegistryUrl), properties); And the Deserialization class looks like this : public class KafkaGenericAvroDeserializationSchema implements KeyedDeserializationSchema<GenericRecord> { private final String registryUrl; private transient KafkaAvroDeserializer inner; public