Post

2 followers Follow
0
Avatar

How create an Avro file with explicitly defining the Schema using Spring-XD create stream command

We are reading the RabbitMQ message as a source and creating the Avro using Sink hdfs-dataset, but the problem the Avro file will be created with the default schema file with content as "String" this is causing as problem while loading the data to HAWQ as well performing some transformation on the Avro file.

We are creating an stream Ex:- "stream create --name rabbitmq | hdfs-dataset --directory=/xd/Avrofile batchSize=1000 " --deploy

Please let us know if there is any option to explicitly the define the schema or any option where we can specify in the Groovy script as transformation.

Sudhakar Rao

Please sign in to leave a comment.

1 comment