Hi mates, in accordance with the contract of org.apache.flink.formats.avro.DeserializationSchema, it should return null value, when content couldn’t be deserialized.
But in most cases (for example org.apache.flink.formats.avro.AvroDeserializationSchema) method fails if data is corrupted.
We’ve implemented our own SerDe class, that returns null, if data doesn’t satisfy avro schema, but it’s rather hard to maintain this functionality during migration to the latest Flink version.
What do you think, maybe it’ll be useful if we will support optional skip of failed records in avro and other Deserializers in the source code ?
Software Engineer at 1DMP CORE Team
mobile: +7 (925) 416-37-26
make your data clever