-
Notifications
You must be signed in to change notification settings - Fork 4.5k
Description
When a row object has a beam schema containing fields of type MAP which values are null, it will throw a NullPointerException while trying to call Row.toString() method.
This could happen, for example, when you convert an AVRO record (which has unions of null and map types) to Row object and then call toString() method.
Test showing the issue:
@Test(expected = NullPointerException.class)
public void givenANullableMap_whileRowToString_thenThrowNullPointerException()
{
Schema mapSchema = map().values().stringType();
Schema nullableMapUnion = SchemaBuilder.unionOf().nullType().and().type(mapSchema).endUnion();
Schema
recordSchema = SchemaBuilder.record("TestRecord").fields().name("union") .type(nullableMapUnion).withDefault(null).endRecord();
GenericRecord
genericRecord = new GenericRecordBuilder(recordSchema).build();
Row rowRecord = AvroUtils.toBeamRowStrict(genericRecord,
AvroUtils.toBeamSchema(recordSchema));
String rowAsString = rowRecord.toString();
}
This behaviour doesn't happen with the previous implementation of the Row.toString() method from version 2.29.0.
Additionally when you have a non null value for the map attribute, the row representation as string in version 2.30.0 is as follow:
Row: union:{(testing, value), }
Which has an extra comma despite the fact it is only one element in the map.
Imported from Jira BEAM-12581. Original Jira may contain additional context.
Reported by: LuisMi.