Goal
This is a source plugin that would allow users to read and process mainframe files defined using COBOL Copy Book. This should be basic first implementation.
...
Input Format implementation : here
Design
Assumptions:
- .cbl file will have the schema in data structure
- Both data file and .cbl files would reside on HDFS
Examples
Properties :
cobolFile : .cbl file contents to specify schema
binaryFilePath : hdfs path of .bin data file to be read
isCompressed : check if it is a compressed file.User can also specify a Native Compressed Codec as input.
outputSchema : list of fields in the output file
...