Goal
This is a source plugin that would allow users to read and process mainframe files defined using COBOL Copy Book. This should be basic first implementation.
...
Input Format implementation : here
Design
Examples
Properties :
cobolFilePath : hdfs path of cobolFile : .cbl file contents to be readspecify schema
binaryFilePath : hdfs path of .bin data file to be read
isCompressed : check if it is a compressed file.User can also specify a Native Compressed Codec as input.
outputSchema : list of fields in the output file
...
}
This source plugin will read fixed length flat file sale.bin and the CobolCopyBook file stored at the hdfs location hdfs://data/sales/sales.bin and hdfs://data/sales/sales.cbl respectivelyschema specified from .cbl file in text input field. The plugin will output a Base64 encoded data having the schema as convert the input binary data fo Base64 encoding format. The output data would be emitted as per the schema defined by the user.
Sample .cbl file:
...