Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Goal 

This is a source plugin that would allow users to read and process mainframe files defined using COBOL Copy Book. This should be basic first implementation.

...

Input Format implementation : here 

Design

Assumptions:

  1. .cbl file will have the schema in data structure
  2. Both data file and .cbl files would reside on HDFS

 

Examples

Properties

cobolFile :  .cbl file contents to specify schema
binaryFilePath : hdfs path of .bin data file to be read
isCompressed : check if it is a compressed file.User can also specify a Native Compressed Codec as input.
outputSchema : list of fields in the output file

...