/
Performance Evaluation
Performance Evaluation
Setup
Hardware used for measuring the performance:
2.9 GHz Intel Core i5
16 GB 2133 MHz LPDDR3
Java 7
Light Data Transformation DMD
These are the high-level transformations being performed on the data:
Parsing of CSV
Drop columns
Setting defaults on column
Changing case
Masking data
Filtering rows based on an expression
Directives
parse-as-csv demo , true
drop demo
drop demo_12
fill-null-or-empty demo_11 N/A
uppercase demo_17
mask-number demo_18 xxx###
drop demo_6
drop demo_7
fill-null-or-empty demo_5 N/A
uppercase demo_3
filter-row-if-true demo_9 =~ "CA"
mask-number demo_10 xxx##
mask-shuffle demo_4
Experiments
These two experiments were run: the first with 13M records, and the second with 80M records.
Experiment #1
Number of records: 13,499,973
Number of bytes: 4,499,534,313 (~ 4GB)
Number of columns: 18