PRs: Use "MapReduce Program" instead of "MapReduce Job"
Description
Release Notes
Activity
John Jackson March 6, 2015 at 2:16 AM
Here are all the PRs:
CDAP Apps: https://github.com/caskdata/cdap-apps/pull/69
cdap-bi-guide: https://github.com/cdap-guides/cdap-bi-guide/pull/25
cdap-flow-guide: https://github.com/cdap-guides/cdap-flow-guide/pull/10
cdap-flume-guide: https://github.com/cdap-guides/cdap-flume-guide/pull/16
cdap-kafka-ingest-guide: https://github.com/cdap-guides/cdap-kafka-ingest-guide/pull/14
cdap-mapreduce-guide: https://github.com/cdap-guides/cdap-mapreduce-guide/pull/15
cdap-spark-guide: https://github.com/cdap-guides/cdap-spark-guide/pull/39
cdap-timeseries-guide: https://github.com/cdap-guides/cdap-timeseries-guide/pull/11
cdap-twitter-ingest-guide: https://github.com/cdap-guides/cdap-twitter-ingest-guide/pull/10
cdap-packs: https://github.com/caskdata/cdap-packs/pull/33
cdap-clients: https://github.com/caskdata/cdap-clients/pull/71
cdap-ingest: https://github.com/caskdata/cdap-ingest/pull/104
John Jackson March 5, 2015 at 5:28 AM
Other Repos:
cdap-apps (https://github.com/caskdata/cdap-apps):
Used in WISE app (https://github.com/caskdata/cdap-apps/pull/68)cdap guides: https://github.com/cdap-guides
cdap-bi-guide: https://github.com/cdap-guides/cdap-bi-guide/pull/25 (changes format from markdown to rst)
cdap flow-guide(s): PRs have been created for these.
John Jackson January 21, 2015 at 5:53 PM
Even though PR-971 has been merged, the other repos need doing.
John Jackson January 15, 2015 at 7:51 PM
Need to also do the map reduce guide, the source of ~/Source/cdap/cdap-docs/examples-manual/build/_includes/cdap-mapreduce-guide/README_SOURCE.rst
John Jackson January 15, 2015 at 5:42 PM
It would include Javadocs, but that could be just done over time. Or in 2.8.0.
In the docs (and elsewhere) we have for a long time used the phrase "MapReduce Job". It was promoted by JJ because he saw it in the Hadoop book.
However, it really isn't a good term, as a "job" is really a instance or run of a Program. The Spark technology is a lot better at being clear on this point.
Andreas and I are in agreement that, for the most part, "MapReduce Job" can be replaced with either "MapReduce Program" or simply "MapReduce". As we are using similar language for a "Spark Program", this will bring MapReduce into alignment.
"MapReduce job" would be used to refer to a specific run of a MapReduce program.