0
I am taking a course offered by www.statistics.com
—Advanced Analytics and Machine Learning with Hadoop—and I reading through course material. I stumbled upon the following statement (emphasis mine):
…because MapReduce is designed to run in a batch-oriented, streaming-off-disk fashion, this presents a number of limitations when performing data analysis that entails iterative processing (including many machine-learning algorithms) or interactive data mining that requires responsive feedback.
I Googled the term “streaming off disk” and wasn’t satisfied with the first page results.
Can someone explain what it means for a programming model (e.g., MapReduce) to be “designed to run in a … streaming-off-disk fashion“ and why this fact is noteworthy (given the context above)?