In Hadoop, how to show current process of -copyFromLocal

Question

I am still a newbie learner of Hadoop, and this time I was trying to process a 106GB file. I used -copyFromLocal to copy that big file to my Hadoop DFS, but since the file is big I have to wait for a long time without a clue about the current copying status.

Is there any way to show the current file copying status with this command?

Thank you guys in advance for your help!

score 15 · Accepted Answer · edited Sep 24 '14 at 05:00

15

CopyFromLocal does not have the ability to display the file copy progress. Alternatively, you could open another shell and run the $ watch hadoop fs -ls <filenameyouarecopying>. This will display the file and its size once every 2.0 seconds.

edited Sep 24 '14 at 05:00

Deer Hunter

1,070
7
17
25

answered Sep 24 '14 at 03:30

datarockz2

176
1
3

1

If it is not documented then it does not exist. :-) – SunnyShah Mar 04 '15 at 14:41

score 4 · Answer 2 · answered Nov 08 '16 at 20:14

4

It is also possible to track the progress of reading of the local file using pv command and pipe the file content to hdfs dfs stdin:

pv mylargefile.txt | hdfs dfs -put - /path/to/file/on/hdfs/mylargefile.txt

answered Nov 08 '16 at 20:14

Alexander Rodin

141
2

`pv` is such an undervalued tool IMO. Does the job here perfectly. – Michael Mior Mar 22 '18 at 12:17

score 1 · Answer 3 · answered Apr 21 '14 at 16:49

1

It doesn't look like there's a verbose option to any of the copy commands (copyFromLocal, copyToLocal, get, put). Your best bet is probably to look at the size of the file at it's destination on HDFS in order to gauge it's progress.

answered Apr 21 '14 at 16:49

Travis Campbell

1,456
7
15

score 1 · Answer 4 · answered Mar 15 '15 at 08:47

1

You can use "nohup &" to execute the copying as a background process. nohup will make the process to execute even after you log out of the server. When ever you need, you can check the process using "hadoop fs -ls .

answered Mar 15 '15 at 08:47

Anan

11
1

In Hadoop, how to show current process of -copyFromLocal

4 Answers4