Copy entire file system hierarchy from one drive to another

100

71

I would like to copy the entire file system hierarchy from one drive to another..i.e contents of each directory as well as regular files in Linux platform. Would be gratefull to know the best way to do that with possibly Linuxes in-built functions. The file system is a ext family.

Juggler

Posted 2011-07-07T03:30:57.140

Reputation: 1 101

1Umm... where is the love for dd? dd if=/dev/sda1 of=/dev/sdb1 bs=4096 – juniorRubyist – 2017-03-25T05:08:53.123

@juniorRubyist +1 for dd. I always use that. But which flags to use? I use conv=notrunc,noerror,sync. – BeniBela – 2017-08-30T12:10:47.180

2-1 for dd for 2 reasons: firstly it's a bad idea to perform a block-level copy of a filesystem that is mounted (which is the case for /) and secondly dd won't copy data from sources mounted within the filesystem like /boot and /home. – Eric – 2019-08-16T10:37:47.863

2-1 for dd also because whatever fragmentation has occurred on the source is copied as well; and a different sized destination isn't automatically handled – duanev – 2019-10-30T17:53:24.507

Answers

35

I often use

> cp -ax / /mnt

Presuming /mnt is the new disk mounted on /mnt and there are no other mounts on /.

the -x keeps it on the one filesystem.

This of course needs to be done as root or using sudo.

This link has some alternatives, including the one above

http://linuxdocs.org/HOWTOs/mini/Hard-Disk-Upgrade/copy.html

WolfmanJM

Posted 2011-07-07T03:30:57.140

Reputation: 844

while this is dead old answer it is still worth noting that you usually do NOT want to copy all the stuff present in /, excluding i.e. /dev, /sys, /proc etc. Therefore before issuing that cp I suggest searching for better approaches (also using rsync) – Marcin Orlowski – 2019-06-13T08:17:24.237

2@MarcinOrlowski WolfJM's use of the -x flag means that the synthetic filesystems you mention will not be copied. – Jim L. – 2019-07-31T21:37:24.447

208

What you want is rsync.

This command can be used to synchronize a folder, and also resume copying when it's aborted half way. The command to copy one disk is:

rsync -avxHAX --progress / /new-disk/

The options are:

-a  : all files, with permissions, etc..
-v  : verbose, mention files
-x  : stay on one file system
-H  : preserve hard links (not included with -a)
-A  : preserve ACLs/permissions (not included with -a)
-X  : preserve extended attributes (not included with -a)

To improve the copy speed, add -W (--whole-file), to avoid calculating deltas/diffs of the files. This is the default when both the source and destination are specified as local paths, since the real benefit of rsync's delta-transfer algorithm is reducing network usage.

Also consider adding --numeric-ids to avoid mapping uid/gid values by user/group name.

Michael Aaron Safyan

Posted 2011-07-07T03:30:57.140

Reputation: 2 645

33--info=progress2 instead of --progress is useful for large transfers, as it gives overall progress, instead of (millions of lines for) individual files. – Florian – 2015-09-27T08:43:38.210

3I had to replace X and A with E, because extended attributes and ACLs are covered by E on my mac. Tested: rsync version 2.6.9 protocol version 29 – Jonathan Komar – 2016-08-04T05:17:47.510

let me also link to this answer: http://superuser.com/a/709224/265964

– Frederick Nord – 2016-12-07T13:42:07.050

1In addition to > ~/rsync.out, 2> ~/rsync.err will save any errors in a separate file. – Aneel – 2017-03-05T00:45:26.237

This doesn't handle sparse files properly. – Ilia Sidorenko – 2017-03-05T06:52:09.610

1-S, --sparse handle sparse files efficiently – Ken Sharp – 2017-03-11T12:55:36.443

3If you're copying from some other folder than /, note that having a trailing slash (or not) on the source directory makes a difference: rsync source/ dest/ copies everything inside source/ to dest/, while rsync source dest/ copies the folder source and everything inside into dest/. – semi-extrinsic – 2017-03-20T08:22:03.130

EC2: copied from running EBS to mounted, after that attached mounted as a sda1 - and cannot connect through ssh :( – Vitaly Zdanevich – 2018-02-27T23:55:00.153

Can you be more specific on what rsync does? – Hello71 – 2011-07-07T14:42:26.093

rsync copies the contents of one path to another, and has lots of options that can control its behavior (like compression, excluding certain file patterns, optionally deleting the originals after transfer, optionally deleting the target files before transfer, etc.). – Michael Aaron Safyan – 2011-07-08T03:16:22.777

1Genius, thank you. Btw, I ended up using rsync -avxHAWX --numeric-ids --progress / mnt/ but I should have done rsync -avxHAWX --numeric-ids --progress / mnt/ > ~/rsync.out. I suspect pouring the output to the terminal slowed the process. :D – Chris K – 2014-01-15T10:15:33.157

56

Michael Aaron Safyan's answer doesn't account for sparse files. -S option fixes that.

Also this variant doesn't spam with the each file progressing and doesn't do delta syncing which kills performance in non-network cases.

Perfect for copying filesystem from one local drive to another local drive.

rsync -axHAWXS --numeric-ids --info=progress2

Ilia Sidorenko

Posted 2011-07-07T03:30:57.140

Reputation: 661

1Amazing. This is really doing a good job – Gildas – 2018-07-26T08:13:32.127

1This should be the accepted answer, works great. Example 55,431,669,792 57% 97.47MB/s 0:06:56 xfr#2888, ir-chk=5593/8534) – Drew – 2018-08-04T03:11:13.073

1<3 this is perfect – Tim Strijdhorst – 2019-01-21T15:10:26.193

6

For a one shot local copy from one drive to another, I guess cp suffices as described by Wolfmann here above.

For bigger works like local or remote backups for instance, the best is rsync.

Of course, rsync is significantly more complex to use.

Why rsync :

  • this allows you to copy (synchronized copy) all or part of your drive A to drive B, with many options, like excluding some directories from the copy (for instance excluding /proc).

  • Another big advantage is that this native tool monitors the file transfer: eg for massive transfers, if the connection is interrupted, it will continue from the breakpoint.

  • And last but not least, rsync uses ssh connection, so this allow you to achive remote synchronized secured "copies". Have a look to the man page as well as here for some examples.

hornetbzz

Posted 2011-07-07T03:30:57.140

Reputation: 201

5

Like Michael Safyan suggests above, I've used rsync for this purpose. I suggest using some additional options to exclude directories that you probably don't want to copy.

This version is fairly specific to Gnome- and Debian/Ubuntu-based systems, since it includes subdirectories of users' home directories which are specific to Gnome, as well as the APT package cache.

The last line will exclude any directory named cache/Cache/.cache, which may be too aggressive for some uses:

rsync -WavxHAX --delete-excluded --progress \
  /mnt/from/ /mnt/to/
  --exclude='/home/*/.gvfs' \
  --exclude='/home/*/.local/share/Trash' \
  --exclude='/var/run/*' \
  --exclude='/var/lock/*' \
  --exclude='/lib/modules/*/volatile/.mounted' \
  --exclude='/var/cache/apt/archives/*' \
  --exclude='/home/*/.mozilla/firefox/*/Cache' \
  --exclude='/home/*/.cache/chromium'
  --exclude='home/*/.thumbnails' \
  --exclude=.cache --exclude Cache --exclude cache

Dan

Posted 2011-07-07T03:30:57.140

Reputation: 240

2

As mentioned in the comments by juniorRubyist, the preferred approach here should be to use dd. The main reason is performance, it's a block-by-block copy instead of file-by-file.

Cloning a partition

# dd if=/dev/sda1 of=/dev/sdb1 bs=64K conv=noerror,sync status=progress

Cloning an entire disk

# dd if=/dev/sdX of=/dev/sdY bs=64K conv=noerror,sync status=progress

References

  1. https://wiki.archlinux.org/index.php/disk_cloning

Rikard Söderström

Posted 2011-07-07T03:30:57.140

Reputation: 181

dd is a very bad idea for 2 reasons: firstly performing a block-level copy of a filesystem that is mounted (which is the case for /) will more than likely result in target filesystem errors, and secondly dd won't copy data from sources mounted within the filesystem like /boot and /home. Your link is valid for disk cloning, not "file hierarchy" cloning – Eric – 2019-08-16T10:42:09.957

2

Adding two useful bits to the thread re rsync: changing cypher, and using --update:

As per Wolfman's post, cp -ax is elegant, and cool for local stuff.

However, rsync is awesome also. Further to Michael's answer re -W, changing the cypher can also speed things up (read up on any security implications though).

rsync --progress --rsh="ssh -c blowfish" / /mnt/dest -auvx

There is some discussion (and benchmarks) around the place about a slow CPU being the actual bottleneck, but it does seem to help me when machine is loaded up doing other concurrent things.

One of the other big reasons for using rsync in a large, recursive copy like this is because of the -u switch (or --update). If there is a problem during the copy, you can fix it up, and rsync will pick up where it left off (I don't think scp has this). Doing it locally, cp also has a -u switch.

(I'm not certain what the implications of --update and --whole-file together are, but they always seem to work sensibly for me in this type of task)

I realise this isn't a thread about rsync's features, but some of the most common I use for this are:

  • --delete-after etc (as Michael mentioned in follow-up), if you want to sync the new system back to the original place or something like that. And,
  • --exclude - for skipping directories/files, for instances like copying/creating a new system to a new place whilst skipping user home directories etc (either you are mounting homes from somewhere else, or creating new users etc).

Incidentally, if I ever have to use windows, I use rsync from cygwin to do large recursive copies, because of explorer's slightly brain-dead wanting to start from the beginning (although I find Finder is OS X even worse)

herdingofthecats

Posted 2011-07-07T03:30:57.140

Reputation: 333

2

rsync

"This approach is considered to be better than disk cloning with dd since it allows for a different size, partition table and filesystem to be used, and better than copying with cp -a as well, because it allows greater control over file permissions, attributes, Access Control Lists (ACLs) and extended attributes."

From:

https://wiki.archlinux.org/index.php/Full_system_backup_with_rsync

Man Page Here

Mauro Colella

Posted 2011-07-07T03:30:57.140

Reputation: 121

1

'dd' is awesome, but ddrescue (apt install gddrescue) is even better. If dd gets interrupted, there is no way to restart (another good reason to use rsync). When you use ddrescue with a logfile, it keeps track of which blocks have been copied.

When backing up a dual boot Windows/Linux system, I use ntfsclone for the Windows partitions and ddrescue for the Linux partition and dd for the MBR. (I haven't tried to back up a dual boot system using GPT/UEFI.)

What I'd love to see is a ddrescue tool that can create files like ntfsclone where unallocated space is marked with control characters. This makes the image not directly mountable, but allows it to be only as big as the contained data.

Someone please come up with the ntfsclone "special image format" for ddrescue...

ECJB

Posted 2011-07-07T03:30:57.140

Reputation: 21

1

I tried the rsync commands proposed here but eventually I got much cleaner and faster results with partclone. Unmount source and target partitions and then run the following:

partclone.ext4 -b -s /dev/sd(source) -o /dev/sd(target)
e2fsck -f /dev/sd(target)
resize2fs /dev/sd(target)

This performs the following steps:

  1. Clone (only the used parts of) the partition
  2. make sure the file system is o.k. (resize2fs enforces this step)
  3. resize the partition to the new file system

The above works in case the target partition is the same size or larger than the source. If your target is smaller than the source (but fits all data) then do the following:

e2fsck -f /dev/sd(target)
resize2fs -M /dev/sd(target)
partclone.ext4 -b -s /dev/sd(source) -o /dev/sd(target)
resize2fs /dev/sd(target)

resize2fs -M shrinks the filesystem to the minimum size before cloning the data.

Note that partclone is not installed by default on most systems. Use a live distro like clonezilla or install partclone from your distros packet manager (apt-get install partclone on debian-based sytsems).

Thawn

Posted 2011-07-07T03:30:57.140

Reputation: 246

0

rsync is the perfect solution as explained above.

I'd just add -S to "handle sparse files efficiently" in case there is a docker devicemapper volume or similar to be copied.

Jürgen Weigert

Posted 2011-07-07T03:30:57.140

Reputation: 101