Bcachefs

Bcachefs is a next-generation CoW filesystem that aims to provide features from Btrfs and ZFS with a cleaner codebase, more stability, greater speed and a GPL-compatible license.

It is built upon Bcache and is mainly developed by Kent Overstreet.

Installation

Bcachefs is not in the upstream Kernel yet but the linux-bcachefs-gitAUR kernel can be installed from the AUR.

The Bcachefs userspace tools are available from bcachefs-tools-gitAUR.

Setup

Single drive

# bcachefs format /dev/sda
# mount -t bcachefs /dev/sda /mnt

Multiple drives in RAID0/1

Bcachefs defines a replica as any instance of data, so 1 replica with 2 drives is equivalent to RAID0, 2 replicas with 2 drives is equivalent to RAID1, etc.

# bcachefs format /dev/sda /dev/sdb --replicas=n
# mount -t bcachefs /dev/sda1:/dev/sdb1 /mnt

RAID0/1 with SSD caching

Bcachefs has 3 categories of storage: background, foreground, and promote. Writes to the filesystem prioritize the foreground drives, which are then moved to the background over time. Reads are cached on the promote drives.

A recommended configuration is to use an ssd group for the foreground and promote, and an hdd group for the background, as in the following example.

Note: These are not separated "tiers" of storage. They are just guidelines for a single large pool. Writes will go directly to the background if the foreground is full, or to promote if they both are. Metadata can be written to any of them. In this configuration, metadata_replicas should be at least 2, so that a cache drive may be able to fail without causing data loss.
# bcachefs format \
    --label=ssd.ssd1 /dev/sda --label=ssd.ssd2 /dev/sdb \
    --label=hdd.hdd1 /dev/sdc --label=hdd.hdd2 /dev/sdd --label=hdd.hdd3 /dev/sde --label=hdd.hdd4 /dev/sdf \
    --data_replicas=1 --metadata_replicas=2 \
    --foreground_target=ssd \
    --background_target=hdd \
    --promote_target=ssd
# mount -t bcachefs /dev/sda:/dev/sdb:/dev/sdc:/dev/sdd/dev/sde:/dev/sdf /mnt

Configuration

Most options can be set at either during bcachefs format, at mount time (mount -o option=value), or through sysfs (echo X > /sys/fs/bcachefs/UUID/options/option). Setting the option during format or changing it through sysfs saves it in the filesystem's superblock, making it the default for those drives. Mount options override those defaults.

Note: The filesystem must be mounted for sysfs to be available. All operations except fsck are possible on a live filesystem.
  • data_checksum, metadata_checksum (none, crc32c, crc64)
  • (foreground) compression, background_compression (none, lz4, gzip, zstd)
  • foreground_target, background_target, promote_target

The following can also be set on a per directory or per file basis with bcachefs setattr file --option=value

  • data_replicas
  • data_checksum
  • compression, background_compression
  • foreground_target, background_target, promote_target
Note: Disk usage reporting currently shows uncompressed size. Compression is otherwise complete.

Changing a device's group

# echo group > /sys/fs/bcachefs/filesystem_uuid/dev-X/label

Adding a device

# bcachefs device add --group=group /mnt /dev/device

If this is the first drive in a group, you will need to change the target settings to make use of it. This example is for adding a cache drive.

# echo new_group > /sys/fs/bcachefs/filesystem_uuid/options/promote_target
# echo new_group > /sys/fs/bcachefs/filesystem_uuid/options/foreground_target
# echo old_group > /sys/fs/bcachefs/filesystem_uuid/options/background_target

Removing a device

First make sure there are at least 2 metadata replicas (Evacuate does not appear to work for metadata). If your data and metadata are already replicated, you may skip this step.

# echo 2 > /sys/fs/bcachefs/UUID/options/metadata_replicas
# bcachefs data rereplicate /mnt
# bcachefs device set-state device readonly
# bcachefs device evacuate device

To remove the device:

# bcachefs device remove device
# bcachefs data rereplicate /mnt

Tips and tricks

Documentation

Up-to-date documentation is only available via . The man page, for instance, includes the now-useless --tier option.

Check dmesg for more useful error messages.

gollark: Fascinating.
gollark: I assumed Macron had a builtin to.
gollark: <@!319753218592866315> Do so.
gollark: Has anyone published a breakdown of what algorithms are being used yet?
gollark: Mine too!

See also

This article is issued from Archlinux. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.