Questions tagged [deduplication]

94 questions
3
votes
1 answer

Production-ready disk deduplication for KVM (Linux)?

Multiple Windows VMs, all of them with identical software installed (they're clones in fact) that I have to run for administrative+licensing reasons, give me a headache storage-wise. KVM on Linux runs great but ext4 does not provide any features re…
LetMeSOThat4U
  • 1,159
  • 2
  • 14
  • 29
3
votes
2 answers

Deduplication of tar archives

Now I'm trying to configure ZFS with deduplication on my linux backup server and have a question. For example I have 1k files in backup1.tar and 1k same files and 1 new in backup2.tar. This files will be deduplicated or not? As I can see it doesn't…
cpt.Buggy
  • 279
  • 2
  • 9
3
votes
2 answers

Accessing windows server deduplicated files from other operating systems

If you enable data deduplication on the Windows Server 2012, it will change certain files (according to the policy) and introduce reparse points. If the drive containing deduplicated volumes is detached from Windows Server 2012 and attached to…
Iravanchi
  • 320
  • 3
  • 11
3
votes
2 answers

Mixing Volume Shadow Copy and Data Deduplication in Windows Server

I'm using Windows Server 2012 and I'm creating a scheduled backup for multiple Hyper-V guests (all of them are Windows Server 2012) to a dynamic VHDX on a removable USB drive (so that I can alternate two drives and have two sets of backups). I…
3
votes
1 answer

Extracting and saving attachments from maildir, deduplicated

I would like to know if there exists a solution for archiving email, stripping out the attachments and saving them separately, deduplicated? The ideal would be a maildir-setup, with a script running over the messages, extracting the attachments,…
grojo
  • 419
  • 1
  • 7
  • 18
3
votes
1 answer

Cheap/fast deduplication with hardlinks?

I've got shared hosting with a few thousand Wordpress installs and I've wanted for ages to have a nice way of removing all the duplicate files in a sensible and secure way. I'm looking for better disk cache hit ratios and simpler backups. I'm just…
Nick
  • 287
  • 1
  • 10
3
votes
1 answer

Setup Raid 10 for backup store

We have HP DL380p Gen8 with Smart Array P420i: 10HDDx6TB, 2 RAID10 6 and 4 HDD, for backup store via Arcserve UDP, deduplication backup configured for 16 KB blocks, we are not satisfied with the speed of recovery, want a better option, how do we…
mgkmgk
  • 31
  • 1
2
votes
0 answers

Windows 2016 deduplication and application compatibility

I'm re-evaluating Windows 2016/2019 deduplication engine, which is way better (faster/more capable) than what shipped in previous Windows versions. I understand how it works (sparsifying files via holes + reparse point + compression), the…
shodanshok
  • 44,038
  • 6
  • 98
  • 162
2
votes
1 answer

puppet code design: how to collect arguments into a string (avoiding possible duplicate declaration errros)

I'm having trouble with puppet (version 5.5) code design. I've written a component module to deal with ucarp. It make use of the module eyp-systemd to register an ucarp service within systemd. Now I would like to consume the ucarp module from two…
2
votes
1 answer

How to read fully deduplicated files over SMB on macOS?

I have a volume in Windows Server 2016 that is a target of the data deduplication role. It works very well ~60% savings rate, with many files appearing as zero bytes on the disk. The volume is shared over SMB and mounts fine on Mac, Windows and…
2
votes
0 answers

2012 R2 DPM: massive increase in data transferred during syncronisation of a Deduplicated protected volume

I have been using 2012 R2 DPM to protect a 46 TB 2012 R2 deduplicated volume for the past 4 months without too much issue. The volume is a low access archive server. Current usage on the volume is 23 TB (if not deduplicated, this would be more like…
Steve
  • 133
  • 2
  • 6
2
votes
2 answers

Windows Server 2012R2 NTFS data deduplication and SVN clients

There seems to be an issue with using SVN on a NTFS deduped volume. To save space on my SSD for some huge development checkouts (> 5 GB each) i enabled data deduplication on the volume. This seems to cause SVN all kinds of grief and madness. The…
schlenk
  • 183
  • 5
2
votes
1 answer

ZFS for offline backups

I'm thinking of using ZFS to backups files to single drives that are going to be kept mostly offline. Does anyone know if the FreeBSD ZFS implementation still uses memory if a dedupped pool is exported?
Cyclone
  • 557
  • 4
  • 15
2
votes
3 answers

Fastest way to copy large file directory over slow internet

I need to copy a directory from one of our servers several hundred miles away. Both sites where I am and where the files are have business cable. The cable is 5 Mbps(Down) / 768 Kbps(Up). This poses a small problem considering I need to copy about…
ianc1215
  • 1,965
  • 7
  • 34
  • 55
2
votes
1 answer

ZFS interpret output of zdb -S tank

I wanted to know if it would pay it out for me to activate the zfs deduplication so I ran the command zdb -S tank but know I need some help to interpret the output. Simulated DDT histogram: bucket allocated …
White
  • 25
  • 2