8

In the Backup and Recovery book, they write it is good practice to make a full backup every month, and then increments or differential back up each week.

What if I have 800GB data and ~10GB changes per week.

Should I still make a full backup each month?

I mean, on LTO tapes they guarantee data integrability for 30 years.

So why make full back ups each time?

Sandra
  • 9,973
  • 37
  • 104
  • 160

4 Answers4

11

That is generic guidance. Specific guidance is much better.

The big questions you need to have an answer to before you start setting up your backup retention schedule is:

How much data am I willing to lose, and how long am I willing to take to recover what I can?

Tape backup is near the bottom of the backup/disaster-recovery hierarchy. Very roughly, that is (and I'm sure I'll forget a few steps):

  1. RAID (data-loss prevention)
  2. Traditional data backup
  3. Multi-site data backup
  4. Data replication
  5. Cold fail-over services
  6. Hot fail-over services
  7. Load-balanced replicated services
  8. Multi-site replication
  9. Multi-site cold failover services
  10. Multi-site hot failover services
  11. Multi-site load-balanced replicated services

We're talking about steps 2 and 3 here. How fast you want your data back depends on several factors:

  • How much of it you have
  • How many backup sets you have to go through to get it all back
  • What those backup sets are stored on
  • How fast the hardware supporting all of this (both servers, network, and backup hardware) can run
  • Whether or not the backup system can do a 'differential' backup, or is it just Full/Incremental

In case you hadn't run into the term before a Differential backup is defined as "everything that has changed since the last full backup". I think the term originated with BackupExec and has since been adopted elsewhere. But I digress.

In the book's backup scheme, one full a month, net-change daily the rest of it, the worst-case disaster recovery scenario is a data-loss event the day before the full backup is taken. Recovering in that case will require:

  • The last Full backup, 29 days ago
  • Every single tape since then, all 28 of them.

Depending on the aforementioned variables, this could take a really long time to recover.

Take an alternate scenario, Full on Friday, net-change the other 6 days. The worst-case recovery here is a loss event Friday afternoon. Recovering in that case will need:

  • Last Friday's tape
  • The other 6 tapes

This should take a lot less time.

One thing that hasn't been covered is what happens when a backup tape is bad. With the 30-day between fulls scenario, a bad tape can cost you anywhere from 1 to 59 days of data-loss. If that's unacceptable, run your full backups more often.

One thing some backup to disk vendors are selling these days is something called a synthetic full backup. How it works is that you take an initial full backup and then do net-change forever more. On a set schedule you do a synthetic full backup which coalesces a week/two-weeks/months worth of net-change with the last full backup to come up with a virtual full backup. This is handy for staying within backup windows.

When doing a hybrid disk/tape system, you do your weekly/monthly backups to disk, and then spool archive sets off to tape to sit on a shelf for 3/5/7/10 years. When used in combination with something that can do a synthetic full, a synthetic full can be spun to tape and sent off-site on a regular schedule. Hybrid systems offer the most flexibility these days, and I recommend them whenever possible. Disk for short-term, tape for long term.

sysadmin1138
  • 131,083
  • 18
  • 173
  • 296
5

(what mailq said) plus: Doing incremental forever is not common practice with tapes, as you can lose the tape with full backup on it and render your entire backup useless.

The shift right now is to do full backup + incermental forever against disk backup with deduplication.. this can basically run forever, and you normally run RAID6 in the bottom wich can tolerate 2 disks failing. That, plus weekly/monthly/quartely/yearly tape backups stored in some vault far, far below ground.

pauska
  • 19,532
  • 4
  • 55
  • 75
5

I mean, on LTO tapes they guarantee data integrability for 30 years.

I strongly suspect there is no meaningful "guarantee." If you need to restore from tape and the tape turns out to be bad and your company loses $10M during the extra downtime or goes out of business completely, what is the tape provider going to do? Nothing.

Monthly fulls are valuable even if the data doesn't change.

  1. All your data is read so you verify that it is still readable.
  2. Since tape drives do a read after write you have some indication that the backup is readable.
  3. Your backup and restore process is tested. (You do test restores, right?)
Mark Wagner
  • 17,764
  • 2
  • 30
  • 47
4

It's just a matter of disaster recovery time.

When you can afford to recover from the tape back in January and then replay all the incremental back ups from then to now there is no problem in doing just a yearly full-back-up. But what happens if the January tape gets destroyed? Do you then have the January tape from one year before to do the replay?

The recommendations are not because of integrity but of having enough possibilities to recover from the worst case in a matter of time you can live with.

mailq
  • 16,882
  • 2
  • 36
  • 66