BACKUP AND DATA RECOVERY

DATA RECOVERY

Backup refer to making copies of data so that the copies may be used to restore the original data once there is the data loss event. I’m majorly going to focus on telling you what backups are and what types of backups we usually have in market. Talking about tools there are number of backup tools in the market and every tool has some or the other kind of functionality that every company brings in with them for us to choose depending upon the kind of architecture we will be having in our environment. The most commonly used tools are Symantec NetBackup. The companies keep on changing but NetBackup is one big and vast tool which is mostly used. Then we have some of the EMC products also that can be used for backup and data recovery which include networker FMR and so on. Coming back to what backup is and what backup technologies are. It’s like backup refers to making copies of data. So, you know not backing up is lot like wearing a seat belt. You can go months without backups and it won’t matter. But the moment there is a disaster, then you will have nothing to recover your data from so backups are useful primarily for restoring. You know small number of files or some important files that you accidentally probably delete before the data gets onto the backup Drive. It must be extracted from the system. So, there are various techniques for doing that and they include by dealing with open files like data sources including compression, encryption and duplication. 

Now let’s discuss about the types of backup majorly that are there are full backups, differential backups, incremental backups and synthetic backups. Now let’s talk in basic detailing about each of these. But so primarily, you know this is the major chunk how backup and data recovery can be divided and how your architectures can be decided based on your requirements for data. Starting with the full backup, it is the easiest one and as the name suggests a full backup makes a copy of all data from the selected partition to the destination storage each full backup will consist of a previous full backup and new data that appeared so far. Full backup is a basis for each other. The pros of full backup are – it is easy to setup. You simply choose the entire thing and ask the tool that you are using to back it up. It is reliable so you know you are just mirroring everything to another side and once your current data loses you can just go and ask that entire data to come. Now let’s discuss about its cons. Since, it is full backup, every time you run it, it is going to take everything from that particular source file or source server so it takes long time and of course because it’s taking the entire stuff from a particular source server or source file then it is going to consume a lot of storage space. You know these days taken on a lot of things. Earlier it was just tapes that were used for backups, but with the growing technologies we came up with the idea of virtual tape libraries disk-based backups where it has appliances, EMC data domain. You can also dump your backup on EMC storage tools or any other storage tools. So, lot-and-lot of things have changed in backups over last few years.

Backups can be integrated with your Oracle databases or your sequel databases. You can also take backup for flat file systems, a separate snapshot backup for VMs. So, backup has evolved to a large extent in the last few years. Now coming to the next kind of backup that is differential backup, a differential backup happens after a full or another differential backup, so every differential backup will run and generate a restore point and content consists of changes made since last full backup. For example, you’ve had a full backup today that is Sunday and your incremental backup runs on Monday and Tuesday and it is just saving the changes from Sunday so your Wednesday backup will be a differential backup which is also known as cumulative backup, which is not going to backup changes from Tuesday but it is going to backup all the changes in the backup from the source server from the last full backup which was on Sunday. So, a differential backup essentially also known as a cumulative backup will backup all the changes that have happened on the source side from last full backup to this differential backup or from the last differential backup only to this differential backup.

The pros are that obviously it is less space consuming because it is only storing the changes from last full or differential backup to this current differential backup. Its reliable in comparison to incremental backup because it consists of all changes from last particular full backup. But the cons are that you know storage consumption is not very effective in the long run because every database roll subsequent changes after the initial one and that can cause duplicated data. But we have some duplication stuff. But differential backup does not serve that purpose because we always ask it to backup everything that has changed from last full backup and data recovery is a painful process. If you know you get a full system restore because you have to restore the full backup that ever happened before this and then restore this. One restoring only differential backup is not going to bring your entire system back in case of all data loss.

Difference between incremental and differential is that incremental backup only stores the changes since last any backup, it can be your last backup which could be your incremental backup or your cumulative or differential backup or even your full backup. So, this one is only going to have the changes done. The pros are – that it uses very less space on storage in comparison to the cumulative and the full backups. It is faster than full backup because obviously it is storing less data. The cons are – for example say that you have planned your backup schedules and because you know this is less space consuming and all and I am going to just backup only different incremental stuff and you don’t even setup a full backup so tomorrow if you lose one of your incremental backup and data recovery files or data then your entire incremental backup images are of no use. Because they were always dependent on the changes on the previous one so let’s say that I backed up ABC on Wednesday C changed a bit on Thursday I stored that and I added D on Friday but I lost that C changes on Thursday’s data. So, when I restored it later, I will never have what C changed to. So essentially when you have rink it is always suggested to have one full backup scheduled. It needs to get backup once a week so that even if you lose something on the incremental front, you atleast have a full backup image to take back you to the main data because at the end of the day the only purpose of backup is to serve you when your data loop is lost. 

Coming to synthetic full backup now essentially this was a breakthrough in backup technology because synthetic full backup works just like incremental. It backs only the change file from the last hold. But it merges itself to get the full backup and the changes since the full backup. So this kind of backup like while the active full backup brings 100% of the data from source i.e.; a synthetic full backup can change this by creating an entire point reusing the existing data in Full backup and just storing the changes so what this is going to do is this is going to backup and data recovery the change data and make a pointer to the previous full backup. So, when you restore only from the synthetic full backup you will be able to restore the entire system. The pros are that it’s faster. You can use it in low bandwidth networks and it uses less bandwidth obviously. But the cons are that it makes a track file on your source server. So, it is going to basically use your source server space which other types of backups do not solve. So, now you may get an idea about what the full backup looked like how incremental just took up the changed blocks and how synthetic basically just picked the changed blocks and made a pointer to the full backup soon.

Leave a Reply

Your email address will not be published. Required fields are marked *