Implement enterprise back-up solutions using Linux and open source
Linux and Open Source have allowed companies to save big on traditional back up systems.
Bacula is an Enterprise grade solution operational in large data centers around the world.
A new paradigm in archiving technology has emerged from large web companies that have relied for years on low cost disk-equipped servers for all their data storage requirements. The HDFS (Hadoop Distributed File System) is a new paradigm allowing data to be distributed in a fault-tolerant manner across servers. This approach scales out while providing no disruption to the computing infrastructures as the file system installs on top of the native Linux file system. A mature set-up for the Hadoop Distributed File Systems would comprises several very large clusters of inexpensive servers, taking full advantage of data replication features across servers, racks or servers and even across datacenters. Installed as a software layer providing redundancy while increasing scale-out storage capacity is an extremely low cost and all purpose storage usage for inexpensive disks.
Back-up management & recovery operation
Designing an effective organization’s archiving & back-up program requires a business impact assessment in order to identify the information systems that are most critical to business mission, and the cost that downtime will inflict to the organization. This comparative approach is a fundamental planning tool for apportioning resources and selecting technology appropriate to the organization downtime tolerance. Information systems whose loss will have a higher financial impact will benefit from a more significant technology investment in back up resources. Whereas systems with lower criticality will have a lesser portion of back up technological investment. Accordingly the policies and controls for back up should adapt to these realities
Bacula is a data center capable back up solution that is fully open source. It is very flexible and inexpensive when deployed on off-the-shelf Linux powered hardware. Granularity allows it to define pools of storage volumes to write data to as any combination of storage media, optical, disks, tapes. Deployed across the network, it integrates well with Mac and Windows clients.
Bacula can implement any backup strategies flexibly over specified pools of storage volumes.
Explosion in magnetic disk storage capacity and significant drop in price makes such open source/COTS-based back up solution very compelling.
Low cost archiving with fault tolerance with HDFS.
HDFS highlights data corruption self-healing features for inexpensive disks to archive and preserve your data.
Data replication: HDFS constantly perform error detection of processed data looking for corrupted blocks. The mechanism uses check sums to verify data integrity. Logs of check sums are produced and stored. The operation is conducted by the data node prior to storing the data and by clients during read operation. The file system is built to leverage on regular disks and incorporates additional features to detect corrupted data block due to degraded disk.In the event that a block is found to be corrupted HDFS will direct read operation to a previously stored replica before creating a copy of the replica on an other node.