Storage Expo Catalogue 08

Storage Cat 08 3/10/08 17:07 Page 39
39 ADVERTORIAL
Q
land in full on disk before initiating the deduplication process. data is too big to fit in RAM unless it is a very small
deployment. So it needs to seek on disk, and disk seeks are
7. How does deduplication improve off-site replication and DR? notoriously slow (and not getting better).
The effect deduplication has on replication and disaster The easiest ways to make data deduplication go fast are (1)
recovery windows can be profound. To start, deduplication to be worse at data reduction, e.g. look only for big
means a lot less data needs transmission to keep the DR site sequences, so you don’t have to perform disk seeks as
up to date, so much less expensive WAN links may be used. frequently; and (2) to add more hardware, e.g. so there are
Second, replication goes a lot faster because there is less more disks across which to spread the load. Both have the
data to send. The length of the deduplication process (beginning unfortunate side effect of raising system price, so it becomes
to end) depends on many variables including the deduplication less attractive against tape from a cost perspective. Vendors
approach, the speed of the architecture and the DR process. vary in their approaches.
For the most efficient time-to-DR, inline deduplication and
replication (inline) of deduplicated data will yield the most 9. How much “upfront” capacity does deduplication require?
aggressive and efficient results. In an inline deduplication This is not a question for inline deduplication systems, but it is
approach, replication happens during the backup, significantly for a post-process. Post-process methods require additional
improving the time by which there is a complete restore point at capacity to temporarily store duplicate backup data before the
the DR site, or improving the time to DR readiness. dedplication process begins.
8. How will data deduplication affect my backup and 10.What are best practices in choosing a
restore performance? deduplication solution?
Restore access time will be faster than tape, since it is online • Ensure ease of integration to existing environment.
and random access. Throughput will vary by vendor. Data • Get customer references - in your industry.
deduplication is a resource-intensive process. Before it • Pilot the product/technology - in your environment.
writes data to disk, it needs to find whether some new small • Understand the vendor's roadmap.
sequence of data has been stored before, often across
hundreds of prior terabytes of data. A simple index of this See Data Domain on stand 280
Page 1 | Page 2 | Page 3 | Page 4 | Page 5 | Page 6 | Page 7 | Page 8 | Page 9 | Page 10 | Page 11 | Page 12 | Page 13 | Page 14 | Page 15 | Page 16 | Page 17 | Page 18 | Page 19 | Page 20 | Page 21 | Page 22 | Page 23 | Page 24 | Page 25 | Page 26 | Page 27 | Page 28 | Page 29 | Page 30 | Page 31 | Page 32 | Page 33 | Page 34 | Page 35 | Page 36 | Page 37 | Page 38 | Page 39 | Page 40 | Page 41 | Page 42 | Page 43 | Page 44 | Page 45 | Page 46 | Page 47 | Page 48 | Page 49 | Page 50 | Page 51 | Page 52 | Page 53 | Page 54 | Page 55 | Page 56 | Page 57 | Page 58 | Page 59 | Page 60 | Page 61 | Page 62 | Page 63 | Page 64 | Page 65 | Page 66 | Page 67 | Page 68 | Page 69 | Page 70 | Page 71 | Page 72 | Page 73 | Page 74 | Page 75 | Page 76 | Page 77 | Page 78 | Page 79 | Page 80