Archive for September, 2009
No SAN is an island…
by Jesse on Sep.16, 2009, under Best Practices, Cisco, Fibrechannel, Switches
Ok, that was too cutsey for such a classy establishment.
When you’re building a SAN, everything should play together, in the same SAN box if you will (with my apologies to QLogic.)
When you start putting in multiple stand-alone SAN islands you increase your maintenance overhead exponentially. You also prevent the very thing that make a SAN a huge advantage over DAS.
Everything can see everything.
If you have a host and need to throw a certain type of storage at it, you can do that easily. (and if it’s already cabled you can do it from your living-room)
However, if SAN A/B are connected to one group of hosts/storage, and SAN C/D are connected to a second group of storage, and SAN E (no redundancy) is connected to even a third, you run into a problem.
*NOW* if I want Host_A (Connected to SAN A/B) to see Storage_C (Connected to SAN C/D) I have to do much more than a simple zoning change.
In the end, this is where a few well placed ISL connections can come in handy. VSAN them off so they don’t cause the fabrics to merge, create an IVR zone to route across them, and then presto. Host_A can see Storage_C with a minimum of fuss.
Or maybe a *REAL* core-edge topology even. Where you put a core switch with 24-port (fully subscribed) blades ISL’d to an edge switch, which maybe has the 4/4/40 configuration (4 8-Gig ports, 4 dedicated 4Gig ports, and 40 shared 4Gig ports)
And put one person in charge of it. Preferably someone with a touch of OCD.
Backup Vs. Archive
by Jesse on Sep.15, 2009, under "Cloud", Archive, Backup, Best Practices, Centerra, Deduplciation, Gripe
The fundamental difference between BACKUP and ARCHIVE.
A backup is there to help you deal with a crisis such as “My datacenter is a smoking hole in the ground now what do I do?” or something not quite as dramatic like “A virus ate my data.” You recover from the backup to the last known good and all is happy, right? Well except for the two or three days that might have gone since your last good backup… (Was in one lawfirm that lost a drive only to find out their backups hadn’t been running for two months.. came back two weeks later to find a COMPLETE change in personnel had gone on while I was gone – lawyers are not very forgiving when they lose two months worth of email.)
An archive is data that, while not “Active” still might be required on a day-to-day basis. Film / Video / Image archives are a good candidate for and example of that.
So on a disk-based archive you have some platform, ostensibly EMC/Legato DiskExtender or Rainfinity or something along those lines – that will move the data from “Active” storage to “Archive” storage. In some applications you can even set up a true HSM, moving data that hasn’t been accessed to Tier-2(Enterprise SATA) and even Tier-3(yes, tape) as it ages, only to be recalled to Tier-1 when it’s accessed.
More often than not I’m brought face to face with people who don’t understand that very subtle difference. One of my recent customers is actually doing it appropriately, using DX and a smallish Centerra to archive data that, while retention is required, is almost never actually accessed.
Then there are the people who use backup technology for archival purposes.
I’m pretty “old school” when it comes down to it.
Tape is for backup. Tape is *NOT* supposed to be used as nearline storage when there are equally inexpensive (and more reliable) disk methods out there.
My main complaint about tape as archive: You don’t know if it’s bad until you try to read it. And time you read it the simple act of moving the tape into a tape drive that was manufactured under less than ideal conditions means you are putting your data at risk.
Spending millions of dollars on a new Room-Sized tape library doesn’t make sense when Centerra storage is fairly inexpensive *AND* provides redundancy of the data automatically.
Spending more millions of dollars on three of them is lunacy when one EMC Atmos set up could provide redundancy and a single namespace for recall. (and if you go whole hog, geographically relevant retrieval is an option to, so you automatically get it from the closest copy.)
It pains me to see it done wrong. Especially when it involves trying to shoe-horn two more STK monsters into an already cramped datacenter when the work of it could be done in a couple of floor-tiles of spinning disks.
I’ve got the power….
by Jesse on Sep.10, 2009, under General
But apparently the brain-trust that runs the facilities for my customer doesn’t.
If you’ve ever seen a datacenter go completely black, it’s a scary thought.
If you’ve ever seen someone actually manage to take out the power to *JUST* the servers (leaving the lights and AC functional)
Well that’s just a work of art…
That we’re *STILL* recovering from.