Category Archives: Hardware

expensive equipment, a hammer, backups, and disaster recovery; A good mix

I found out yesterday that apparently using a hammer and a phillips head screw driver to drive a SCSI cable through a maybe 1/8 inch opening between my desk and the cube wall it is screwed into is a bad idea.

I spent a couple hours between yesterday afternoon, later on that night, and some time this morning trying to figure out why my linux box refused to acknowledge the existence of the Sun StorEdge L8 LTO tape autoloader I hooked up to it. I didn't think the screwdriver actually went into the cable at all. It just looked like it busted into the magnet that surrounds the cable near the end. That thing really needed to be driven through the desk. On the good side, it gave Bill and I a good excuse to use a hammer and a bunch of prying tools to "install" a tape autoloader.
I have been trying to implement a fairly reliable backup system for a few small file servers we have at the office. The previous group of people that managed the backups for these systems had a disaster recovery plan that involved having a rotation of backups that traveled through 3 separate physical locations. It seemed like a bit overkill, but then again, it is better to be safe. The funny thing is that the backups were all on a bunch of 4mm 20 gig (uncompressed) tapes. The 3 servers that were being backed up totaled somewhere around 500 gigs…maybe a bit less. The best part was that between the 3 servers there where only 2 tape drives. 2 very slow tape drives. Plus, the majority of the data that was being backed up was uncompressable. movies, audio, and pictures mostly. So this involved a lot of tapes. It took a good 3 hours for 1 tape to get filled. They would get no notification it was ready for the next tape, so every couple hours they would go and log into the machine, or just check if the tape drive ejected a tape, then switch it, and rinse and repeat for the 2 day (or more) long backup. Luckily incremental backups weren't as bad, but most of the time I don't think they could even happen given how long a full backup would take. If you forget to change the tape for a while, you just might have wasted a whole days worth of time that the backup could have been chugging along. The tapes would get put into a plastic tape case that looked like it was supposed to be rushed to the hospital for a life saving organ transplant. Then that would get carted off to the first off site location in the big 3 location backup plan.

Then the group that had been handling these backups..plus a bunch of other tasks got moved to another location because of "streamlining" how their group worked. Which is when My co-worker and I got stuck with all the fun. Neither one of us had the time to keep checking to see when the next tape needed to be changed, so a full backup would take maybe 2 weeks to finish.

Anyway, that is a bunch of back story that doesn't really matter. I really wanted to just complain about Backup Exec, and some oddness associated with the Arkeia trial installation I have been working on. The whole old backup system for these 3 machines used Backup Exec. I really really really don't like Backup Exec. The UI was poorly designed, the server has to run on a windows machine, and Veritas/Symantec decided to screw over their customer base and not offer any encryption option unless you upgraded to their $20,000 Enterprise "we screwed you" 2.0 package (i made that price up). I didn't realize that until I was going to upgrade the 3 client installs, and the 1 Backup Exec server to their most recent version. But, I did get a chance to try out the Sun StorEdge L8 autoloader we have had laying around for who knows how long. The L8 uses 200gig LTO tapes (400 compressed), and when I tried the first backup on the trial of the new Backup Exec, The entire backup of the 3 systems took around 4 hours to finish, and everything fit on a tape and a half. On the bad side, the L8 only holds 8 tapes, one of which is a cleaning tape, so really 7. That isn't a safe number for a full mostly automated backup strategy, but it is still much better than the previous setup.

After I found out about the lack of encryption support, that got weighed in with the crappy UI, and the need for a windows 2003 server, we decided to try something else, and since my co-worker loved Arkeia so much, I figured I would give that a try.

For a test install, I hooked the Storedge autoloader up to a Sun V120 running Solaris 10, and got a bunch of trial licenses for Arkeia. The installation was completely painless, everything was pretty straight forward. The only part that took any time was getting the v120 to recognize the autoloader, but that can't be blamed on the software. It was more my lack of knowledge.

Arkeia has a really well thought out X interface that everything can be setup from, and you can install the server on a variety of platforms. Solaris, Linux, FreeBSD..etc. Most installs involve just typing rpm -i, or dpkg -i or ./install, depending on the packaging system on the server. I was pretty surprised on how well thought out everything was.

After I got everything going, i tried the first backup. I left encryption off, and figured I would try the best (compression wise) compression method, which was LZ3. The backup gets started, and I looked at the fun little speedometer the X interface displays during an interactive backup. You can see a bunch of differant metrics, like MB/h, MB/min, MB/s, KB/s for both the network and the backup speed. This is when things started to go downhill. The max backup speed I was getting was 5 gigs an hour. Then I thought maybe the compression was too much for a v120. The load on the machine was a little over 1, but still, something didn't seem right. I tried the backup again with no compression this time, and left work for the weekend (this was on Friday). Some time Saturday I log in to see how things are going. and in 33 hours it has backed up a whopping 144 gigs. This was never going to finish. I tried a bunch of differant things, then on Monday, we tried doing an scp of a large file from the v120 to various other machines. I was getting the same crappy throughput. The port on the switch was set to auto negotiated, so I tried forcing it to 100/full duplex, but no difference. It must be a misconfiguration of some kind either on the switch or with the interface on the server, but it was happening on a couple of the other servers on that same bank of switches, so I figured I would just try a more localized test install on my Sun Ultra 20, which is running OpenSuse 10.0/64bit. Arkeia had an rpm for Suse enterprise 64 bit, and that installed without a problem.

I really didn't want to shove the autoloader under my desk, and I found a SCSI cable that was long enough to let me put the autoloader on the corner of my cube against a wall. The only problem was that the hole in the desk for cables to pass through can't fit the whole SCSI cable end. Which left me with 2 options. Leave it under the desk, or figure out a way to get the cable up behind the desk. Which is where the hammer and a bunch of large screw drivers came in. My co-worker pried from the top, and I was prying with another screwdriver from the bottom while trying to push the cable through the little opening. I was thinking how funny it would be if we ripped the desk out of the cube wall by accident and the whole thing crashed on top of me (including my co-worker) but the cable got through. Except for that damn metal cylinder at the end of the cable. This was going to take some finesse. After trying everything. I decided to use a philips head as a wedge, and just smacked it as hard as I could until the stupid metal/plastic/rubber thing went up through the crack….with the screwdriver inside. The cable looked fine, but apparently it wasn't.

This morning, after trying everything I could think of to get my system to recognize the new scsi device, I figured I would try another cable, which all I could find was a little 3 foot long cable. So under the desk the autoloader went. It is actually just balancing on top of a little terastation NAS device. If I touch it with my foot by accident, I am sure it will flip on its side, but that is part of the fun.

So, I plug in the autoloader, reload the scsi card module, and low and behold, there it is in all its glory. So I set Arkeia up real quick and get a backup going. No compression or encryption which is the same as the last backup I did on the Solaris install. The backup speed now is averaging 30-40 gigs an hour.

I have no idea what was up with the v120, but if you saw our network closet, our network…actually, any of our stuff, you would run in horror. So now I can add that to my never decreasing list of tasks.

"figure out why throughput on half the equipment sucks"

The funny part I guess is that my Ultra 20 is my main workstation. I wrote this post on it, in KDE, with a bunch of other stuff running all during the backup.

Reason #306 why I hate Windows and Microsoft

Late last week, one of the computers in the design department died, and it turned out to be because the hard drive went.

This particular person had a bunch of documents and such saved in her My Documents folder that would be sorely missed. A couple months back, I came accross a application suite that is supposed to be able to recover data from “fucked” drives. (my words not theirs). Even to the point were the hard drive is making that deathly clicking sound. I have used it with much success both of the times I have needed it before this.

The downside is that the application ONLY works in windows. The suite lets you read data from a variety of file systems, NTFS, FAT(32), Mac crap, UFS, ext2..etc. Basically it can read anything that has Windows, Linux, Mac crap, or Solaris on it. Why the manufacturer chose windows as the platform of choice? no idea, but hey, it works so get over it right?

So I put the IDE drive in a USB case and plug it into the USB port on the computer I had to install windows on, and let it run. The program takes a while to weed through the disk looking for stuff, and since by the time I had found a computer to use, installed windows plus all 567 patches from windows update, it was already friday afternoon, so the program was still analyzing the disk when I left work at 5:30.

I come in today after the long weekend, and the computer is sitting at the login screen. After it starts up, I see a pretty litty information box informing me that Windows has to install an important update, and automatically rebooted the computer for me.

Oh thank you! I definately couldn’t have done that myself. It’s not like I updated the computer with every patch available before I even started analyzing the drive on friday. So, now I get to start from square one again.

It takes a good 3-4 hours for this program to analyze the drive.

Microsoft is a bunch of dirty dirty whores.

And yes, I know that turning off automatic updates is an option, but that is not my point. My point is that if an application is open, maybe it shouldn’t automatically reboot the damn machine.

Also, the application suite is called Stellar Phoenix. I highly recommend it. It hasn’t failed me once, and one time I had a 200 gig drive filled with everything I could think of, and I got almost all of it back after the drive died. It’s a bit pricey though

A massive mesh network

Since I obviously have a major say in the business decisions of companies like Clear Channel. I decided that since all their billboards are usually around normal building height, and almost all of them have power to light up the stupid signs at night, they should install WiFi routers, or any preferrably a better high speed wireless technology in all the signs. They could make a massive mesh network out of property they already own or lease, They could probably throw a couple cell phone repeaters in there as well, but for all I know they might be too low.

From the 30 seconds of research I did, Clear Channel alone ownes over 775,000 billboards nationwide. If I spent more time writing this I could probably mention how many other companies owne, but I figured this was enough information for someone to steal my idea.

In the end my only goal is for comcast to actually have some competition in the broadband internet access market so the market will force them to lower theirridiculously outrageus prices on their over-exaggerated bandwidth claims

A quick tour of the Sun Ultra 20

The other day I got my new Sun Ultra 20 workstation for my desk at work. Sun did a pretty good job putting it together, and if I wasn’t poor, I would probably buy one for my house. However, since I have no money, i’ll just pretend I own things through what my work provides me. It came with a gig of memory, onboard RAID, sound, ethernet, SATA, and an AMD Opteron 148 CPU. The hard drive is something like 74 gigs, and this model also included the optional nVidia 3D digital dual head video card.

Ultra 20 front

The case is all metal, and has 2 firewire, 2 USB ports, a mic jack, and a heaphone jack. I noticed already that the headphone jack on the front of the computer (there is also the normal one on the back of the computer) has a ton of interference. You can hear hard drive noise, and even interference from the mouse every time you move the mouse or do anything.

Ultra 20 back

On the back of a the case, there is nothing at all special going on, its as normal looking as you would expect. The fans, are real quite and are some of those fancy smanshy smart fans that powers up and down depending on how busy or hot the computer is, so during normal operation, you can barely notice that it’s even on.

Ultra 20 back, PCI slots

The picture kind of sucks, but you can see that I got the optional 3D digital graphics card, which comes with an adapter that supports a dual head display. Also, the workstation has 3 PCI-X slots, and 3 normal PCI slots. It’s kind of dissapointing that there are no DB9 serial ports, which I often need, but I didn’t pay for it, so I guess I don’t really care. Also, I could always get a USB-DB9 serial port converter and be just fine.

Ultra 20 - HD

I was real happy with the fact that Sun used the same hard drive mounting rails they use on virtually all of their servers/workstations nowadays. The motherboard has built in support for hardware base RAID, and uses those new spiffy SATA drives.

Ultra 20 - SATA

This is so much cleaner looking than old IDE or SCSI cables.

Ultra 20 - CDROM

To further going along with the “no screws involved” plan, the CD/DVD-ROM uses some quick-release contraption for those days when you are on the move.

Ultra 20 - CPU

You can see there is a ton of empty space in the case. You can actually see right through from the front to the back of the case from the outside, and look at the fancy baffling setup that the CPU fan has.

Ultra 20 - BIOS Code

Look at the 2 character LCD display in the middle of the picture. Instead of using horrible speaker beeps or those dumb color coded LED’s to tell you about hardware problems, it just displays the error code for you to go cry about

That’s my little tour of the Ultra 20.