topics that matter; ideas worth sharing

share a tip, submit a link, add something new

"The network is down"

December 12, 2000, 03:17 PM —  ITworld.com — 

Somewhere, deep in the bowels of the data center, a mission-critical server goes
belly-up. This server is as bulletproof as they come. It has clustered hard drives;
should a drive fail, the others take over. It has multiple NICs, all dual-homed to
redundant switches. Those switches are, in turn, dual-homed to a redundant core
network.

One morning, the server disappears from the network. Users arriving at work attempt
to log in and can't. Mission-critical applications and databases are offline. Work
comes to a screeching halt. Soon word begins to circulate: "The network is down!"

This cannot be considered a network failure in any way, shape, or form -- the server
has abended and needs to be rebooted. However, to your users, anything attached to the
network is "the network." This extends to Internet sites and even to users' own
workstations.

The more you argue the network is perfectly fine, the more convinced users become
there is a network problem. ("Methinks thou dost protest too much!")

Stop arguing. There's a better way to correct this misconception, serve your users,
and keep your network's good PR in place.

  • Let your attitude be "I am guilty until proven innocent." (In
    other words, do exactly the opposite of what telephone carriers do when they
    have a network failure.) Assure your users that you know this is a serious situation
    that must be taken care of quickly. Assume responsibility even if you aren't convinced
    it is a network problem.
  • Communicate and work closely with the server administrators.
    This may not always be easy to do, but it pays off in times of crisis. A good working
    relationship allows you to assuage user complaints while solving their problems as
    quickly as possible.
  • Educate your users. You can't expect ordinary users to
    distinguish between a server abend and a router crash, so don't push this too far.
    However, you can help them understand that just because they can't reach a certain host
    or Website, the network is not necessarily at fault.
  • Monitor your network closely so you know what actually went
    wrong. Most servers can be equipped with SNMP MIBs that signal you if there is a
    problem. If you have hundreds of servers, you might not want to equip every one with a
    MIB, but you should include your mission-critical servers in your SNMP
    system.

If you follow these axioms, you still may hear those feared words, "the network is
down," but you will hear them less often and with less venom.

» posted by abennett

ITworld.com

I like it!
Post a comment
The content of this field is kept private and will not be shown publicly.
  • Allowed HTML tags: <a> <em> <strong> <cite> <code> <ul> <ol> <li> <dl> <dt> <dd>
  • Lines and paragraphs break automatically.
Resources
White Paper

Symantec Backup Exec 12 and Backup Exec System Recovery 8 deliver industry leading Windows data protection and system recovery. Download this whitepaper to find out the top reasons to upgrade and how to get continuous data protection and complete system recovery.

Webcast

Data and system loss — from a hard drive failure, malicious attack, natural disaster, or simple human error — can happen anytime. Don’t leave your business vulnerable. Make sure you have a secure recovery strategy in place. Symantec's latest backup and system recovery technology can efficiently restore critical applications, individual emails and documents and even restore your entire system in minutes in the event of a loss.

White Paper

Businesses face a growing challenge to ensure that the IT environment is properly protected. Backup Exec 12 integrates with other applications in the Symantec family of products, to complement your current data protection strategy, keep your data securely backed up and make it recoverable when you need it most.

Free stuff
Featured Sponsor

Get a broad understanding of important regulations and how you can make sure your site is in adherence.





Learn how VeriSign SGC-enabled SSL Certificates can help improve site security and customer confidence in the free white paper, "How to Offer the Strongest SSL Encryption." In this paper you will learn the differences between weak and strong encryption and what they mean for your site's performance.

Get VeriSign's free white paper: "The Latest Advancements in SSL Technology" and learn about the benefits of strong SSL encryption, Extended Validation (EV) SSL and security trust marks and what these SSL offerings can do for your site.

Now with Extended Validation (EV) SSL available from VeriSign, you can show your customers that they can trust your site. Learn about EV SSL benefits in this free VeriSign white paper.

More Resources