Five Lessons from a Data Center's Crisis of Capacity

By Robert Lemos, CIO |  Data Center/Servers, energy consumption, servers Add a new comment

In 2005, problems in the data center at Pacific Northwest National Laboratory came to a head.

Unscheduled outages were occurring almost monthly, bringing down the data center for hours at a time. Groups were buying an increasing number of rack-mounted servers - which had recently become cheaper at the time - to boost the computing resources, says Ralph Wescott, data center services manager for the government laboratory, which is managed by the U.S. Department of Energy. In July, 2005, the server room had reached its capacity limit.

"Groups would go buy a server and throw it over the wall to me, saying, 'Hey, install this,'" Wescott says. "But I didn't have any space, power or cooling (capacity) left. If I installed (one more), the whole room would go dark."

[ For timely data center news and expert advice on data center strategy, see CIO.com's Data Center Drilldown section. ]

Wescott and PNNL embarked on a broad project to revamp their data center without breaking the budget. Every quarter for three years, the data center group spent a weekend shutting down the server room and replacing a row of old servers and tangled network cables under the floor with more efficient, yet more powerful servers connected by fewer cables running in the ceiling. The new configuration allowed for more efficient cooling under the floor.

The result? PNNL moved from 500 applications on 500 servers to 800 applications running on 150 servers.

During a tight economy, tackling such information-technology projects require a tight grip on the purse strings, says Joseph Pucciarelli, the program director of technology, financial and executive strategies for analyst firm IDC, a sister company to CIO.com.

"The situation is a very common one," he says. "Companies are making just-in-time investments. They have a problem, and they are looking at the problem in a constrained way."

Here are some lessons PNNL learned in bringing their data center back from the brink.

1. Plan, don't react The first problem Wescott needed to solve was the data center group's habit of reacting to each small problem as it arose, rather than seeing the systematic issues and creating a plan to create a sustainable service. In addition to the 500 servers, the data center had some 33,000 cables connecting those servers to power, networking and security systems.

"We decided what the data center should look like and what its capacity should be," he says.

The group concluded that the current trajectory would result in 3,000 applications, each running on its own server, in 10 years. Now, the data center has 81 percent of applications virtualized - and average of 17 per server - and Wescott plans to reach the 90 percent mark.

Companies should focus on three areas to increase capacity, says IDC's Pucciarelli. Reducing the number of physical servers and running applications on virtual systems helps reduce power requirements, as does more efficient cooling systems and improvements in electrical distribution.

"That's typically the one-two-three that you go to when updating the data center," he says.

Pucciarelli has encountered many companies that have replaced up to 50 servers with just two or three larger capacity systems and used virtualization to run their applications.

2. Measure to manage Data center managers need ways to monitor the state of the data center, but all too frequently they don't have the right tools, PNNL's Wescott says. Prior to the changes, Pacific Northwest National Labs had no way to measure the efficiency of its data center. Power problems were discovered when the room went dark, or though a more seat-of-your-pants method.

"If there was too much amperage through our power supplies, the way I found out was to put my hand on the circuit breaker and if it was warm, then I knew we had a problem," he says. "That's proof that you need tools."

Now, PNNL has sensors in place on every fourth cabinet at the low, medium and high points to create a 3-D heat map of the server room. The data allowed Wescott to change the way he cools the data center, increasing overall temperatures and applying cooling where he needed it.

"I think that is going to save me a lot of money, and wear and tear, on my air conditioners," he says, adding that current estimates are that the data center will be 40 percent more efficient with cooling.

3. Take small steps Radically reconfiguring the data center without disrupting operations is a major problem, says Wescott. The manager advocates taking small steps to minimize outages, but left the decision to his managers, he says.

ITworld LIVE

Data Center/ServersWhite Papers & Webcasts

White Paper

The Forrester Wave™: Disaster Recovery Services Providers

Improvements in disaster recovery plans and broad business continuity strategies are top-of-mind concerns for leading enterprises today and recovery time is now measured in hours and minutes not days. These key insights are discussed in the 2010 Forrester Wave Report.

White Paper

Roadmap to the Cloud Summary HP Brochure

This white paper reveals the key steps you need to take in order to build an effective cloud computing infrastructure. Start building your cloud step-by-step today.Intel and the Intel logo are trademarks of Intel Corporation in the U.S. and/or other countries.

White Paper

Forrester Whitepaper: IT Operations Managers Must Rethink Their Approach to Private Cloud

Organizations of all types are attracted by the promises of private cloud computing, but few actually have the virtual maturity to be successful. This Forrester report reveals the latest virtualization trends so you can see how far your peers are in their journey to the private cloud. Read on and discover best practices for improving virtualization in order to prepare for the cloud.Intel and the Intel logo are trademarks of Intel Corporation in the U.S. and/or other countries.

White Paper

Building Cloud-Optimized Data Center Networks white paper

Enterprises are turning to the Cloud to improve business agility, reduce expenses and accelerate business innovation. Cloud computing redefines the way IT assets are deployed and consumed and dramatically affects the way data center networks are architected and managed. Conventional hierarchical data center networks built to support traditional IT architectures can't meet the security, agility and price/performance requirements of virtualized cloud computing environments. This white paper reviews the impact of cloud computing on data center networks and describes HP's approach to building simpler, more secure and automated networks that fully meet the stringent performance, security, reliability and agility demands of the new data center in the Cloud.Intel and the Intel logo are trademarks of Intel Corporation in the U.S. and/or other countries.

White Paper

Seven Priorities for Integrated Network Management - How HP Intelligent Management Center Delivers an Enterprise-class Solution

This white paper describes the major requirements for network management solutions to help the organizations become more profitable, efficient and reliable.Intel and the Intel logo are trademarks of Intel Corporation in the U.S. and/or other countries.

See more White Papers | Webcasts

Ask a question

Ask a Question