How the New York Times covered the election with Amazon

The New York Times cobbled together a Ruby application to run on Amazon to report on live election results

By Joab Jackson, IDG News Service |  Software, Amazon EC2 Add a new comment

Anticipating an immense number of visitors seeking live data, The New York Times turned to Amazon, as well as to a set of open source software, to deliver nearly instantaneous results of the U.S. elections held last November.

With its setup, the Times was able to handle bursts of traffic up to "thousands of requests per second" with a total cost of only "a few hundred dollars," explained Times developer Ben Koski in a blog post Monday afternoon.

The Times maintains an interactive news department to serve website readers with graphs, charts and other enhanced content. Over time, the department has built up a cloud software stack to serve these projects, Koski said. Typically, the department's projects are written using the Ruby on Rails framework, with the data stored on a MySQL database running on Amazon's RDB (Relational Database Service). The Web apps themselves are run on Amazon's EC2 (Elastic Compute Cloud).

Election night generates more traffic than the usual news story, however, so the team had to rethink its approach slightly.

"Publishing live election results requires a carefully tuned system: the setup must be able to withstand some of the most intense traffic levels seen all year ... but at the same time, it needs to get information to our readers quickly," Koski said.

For the coverage, the site had posted 184 separate pages covering election results, each of which needed to be updated every few minutes from an Associated Press news feed.

To ensure uptime, the team decided to forgo its usage of Varnish, an open source Web accelerator application that holds dynamically-created pages in memory so they don't have to be recreated for new users. While Koski noted the software had posed no problems in the past, the team "decided it was too risky to lean entirely on an ephemeral cache for an evening where seconds of downtime matter," Koski explained.

Instead, the team resorted to a tried-and-true method of server Web pages quickly -- serving them as flat files. The team set up a pool of four application servers behind a bank of Amazon servers running Apache that assembled pages from the latest data and periodically uploaded them.

Joab Jackson covers enterprise software and general technology breaking news for The IDG News Service. Follow Joab on Twitter at @Joab_Jackson. Joab's e-mail address is Joab_Jackson@idg.com

ITworld LIVE

SoftwareWhite Papers & Webcasts

White Paper

Activities Streams Base An Integrated Social Layer

The enterprise social software market is exploding thanks to converging trends of consumerization, cloud, and mobile. In this must-read report, "The Forrester Wave: Activities Streams, Q2 2012", Forrester Research Inc. evaluated five social software vendors with core strengths in the stream based on the overall strength of vendors' current offerings, a clear product strategy, and vendor market presence. In a detailed look at the space, Forrester named Yammer as a leader.

White Paper

ESG Lab Review: HP 3PAR Peer Motion Software

This ESG Lab review sponsored by HP + Intel documents hands-on testing of HP 3PAR Peer Motion Software's distributed volume.Intel and the Intel logo are trademarks of Intel Corporation in the U.S. and/or other countries.

White Paper

ESG Lab Review: HP 3PAR Peer Motion Software

This ESG Lab review documents hands-on testing of HP 3PAR Peer Motion Software's distributed volume management with a focus on federated workload balancing, asset management, and thin provisioning.Intel and the Intel logo are trademarks of Intel Corporation in the U.S. and/or other countries.

White Paper

Deliver Cost-Effective Business Continuity with Extreme Capacity

IBM DB2 provides application cluster transparency technology that equips organizations running OLTP applications with the ability to deliver high availability and continuous uptime for transactional data, plus the flexibility and capacity they need to remain competitive.

White Paper

What Developers Want: The End of Application Redeploys

Eliminate application restarts in Java with JRebel! JRebel is a JVM plugin that eliminates application redeploys from the Java development cycle, a process that takes over 10 minutes of coding time away from developers each working hour, according to a recent survey. Just code, refresh and see everything instantly.

See more White Papers | Webcasts

Ask a question

Ask a Question