Apache moves on traffic server, machine learning projects

For the first time, the open source organization is announcing six top-level efforts

By Paul Krill, InfoWorld |  Software, Apache Add a new comment

The Apache Software Foundation, developer of open source software, on Tuesday is announcing the creation of six Top-Level Projects, including the Apache Traffic Server for caching and Apache Mahout, implementing machine-learning algorithms atop the Apache Hadoop distributed computing platform.

This is the first time Apache has announced six Top-Level Projects at the same time; Top-Level Project status signifies the highest level a project can reach at the organization.

[ See InfoWorld's report on the recent hack of an Apache project server. ]

Traffic Server is a former commercial project from Yahoo, submitted as an Apache incubator project last year. Suitable for providing edge services in cloud computing, it can serve static content, such as images and JavaScript. Able to process more than 75,000 requests per second, Traffic Server also can route requests for dynamic content to a Web server.

"Becoming a Top-Level Project is a vote of confidence from the foundation at-large, demonstrating a project has proven its ability to be properly self-governed," said ASF chairman Jim Jagielski in a statement released by the foundation.

Mahout, a former Apache sub-project, offers collaborative filtering, clustering, classification, and data mining algorithms.

Other former sub-projects moving to Top-Level status include:

  • Tika, which is an embeddable toolkit for content detection and analysis.
  • Nutch, a modular Web searching engine.
  • Avro, a fast data serialization system.
  • HBase, a distributed database modeled after Google's Bigtable distributed storage system.

HBase and Avro are former subprojects of Hadoop, while Mahoot, Nutch, and Tika formerly were sub-projects of the Lucene search engine effort.

Other Top-Level Projects formed at Apache this year include UIMA (Unstructured Information Management Architecture), providing a framework for analyzing unstructured information; Cassandra, a second-generation "NoSQL" distributed data store; and Click, a Java EE Web application framework.

Apache this year also has accepted the Subversion versioning control system as a Top-Level Project this year, along with Shindig, a container for hosting OpenSocial applications.

This article, "Apache moves on traffic server, machine learning projects," was originally published at InfoWorld.com. Follow the latest developments in business technology news and get a digest of the key stories each day in the InfoWorld Daily newsletter and on your mobile device at infoworldmobile.com.

Read more about data management in InfoWorld's Data Management Channel.


Originally published on InfoWorld |  Click here to read the original story.

ITworld LIVE

SoftwareWhite Papers & Webcasts

White Paper

Activities Streams Base An Integrated Social Layer

The enterprise social software market is exploding thanks to converging trends of consumerization, cloud, and mobile. In this must-read report, "The Forrester Wave: Activities Streams, Q2 2012", Forrester Research Inc. evaluated five social software vendors with core strengths in the stream based on the overall strength of vendors' current offerings, a clear product strategy, and vendor market presence. In a detailed look at the space, Forrester named Yammer as a leader.

White Paper

ESG Lab Review: HP 3PAR Peer Motion Software

This ESG Lab review sponsored by HP + Intel documents hands-on testing of HP 3PAR Peer Motion Software's distributed volume.Intel and the Intel logo are trademarks of Intel Corporation in the U.S. and/or other countries.

White Paper

ESG Lab Review: HP 3PAR Peer Motion Software

This ESG Lab review documents hands-on testing of HP 3PAR Peer Motion Software's distributed volume management with a focus on federated workload balancing, asset management, and thin provisioning.Intel and the Intel logo are trademarks of Intel Corporation in the U.S. and/or other countries.

White Paper

Deliver Cost-Effective Business Continuity with Extreme Capacity

IBM DB2 provides application cluster transparency technology that equips organizations running OLTP applications with the ability to deliver high availability and continuous uptime for transactional data, plus the flexibility and capacity they need to remain competitive.

White Paper

What Developers Want: The End of Application Redeploys

Eliminate application restarts in Java with JRebel! JRebel is a JVM plugin that eliminates application redeploys from the Java development cycle, a process that takes over 10 minutes of coding time away from developers each working hour, according to a recent survey. Just code, refresh and see everything instantly.

See more White Papers | Webcasts

Ask a question

Ask a Question