With hundreds of projects and thousands of committers, the Apache Foundation has found stunning success without knuckling under to the software titans The Apache Software Foundation recently released its 28-page annual report for its 2015-2016 year, but here’s the TL;DR in one word: amazing. What started as a simple HTTP server supported by a handful of developers in 1995 has become an army of 3,425 ASF committers and 5,922 Apache code contributors building 291 top-level projects. Of course, during this same time, open source in general has grown exponentially. But the ASF has seen particularly impressive growth as it propels big data forward with dozens of popular projects, along with dev tools and more general fare. The reason, as board member Jim Jagielski explained in an interview, is the ASF’s emphasis on neutral, community-focused development. Not bad for an organization that costs less than $1 million to run each year — especially compared to other open source foundations that put the needs of corporate interests above those of the developer community. First, the data By any metric, the ASF’s growth in 2015 is impressive: 20 new Apache Top-Level Projects (TLPs) A record 55 podlings under development in the Apache Incubator, plus 39 initiatives in the Apache Labs 743 repositories managed 33 percent increase in signed Individual Contributor License Agreements (CLAs) 3,425 ASF Committers and 5,922 Apache code contributors (a 21 percent year-over-year increase) added nearly 20 million lines of code, with an average 18,000 code commits per month 315,533,038 lines of code changed (65 percent year-over-year increase) Apache services ran 24/7/365 at near 100 percent uptime on an annual budget of less than $5,000 per project Viewed in terms of code, it looks like this: This is even more impressive when you consider how little money the ASF needs to operate. In its most recent fiscal year, the ASF required $874,000 to run, with much of its budget paid for by sponsors. The ASF has seven Platinum sponsors (Cloudera, Facebook, Google, LeaseWeb, Microsoft, Pivotal, and Yahoo) and eight Gold sponsors (ARM, Bloomberg, Comcast, Hewlett-Packard, Hortonworks, IBM, ODPi, and PhoenixNAP), not to mention a host of smaller ones. But unlike other open source organizations, the strength of the ASF is its independence from corporate interests, as Jagielski told me in an interview. This independence has created a safe haven for a burgeoning open source developer population: The Switzerland of open source When I asked Jagielski to identify the primary reasons for the ASF’s growth, he made it clear that neutrality matters: The ASF’s growth emphasizes the need, and the acknowledgement of the need, for a neutral, community-focused environment where everyone can work on, and contribute to, a project, without a pay-to-play governance model. In an ecosystem where companies are jockeying for positions of control over open source projects, the ASF provides a safe space where it is the community itself which has, and always will have, control over its own destiny. Of course, some ASF projects are dominated by a single commercial entity. For example, though Cassandra was initially developed by Facebook, DataStax now accounts for nearly all of its ongoing development. When I asked Jagielski about this phenomenon, he stressed that “even though some projects have a large, singular corporate ‘association,’ the ASF in general and the PMC [project management committee] take extreme care in ensuring that the influence and control rests on the shoulders of the individuals within the community, and not on the wants or needs of any companies.” His comments are backed up by the ASF’s governing principles: A PMC’s actions within their Apache project community and management of their project must be in the interest of that consensus and consistent with the ASF’s mission of producing software for the public good. There are also certain expectations of diversity within a PMC; the board may apply extra scrutiny to PMCs with low diversity (that is, PMCs that are dominated by individuals with a common employer). Similarly, the ASF does not allow corporations to participate directly in Apache project management or other governance activities at the ASF, only individuals. This requirement that “Apache projects … govern themselves independently of undue commercial influence” may not always live up to its ideal, but it’s telling that the ideal exists. In my experience ASF projects tend to live up to it, particularly as the projects attain a higher profile. Hadoop, for example, has broad participation that prevents any particular company from controlling its destiny (despite a years-long PR war over who contributed most). This same ideal isn’t necessarily followed by all open source foundations, a fact called out by Simon Phipps. Far too many foundations are simply facades for corporations that want incoming code but don’t want to share that code freely. This isn’t surprising. After all, open source has become big business. As Jake Flomenberg, an Accel partner styles it, “There is a massive shift going on in the ways technology is bought. Open source has gone from the exception to the rule.” Fortunately, that “rule” keeps getting support from the ASF, an organization that shepherds more and more of the industry’s most critical projects. Related content news ActiveState's Python taps Intel MKL to speed data science and machine learning The MKL libraries for accelerating math operations debuted in Intel's own Python distribution, but now other Pythons are following suit By Serdar Yegulalp May 18, 2017 3 mins Data Science Machine Learning Open Source news CrateDB 2.0 Enterprise stresses security and monitoring—and open source The open source database for processing high-speed freeform data with SQL queries now has enterprise features, available as open source for faster developer uptake By Serdar Yegulalp May 16, 2017 3 mins NoSQL Databases Technology Industry Databases news analysis Waah! WannaCry shifts the blame game into high gear Every security crisis presents the opportunity to point fingers, but that's just wasted energy. The criminals are at fault—and we need to work together to stop them By Fahmida Rashid May 16, 2017 7 mins Small and Medium Business Technology Industry Malware news Faster machine learning is coming to the Linux kernel The addition of heterogenous memory management to the Linux kernel will unlock new ways to speed up GPUs, and potentially other kinds of machine learning hardware By Serdar Yegulalp May 15, 2017 3 mins Technology Industry Machine Learning Open Source Resources Videos