Contributors to Drupal 7.x - Code Freeze Looming Update

Greg's picture

The code freeze for Drupal 7.x is looming large on the horizon. From that point on we will be limited in what kinds of changes we can get into Drupal core. For some the code freeze is a time of relief: it means we are down to bug fixes and the final release should be coming soon. For others it is a hard time - bug fixing isn't always as fun as adding new features.

So, as we head into feature freeze it seemed like a good time to run some statistics on who has been contributing the most to Drupal 7.x so far.

Contributors to Drupal 7.x. Through August 10th

Following on from previous times that I've run these stats, I've published documentation of the process to get the data on groups.drupal.org. This time I went straight to the commit messages stored in database tables on drupal.org This has the benefit of counting new files as well as old files (the last times I did this it only counted changes to existing files).

So, who are the top 10 people based on the number of times their name is in a commit message?

Name Commit mentions
Damien Tournoud 192
catch 179
chx 123
pwolanin 113
Dave Reid 109
boombatower 95
yched 77
c960657 57
drewish 56
Berdir 56

The total number of mentions is 3133, so those top 10 are responsible for roughly 33% of the code. On the flip side, people with 3 or fewer mentions are responsible for roughly 15% of the code. We still have a long tail of 222 people who are mentioned in only one message. We see a fairly typical "long tail" distribution: the people who are most involved do a lot of the work, but the people who only get mentioned a few times each are still responsible for a large number of commits when aggregated together.

Commit mentions Count of people with that number
1 222
2 80
3 38
4 17
5 15
6 13
7 8
8 7
9 5
10 6
11 1
12 3
13 3
14 5
15 2
16 2
17 1
18 1
19 2
20 1
21 1
22 5
23 2
24 3
26 3
27 2
31 1
33 1
35 1
36 1
39 1
40 2
45 2
52 1
54 1
56 2
57 1
77 1
95 1
109 1
113 1
123 1
179 1
192 1

The current number is at 469 contributors to Drupal 7. That's good, but we'll need about 300 more to beat the number for Drupal 6.x One concern from adding simpletests to Drupal 7 was that it might lock people out of the process. I don't think that's the case, but it will be interesting to see how things go as the code freeze passes.

Mashup the data

I've posted the parsed data as a CSV file to groups.drupal.org and a Google Spreadsheet so that other people can download it and look for other bits of information.

Previous Reports:

In January 2009 I did a similar analysis: Drupal 7: Who is Providing Patches for the Next Release?.

Comments

Do you have documentation

Do you have documentation anywhere on your methodology for doing this analysis?

"[T]hose top 10 are

"[T]hose top 10 are responsible for roughly 33% of the code" -- technically that's 33% of the commits. Major contributions like DBTNG, which probably counted as one or two of Crell's commits, account for a huge percentage of the code. Just a technical nitpick. :) This is fun data to look at -- thanks for putting it together.

Absolutely - it's a very

Absolutely - it's a very rough analysis of the contribution but it is the best one we have.

It also doesn't count people who do doc reviews or provide the ideas that eventually become contributions. All of those things are valuable, but it's much harder to quantify them. The analysis is only as good as the data we have available ;)

Perhaps collating all

Perhaps collating all usernames from all issue ques that do include simply "subscribe" or short ones that include "duplicate" and do a simple weighting based on number of comments and attached files.

Just an idea...

For instance, I lurk and offer code reviews, but have never written a patch that has made it into Drupal. I spend 10-15 hours a week on these activities.

Josh

Interesting that it is also

Interesting that it is also spot-on the apocryphal "80/20 rule" of powerlaws. 20% of the contributors make 80% of the patches, and vice versa. In this case, I've got 20% of patches coming from 78.9% of contributors.

The 80/20 rule is a common occurrence in economics, particularly regarding wealth and poverty ratios (ie 20% of the people hold 80% of the wealth), but Clay Shirky likes to use it to talk about online collaboration.

(by the way, Damien Tournoud and his 192 patches are missing from the second table)

Hmm. Misuse of the word

Hmm. Misuse of the word "apocryphal." Not sure what I was thinking.

GVS projects

The Hyperlocal News installation profile is an "internal project" for some of the folks at GVS. Profiles are ways to bundle together Drupal, some contributed modules, and the configuration necessary to make the site actually do something cool. Users are presented with an wizard that sets up...

GVS is now part of Acquia.

Acquia logo

Contact Acquia if you are interested in a Drupal Support or help with any products GVS offered such as the Conference Organizing Distribution (COD).

We Wrote the Book On Drupal Security:

Cracking Drupal Book Cover