How many actual developers work on the Linux kernel?
It could be interesting to clone the git repository of linux and query it immediately.
Cloning the repo
Beware it's a large file! (~1.5G)
Install git
and run the following (in a new directory):
git clone http://github.com/torvalds/linux.git
Querying the repo
Once you've cloned it, you can analyze the log of commits with git log
.
Since the log is so long, you may want to limit your research to a smaller period of time:
git log <since>..<to>
for instance
git log v3.4..v3.5
This has theoretically a lot of info you could use. For example, that command prints the 20 most prolific committers along with their number of commits and their email address.
$ git log v3.4..v3.5 | grep Author | cut -d ":" -f 2 | sort | uniq -c | sort -nr | head -n 20
417 Linus Torvalds <[email protected]>
257 Greg Kroah-Hartman <[email protected]>
196 Mark Brown <[email protected]>
191 Axel Lin <[email protected]>
172 David S. Miller <[email protected]>
138 Daniel Vetter <[email protected]>
132 H Hartley Sweeten <[email protected]>
128 Al Viro <[email protected]>
117 Stephen Warren <[email protected]>
113 Tejun Heo <[email protected]>
111 Paul Mundt <[email protected]>
104 Johannes Berg <[email protected]>
103 Shawn Guo <[email protected]>
101 Arnd Bergmann <[email protected]>
100 Thomas Gleixner <[email protected]>
96 Eric Dumazet <[email protected]>
94 Hans Verkuil <[email protected]>
86 Chris Wilson <[email protected]>
85 Sam Ravnborg <[email protected]>
85 Dan Carpenter <[email protected]>
The email address can give you an idea about the employers of the developers (google.com, cisco.com, oracle.com).
Go to Kernel coverage at LWN.net and do a search for "Releases", and "Contributor statistics". Also do a search for "Who". There are various articles in that index with titles like (most recently) Who wrote 3.5.
While these articles may not directly answer your question, they are as detailed an answer as you are likely to find on the net, without trying to collect information first hand. In particular, they should provide at least a partial answer to 3.
The statistics gathering is done by gitdm (LWN article announcing it: gitdm v0.10 available). Thanks to vonbrand for pointing this out. The repository can currently (January 2015) be obtained with
git clone git://git.lwn.net/gitdm.git
As for 1 and 2, they are not so well defined. In the case of 1, I imagine the answer is almost certainly yes, some of the time. But it is not clear what you are looking for - anecdotal evidence, or some statistics. If statistics, in what form? In the case of 2, by "3rd party groups" it is not clear what you mean, and what kind of help you are referring to. Would people on an IRC channel count as a third party group, for example? Or are you talking about a more formal contractual relationship where money changes hands? Like an outside company retained for temporary consulting? In any case, such information would be hard to get without talking to kernel developers directly, and even then would likely be anecdotal. I suppose forums like the Linux kernel mailing list would be a possibility in that case.