How to remove gender bias from an academic job search?
Bias exists at many points of the hiring process. You suggested blinding the search committee to applicant gender but, as you point out, this is extremely difficult to do perfectly and completely broken by even small failures. For obvious reasons, blinding will also not be particularly relevant after you start interviewing candidates. I like the other suggestions to provide training to sensitize the committee to issues of gender bias.
Beyond that — and if your university policies allow it — you might also decide now (i.e., before the search) to interview at least one male and at least one female candidate. This way, you will give the best male and female candidates a full chance to convince you that they are right for your department. This ensures that at the top person of each gender makes it through the earlier stages of the process where gender bias may very well play its biggest role. At the interview stage, blinding would not have worked anyway.
This kind of policy is unusual but not unheard of. The most famous example I know if is the Rooney Rule in the US National Football League which requires that all teams interview minority candidates for head coaching and senior football operation jobs. Although this is sometimes cited as an example of affirmative action, it does not mandate any preference or quota to candidates within the pool of those being interviewed. If you're doing it right, it does mean that the very best candidates from under-represented groups will always have an opportunity to show their stuff at the final round.
If you found out that best person from the under-represented groups is really not as good as the best person from the over-represented group, at least you'll know that you gave the best member from each group a full hearing.
Update: I will point out that this answer basically assumes that all of your candidates will present as either male or female. As a result, is it very limited in the case of non-gender conforming candidates. These candidates may also be subject to even greater discrimination and this approach will not solve (and could even aggravate) those problems.
I think there are two approaches. The approach your question focuses on is blinding of the panel to the gender of the applicant. Doing this, may actually increase gender bias. By blinding the search panel to the gender of the applicant, it becomes very difficult for things like maternity leave to be taken into consideration. The better way to remove, or limit gender bias, is to provide training to the search panel about gender bias in academia and help them become aware of any biases they might have.
One simple and extremely effective step is to start tracking metrics on the candidate pool at every stage of the process. Let's say you're looking at how your department hires assistant professors. Then you might track:
- What percentage of Ph.D. holders in the field are women?
- What percentage of the applications you receive are from women?
- What percentage of the short-listed candidates are women?
- What percentage of the interviewed candidates are women?
- What percentage of the offers made are to women?
- What percentage of the accepted offers are taken by women?
- What percentage of the professors who advance toward tenure are women?
Now you've got actual data on what your pipeline looks like and can look for where the leaks are. If the fraction of females in the pool changes significantly at any particular stage, then that's where to focus your energy. Likewise, if the base fraction in the field is lower than you want in your institution, you can use your metrics to decide where to try to enrich the pool with good candidates. Obviously, the same approach can be applied for other disadvantaged groups as well.
I personally think this type of approach is a critical addition to the toolbox of addressing bias, because it lets you scientifically study your institution's process. You may discover things that surprise you. For example, the colleagues who I learned about this from discovered that the later stages of the hiring pipeline they were dealing with were actually OK, but that the percentage of women applying in the first place was much lower than the percentage of women in the field. That meant (to everybody's surprise) that the problem was primarily in the way that positions were being advertised and recruited for, rather than in the interviews themselves, and so that was the process that fixes were targeted at.