Are there instances where citing Wikipedia is allowed?
Are there instances where citing Wikipedia is allowed?
I'm not going to try to answer the question of "is allowed" or "isn't allowed" but instead I'm going to try and answer if citing Wikipedia is, in general, a good idea or not for research papers.
Here's five points I can think of for and five points against.
Good idea:
- If Wikipedia is your source, and you learn something from what Wikipedia's editors have provided, then citing Wikipedia is the Nice Thing To Do™.
- Wikipedia is a convenient, well-known, easily and freely accessible source (vs. final-copy published papers that are pay-per-view if not out of your wallet, then indirectly out of your university's).
- Content aims to be written in an accessible manner targeted at a wide audience (vs. arguably the bulk of technical articles published in journals and conferences). Plus there's clickable links for jargon! Sweet!
- Notable or Featured Articles will probably have undergone more scrutiny from experts (w.r.t. bias, technical correctness, sources cited, etc.) than your average peer-reviewed article on a similar topic.
- At least with Wikipedia, readers should know where they stand with such a citation. Compare this with the artifice of citing papers for claims made in poor-quality journals or conferences that can be considered authoritative solely on the basis of being "peer-reviewed".
Bad idea:
- A Wikipedia article is subject to frequent editing and the content you cite it for may not be there when the reader goes looking for it (adding a date-accessed only partly addresses the issue since it will be difficult for the reader to start going through versions edit but as per the comments below, you can use a citing service within Wikipedia to cite a specific version of a page).
- Relatedly, the content may have been correct at the time of citing but errors may be edited into the content at a later point, making you look bad by association (edit, again, citing a specific version could work around that)
- There is no minimal guarantee of the quality for most Wikipedia articles (which peer-review, at least in theory, should provide, depending perhaps on the venue).
- Wikipedia is not a first-party source: in theory, no new knowledge is "created" in Wikipedia. Where feasible, you should try to attribute knowledge to those who create it (and optionally thereafter add a reference to those who report it or better describe it in a book or survey)
- Attribution for the content of a Wikipedia article is difficult: multiple editors may be involved in a certain piece of content, many editors are anonymous, etc. This means that there is no culpability for information (unlike peer-reviewed papers where authors have something to lose by publishing crap). This opens up further possibilities, such as authors anonymously adding content to Wikipedia that they can cite to support the claims in their paper.
Okay, back to academic convention and "isn't alloweds" and "is alloweds" ...
Maybe it's not very well known or something but suppose there is a small Wikipedia page with the correct mathematical proof. How much effort are people going to go to to find a proof in a book when there's a (correct (which we know, because it's maths and we can check it)) proof on Wikipedia?
According to Wikipedia policies, it should have a reference to an original source. Cite that.
If it doesn't have a reference, or if you copy how the proof is expressed directly, I would recommend to add a footnote (rather than a formal citation) stating the Wikipedia article you found the proof on and how you verified it (and the CC-BY-SA licence if you copy the expression).
Generally avoid citing web-pages (unless they are normative online standards or something) if only to avoid getting your hand bitten off by a reviewer. Using footnotes is much safer in this regards and serves a similar-ish purpose (though it won't be counted as a formal citation).
In research, if enough reviewers believe something isn't allowed, then de facto it isn't allowed.
Or if you do find it somewhere else why not say: "Proof taken from Wikipedia and verified by the proof shown in "Triangles and their properties, Nature, 2014, p113 etc..."
Why not say "Proof sketched for me by Fred, the guy who sits next to me, who had previously read it in 'Triangles and their properties, Nature, 2014, p113 etc...'"?
In other words, what's important is the reference, not how you found.
You should never use something that isn't your own without a citation. "Copied down without a citation" is not an option.
However, you should also not cite Wikipedia as an authoritative source; you should track down a source that is authoritative and use that.
Why not use Wikipedia as a source?
One reason for not using Wikipedia as an authoritative source is that it is simply not reliable enough in many cases. Wikipedia's founder, Jimmy Wales, himself says
that he gets about 10 e-mail messages a week from students who complain that Wikipedia has gotten them into academic hot water. "They say, 'Please help me. I got an F on my paper because I cited Wikipedia'" and the information turned out to be wrong, he says. But he said he has no sympathy for their plight, noting that he thinks to himself: "For God sake, you’re in college; don’t cite the encyclopedia."
Even if you can verify that the Wikipedia article is correct, you should not use it as an authoritative source. You asked:
Why is it so bad to say: "I found this proof on Wikipedia, and the maths checks out so it doesn't matter who wrote it, it is correct, but that's where I found it."
Citing Wikipedia as a source for an idea that isn't original to Wikipedia does not give credit to the original author of the idea, which is a major reason for citing something in the first place. (The same applies to any tertiary source.)
The Wikipedia entry on Citing Wikipedia itself advises:
It will usually be more acceptable to cite those original sources rather than Wikipedia since it is, by nature, a secondary or tertiary source. At the same time, simple academic ethics require that you should actually read the work that you cite: if you do not actually have your hands on a book, you should not misleadingly cite it as your source.
When should you cite Wikipedia?
It is appropriate to cite Wikipedia if you are commenting on Wikipedia. For example: "The Wikipedia page on X shows that it is a controversial topic, with a large number of rollbacks indicating disagreement on the correct approach."
Similarly, Wikipedia can sometimes be a primary source on popular culture. In this case, you should use it and cite it accordingly. Update: I gave an example of this in another answer.
If you use Wikipedia as a source for finding other sources, then the question, Is there a problem with citing the original source instead of the source where the information was first found? applies.
Further reading
- Appropriate use of Wikipedia (highly recommend reading this one)
- Should I use or cite Wikipedia? Probably not.
- Why you can't cite Wikipedia in my class
- A Stand Against Wikipedia
In my opinion a lot of the answers are too negative about wikipedia, at least when applied to the part of wikipedia that applies to mathematics (my academic field, and the field the OP asked about).
I am a little surprised to hear people describe wikipedia as "unreliable", including links to university websites which say rather snootily to avoid it. This is how I felt about the mathematics on wikipedia circa 2006. It has gotten so much better in the intervening years, for the obvious reason: a lot of very mathematically experienced people (including at least one Fields Medalist, and also including me, for a period from about 2006 to 2008) put in a lot of time writing and vetting the articles. Where it stands now is that wikipedia is the best single repository of mathematical information in the world. It has been several years since I have seen anything that was wrong on a wikipedia math article. Some of these articles contain content that is difficult to find in other places, and some of the content is new: it is far from unheard of that someone just put in their own proof of a theorem. Many feel in principle that this sort of thing should not be done (I think I do; it's been a while since I've really thought about it), but in practice when someone writes a nice self-contained proof of a mathematical result, why delete it? So there is some really great stuff there: I think that most research mathematicians who are frequent users of the internet have by now learned mathematics from wikipedia.
As others have correctly pointed out, the question of "when to cite" is more complicated. Let me consider several of the alternatives:
1) Should you refer to wikipedia for standard proofs?
I think I believe that sometimes you should but I have never actually done it in a "serious research paper", in part because of exactly the sort of internet-phobic practices that Paul Garrett refers to in his answer. Recently I was writing a broad-audience article, and I wanted to say that a certain aspect of a classical construction -- the Galois connection between ideals of a polynomial ring over an algebraically closed field k and subsets of affine n-space over k -- worked verbatim with k replaced by an arbitrary integral domain. I ended up referring to Lang's Algebra for this. That is really not (ahem) ideal: this is one of the most "standard texts" in the sense that a large percentage of professional mathematicians have a copy in their office. On the other hand it is not free and even more mathematicians and math students don't have it. But billions of people have internet access, and surely wikipedia (for instance) does a perfectly good job of explaining the point. I wussed out and didn't give an explicit electronic reference, and I rarely do in formal writing. (In fact I have myself written many, many pages of mathematical writing -- as has Paul Garrett, by the way -- and I usually wuss out and do not refer to it in my formal writing either, even though I know exactly where I would like to point and a student would understand my research paper much more easily with that reference included.) At this point, when I say that something is "well known" I assume that students will look for it on the internet, and as a code between me and myself, at least, I try never to say that in papers except in cases where a student who looked for it on the internet would quickly and easily find it (and when that happens I don't worry so much about tracking down a print reference).
In the above case, the big advantage of wikipedia is its ease and convenience: it has almost exactly what any text would have but is much quicker, easier and freer to access.
2) Should you refer to wikipedia for non-standard proofs?
In other words, if a wikipedia article has a proof which is different from the one that you would find in any expensive math text, should you refer to that? If you want the reader to read that proof, I think you have to do refer to it or try to track down the source of the proof that made it into the wikipedia article. However, the latter brings me to my biggest complaint about the math articles on wikipedia: they're great for mathematical content. They can be really bad as references: e.g. they can be taken out of some standard source without referring back to that source. Or an article on the X-Y Theorem will have a statement of the theorem, motivation for the statement, the proof of the theorem, and then talk about further work and generalizations. That would make for a great lecture about the X-Y Theorem, but for an encyclopedia article there's a lot missing: who are X and Y? (Sometimes they don't even try to tell you, even when there are wikipedia articles on X and Y.) Where was the X-Y Theorem first published? (I'm sorry to tell you that many mathematically rock-solid articles don't contain this kind of primary source material.) Is the proof included in the article the original proof of X-Y? If not, where does it come from?
When I was involved with it, the culture of mathematical wikipedians was not good at addressing the above issues: if I asked for this information about an article, someone would usually nicely tell me that I was more than welcome to add it myself. I would mention that unfortunately I didn't know the source material that led to most of what other people included in the article...and there the matter usually got dropped.
So it may very well be the case that wikipedia has a proof of something for which it is not trivial to discern where the proof comes from. As an example, wikipedia has a really nice proof of the Schwartz-Zippel Lemma. It is not the original proof, I think -- it's slicker. Where does it come from? I couldn't tell from the article itself. This is not a hypothetical example: I wrote a brief expository note including this proof. As you can see, I did refer to the wikipedia article. However, I should say that this is an article in the informal sense of the term: I wrote it up for myself, spoke in a colleague's seminar about it, and kept the document for myself. I have not tried to publish it anywhere, nor would I, since it is "just an exposition" of a proof of Zeev Dvir's resolution of the Finite Field Kakeya Problem. This brings me to my last point:
3) When should you include proofs from wikipedia in your articles?
If you use a wikipedia proof in your article in a critical way, then you should include a reference to it (or where it comes from, if possible). However, if you are using a wikipedia proof in a critical way in your article, is your article a research article or even a "serious expository" article? Why would a journal want to republish something that is available in a standard source?
In the OP's example he mentions including a proof of the Pythagorean Theorem. No math journal I know is going to allow you to include (any one of; I'm sure it gives several) wikipedia's proof of the Pythagorean Theorem, but not because it comes from wikipedia: they're just not going to want you to rehash such old-hat stuff. To be honest, the introductory passage "For example, suppose you're writing an article about triangles..." raises some eyebrows in this regard: are you trying to formally publish an article about triangles? Good luck with that: it's going to be tough. Such articles are published, but for every one that is, probably a hundred are rejected.
I also think that in a formal article -- even, perhaps even especially, if it's an expository article -- the burden falls more highly on you to investigate primary source material. If you're teaching a class or something, then it's helpful to say exactly where you got the material from. But if you're writing an article, it becomes more important to track down the provenance of the intellectual content itself: that is a much more challenging thing to do. Still though I think there are cases where the answer really will be that the argument appeared for the first time on wikipedia, in which case you should cite it there.
4) What about the "unreliability" of wikipedia articles?
This is really "weak sauce" for math articles, because unlike most encyclopedia articles, mathematics articles are self verifying by any sufficiently qualified reader. So saying "Don't include this proof from wikipedia because wikipedia is full of errors" sounds silly to me: on the one hand lots of published books have a higher density of errors than wikipedia math articles; on the other hand, every proof you read you're supposed to check anyway. So don't worry about whether it's correct: see whether it's correct. Most proofs in wikipedia articles are no more than a page or so in length, so they can be checked in a relatively short time. If it's not correct, fix it or tell someone about it!
@SteveJessop asked:
Since you mention not being able to work out where a proof comes from: suppose you found a proof spray-painted under a bridge, that was slicker than the best proof you can find published. Are the concerns over such proofs on Wikipedia essentially the same as for that found on a wall? That is, there's no problem with verifying it correct, but if you can't print it or refer to it without giving credit then it's difficult to use?
Wikipedia seems better than "bridge proofs" because any interested party can access wikipedia and see the proof there as well as whatever documentation or lack thereof exists. In the bridge scenario, one can question whether this is really where I found the argument or whether I did my due diligence in trying to track down its provenance.
In practice: in the mathematical community, the scholarly task of giving the correct attribution -- either in the sense of where you found it or, still more so, the primary source -- is not taken all that seriously by many (compared to other branches of academia). We agree that you shouldn't be passing off others' ideas as your own, and that if you know who the "other" is you should cite them, but "I learned this trick from somewhere, and now I can't remember where" is pretty common in mathematics. In fact math papers tend to be written in logical sequence rather than psychological sequence, so that there is a key part of the mathematical writing process in which history is removed or rewritten, so to speak. I can only hope you understand what I mean by that. It's a subtle phenomenon and not an inherently negative one, but we do it in mathematics more than in almost any other field (and I do it more than the average amount for a mathematician: a big part of my research process is to take others' ideas and writing and rewriting it in one way or another). In general I have grown "more scholarly" over the years, but usually with the suspicion that at best the referees will not really care one way or the other. I ended one recent paper with a section surveying the history and the literature of a certain problem. That is very rare in a math paper. I got zero comments about this section, and if the journal was better I would have expected them to tell me to shorten or remove the section. In my most recent paper, see Theorem 2.1 and Remark 2.2. Remark 2.2 explains the history of Theorem 2.1. It is almost as long as the proof of the theorem!