Bayesian statistics for pure mathematicians
Many hold that Bayesian statistics "from a purely mathematical point of view" is entirely coextensive with probability (however it is that you want to define its boundaries as a mathematical discipline). Nonetheless, if I interpret your request as being for a mathematically sophisticated and rigorous exposition on why the Bayesian approach is a worthy one, three book spring to mind.
- Theory of Statistics by Mark Schervish
- Bayes Theory by John Hartigan
- The Bayesian Choice by Christian Robert
The first of these is a general graduate text in statistics, but the author gives uncommonly complete coverage of both Bayesian and frequentist methods.
The second is a smaller volume and, as I recall, is devoted to some of the more delicate issues surround finite versus countable additivity as relates to using probability distributions as priors in a Bayesian approach.
The final book is more general, but the style is more formal than the Bernardo and Smith book mentioned by PaPiro. (This is, in my experience, true of the style of French Bayesians :)
As I said, the distinctive elements of the Bayesian perspective are more philosophical than technical, but there are some technical areas that have received attention in the Bayesian community that may be of independent mathematical interest. One would be the role of so-called "improper" priors as mentioned above.
Another is the role of conditional distributions as a primitive rather than derived notion, leading to the idea of disintegration, as in this manuscript of Pollard.
Also, because of a keen interest in the application of Monte Carlo methods, Bayesian statisticians have to a lot of work on various aspects of computational methods for sampling from various distributions. Christian Robert is a prominent researcher in this area, and he has a blog. The current post happens to be about Bayesian foundations.
Finally, at the heart of a many arguments in favor of a Bayesian approach (early chapters in Bernardo and Smith and Robert are dedicated to it) are de Finetti type representation theorems, which sanction prior distributions via appeals to exchangeability. You can start with the wiki entry for de Finetti theorems and then look at the work of Persi Diaconis on the topic. In this vein see also Lauritzen's monograph, which (for me anyway) is the last word on the matter.
Suggestions:
- W.M. Bolstad, Introduction to Bayesian Statistics 2nd. Ed., Wiley, 2007.
- J.M. Bernardo, A.F.M. Smith, Bayesian Theory, Wiley, 2000.
- J.K. Ghosh, M. Delampady, T. Samanta, An Introduction to Bayesian Analysis: Theory and Methods, Springer, 2006.
Freely avaliable:
J.F. Kenney, Mathematics of Statistics Part One, Chapman & Hall Ltd. 1939.
J.F. Kenney, Mathematics of Statistics Part Two, Chapman & Hall 1940.
In addition to former proposals, why not the famous two-volume work "Theory of Probability" by Bruno de Finette (In the Wiley classics series)
Quotation: "Probability do not exist"
A more modern book, with quite another take than the above, is "Bayesian Data Analysis", Third Edition by A Gelman, J B Carlin, H S Stern, D B Dunson, A Vehtari, and D B Rubin (CRC press). This a modern book with emphasis on data analysis and computation. I would say that for today (2017) this is the best starting point.