Ben C. Smith wrote:Methodologically speaking, Peter, is it more secure that Origen is not the author than that Clement is?
Sort of.
Yes, there is
more probability behind the conclusion that Origen was not the author ... although that
doesn't imply that "Clement is the author" is improbable, less probable than not (that may be obvious, but I think I should spell that out).
Ben C. Smith wrote:IOW, is it possible that Clement was selected simply because the real author was not on your list, and Clement happened to be closest of those who were? Is that how it works?
That is possible, sure.
The answer as to how 'really' probable it is that Clement is the author requires both stylometric and non-stylometric analysis.
If you asked me how to set that up, I would say that the 'non-stylometric analysis' should be used to create the prior probability that Clement is the author.
Then the stylometric analysis should be used to calculate the consequent probabilities, based on the stylometry, that Clement is or is not the author. This should be done on the basis of empirical/experimental data about the reliability and rate of accuracy of this stylometric method when working with samples thus and thus big and with such and such number of controls, with this number of matches ('N' out of 'T') to the candidate author, etc.
It must be kept in mind that Clement was picked as the most probable author twice with 3000+ word samples. This isn't a large number of times, but it is greater than once. Conclusions based on more than one reading have much higher probability of being correct (than those based on just one).
Also, there was a relatively high absolute level of correspondence with Clement of Alexandria (p-values > 0.1 with text length > 3000), well within range of plausible authorship. So it's not just the best of a bad bunch.
I'm not trying to oversell the conclusion that Clement is the author (and I also don't want to paint myself into a corner), but it does deserve further analysis, including further stylometric analysis. It is definitely not insignificant that Clement was considered the closest match twice. It would be even more significant, for example, if the text were sliced into three smaller samples and if Clement were still the closest match three times. Or if the number of controls were increased. And so on. Or if scholarly analysis might vindicate such a conclusion.
"... almost every critical biblical position was earlier advanced by skeptics." - Raymond Brown