A Chinese Version of an Authorship Attribution Analysis Program
McAnulty College and Graduate School of Liberal Arts
Mark S. Mazur
Java, Chinese, Authorship, cross-entropy, FMM
The thesis will give an introduction and background for the Authorship Attribution problem in Chinese, and how we extend the existing JGAAP framework and make a few modifications to handle the special problems of Authorship Attribution in Chinese. Then varieties of methods have been used to test. The corpus we used for testing includes four authors and 32 Chinese novels. We found that Character or forward maximum matching (FMM)-segmented words in conjunction with the K-Nearest Neighbor calculated using nominal KS worked best in our test.
Zhao, M. (2008). A Chinese Version of an Authorship Attribution Analysis Program (Master's thesis, Duquesne University). Retrieved from https://dsc.duq.edu/etd/1560