About | HeinOnline Law Journal Library | HeinOnline Law Journal Library | HeinOnline

21 J.L. & Pol'y 299 (2012-2013)
A Systemic Functional Approach to Automated Authorship Analysis

handle is hein.journals/jlawp21 and id is 311 raw text is: A SYSTEMIC FUNCTIONAL APPROACH TO
AUTOMATED AUTHORSHIP ANALYSIS
Shlomo Argamon* and Moshe Koppel**
INTRODUCTION
Attribution of anonymous texts, if not based on factors
external to the text (such as paper and ink type or document
provenance, as used in forensic document examination), is
largely, if not entirely, based on considerations of language
style. We will consider here the question of how to best
deconstruct a text into quantitative features for purposes of
stylistic discrimination. Two key considerations inform our
analysis.  First,  such  features  should  support  accurate
classification by automated methods. Second, and no less
importantly, such features should enable a clear explanation of
the stylistic difference between stylistic categories (read:
authors) and why a disputed text appears more likely to fall into
one or another category. The latter consideration is particularly
important when a nonexpert, such as a judge or jury, must
evaluate the results and reliability of the analysis.
We start from the intuitive notion that style is indicated in a
text by those features of the text that indicate the author's choice
of one mode of expression from among a set of equivalent
modes for a given content. There are many ways in which such
choices manifest themselves in a text. Specific words and
phrases may be chosen more frequently by certain authors than
others, such as the phrase cool-headed logician favored by the
Unabomber. Some authors may habitually use certain syntactic
* Linguistic Cognition Laboratory, Department of Computer Science, Illinois
Institute of Technology, argamon@iit.edu.
** Department of Computer Science, Bar-Ilan University, Ramat-Gan, Israel,
moishk@gmail.com.

299

What Is HeinOnline?

HeinOnline is a subscription-based resource containing thousands of academic and legal journals from inception; complete coverage of government documents such as U.S. Statutes at Large, U.S. Code, Federal Register, Code of Federal Regulations, U.S. Reports, and much more. Documents are image-based, fully searchable PDFs with the authority of print combined with the accessibility of a user-friendly and powerful database. For more information, request a quote or trial for your organization below.



Short-term subscription options include 24 hours, 48 hours, or 1 week to HeinOnline.

Contact us for annual subscription options:

Already a HeinOnline Subscriber?

profiles profiles most