摘要:We present a finite-state technology (FST) based system capable of performing metrical scansion of verse written in English.Scansion is the traditional task of analyzing the lines of a poem,marking the stressed and non-stressed elements and dividing the line into metrical feet.The system’s workflow is composed of several subtasks designed around finite-state machines that analyze verse by performing tokenization,part-of-speech tagging,stress placement,and stress-pattern prediction for unknown words.The scanner also classifies poems ac_cording to the predominant type of metrical foot found.We present a brief evaluation of the system using a gold standard corpus of human?scanned verse,on which a per-syllable accuracy of 86.78% is achieved.The program uses open-source components and is released under the GNU GPL license.
关键词:scansion;English;poetry;out-of-vocabulary words