This module is developed for the NewsReade project[1] and splits a LexisNexis XML file into headers and body text. It writes the output to KAF [2]. The LexisNexis data is structured using the NITF News Industry Text Format[3]
The script can be invoked as: "perl DiscourseModule.pl FILE" where "FILE" is an XML file using the NITF format.