Applying software analysis technology to lightweight semantic markup of document text