A corpus for cross-document coreference