Personalizing Web publishing via information extraction