Introducing and evaluating ukWaC, a very large web-derived corpus of English