Automated construction and evaluation of a Japanese web-based reference corpus