Articles tagged with corpus linguistics

  1. Efficient Querying Of Google Books Ngram Data

    The Google Books Ngram Data (raw data available here) is a pretty amazing resource. Version 2, released in July 2012, contains 1gram through 5gram frequency counts derived from 6% of all books ever published (!).

    There's clearly a lot an enterprising young gentleman or gentlewoman could accomplish with a resource such …