The course will use mainly the following handbook:

Mc Enery, Tony, and Hardie, Andrew (2012). Corpus Linguistics. Cambridge: Cambridge University Press. (294 pages).

The parts on data analysis and processing will be instead based on parts of:

Gries, Stefan Th. (2009). Quantitative Corpus Linguistics with R. New York and London: Routledge. (248 pages)

Other material, like case-study articles, will be provided during the course.

A good primer on text manipulation is:

Kenneth W. Church’s “Unix for poets”, which can be found easily online in different formats. This is the original link to the document (download it and save it as a PostScript .ps file, which can be then converted to PDF or printed):

