By Tony McEnery

Corpus linguistics is the examine of language facts on a wide scale - the computer-aided research of very huge collections of transcribed utterances or written texts. This textbook outlines the fundamental tools of corpus linguistics, explains how the self-discipline of corpus linguistics constructed and surveys the foremost techniques to using corpus information. It makes use of a wide diversity of examples to teach how corpus info has resulted in methodological and theoretical innovation in linguistics mostly. transparent and specific causes lay out the foremost problems with process and conception in modern corpus linguistics. A established and coherent narrative hyperlinks the historic improvement of the sector to present issues in 'mainstream' linguistics. sensible initiatives and questions for dialogue on the finish of every bankruptcy inspire scholars to check their knowing of what they've got learn and an intensive word list presents easy accessibility to definitions of technical phrases utilized in the textual content.

Show description

Read or Download Corpus Linguistics: Method, Theory and Practice PDF

Similar language & grammar books

Who's Whose? A No-Nonsense Guide to Easily Confused Words

Who is Whose? is an enjoyable and simple advisor to the main usually harassed phrases in English this day, with genuine examples of excellent and undesirable utilization to make modifications crystal-clear. The Embarrassment score considering every one mistake is integrated, including an evidence of why the confusion occurs and the way to prevent it sooner or later.

The Iranian Languages (Routledge Language Family Series)

The Iranian languages shape the key jap department of the Indo-European team of languages, itself a part of the bigger Indo-Iranian kinfolk. predicted to have among a hundred and fifty and 2 hundred million local audio system, the Iranian languages represent one of many world’s significant language households. This accomplished quantity bargains a close evaluate of the primary languages which make up this staff: previous Iranian, heart Iranian, and New Iranian.

Handbook of translation studies

2. edition (by Milton, John), p3-6; three. utilized Translation reports (by Rabadan, Rosa), p7-11; four. Audiovisual translation (by Remael, Aline), p12-17; five. Censorship (by Merkle, Denise), p18-21; 6. kid's literature and translation (by Alvstad, Cecilia), p22-27; 7. Cognitive techniques (by Alves, Fabio), p28-35; eight.

Corpus-Based Contrastive Studies of English and Chinese

This booklet is anxious with cross-linguistic distinction of significant grammatical different types in English and chinese language, most crucial but genetically various global languages. This genetic distinction has ended in many subsidiary modifications which are, between different issues, regarding grammar. in comparison with typologically similar languages, cross-linguistic distinction of English and chinese language is more difficult but promising.

Extra resources for Corpus Linguistics: Method, Theory and Practice

Sample text

The Manually Annotated Sub-Corpus (MASC; see Ide et al. 2010) of the American National Corpus, for example, contains multiple layers of manually inserted or manually checked annotation and is clearly suitable for this purpose. , Archer 2005). The range of annotations developed and applied is similar for a number of major languages, such as Chinese, French or German. For a host of other languages, however, the availability of programs which can undertake automatic annotation is patchy at best. For example, there is no publicly available part-of-speech tagger for the Bantu language Kinyarwanda that we are aware of.

13 14 what is corpus linguistics? On the basis of this somewhat brief description of corpus annotation, a reader would be forgiven for thinking that the distinction between annotated and unannotated corpora is based simply on whether or not the corpus has been analysed in a particular way yet. Those corpora which have already been analysed in some way are annotated, those which have yet to be analysed are not. This distinction in itself, however, is so trivial that it would hardly constitute a major dimension along which research in corpus linguistics can vary.

While this would ideally be accomplished using a computer program rather than manual analysis, the automatic alignment of parallel corpora is not a trivial task for some language pairs (Piao 2000, 2002). 8 Summary By looking at a series of defining features in corpus linguistics, this chapter has explored ways in which the construction and use of corpora vary. In doing so, we have highlighted some of the differences that exist between linguists in their use – and basic conception – of corpus linguistics.

Download PDF sample

Rated 4.86 of 5 – based on 12 votes