Designing and Evaluating Language Corpora
Corpora are ubiquitous in linguistic research, yet to date, there has been no consensus on how to conceptualize corpus representativeness and collect corpus samples. This pioneering book bridges this gap by introducing a conceptual and methodological framework for corpus design and representativeness. Written by experts in the field, it shows how corpora can be designed and built in a way that is both optimally suited to specific research agendas, and adequately representative of the types of language use in question. It considers questions such as 'what types of texts should be included in the corpus?', and 'how many texts are required?' – highlighting that the degree of representativeness rests on the dual pillars of domain considerations and distribution considerations. The authors introduce, explain, and illustrate all aspects of this corpus representativeness framework in a step-by-step fashion, using examples and activities to help readers develop practical skills in corpus design and evaluation.
- Surveys the state of corpus design and representativeness.
- Provides a practical framework for conceptualizing and achieving corpus representativeness, and helps readers to understand and apply this framework to the design of new corpora and the evaluation of existing corpora.
- Gives readers examples and activities to help them develop practical skills in corpus design and evaluation.
Product details
March 2022Adobe eBook Reader
9781009254762
0 pages
This ISBN is for an eBook version which is distributed on our behalf by a third party.
Table of Contents
- 1. Introduction
- 2. Approaches to representativeness in previous corpus linguistic research
- 3. Corpus representativeness: a conceptual and methodological framework
- 4. Domain considerations
- 5. Distribution considerations
- 6. The influence of domain and distribution considerations on corpus representativeness – bringing it all together
- 7. Corpus design and representativeness in practice
- Glossary
- Appendix A. Example articles documenting existing corpora
- Appendix B. Survey of corpus design and compilation practices.
Sorry! No results found
Sorry, we couldn't find any results that match your search. Please check the spelling or try different key words.
Chapter 4 Sampling Frame
Answers to End-Of-Chapter Exercises
Request processing
✕Thank you. Your request is now being processed.
You may be contacted by your local representative in order to verify your credentials. We review all requests as quickly as possible, but please allow up to 4 working days for your request to be processed.
Teacher’s restricted resource
✕We’re sorry. The resource you selected is for teachers only. Please browse and select a new resource available to you.
This title is supported by one or more locked resources. Access to locked resources is granted exclusively by Cambridge University Press to instructors whose faculty status has been verified. To gain access to locked resources, inst ructors should sign in to or register for a Cambridge user account.
This title is supported by one or more locked resources. Access to locked resources is granted exclusively by Cambridge University Press to instructors whose faculty status has been verified. Please request an instructor account in order to view this content.
Please use locked resources responsibly and exercise your professional discretion when choosing how you share these materials with your students. Other instructors may wish to use locked resources for assessment purposes and their usefulness is undermined when the source files (for example, solution manuals or test banks) are shared online or via social networks.
Supplementary resources are subject to copyright. Instructors are permitted to view, print or download these resources for use in their teaching, but may not change them or use them for commercial gain.
If you are having problems accessing these resources please contact [email protected].