Choose the location you want to see specific content and pricing for:

Designing and Evaluating Language Corpora

Designing and Evaluating Language Corpora

Designing and Evaluating Language Corpora

A Practical Framework for Corpus Representativeness
Jesse Egbert , Northern Arizona University
Douglas Biber , Northern Arizona University
Bethany Gray , Iowa State University
March 2022
This ISBN is for an eBook version which is distributed on our behalf by a third party.
Adobe eBook Reader
9781009254762
$34.99
USD
Adobe eBook Reader
USD
Paperback
USD
Hardback

    Corpora are ubiquitous in linguistic research, yet to date, there has been no consensus on how to conceptualize corpus representativeness and collect corpus samples. This pioneering book bridges this gap by introducing a conceptual and methodological framework for corpus design and representativeness. Written by experts in the field, it shows how corpora can be designed and built in a way that is both optimally suited to specific research agendas, and adequately representative of the types of language use in question. It considers questions such as 'what types of texts should be included in the corpus?', and 'how many texts are required?' – highlighting that the degree of representativeness rests on the dual pillars of domain considerations and distribution considerations. The authors introduce, explain, and illustrate all aspects of this corpus representativeness framework in a step-by-step fashion, using examples and activities to help readers develop practical skills in corpus design and evaluation.

    • Surveys the state of corpus design and representativeness.
    • Provides a practical framework for conceptualizing and achieving corpus representativeness, and helps readers to understand and apply this framework to the design of new corpora and the evaluation of existing corpora.
    • Gives readers examples and activities to help them develop practical skills in corpus design and evaluation.

    Product details

    March 2022
    Adobe eBook Reader
    9781009254762
    0 pages
    This ISBN is for an eBook version which is distributed on our behalf by a third party.

    Table of Contents

    • 1. Introduction
    • 2. Approaches to representativeness in previous corpus linguistic research
    • 3. Corpus representativeness: a conceptual and methodological framework
    • 4. Domain considerations
    • 5. Distribution considerations
    • 6. The influence of domain and distribution considerations on corpus representativeness – bringing it all together
    • 7. Corpus design and representativeness in practice
    • Glossary
    • Appendix A. Example articles documenting existing corpora
    • Appendix B. Survey of corpus design and compilation practices.
    Filter
    Current filters
    Clear all
    Refine results
    Clear
    Show more
    error-img

    Sorry! No results found

    Sorry, we couldn't find any results that match your search. Please check the spelling or try different key words.

    Per Page 1 – 2 of 2
    Per Page 1 – 2 of 2

    Back to top

    This title is supported by one or more locked resources. Access to locked resources is granted exclusively by Cambridge University Press to instructors whose faculty status has been verified. To gain access to locked resources, inst ructors should sign in to or register for a Cambridge user account.

    This title is supported by one or more locked resources. Access to locked resources is granted exclusively by Cambridge University Press to instructors whose faculty status has been verified. Please request an instructor account in order to view this content.

    Please use locked resources responsibly and exercise your professional discretion when choosing how you share these materials with your students. Other instructors may wish to use locked resources for assessment purposes and their usefulness is undermined when the source files (for example, solution manuals or test banks) are shared online or via social networks.

    Supplementary resources are subject to copyright. Instructors are permitted to view, print or download these resources for use in their teaching, but may not change them or use them for commercial gain.

    If you are having problems accessing these resources please contact [email protected].

      Authors
    • Jesse Egbert

      Jesse Egbert is Associate Professor of Applied Linguistics at Northern Arizona University. He is a co-founding General Editor of Register Studies, and his recent books focus on online register variation (2018), methodogical triangulation (2016, 2020), and corpus linguistics methods (2020).

    • Douglas Biber

      Douglas Biber is Regents' Professor of Applied Linguistics at Northern Arizona University. Previous books include Register, Genre, and Style (2009/2019), Grammar of Spoken and Written English (2021), and studies of register variation (1988, 1995, 2018).

    • Bethany Gray

      Bethany Gray is Associate Professor of Applied Linguistics and Technology at Iowa State University. Her publications include monographs on academic research articles (2015), historical change in writing (2016). She is a co-founding General Editor of Register Studies.

    Thank You

    You will receive email communication regarding the availability of this product