Programming for Corpus Linguistics with Python and Dataframes | Cambridge University Press & Assessment

0

Recommended product

Popular links

Popular links

All titles

Programming for Corpus Linguistics with Python and Dataframes

Programming for Corpus Linguistics with Python and Dataframes

Series:

Elements in Corpus Linguistics

Author:

Daniel Keller, Western Kentucky University

Published:

June 2024

Availability:

Available

Format:

Hardback

ISBN:

9781009486781

£55.00

GBP

Hardback

This Element offers intermediate or experienced programmers algorithms for Corpus Linguistic (CL) programming in the Python language using dataframes that provide a fast, efficient, intuitive set of methods for working with large, complex datasets such as corpora. This Element demonstrates principles of dataframe programming applied to CL analyses, as well as complete algorithms for creating concordances; producing lists of collocates, keywords, and lexical bundles; and performing key feature analysis. An additional algorithm for creating dataframe corpora is presented including methods for tokenizing, part-of-speech tagging, and lemmatizing using spaCy. This Element provides a set of core skills that can be applied to a range of CL research questions, as well as to original analyses not possible with existing corpus software.

Product details

Published: June 2024
Format: Hardback
ISBN: 9781009486781
Length: 114 pages
Dimensions: 229 × 152 × 8 mm
Weight: 0.306kg
Availability: Available

Often bought together

This title is available for institutional purchase via Cambridge Core

Related Journals

Also by this Author

Table of Contents

1. Data frame corpora
2. Python basics for corpus linguistics
3. Working with data frames
4. Algorithms for common corpus linguistic tasks
5. Creating data frame corpora
6. Conclusion
References.

Look inside

✕

Displaying 1 - 1 of 1

Keller - Supp Mat

Size: 359.8 MB

Type: application/zip

Additional Information

About the authors

Author

Daniel Keller , Western Kentucky University