A gentle introduction to programming for students and researchers interested in conducting computer-based analysis in linguistics, this book is an ideal starting point for linguists approaching programming for the first time. Assuming no background knowledge of programming, the author introduces basic notions and techniques needed for linguistics programming and helps readers to develop their understanding of electronic texts.The book includes many examples based on diverse topics in linguistics in order to demonstrate the applicability of the concepts at the heart of programming. Practical examples are designed to help the reader to:*Identify basic issues in handling language data, including Unicode processing*Conduct simple analyses in morphology/morphosyntax, and phonotactics*Understanding techniques for matching linguistic patterns*Learn to convert data into formats and data structures suitable for linguistic analysis*Create frequency lists from corpus materials to gather basic descriptive statistics on texts*Understand, obtain and 'clean up' web-based data*Design graphical user interfaces for writing more efficient and easy-to-use analysis tools.Two different types of exercise help readers to either learn to interpret and understand illustrative sample code, or to develop algorithmic thinking and solution strategies through turning a series of instructions into sample programs. Readers will be equipped with the necessary tools for designing their own extended projects.Key Features:*Ideal introduction for students of linguistics attempting to process corpus materials or literary texts for dissertations, theses or advanced research work*Linguistic examples throughout the text clearly demonstrate the application of programming theory and techniques*Coverage ranging from basic to more complex topics and methodologies enables the reader to progress at their own pace*Two chapters on the advantages of modularity and associated issues provid
This handbook is a comprehensive practical resource on corpus linguistics. It features a range of basic and advanced approaches, methods and techniques in corpus linguistics, from corpus compilation principles to quantitative data analyses. The Handbook is organized in six Parts. Parts I to III feature chapters that discuss key issues and the know-how related to various topics around corpus design, methods and corpus types. Parts IV-V aim to offer a user-friendly introduction to the quantitative analysis of corpus data: for each statistical technique discussed, chapters provide a practical guide with R and come with supplementary online material. Part VI focuses on how to write a corpus linguistic paper and how to meta-analyze corpus linguistic research. The volume can serve as a course book as well as for individual study. It will be an essential reading for students of corpus linguistics as well as experienced researchers who want to expand their knowledge of the field.
Assuming no prior knowledge, this text provides a concise, practical and accessible introduction to using, analysing and interpreting statistics and methodologies in empirical work using R or JASP. It takes an activity-based approach, in which students are first stimulated to consider a problem or question and find their own solution before the conventional way of solving that problem or question is introduced. 'How To' guides on R and JASP, which take students step-by-step through statistical analyses and practical assignments, are available on the book's companion website. This book will be an invaluable resource for undergraduate and postgraduate students of applied linguistics. New to this Edition: - Now focuses on R and JASP, two increasingly popular open source softwares used by linguists - 'How To' guides to help students strengthen their understanding and apply what they have learnt available on a companion site - Contains more comprehensive coverage of effect sizes, basic methods, regression theory, relationship studies and non-parametric tests
Linguistic research increasingly relies on large electronic corpora for its primary data. While off-the-shelf programs can perform a set of standard searches, specialized questions usually require a custom-written program to find their answers. Essential Python for Corpus Linguistics uses the programming language Python to explain how to write simple programs that extract linguistically useful information, such as the frequency of a given utterance in a particular context within a corpus, or instances of certain phrasal structures in a Treebank. Assuming no prior programming background, the book provides numerous example programs that search for phonological, morphological and syntactic constructions in corpora, and the associated web site provides sample data and programs, which make it easy to start working independently. This book is a valuable resource for linguists who use corpus methods but have no programming training.
Corpus linguistics is the study of language data on a large scale - the computer-aided analysis of very extensive collections of transcribed utterances or written texts. This textbook outlines the basic methods of corpus linguistics, explains how the discipline of corpus linguistics developed and surveys the major approaches to the use of corpus data. It uses a broad range of examples to show how corpus data has led to methodological and theoretical innovation in linguistics in general. Clear and detailed explanations lay out the key issues of method and theory in contemporary corpus linguistics. A structured and coherent narrative links the historical development of the field to current topics in 'mainstream' linguistics. Practical tasks and questions for discussion at the end of each chapter encourage students to test their understanding of what they have read and an extensive glossary provides easy access to definitions of technical terms used in the text.
This book is an innovative guide to quantitative, corpus-based research in historical and diachronic linguistics. Gard B. Jenset and Barbara McGillivray argue that, although historical linguistics has been successful in using the comparative method, the field lags behind other branches of linguistics with respect to adopting quantitative methods. Here they provide a theoretically agnostic description of a new framework for quantitatively assessing models and hypotheses in historical linguistics, based on corpus data and using case studies to illustrate how this framework can answer research questions in historical linguistics. The authors offer an in-depth explanation and discussion of the benefits of working with quantitative methods, corpus data, and corpus annotation, and the advantages of open and reproducible research. The book will be a valuable resource for graduate students and researchers in historical linguistics, as well as for all those working with linguistic corpora.
This book constitutes the thoroughly refereed post-proceedings of the Second International Conference on Logical Aspects of Computational Linguistics, LACL '97, held in Nancy, France in September 1997. The 10 revised full papers presented were carefully selected during two rounds of reviewing. Also included are two comprehensive invited papers. Among the topics covered are type theory, various types of grammars, linear logic, parsing, type-directed natural language processing, proof-theoretic aspects, concatenation logics, and mathematical languages.
Author: Management Association, Information Resources
Publisher: IGI Global
In a globalized society, effective communication is critical, and study of language from a mathematical perspective can shed light on new ways in which to express meaning across cultures and nations. Computational Linguistics: Concepts, Methodologies, Tools, and Applications explores language by dissecting the phonemic aspects of various communication systems in order to identify similarities and pitfalls in the expression of meaning. With applications in a variety of areas, from psycholinguistics and cognitive science to computer science and artificial intelligence, this multivolume reference work will be of use to researchers, professionals, and educators on the cutting edge of language acquisition and communication science.
Including a brief review of classical logic and its major assumptions, this textbook provides a guided tour of modal, many valued and substructural logics. The textbook starts from simple and intuitive concepts, clearly explaining the logics of language for linguistics students who have little previous knowledge of logic or mathematics.