This volume highlights the ways in which recent developments in corpus linguistics and natural language processing can engage with topics across language studies, humanities and social science disciplines. New approaches have emerged in recent years that blur disciplinary boundaries, facilitated by factors such as the application of computational methods, access to large data sets, and the sharing of code, as well as continual advances in technologies related to data storage, retrieval, and processing. The "march of data" denotes an area at the border region of linguistics, humanities, and…mehr
This volume highlights the ways in which recent developments in corpus linguistics and natural language processing can engage with topics across language studies, humanities and social science disciplines. New approaches have emerged in recent years that blur disciplinary boundaries, facilitated by factors such as the application of computational methods, access to large data sets, and the sharing of code, as well as continual advances in technologies related to data storage, retrieval, and processing. The "march of data" denotes an area at the border region of linguistics, humanities, and social science disciplines, but also the inevitable development of the underlying technologies that drive analysis in these subject areas. Organized into 3 sections, the chapters are connected by the underlying thread of linguistic corpora: how they can be created, how they can shed light on varieties or registers, and how their metadata can be utilized to better understand the internal structure of similar resources. While some chapters in the volume make use of well-established existing corpora, others analyze data from platforms such as YouTube, Twitter or Reddit. The volume provides insight into the diversity of methods, approaches, and corpora that inform our understanding of the "border regions" between the realms of data science, language/linguistics, and social or cultural studies.
Steven Coats is a Lecturer in English at the University of Oulu, Finland. Veronika Laippala is Professor of Digital Language Studies at the University of Turku, Finland.
Inhaltsangabe
List of Figures List of Tables List of Contributors Acknowledgements Introduction Steven Coats (University of Oulu Finland) and Veronika Laippala (University of Turku Finland) Part I. Methods for Data Collection Analysis and Visualization 1. Using Automatic Speech Recognition Transcripts for Linguistic Research Steven Coats (University of Oulu Finland) 2. Low-code Data Science Tools for Linguistics: Swiss Army Knives or Pretty Black Boxes? Jukka Tyrkkö and Daniel Ihrmark (Linnaeus University Sweden) 3. The Visualisation and Evaluation of Semantic and Conceptual Maps Gerold Schneider (University of Zurich Switzerland) Part II. Corpus Construction Registers and Genres 4. Toward Automatic Register Classification in Unrestricted Databases of Historical English Liina Repo (University of Turku Finland) Brett Hashimoto (Brigham Young University USA) Aatu Liimatta (University of Helsinki Finland) Lassi Saario (University of Helsinki Finland) Tanja Säily (University of Helsinki Finland) Iiro Tiihonen (University of Helsinki Finland) Mikko Tolonen (University of Helsinki Finland) and Veronika Laippala (University of Turku Finland) 5. Exploring the Interplay of Registers and Topicality in a Web-Scale Corpus Valtteri Skantsi Veronika Laippala and Aku Kyroläinen (University of Turku Finland) 6. Towards 'Large and Tidy': Establishing Internal Structure in Mega-Corpora Axel Bohmann (University of Freiburg Germany) Part III. Social Media Discourse and Meanings 7. Multi-Modal Considerations for Social Media Discourse Analysis: A Specialised Corpus of Twitter Commentary on 'Working from Home' Christopher Fitzgerald (Mary Immaculate College Ireland) Geraldine Mark (Cardiff University Wales) Anne O'Keeffe (Mary Immaculate College Ireland) Dawn Knight (Cardiff University Wales) Justin McNamara (Mary Immaculate College Ireland) Svenja Adolphs (University of Nottingham England) Benjamin Cowan (University College Dublin Ireland) Tania Fahey Palma (University of Aberdeen Scotland) Fiona Farr (University of Limerick Ireland) and Sandrine Peraldi (University College Dublin Ireland) 8. Exploring Patterns of Self-Identification in the LGBTQ+ Reddit Corpus Laura Hekanaho (University of Helsinki/Tampere University Finland) Turo Hiltunen (University of Helsinki Finland) Minna Palander-Collin (University of Helsinki Finland); and Helmiina Hotti (University of Helsinki Finland) Index
List of Figures List of Tables List of Contributors Acknowledgements Introduction Steven Coats (University of Oulu Finland) and Veronika Laippala (University of Turku Finland) Part I. Methods for Data Collection Analysis and Visualization 1. Using Automatic Speech Recognition Transcripts for Linguistic Research Steven Coats (University of Oulu Finland) 2. Low-code Data Science Tools for Linguistics: Swiss Army Knives or Pretty Black Boxes? Jukka Tyrkkö and Daniel Ihrmark (Linnaeus University Sweden) 3. The Visualisation and Evaluation of Semantic and Conceptual Maps Gerold Schneider (University of Zurich Switzerland) Part II. Corpus Construction Registers and Genres 4. Toward Automatic Register Classification in Unrestricted Databases of Historical English Liina Repo (University of Turku Finland) Brett Hashimoto (Brigham Young University USA) Aatu Liimatta (University of Helsinki Finland) Lassi Saario (University of Helsinki Finland) Tanja Säily (University of Helsinki Finland) Iiro Tiihonen (University of Helsinki Finland) Mikko Tolonen (University of Helsinki Finland) and Veronika Laippala (University of Turku Finland) 5. Exploring the Interplay of Registers and Topicality in a Web-Scale Corpus Valtteri Skantsi Veronika Laippala and Aku Kyroläinen (University of Turku Finland) 6. Towards 'Large and Tidy': Establishing Internal Structure in Mega-Corpora Axel Bohmann (University of Freiburg Germany) Part III. Social Media Discourse and Meanings 7. Multi-Modal Considerations for Social Media Discourse Analysis: A Specialised Corpus of Twitter Commentary on 'Working from Home' Christopher Fitzgerald (Mary Immaculate College Ireland) Geraldine Mark (Cardiff University Wales) Anne O'Keeffe (Mary Immaculate College Ireland) Dawn Knight (Cardiff University Wales) Justin McNamara (Mary Immaculate College Ireland) Svenja Adolphs (University of Nottingham England) Benjamin Cowan (University College Dublin Ireland) Tania Fahey Palma (University of Aberdeen Scotland) Fiona Farr (University of Limerick Ireland) and Sandrine Peraldi (University College Dublin Ireland) 8. Exploring Patterns of Self-Identification in the LGBTQ+ Reddit Corpus Laura Hekanaho (University of Helsinki/Tampere University Finland) Turo Hiltunen (University of Helsinki Finland) Minna Palander-Collin (University of Helsinki Finland); and Helmiina Hotti (University of Helsinki Finland) Index
Es gelten unsere Allgemeinen Geschäftsbedingungen: www.buecher.de/agb
Impressum
www.buecher.de ist ein Internetauftritt der buecher.de internetstores GmbH
Geschäftsführung: Monica Sawhney | Roland Kölbl | Günter Hilger
Sitz der Gesellschaft: Batheyer Straße 115 - 117, 58099 Hagen
Postanschrift: Bürgermeister-Wegele-Str. 12, 86167 Augsburg
Amtsgericht Hagen HRB 13257
Steuernummer: 321/5800/1497
USt-IdNr: DE450055826