Via the twitterings of local hero @laureng:
https://direct.mit.edu/books/book/5244/The-Open-Handbook-of-Linguistic-Data-Management
This book is aMAZing. 56 open access chapters on data management in many linguistic disciplines. Documentation is well-represented! And they have a really nice website with an online course. [Update: Also available at https://linguisticdatamanagement.org/, thanks @laureng!]
Several local heroes on the very website you are currently perusing also have chapters (tell me if I missed anyone):
(And my better half may or may not have a chapter in here… by “may or may not” I mean “does”…(https://direct.mit.edu/books/book/5244/chapter/3537387/Data-Management-Practices-in-an-Ethnographic-Study) .))
Congrats to all y’all! My phone will be heavier after I have downloaded all these PDFs (and I won’t even complain that there are no HTML
versions. )
I was going to go through this list and point out the articles that looked particularly relevant to our community but… almost all of them are. (I wonder if we could get some of these folks to come and talk to us!)
- 1: Data, Data Management, and Reproducible Research in Linguistics: On the Need for The Open Handbook of Linguistic Data Management - Andrea L. Berez-Kroeker, Bradley McDonnell, Lauren B. Collister, Eve Koller
- 2: Situating Linguistics in the Social Science Data Movement - Lauren Gawne, Suzy Styles
- 3: The Scope of Linguistic Data - Jeff Good
- 4: Indigenous Peoples, Ethics, and Linguistic Data - Gary Holton, Wesley Y. Leonard, Peter L. Pulsifer
- 5: The Linguistic Data Life Cycle, Sustainability of Data, and Principles of Solid Data Management - Eleanor Mattern
- 6: Transforming Data - Na-Rae Han
- 7: Archiving Research Data - Helene N. Andreassen
- 8: Developing a Data Management Plan - Susan Smythe Kung
- 9: Copyright and Sharing Linguistic Data - Lauren B. Collister
- 10: Linguistic Data in the Long View - Laura Buszard-Welcher
- 11: Guidance for Citing Linguistic Data - Philipp Conzett, Koenraad De Smedt
- 12: Metrics for Evaluating the Impact of Data Sets - Robin Champieux, Heather L. Coates
- 13: The Value of Data and Other Non-traditional Scholarly Outputs in Academic Review, Promotion, and Tenure in Canada and the United States - Juan Pablo Alperin, Lesley A. Schimanski, Michelle La, Meredith T. Niles, Erin C. McKiernan
- 14: Managing Sociolinguistic Data with the Corpus of Regional African American Language (CORAAL) - Tyler Kendall, Charlie Farrington
- 15: Managing Data for Integrated Speech Corpus Analysis in SPeech Across Dialects of English (SPADE) - Morgan Sonderegger, Jane Stuart-Smith, Michael McAuliffe, Rachel Macdonald, Tyler Kendall
- 16: Data Management at the uOttawa Sociolinguistics Laboratory - Shana Poplack
- 17: Managing Legacy Data in a Sociophonetic Study of Vowel Variation and Change - James Grama
- 18: Managing Sociophonetic Data in a Study of Regional Variation - Valerie Fridland, Tyler Kendall
- 19: Data Management Practices in an Ethnographic Study of Language and Migration - Lynnette Arnold
- 20: Managing Conversation Analysis Data - Elliott M. Hoey, Chase Wesley Raymond
- 21: Managing Sign Language Data from Fieldwork - Nick Palfreyman
- 22: Managing Data in a Language Documentation Corpus - Christopher Cox
- 23: Managing Data for Writing a Reference Grammar - Nala H. Lee
- 24: Managing Lexicography Data: A Practical, Principled Approach Using FLEx (FieldWorks Language Explorer) - Christine Beier, Lev Michael
- 25: Managing Data from Archival Documentation for Language Reclamation - Megan Lukaniec
- 26: Managing Data for Descriptive and Historical Research - Don Daniels, Kelsey Daniels
- 27: Managing Historical Data in the Chirila Database - Claire Bowern
- 28: Managing Historical Linguistic Data for Computational Phylogenetics and Computer-Assisted Language Comparison - Tiago Tresoldi, Christoph Rzymski, Robert Forkel, Simon J. Greenhill, Johann-Mattis List, Russell D. Gray
- 29: Managing Computational Data for Models of Language Acquisition and Change - Matthew Lou-Magnuson, Luca Onnis
- 30: Managing Sign Language Acquisition Video Data: A Personal Journey in the Organization and Representation of Signed Data - Julie A. Hochgesang
- 31: Managing Acquisition Data for Developing Large Sesotho, English, and French Corpora for CHILDES - Katherine Demuth
- 32: Managing Phonological Development Data within PhonBank: The Chisasibi Child Language Acquisition Study - Yvan Rose, Julie Brittain
- 33: Managing Oral and Written Data from an ESL Corpus from Canadian Secondary School Students in a Compulsory, School-Based ESL Program - Philippa Bell, Laura Collins, Emma Marsden
- 34: Managing Second Language Acquisition Data with Natural Language Processing Tools - Scott A. Crossley, Kristopher Kyle
- 35: Managing Data Workflows for Untrained Forced Alignment: Examples from Costa Rica, Mexico, the Cook Islands, and Vanuatu - Rolando Coto-Solano, Sally Akevai Nicholas, Brittany Hoback, Gregorio Tiburcio Cano
- 36: Managing Transcription Data for Automatic Speech Recognition with Elpis - Ben Foley, Daan van Esch, Nay San
- 37: Managing Data and Statistical Code According to the FAIR Principles - Laura A. Janda
- 38: Managing Synchronic Corpus Data with the British National Corpus (BNC) - Stefan Th. Gries
- 39: Managing Data in Sign Language Corpora - Onno Crasborn
- 40: Managing Sign Language Video Data Collected from the Internet - Lynn Hou, Ryan Lepic, Erin Wilkinson
- 41: Managing Data from Social Media: The Indigenous Tweets Project - Kevin P. Scannell
- 42: Managing Semantic Norms for Cognitive Linguistics, Corpus Linguistics, and Lexicon Studies - Bodo Winter
- 43: Managing Treebank Data with the Infrastructure for the Exploration of Syntax and Semantics (INESS - Victoria Rosén, Koenraad De Smedt
- 44: Managing Data in a Formal Syntactic Study of an Under-Investigated Language (Uzbek) - Vera Gribanova
- 45: Managing Data for Theoretical Syntactic Study of Underdocumented Languages - Philip T. Duncan, Harold Torrence, Travis Major, Jason Kandybowicz
- 46: Managing Experimental Data in a Study of Syntax - Matthew Wagers
- 47: Managing Web Experiments for Psycholinguistics: An Example from Experimental Semantics/Pragmatics - Judith Degen, Judith Tonhauser
- 48: Managing, Sharing, and Reusing fMRI Data in Computational Neurolinguistics - Hiroyuki Akama
- 49: Managing Phonological Data in a Perception Experiment - Rory Turnbull
- 50: Managing Speech Perception Data Sets - Anne Cutler, Mirjam Ernestus, Natasha Warner, Andrea Weber
- 51: Managing and Analyzing Data with Phonological CorpusTools - Kathleen Currie Hall, J. Scott Mackie, Roger Yu-Hsiang Lo
- 52: Managing Phonological Inventory Data in the Development of PHOIBLE - Steven Moran
- 53: Managing Data in a Typological Study - Volker Gast, Łukasz Jędrzejowski
- 54: Managing Data for Descriptive Morphosemantics of Six Language Varieties - Malin Petzell, Caspar Jordan
- 55: Managing Data in TerraLing, a Large-Scale Cross-Linguistic Database of Morphological, Syntactic, and Semantic Patterns - Hilda Koopman, Cristina Guardiano
- 56: Managing AUTOTYP Data: Design Principles and Implementation - Alena Witzlack-Makarevich, Johanna Nichols, Kristine A. Hildebrandt, Taras Zakharko, Balthasar Bickel