I recently attended a symposium titled Models for capacity development in language documentation and conservation hosted by ILCAA at the Tokyo University of Foreign Studies. The symposium brought together a group of people involved in supporting language work in the Asia-Pacific region in various ways (see the website for a full list): academic (Institute of Linguistics, Minhsiung, Taiwan, Beijing, China, Goroka, PNG, Batchelor, Australia, Bangkok, Thailand) and community-based (Manokwari, West-Papua; Tshanglalo, Bhutan; Bhasha Research Centre and Adivasi Academy, Gudjarat, India; Miromaa, Australia), using film (Sorosoro, France), or archiving language records (PARADISEC). The aim of the meeting was to build a network that would continue to link between training activities to support language work, the Consortium on Training in Language Documentation and Conservation (CTLDC), whose planning group members are listed here.
Every time I revisited my fieldsite I was asked for copies of photos or recordings and I wanted some way that these could be accessed without me having to be present. When I started visiting Erakor village in central Vanuatu there was intermittent electricity available, usually only in the evenings in the house I lived in.
The problem: you have text files and audio files, but the text files are not aligned to the audio files.
I imagine there are a few readers out there who have transcriptions of audio files that never made it past an utterance per line text file. This is a post for you, if you'd like to know how to import and time-align those files in ELAN.
A lot of work has been happening at the University of Sydney over the past six months, and at the end of last year the top floor of the Transient Building, which houses Linguistics, Paradisec and a few other offices, got renovated. Unfortunately, since the entire exterior of the building is composed of fibrous asbestos, it's unlikely that the University will outlay the mammoth insurance costs to do any exterior work. But anyone who knows the Transient building knows that the best option would be to demolish the whole thing and start again from scratch.
Last Friday was a bit of a milestone for me, since, in the 6 or so months that I have been involved in the audio preservation side of things at PARADISEC, I hadn't yet actually cleaned a damaged audio tape. Unfortunately for me, the process isn't quite as straight-forward as it is for a CD - warm soapy water, a non-abrasive cloth, wipe across the grain - rather, the entire process can take weeks, depending on how badly affected the tapes are.
Regular summary of PARADISEC’s ever growing digital repository of sound and video recordings, images and text files, currently totalling 3,003 items representing 54 countries and 598 languages.
Regular summary of PARADISEC’s ever growing digital repository of sound and video recordings, images and text files, currently totalling 2,779 items representing 54 countries and 593 languages.
Dear ELAN Workshop attendees, and anyone who might find this of interest,
There were a few loose ends left at the end of the ELAN workshop last week. I'd particularly like to address one, the question as to whether we should aim for a standard set of ELAN templates which everyone uses.
I wandered into the office today to see Jane and Mark with a large map of part of the northern territory rolled out on the floor, discussing the issue of iso-glosses, and boundaries. Maps maps maps. They're just everywhere at the moment!
Biwa is a generic term for long-necked, plucked lutes in Japan (written with the same characters as the Chinese pipa, also a plucked lute). Almost all biwa traditions involve oral narrative or poetic recitation with biwa accompaniment. Frank Davey, PARADISEC’s Audio Preservation Officer, who has been carefully digitising and listening to Hugh de Ferranti’s vast collection of field recordings of various biwa traditions (HDF1) over the past few months, describes the music as having a raw, powerful quality that speaks directly to the listener’s emotions. I asked Hugh about his long-standing involvement with the biwa tradition.
Our December conference is almost full, so if you were thinking of coming along, now is the time to register! The preliminary schedule is up, papers have been reviewed, everything is going along nicely (touch wood).
The third day of the conference is a workshop, with sections on audio and video recording, transcribing and managing your data, and producing outputs from this data. If this is more your thing you can come to just that. If you're interested in ELAN for transcribing or shoebox/toolbox, I thoroughly recommend it, but there'll be plenty of other useful stuff.
The Australian Research Council's website today has survived the pressure of everyone wanting to know whether they've got winning tickets. I was in a few syndicates (PARADISEC, continuing the Aboriginal Child Language Acquisition (ACLA project), and a new project on Indonesian). And the lucky winners are...
Regular summary of PARADISEC’s ever growing digital repository of sound and video recordings, images and text files, currently totalling 2,732 items representing 54 countries and 593 languages.
The preliminary schedule for the conference "Sustainable data from digital fieldwork: from creation to archive and back" is now up. There looks to be some really interesting projects on display. I had a sneak peek at EOPAS, a project to create a workflow and display interlinearised texts, and annodex, a project to display multiple streams of visual, audio and textual data, both of which look great. I'll also be talking about the FieldHelper tool I've been working on this year, a tool to add in the tagging of arbitrary metadata to field work data, amongst other things.
Our registration quota of 40 places is fast filling up. Please register now if you wish to come, also note that you can choose to come to the third day workshop if your interest in more in practical experience with current digital field work tools.
RNLD in collaboration with the conference "Sustainable data from fieldwork" is offering a day-long session on the creation, organisation, annotation and display of digital media. I highly recommend this to anyone interested in making digital recordings and annotating them. If you're new to shoebox or ELAN and have any questions about using it, and you have your own data, then bring along your laptop. The workshop will be held at Sydney University on Wednesday, December 6, 2006.
Read on for the specifics
There's an interesting post on slashdot today, on a product that will geo-tag your photos. Geo-tagging a photo means recording some geographic information at the time you take your photo, typically the longitude and latitude.
At first glance I thought it might be another on of these data-loggers, but actually, with a minor addition, it's a pretty nifty bit of hardware.
Is it possible to reduce the intrusiveness of video taping someone?
Before I launch into this... let me just say: "flashing lights and ethical alarm bells!". What I'm going to talk about is the paradox of fully informing your informants that you're going film them, and then trying your hardest to seem like you're not there!
Regular summary of PARADISEC’s ever growing digital repository of sound and video recordings, images and text files, currently totalling 2,696 items representing 54 countries and 600 languages.
In the field last year I meticulously gathered photos with audio recordings of many plants in the area I was working in PNG. I certainly don't like creating lexicon entries all with a gloss of "tree/plant species" and I figured in this digital age, including a picture and audio recording of each plant was one way of increasing the identifiability of each plant (and animal... but they're not so photogenic). Pictures are a much more salient identifier for speakers of the language than anything else. Never-the-less, scientific name are a good universal identifier for a plant, but they're hard to get if you don't have a botanist with you.
So earlier this year I sat down with Barry Conn at the National Herbarium of New South Wales to discuss interdisciplinary work between linguists and botanists. One of my questions was "what does a linguist need to do in the field to get a plant identified?".
Here are some of my notes from the meeting, with some comments from Barry:
I'm sure we've all done it from time to time: somehow, despite carefully trying to do something else altogether, we delete a critical and unique recording on our flash recorder... never to be heard again.
But all is not lost, in fact its often really quite simple to get it back... but only if you've taken the necessary precautions.
Many academic disciplines depend on analysis of primary data captured during fieldwork. Increasingly, researchers today are using digital methods for the whole life cycle of their primary data, from capture to organisation, submission to a repository or archive, and later access and dissemination in publications, teaching resources and conference presentations. This conference and workshop will showcase a number of projects that have been developing innovative and sustainable ways of managing such data.
Thanks to Linda and Frank for a very interesting workshop last Friday. I picked up some great tricks on using transcriber and audacity. Perhaps the best trick I learnt was how to remove cicada noise from my PNG recordings.
Maybe you already knew how to do this (or maybe you just fiddle with the EQ), but I always thought this was in the realm of Hollywood fantasy (like when the forensics guy magically "enhances" a grainy image to reveal the killer's face in a thriller). Removing the noise allows me to more easily transcribe a busy recording, and seeing as insect noise was pretty constant and loud through all my recordings, this is quite handy. There are some caveats, but here's how to do this using freely available, cross platform software.
Pacific and Regional Archive for Digital Sources in Endangered Cultures / Sydney Humanities and Social Sciences e-Research Initiative Workshop
Presenters: Dr Linda Barwick, Director, PARADISEC and Frank Davey, Audio Preservation Officer, PARADISEC.
A free workshop covering: the range of research applications for recording and analysis of digital audiovisual media; questions of sustainability and archiving of audiovisual data; tools and resources for archiving, analysis and presentation of digital audio; the role of recordings in humanities disciplines; and using audio recordings in presentations and teaching. Includes hands-on sessions using Audacity sound editing software and Transcriber speech annotation software.
- Amanda Harris
- Aidan Wilson
- Hilario de Sousa
- Ian Smith
- Joe Blythe
- Jane Simpson (This is a multi-authored blog, and the views expressed are those of the authors, not of PARADISEC or the University of Sydney. If you'd like to contribute, please let us know!)
- James McElvenny
- Linda Barwick (PARADISEC)
- Nick Thieberger (PARADISEC)
- Bill Foley
About the Blog
Recently commented on
- Deidre Ikin on Black Day for Indigenous Languages in NT - Felicity Meakins
- Teagan Ness on Wiradjuri language revival
- Peter Austin on Small and strong - Peter K. Austin
- Tim Brookes on Endangered genres - Peter K. Austin
- Claire on Small and strong - Peter K. Austin
- Andrew Garrett on Small and strong - Peter K. Austin
- Libby Edwards on Wiradjuri language revival
- Andz on Streaming access to transcribed media - from Nick Thieberger, University of Melbourne
- nitGreen on Sovereignty over languages and land
- gfsoulstaff on Sovereignty over languages and land
- Archiving (44 entries)
- Fieldwork (56 entries)
- General News (69 entries)
- Indigenous Australia News (65 entries)
- Indigenous language education (56 entries)
- Linguistics (125 entries)
- Murriny Patha Song Project (2 entries)
- Music (10 entries)
- PARADISEC (25 entries)
- RNLD (10 entries)
- October 2013
- February 2011
- January 2011
- December 2010
- November 2010
- October 2010
- September 2010
- August 2010
- July 2010
- June 2010
- May 2010
- April 2010
- March 2010
- February 2010
- January 2010
- December 2009
- November 2009
- October 2009
- September 2009
- August 2009
- July 2009
- June 2009
- May 2009
- April 2009
- March 2009
- February 2009
- January 2009
- December 2008
- November 2008
- October 2008
- September 2008
- August 2008
- July 2008
- June 2008
- May 2008
- April 2008
- March 2008
- February 2008
- January 2008
- December 2007
- November 2007
- October 2007
- September 2007
- August 2007
- July 2007
- June 2007
- May 2007
- April 2007
- March 2007
- February 2007
- January 2007
- December 2006
- November 2006
- October 2006
- September 2006
- August 2006
- July 2006
- June 2006
- Consortium on Training in Language Documentation and Conservation (CTLDC)
- How can we get the material we have used in our research back to the people we recorded?
- How to import a basic transcript into ELAN
- Renovations, Repairs and Repositories
- PARADISEC Open Day
- 'Baking Tapes' or Analogue Audio Restoration
- PARADISEC News and Repository Report
- PARADISEC News and Repository Report
- Suzzy Data Workshop - Guest blogger Bruce Birch
- Vectors, aboriginal kitsch and isoglosses
Writing systems and languages of the world
Linguistics news and commentary
Linguistics news and commentary on a range of languages
Blog with news items and discussion of endangered languages
Notices of recent work on the Indigenous languages of Australia
That Munanga linguist
Community linguist blog
Claire Bowern's linguistics and fieldwork blog
A group blog on Anthropology
Language on the Move
Intercultural communication and multilingualism
Culture matters: applying anthropology
Australian anthropology blog: postgraduates and staff
ethnography and anthropology blog - including about Australia
Blog on Australian linguistics, fieldwork, politics and the environment.
Group blog on language and linguistics
The E-MELD School of Best Practices in Digital Language Documentation
Website in Swedish with links to sites on and in many languages
Hans Rausing Endangered Languages Project: Language Documentation: What is it?
Information on equipment, formats, and archiving, and examples of documentation
Indigenous Peoples Issues & Resources
a worldwide network of organizations, academics, activists, indigenous groups, and others representing indigenous and tribal peoples
Technology-enhanced language revitalization
Include ILAT (Indigenous Languages and Technology) discussion list.
Information on the people of Kamchatka
Linguistic fieldwork preparation: a guide for field linguists
syllabi, funding, technology, ethics, readings, bibliography
Papua New Guinea Language Resources
Phonologies, grammars, dictionaries, literacy, language maps for many PNG languages
Resource network for linguistic diversity
Networking practitioners working to record,retrieve & reintroduce endangered languages
child language acquisition in three Australian Aboriginal communities
The Digital Endangered Languages and Musics Archives Network
The Pacific And Regional Archive for Digital Sources in Endangered Cultures
Murriny-Patha Song Project
Documenting the language and music of public songs and dances composed and performed by Murriny Patha-speaking people
The Project for Free Electronic Dictionaries
Endangered language documentation and archiving, funded by the Volkswagen Foundation and sponsored by the Max Planck Institute, Nijmegen.
Documenting endangered languages at the University of Sydney
Exploring methods and technology for streaming media and interlinear text