> August 2010 - Transient Languages & Cultures
business learning training articles new learning business training opportunities finance learning training deposit money learning making training art loan learning training deposits make learning your training home good income learning outcome training issue medicine learning training drugs market learning money training trends self learning roof training repairing market learning training online secure skin learning training tools wedding learning training jewellery newspaper learning for training magazine geo learning training places business learning training design Car learning and training Jips production learning training business ladies learning cosmetics training sector sport learning and training fat burn vat learning insurance training price fitness learning training program furniture learning at training home which learning insurance training firms new learning devoloping training technology healthy learning training nutrition dress learning training up company learning training income insurance learning and training life dream learning training home create learning new training business individual learning loan training form cooking learning training ingredients which learning firms training is good choosing learning most training efficient business comment learning on training goods technology learning training business secret learning of training business company learning training redirects credits learning in training business guide learning for training business cheap learning insurance training tips selling learning training abroad protein learning training diets improve learning your training home security learning training importance

« July 2010 | Blog home | September 2010 »

August 2010

If you are interested in Indigenous education in Australia or what happens when Governments get worried about minority groups not reading and writing the dominant language, check out Prioritising Literacy and Numeracy: A strategy to improve literacy and numeracy outcomes 2010–2012. Darwin: Northern Territory Government Department of Education and Training. There are some good things in it, but there are some worrying things. Take this paragraph:

"The language and cognitive skills domain includes basic literacy; basic numeracy; interest in literacy, numeracy and memory; and advanced literacy. The percentage of Northern Territory children vulnerable and at risk in the Language and cognitive skills domain at the commencement of full-time schooling is significantly greater than the national average, as indicated below." (p.6)

Now it may be that the NT has lots of children from all backgrounds who are at risk. But I bet this is code for "Indigenous children". Looking further - what is literacy? Literacy=English literacy. How are the cognitive skills tested? Almost certainly in English. This calls into question the reliability of the information on which this claim is made.

A lot of people in the NT are worried about the NT Government's approach to Indigenous education - Australian Society for Indigenous Languages (AuSIL) , Association of Teachers of English to Speakers of Other Languages, NT branch (ATESOL NT) , Uniting Church in Australia, Northern Synod, Darwin Anglican Church of Australia, Diocese of the Northern Territory, Darwin and the Top End Linguistics Circle (TELC). They've got together to sponsor a seminar. So, if you're in Darwin on Thursday 9 September, hop along to Indigenous languages in education Do current policies match our needs?

7:30pm, Thursday, 9 September 2010
Mal Nairn Auditorium,
Charles Darwin University,
Casuarina Campus
Contact: Phil Glasgow, 8931-3133

2 comments |

I've been meaning to express my love and gratitude for the excellent Hugo Schuchardt Archiv at the Uni Graz for a while now. I was thinking of maybe saying a little something about Schuchardt for his birthday or Todestag, but the dates passed and in any case I come to exhume Schurchardt, not to praise him.

You can read all about Schuchardt yourself at the archive. There's freely accessible scans of all his published works, a growing full-text searchable database of some of the correspondence he received, some secondary materials, and pointers to further resources. More online archives like this would be great!

It was sad and moving to read in the Tennant and District Times (Vol.33 No. 26 23 July 2010 p.8), of the tribute concert for the Alekarenge/Ali Curung singer and songmaker, B. Murphy of Band Nomadic, who recently died in Adelaide. Too young. Their most recent album Freedom Road [1] has been shortlisted for the 2010 Indigenous Music Awards. Too late.

The songs on the web are in English and Warlpiri, but apparently Murphy wrote in Kaytetye and Alyawarr as well. You can hear Freedom Road, Stolen Generations, Mungamunga (I think this is Munga Munda, i.e. Mangkamarnda, old Phillip Creek Mission), Kurdu, In and out of prison, Love will never die, and Jiparunpa on the Freedom Road album site here. You can hear Kurdu, Drink-Drive and Freedom Road on the Winanjjikari podcast site here. On Band Nomadic's MySpace page are Munga Munda (with Warlpiri lyrics), and LarnirliLu Parngka Jarlngk (about first contact - spelling is haywire, but it is wonderful that so much Warlpiri language could survive the onslaught of more than fifty years of monolingual English education at Ali Curung school).

In an interview, Murphy says he started the band with a couple of blokes he met in gaol. 'Freedom road' makes most painfully clear the ricochet between gaol and freedom, between exile and homeland, between family and grog, between doing right and going wrong, between fleeting delight and the bleak assurance that you will lose everything that matters. The only 'baby' in these songs is 'run baby run/they're comin to get you baby' in the 'Stolen Generations 'song. The last song on the website of Freedom Road, Jiparunpa (Jiparanpa) is a fusion of Murphy's modern vocals with women singing traditional songs (cf Myf's post), probably linked to the remote place Jiparanpa. Jiparanpa is the traditional country of some Warlpiri living at Ali Curung. It's too far to get to, and has become the homeland of dreams, of imagination, of the golden past and the unattainable future; it's the end of the freedom road.

Travel safely, travel lightly, wiyarrpa.


Peter K. Austin
Linguistics Department, SOAS
8th August 2010

Forty-five years ago the annual fieldwork reports of some of the researchers funded by the then Australian Institute of Aboriginal Studies (now AIATSIS) included specifications of how much research had been completed in terms of the number of feet of tapes that had been recorded during the project year ("this year was especially productive with 45 feet 3 inches of tape being recorded"). The modern measure of this kind of quantitative nonsense is the number of gigabytes of digital files (soon to be terabytes) created by the researcher. Don't mind the quality, it's the length/bytes that count.

My colleague David Nathan, Director of the Endangered Languages Archive (ELAR) at SOAS, has been approached on several occasions by researchers (both those funded by ELDP and those not (yet)) asking how much data they would be allowed to deposit in the archive. "Would it be OK if I deposit 500 gigabytes of data?" they ask. When you think about it for a moment or two, this is a truly odd request, but one driven by part of what David (in Nathan 2004, see also Dobrin, Austin and Nathan 2007, 2009) has termed "archivism". This is the tendency for researchers to think that an archive should determine their project outcomes. Parameters stated in terms of audio resolution and sampling rate, file format, and encoding standards take the place of discussions of documentation hypotheses, goals, or methods that are aligned with a project's actual needs and intentions. David's response to such a question is usually: if the material to be deposited is "good quality" (stated in terms of some parameters (not volume!) established by the project in discussion with ELAR) then the archive will be interested in taking it.

Another quantity that comes up in this context (and in the context of grant applications as well) is the statement that "10% of the deposited archival data will be analysed". The remainder of the archive deposit will be, in the worst case, a bunch of media files, or in the best case, media files plus transcription (and/or translation). Where does this magical 10% come from? It seems to have originated around 10 years ago with the DOBES project which established a set of guidelines for language documentation during its pilot phase in 2000. As Wittenburg and Mosel (2004:1) state:

"During a pilot year intensive discussions ... took place amongst the participants. The participants agreed upon a number of basic guidelines for language documentation projects. ... For some material a deep linguistic analysis should be provided such that later researchers will be able to reconstruct the (grammar of the) language"

Similarly, the guidelines for ELDP grant applications (downloadable here) include the following:

"Note that audio and video are not usable, accessible or archivable without accompanying textual materials such as transcription, annotation, or notes about content and participants. While you are encouraged to transcribe and annotate as much of the material as possible, we recognise that this is very time-consuming and you may not be able to do this for all recorded materials. However, you must provide some text indication of the content of all recordings. This does not have to be the linguistic content and could include, for example, description of the topics or events (e.g. names of songs), or names of participants, preferably with time alignment (indication of where they occur in the recording)."

No actual figure is given of how much "some material" (for DOBES) or "as much of the material as possible" (for ELDP) amounts to. In earlier published versions of advice to applicants both DOBES and ELDP did mention 10%.

Interestingly, Wittenburg (2009, slide 34) has done an analysis of the language documentation data collected by DOBES projects between 2000 and 2009, and he notes that the average project team has recorded 131 hours of media (59 hours of audio, 72 hours of video), transcribed 50 hours of this, and translated 29 hours. Linguistic analysis on average exists for 14 hours of recordings -- strikingly this is exactly 10.68% of the average corpus!!

How much of the corpus needs to be linguistically annotated so that "later researchers will be able to reconstruct the (grammar of the) language" or indeed so that the rest of the corpus can be parsed? Well, it depends on a range of factors, including the nature of the language(s) being documented. Some Austronesian languages, like Sasak or Toratan, have relatively little morphology with pretty straightforward morpho-phonemics of such morphology that does exist, and so a relatively small amount of morpheme-by-morpheme glossed materials in conjunction with a lexicon would enable users to bootstrap the morphological analysis of other parts of a transcribed corpus in those languages. Other languages, like Athapaskan tongues with their fiendishly complex verb morphology, might need more annotated data to help the user deal with the whole corpus.

This is however an empirical question, and one that to my knowledge has not been addressed so far. There are now a number of documentary corpora available, with more coming on stream, and it should be possible to establish whether the "magical 10%" is a real goal to be aimed for, or just a figure that researchers have created and continue to repeat to one another.

3 comments |

The Authors

About the Blog

The Transient Building, symbolising the impermanence of language, houses both the Linguistics Department at Sydney University and PARADISEC, a digital archive for endangered Pacific languages and music.


Papua New Guinea FAQs from Eva Lindstrom Papua New Guinea (New Ireland): Eva Lindstrom's tips for fieldworkers

Australian Languages Answers to some frequently asked questions about Australian languages

Papua Web Information network on Papua, Indonesia (formerly Irian Jaya)

Hibernating blogs

Indigenous Language SPEAK

Langguj gel Australian linguistics and fieldwork blog

Interesting Blogs

Omniglot Writing systems and languages of the world

LingFormant Linguistics news

Language hat Linguistics news and commentary

Jabal al-Lughat Linguistics news and commentary on a range of languages

Living languages Blog with news items and discussion of endangered languages

OzPapersOnline Notices of recent work on the Indigenous languages of Australia

That Munanga linguist Community linguist blog

Anggarrgoon Claire Bowern's linguistics and fieldwork blog

Savage Minds A group blog on Anthropology

Fully (sic)

Language on the Move Intercultural communication and multilingualism

Talking Alaska: Reflections on the native languages of Alaska

Culture matters: applying anthropology Australian anthropology blog: postgraduates and staff

Long Road ethnography and anthropology blog - including about Australia

matjjin-nehen Blog on Australian linguistics, fieldwork, politics and the environment.

Language Log Group blog on language and linguistics


E-MELD The E-MELD School of Best Practices in Digital Language Documentation

Tema Modersmål Website in Swedish with links to sites on and in many languages

Hans Rausing Endangered Languages Project: Language Documentation: What is it? Information on equipment, formats, and archiving, and examples of documentation

Indigenous Peoples Issues & Resources a worldwide network of organizations, academics, activists, indigenous groups, and others representing indigenous and tribal peoples

Technorati Profile

Technology-enhanced language revitalization Include ILAT (Indigenous Languages and Technology) discussion list.

Endangered languages of Indigenous Peoples of Siberia

Koryak Net Information on the people of Kamchatka

Linguistic fieldwork preparation: a guide for field linguists syllabi, funding, technology, ethics, readings, bibliography

On-line resources for endangered languages

Papua New Guinea Language Resources Phonologies, grammars, dictionaries, literacy, language maps for many PNG languages

Resource network for linguistic diversity Networking practitioners working to record,retrieve & reintroduce endangered languages


ACLA child language acquisition in three Australian Aboriginal communities

DELAMAN The Digital Endangered Languages and Musics Archives Network

PARADISEC The Pacific And Regional Archive for Digital Sources in Endangered Cultures

Murriny-Patha Song Project Documenting the language and music of public songs and dances composed and performed by Murriny Patha-speaking people

PFED The Project for Free Electronic Dictionaries

DOBES Endangered language documentation and archiving, funded by the Volkswagen Foundation and sponsored by the Max Planck Institute, Nijmegen.

DELP Documenting endangered languages at the University of Sydney

Ethno EResearch Exploring methods and technology for streaming media and interlinear text