sitemap VIII Encontro de Linguística de Corpus
 
 
 

Dia 13 de novembro 2009

8:15 – 9:30 Conferência de abertura: Steven Bird (University of Melbourne, Austrália; University of Pennsylvania, USA)

Corpus Linguistics and Language Preservation


  1. There is a pressing need to document the world's linguistic heritage

  2. while there is still time. The consequence of language shift is that

  3. many genres -- and many whole languages -- are quickly falling out of

  4. use. Digital technologies speed up the task, yet the work is not

  5. covering a sufficient number of languages, in sufficient depth, at a

  6. sufficient rate. What would it take to compile a million-word corpus

  7. consisting of speech recordings and transcriptions, for 5,000

  8. languages within the space of a decade?


  9. This presentation will describe a new approach to corpus creation

  10. called "basic oral language documentation" (BOLD).  I will describe

  11. BOLD and report on a pilot study with Usarufa, a moribund language

  12. spoken by approximately 1200 people in the Eastern Highlands Province

  13. of Papua New Guinea. Local literacy teachers were trained in the use

  14. of digital voice recorders for capturing linguistic events, and then

  15. adding spoken transcriptions and interpretations into a language of

  16. wider communication.  I will describe a variety of technical and

  17. sociological challenges, and speculate on how the BOLD methodology

  18. could be adopted for preserving the languages of Brazil.


  19. The presentation will also describe the Open Language Archives

  20. Community (OLAC), an international partnership of institutions and

  21. individuals who are creating a worldwide virtual library of language

  22. resources by: (i) developing consensus on best current practice for

  23. the digital archiving of language resources, and (ii) developing a

  24. network of interoperating repositories and services for housing and

  25. accessing such resources.  The software services of OLAC will be

  26. demonstrated, with an emphasis on the languages of Brazil.


 

Plenárias