archiving

Changing the Conduct of Science in the Information Age

The National Science Foundation has posted a workshop report entitled Changing the Conduct of Science in the Information Age. While it doesn't appear to contain direct input from linguists, many of the issues it discusses will be familiar to those interested in promoting a cyberlinguistics infrastructure.

From the executive summary:

RELISH-Symposium „Rendering Endangered Lexicons Interoperable through Standards Harmonization”, Frankfurt, October 10, 2011 “RELISH meets LOEWE”

The RELISH project promotes language-oriented research by addressing a two-pronged problem: (1) the lack of harmonization between digital standards for lexical information in Europe and America, and (2) the lack of interoperability among existing lexicons of endangered languages, in particular those created with the Shoebox lexicon building software. The cooperation partners in the RELISH project are the University of Frankfurt (FRA), the Max Planck Institute for Psycholinguistics (MPI), and Eastern Michigan University, the host of the Linguist List (ILIT).

Access to lexical databases: discussion

Claire Bowern has started a discussion on her blog, Anggarrgoon, about access to aggregated lexical data: how to protect the rights of the various stake holders while encouraging as much sharing as possible. I enjoyed her tongue-in-cheek suggestion that linguist-contributors should, in game-theoretic fashion, get access to data in proportion to the data they share.

“Open and shut: Digital repatriation and the circulation of indigenous knowledge”

Kimberly (“Kim”) Christer (Department of Critical Culture, Gender and Race Studies, Washington State University), an anthropologist by training, presented “Open and shut: Digital repatriation and the circulation of indigenous knowledge” on 4-14-11 at the University of Washington, Seattle campus. With the support of NEH, she is working on the development of the “Mukurtu software tool” (http://www.mukurtuarchive.org/documentation.html), a user-customizable tool for the creation of archives.

Data provenance and data aggregation

Peter Austin, over at Endangered Languages and Cultures, has initiated a discussion on citation practices (with James McElvenny also participating), and it was prompted (at least partly) by some data I have had a role in processing as part of the LEGO project.

Digital repatriation

Tomorrow Kimberly Christen (WSU) will give a talk at the University of Washington on the Mukurtu Indigenous Archive Tool and repatriation of indigenous knowledge in digital form.

Abstract, from the UW Simpson Center for the Humanities calendar:

Beyond the PDF?

While looking for something on this blog http://cameronneylon.net/category/blog/ (which I recommend in general), I stumbled on the fact that an interesting workshop recently took place entitled Beyond the PDF. The workshop goal is described as follows:

A Grand Challenge for Linguistics: Scaling Up and Integrating Models

In response to NSF's call for White Papers in the SBE 2020 Initiative, Jeff Good and I have submitted a paper outlining our take on Cyberinfrastructure for Linguistics, why its necessary, and how it can come about. The abstract:

The Endangered Languages Archive (ELAR) at SOAS

The Endangered Languages Archive (ELAR) at SOAS preserves and disseminates digital documentation of endangered languages around the world, especially (but not limited to) the outcomes of ELDP-funded projects. ELAR's recently re-launched website is designed specifically to suit the needs of endangered languages archiving, using "Web 2.0" methods to implement a nuanced access control system and make the site user-friendly for a range of audiences.

Abney & Bird's Grand Challenge: The Human Language Project

Steven Abney and Steven Bird published a provocative paper (.pdf) at ACL 2010 calling on the computational linguistics community to work to create a "Universal Corpus", an undertaking that they compare in both scale and potential impact to the Human Genome Project. Here is the abstract:

Syndicate content
Powered by Drupal, an open source content management system
foo