Economic and Social Research Council
This website will look much better in a web browser that supports web standards, but it is accessible to any browser or Internet device. Go to main content.
2007 Conference
2006 Conference
2005 Conference
2004 Conference

An Infrastructure for Linguistic Data on the Web

Scott Farrar, William D. Lewis, D. Terence Langendoen

Scott Farrar
University of Arizona, USA

William D. Lewis
University of Washington and California State University at Fresno, USA

D. Terence Langendoen
National Science Foundation, USA

Email address of corresponding author: langendt@u.arizona.edu (Langendoen)

The GOLD Community of Practice is proposed as a model for managing on-line linguistic data. The key components of the model include the linguistic data resources themselves and those focused on the knowledge derived from data. Data resources include the ever-increasing amount of linguistic field data and other descriptive language resources being migrated to the Web. The knowledge resources capture generalizations about the data and are anchored in the General Ontology for Linguistic Description, or GOLD. It is argued that such a model is in the spirit of the vision for a Semantic Web and, thus, provides a concrete methodology for rendering highly divergent resources interoperable. Furthermore, a methodology is given for creating specific communities of practice within the overall scientific domain of linguistics. Finally, a number of applications are proposed including those aimed at knowledge acquisition and those aimed at putting the knowledge to use.