An Infrastructure for Linguistic Data on the Web
Scott Farrar, William D. Lewis, D. Terence Langendoen
Scott Farrar
University of Arizona, USA
William D. Lewis
University of Washington and California State University at Fresno, USA
D. Terence Langendoen
National Science Foundation, USA
Email address of corresponding author: langendt@u.arizona.edu (Langendoen)
The GOLD Community of Practice is proposed as a model for managing on-line linguistic data. The key components of the model include the linguistic data resources themselves and those focused on the knowledge derived from data. Data resources include the ever-increasing amount of linguistic field data and other descriptive language resources being migrated to the Web. The knowledge resources capture generalizations about the data and are anchored in the General Ontology for Linguistic Description, or GOLD. It is argued that such a model is in the spirit of the vision for a Semantic Web and, thus, provides a concrete methodology for rendering highly divergent resources interoperable. Furthermore, a methodology is given for creating specific communities of practice within the overall scientific domain of linguistics. Finally, a number of applications are proposed including those aimed at knowledge acquisition and those aimed at putting the knowledge to use.
