linguana.net home | resources | docs | discussion | database | about



NOTE: The Linguana project is defunct as of early 2003. My efforts to secure the disk space required were not successful, and development has moved into private projects. Sometime in 2004, these will be opened for contribution and community development. Until then ...

The Linguana project provides a forum for Open Source developers to discuss and collaborate on systems for the processing of Natural Language. At present, the principle effort of the project is to enable joint contribution to a large psycholinguistic lexicon, based on Wordnet, and available through this website. The goal of this work is to combine the efforts of the thousands of persons on the Internet who desire to take part in the evolution of a controlled, quality lexicon for use in a miriad of applications.

PLEASE NOTE: Currently, the project is in a planning stage, although the new lexicon interface is nearly complete. Please subscribe to the mailing lists to help.

The Linguana Lexicon Interface

There is room for contribution from programmers and non-programmers alike. The development goals include:

  • Define a standard storage for the lexicon, or a storage-independent way of storing and accessing the content.
  • Define a standard schema for the lexicon, which could be adopted across files and RDBs.
  • Define an accompanying XML DTD or schema for the storage of data and its remote transfer.
  • Define a process for the addition of data types, fields, and so on.
  • Create an XML transport for querying the database.
  • Create a permissions and revision control system for the changes to the database.
  • Create an interface to the database and its administration.
  • Create tools for exchanging and merging database changes

For non-programmers, the lexicon needs quite a bit of attention:

  • Editing of the database entries
  • Addition of new database entries
  • Creation of new databases links for topic association, part relations, noun attributes and functions
  • Addition of new database fields for the storage of dismabiguation data, pronunciation data, images, 3D models, web links, and other content, and populating those fields with content



All materials are the copyright of their owners and submitters.
Linguana Copyright © 2001, 2002 by Dan Brian. Wordnet Copyright © 2001, 2002 by Princeton University. All rights reseved.