Home
Science
I.T.
Arts

Computational Linguistics and Lexicography: English Monolingual General Purpose Doctionaries Freely Available on the Internet  


Abstract Category: Arts
Course / Degree: Modern Languages
Institution / University: Sacred Heart Catholic University, United Kingdom
Published in: 2004


Thesis Abstract / Summary:

Monolingual general purpose dictionaries have been chosen as a subject of this thesis because of their special position in all societies, and in the English-speaking community in particular. Owing to the numerous influences on the language in different parts of the world, English has probably the richest vocabulary of all languages, but it has never been possible to establish a standard. Italy and France have academies to provide guidance and authority in linguistic questions; English speakers have turned to dictionaries. Dictionaries have been important for keeping track of the development of the language, especially since English gained the status of a global language. Cyberspace is a conceptual place where people from all around the world meet to collaborate; to be able to communicate they need a common language: this language is English. Many people use the Internet in order to learn English: they need valuable reference works, and their free availability is essential especially for those who live in the less rich parts of the world.

In order to evaluate the dictionaries found during the meticulous Internet research over an extended period of time, we have first laid foundations for the methodology to be adopted. After a short explanation of the terms and description of the main steps in the development of English and American lexicography in chapters 1 and 2, in chapter 3 we have studied the main principles of lexicography, and analyzed in detail the structure of the printed monolingual dictionary. Particular attention was paid to the microstructure, especially to the composition of information within the entries because online dictionaries should provide at least the same information as their printed counterparts.

In chapter 4 we have looked at the all-embracing effect that the advent of computers has had on dictionary making. Large computer corpora have enabled lexicographers to take a completely different view on the language: instead of relying on their fine linguistic sensitivity only, they suddenly have quantitative evidence on how the language is actually used. Creating an adequate corpus is a very long and expensive process; consequently, standards of encoding have been created that allow exchange of data between different systems in order to put several resources together. In this chapter we have described main markup languages: SGML, XML, HTML, XHTML.

The macrostructure of an online dictionary described in chapter 5 is completely different from the printed one as it does not refer to an artifact, but rather to a virtual dictionary which only exists in a particular moment on the screen of a particular user, and which may be accessed in different ways. The main characteristic of hypertext is that it is not permanent: any user creates their own route to information; they are active in acquiring knowledge. Information retrieval facility is a distinguishing feature of electronic dictionaries and the possibility to add multimedia information is another special feature of electronic medium.

After having outlined a detailed typology of online dictionaries in chapter 6, in chapter 7 we have examined how the English monolingual general purpose dictionaries make use of the electronic medium in different ways and to a different extent. We have identified two kinds of monolingual general purpose dictionaries online: digitized printed dictionaries and original electronic dictionaries. We have created a table of parameters for equal formal validation and validation of content of the dictionaries and listed their main features in order to draw a conclusion about the usefulness of the English monolingual general purpose dictionaries freely available on the Internet.

The future of the publishing industry has been frequently questioned after the advent of the Internet. The financial part of the enterprise plays an important role in the development of original dictionaries, and has crucial implications for the future of the dictionary making. The ideal situation would be close collaboration between Natural Language Processing researchers and lexicographers to produce electronic dictionaries which would have both, high lexicographic quality and sophisticated computing functions.


Thesis Keywords/Search Tags:
computational linguistics, lexicography, natural language processing, corpus linguistics, dictionaries online, markup language, typology, thesaurus, monolingual

This Thesis Abstract may be cited as follows:
No user preference. Please use the standard reference methodology.


Submission Details: Thesis Abstract submitted by Jitka Horcickova from United Kingdom on 15-Apr-2006 14:00.
Abstract has been viewed 4612 times (since 7 Mar 2010).

Jitka Horcickova Contact Details: Email: jitkahorcickova@yahoo.co.uk



Disclaimer
Great care has been taken to ensure that this information is correct, however ThesisAbstracts.com cannot accept responsibility for the contents of this Thesis abstract titled "Computational Linguistics and Lexicography: English Monolingual General Purpose Doctionaries Freely Available on the Internet". This abstract has been submitted by Jitka Horcickova on 15-Apr-2006 14:00. You may report a problem using the contact form.
© Copyright 2003 - 2024 of ThesisAbstracts.com and respective owners.


Copyright © Thesis Abstract | Dissertation Abstracts Thesis Library 2003-2024.
by scope.com.mt @ website design