Abstract
Ontology plays an important role in semantic Web technology since it can effectively represent the domain knowledge. We develop a novel framework for automatically generating the domain knowledge by analyzing different Web sites in a given domain. The idea of our approach is to consider two kinds of information from the Web sites. The first kind of information is the text fragments corresponding to the concepts in the ontology. The other kind of information is the header labels corresponding to the concepts. We design a method for generating the domain ontology by measuring the similarity between the concepts in different Web sites. We have conducted extensive experiments to demonstrate the effectiveness of our approach.
Get full access to this article
View all access options for this article.
