We introduce lemonUby, a new lexical resource integrated in the Semantic Web which is the result of converting data extracted from the existing large-scale linked lexical resource UBY to the lemon lexicon model. The following data from UBY were converted: WordNet, FrameNet, VerbNet, English and German Wiktionary, the English and German entries of OmegaWiki, as well as links between pairs of these lexicons at the word sense level (links between VerbNet and FrameNet, VerbNet and WordNet, WordNet and FrameNet, WordNet and Wiktionary, WordNet and German OmegaWiki). We linked lemonUby to other lexical resources and linguistic terminology repositories in the Linguistic Linked Open Data cloud and outline possible applications of this new dataset.
C.F.Baker and C.Fellbaum, WordNet and FrameNet as complementary resources for annotation, in: Proc. of the Third Linguistic Annotation Workshop, Suntec, Singapore, Association for Computational Linguistics, Stroudsburg, PA, USA, 2009, pp. 125–129.
2.
C.F.Baker, C.J.Fillmore and J.B.Lowe, The Berkeley FrameNet project, in: Proc. of the 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics (COLING-ACL’98), Montreal, Canada, Association for Computational Linguistics, Stroudsburg, PA, USA, 1998, pp. 86–90.
3.
C.Bizer, J.Lehmann, G.Kobilarov, S.Auer, C.Becker, R.Cyganiak and S.Hellmann,
DBpedia – A crystallization point for the Web of data, Journal of Web Semantics: Science, Services and Agents on the World Wide Web7(3) (2009), 154–165.
4.
C.Chiarcos,
An ontology of linguistic annotations, LDV Forum23(1) (2008), 1–16.
5.
C.Chiarcos, Ontologies of linguistic annotation: Survey and perspectives, in: Proc. of the Eighth International Conference on Language Resources and Evaluation (LREC 2012), Istanbul, Turkey, N.Calzolari, K.Choukri, T.Declerck, M.U.Doğan, B.Maegaard, J.Mariani, A.Moreno, J.Odijk and S.Piperidis, eds, European Language Resources Association (ELRA), Paris, France, 2012, pp. 303–310.
6.
C.Chiarcos, S.Hellmann and S.Nordhoff,
Towards a linguistic linked open data cloud: The open linguistics working group, Traitement Automatique des Langues52(3) (2011), 245–275.
7.
C.Chiarcos, S.Nordhoff and S.Hellmann (eds), Linked Data in Linguistics. Representing and Connecting Language Data and Language Metadata, Springer, Heidelberg, 2012.
8.
J.Eckle-Kohler, I.Gurevych, S.Hartmann, M.Matuschek and C.M.Meyer, UBY-LMF – A uniform format for standardizing heterogeneous lexical-semantic resources in ISO-LMF, in: Proc. of the 8th International Conference on Language Resources and Evaluation (LREC 2012), Istanbul, Turkey, N.Calzolari, K.Choukri, T.Declerck, M.U.Doğan, B.Maegaard, J.Mariani, A.Moreno, J.Odijk and S.Piperidis, eds, European Language Resources Association (ELRA), Paris, France, 2012, pp. 275–282.
9.
J.Eckle-Kohler, I.Gurevych, S.Hartmann, M.Matuschek and C.M.Meyer, UBY-LMF – exploring the boundaries of language-independent lexicon models, in: LMF: Lexical Markup Framework, Theory and Practice, G.Francopoulo, ed., ISTE–Wiley, London, UK, 2013, pp. 145–156.
10.
M.Ehrmann, F.Cecconi, D.Vannella, J.P.Mccrae, P.Cimiano and R.Navigli, Representing multilingual data as Linked Data: The case of BabelNet 2.0, in: Proc. of the Ninth International Conference on Language Resources and Evaluation (LREC’14), Reykjavik, Iceland, N.Calzolari, K.Choukri, T.Declerck, H.Loftsson, B.Maegaard, J.Mariani, A.Moreno, J.Odijk and S.Piperidis, eds, European Language Resources Association (ELRA), Paris, France, 2014.
11.
S.Farrar and D.T.Langendoen, Markup and the GOLD ontology, in: EMELD Workshop on Digitizing and Annotating Text and Field Recordings, Michigan State University, 2003, pp. 1–19.
12.
C.Fellbaum, WordNet, MIT Press, Cambridge, MA, 1998.
13.
G.Francopoulo, N.Bel, M.George, N.Calzolari, M.Monachini, M.Pet and C.Soria, Lexical Markup Framework (LMF), in: Proc. of the Fifth International Conference on Language Resources and Evaluation (LREC’06), Genoa, Italy, European Language Resources Association (ELRA), Paris, France, 2006, pp. 233–236.
14.
I.Gurevych, J.Eckle-Kohler, S.Hartmann, M.Matuschek, C.M.Meyer and C.Wirth, UBY – A large-scale unified lexical-semantic resource, in: Proc. of the 13th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2012), Avignon, France, Association for Computational Linguistics, Stroudsburg, PA, USA, 2012, pp. 580–590.
15.
K.Kipper, A.Korhonen, N.Ryant and M.Palmer,
A large-scale classification of english verbs, Language Resources and Evaluation42 (2008), 21–40.
16.
E.Laparra and G.Rigau, Integrating WordNet and FrameNet using a knowledge-based word sense disambiguation algorithm, in: Proc. of the 7th International Conference on Recent Advances in Natural Language Processing (RANLP-2009), Borovets, Bulgaria, Association for Computational Linguistics, Stroudsburg, PA, USA, 2009, pp. 208–213.
17.
M. Matuschek, C.M. Meyer and I. Gurevych, Multilingual knowledge in aligned wiktionary and OmegaWiki for computer-aided translation, Translation: Corpora, Computation, Cognition (TC3)3(1) (2013), 87–118.
18.
J. McCrae, E. Montiel-Ponsoda and P. Cimiano, Integrating WordNet and Wiktionary with lemon, in: [7], 2012, pp. 25–34.
19.
J.McCrae, G.Aguado-de Cea, P.Buitelaar, P.Cimiano, T.Declerck, A.Gómez-Pérez, J.Gracia, L.Hollink, E.Montiel-Ponsoda, D.Spohr and T.Wunner,
Interchanging lexical resources on the Semantic Web, Language Resources and Evaluation46 (2012), 701–719. doi:10.1007/s10579-012-9182-3.
20.
J.P.McCrae and P.Cimiano, Mining translations from the web of open linked data, in: Proc. of the Joint Workshop on NLP&LOD and SWAIE: Semantic Web, Linked Open Data and Information Extraction. Associated with the 9th International Conference on Recent Advances in Natural Language Processing (RANLP 2013), Hissar, Bulgaria, Association for Computational Linguistics, Stroudsburg, PA, USA, 2013, pp. 8–11.
21.
G.deMelo and G.Weikum, Language as a foundation of the Semantic Web, in: Proc. of the Poster and Demonstration Session at the 7th International Semantic Web Conference (ISWC 2008), C.Bizer and A.Joshi, eds, CEUR WS, Vol. 401, CEUR, Karlsruhe, Germany, 2008.
22.
G.deMelo and G.Weikum, Towards universal multilingual knowledge bases, in: Principles, Construction, and Applications of Multilingual Wordnets. Proceedings of the 5th Global WordNet Conference (GWC 2010), Mumbai, India, P.Bhattacharyya, C.Fellbaum and P.Vossen, eds, Narosa Publishing, New Delhi, India, 2010, pp. 149–156.
23.
C.M.Meyer and I.Gurevych, What psycholinguists know about chemistry: Aligning Wiktionary and WordNet for increased domain coverage, in: Proc. of the 5th International Joint Conference on Natural Language Processing (IJCNLP), v.H.Wang and D.Yarowsky, eds, Asian Federation of Natural Language Processing, Chiang Mai, Thailand, 2011, pp. 883–892.
24.
S.Narayanan, C.Baker, C.Fillmore and M.Petruck, FrameNet meets the Semantic Web: Lexical semantics for the web, in: The Semantic Web – ISWC 2003, D.Fensel, K.Sycara and J.Mylopoulos, eds, Lecture Notes in Computer Science, Vol. 2870, Springer, Berlin, Heidelberg, 2003, pp. 771–787. doi:10.1007/978-3-540-39718-2_49.
25.
S.Tonelli and D.Pighin, New features for FrameNet – WordNet mapping, in: Proc. of the Thirteenth Conference on Computational Natural Language Learning (CoNLL-2009), Boulder, Colorado, USA, Association for Computational Linguistics, Stroudsburg, PA, USA, 2009, pp. 219–227.
26.
C. Unger, F. Hieber and P. Cimiano, Generating LTAG grammars from a lexicon-ontology interface, in: Proc. of the 10th International Workshop on Tree Adjoining Grammars and Related Formalisms (TAG+10), Yale University, New Haven, Connecticut, USA, 2010, pp. 61–68.
27.
M.VanAssem, A.Gangemi and G.Schreiber, Conversion of WordNet to a standard RDF/OWL representation, in: Proc. of the Fifth International Conference on Language Resources and Evaluation (LREC’06), Genoa, Italy, European Language Resources Association (ELRA), Paris, France, 2006, pp. 237–242.
28.
M. Windhouwer and S.E. Wright, Linking to linguistic data categories in ISOcat, in: [7], 2012, pp. 99–107.