Please use this identifier to cite or link to this item: http://localhost:8080/xmlui/handle/123456789/2643
Full metadata record
DC FieldValueLanguage
dc.contributor.authorIbrahim, Mohammed-
dc.contributor.authorGauch, Susan-
dc.contributor.authorSalman, Omar-
dc.contributor.authorAlqahtani, Mohammed-
dc.date.accessioned2022-10-17T22:27:27Z-
dc.date.available2022-10-17T22:27:27Z-
dc.date.issued2021-08-09-
dc.identifier.urihttp://localhost:8080/xmlui/handle/123456789/2643-
dc.description.abstractBackground: Clear language makes communication easier between any two parties. A layman may have difficulty communicating with a professional due to not understanding the specialized terms common to the domain. In healthcare, it is rare to find a layman knowledgeable in medical terminology which can lead to poor understanding of their condition and/or treatment. To bridge this gap, several professional vocabularies and ontologies have been created to map laymen medical terms to professional medical terms and vice versa. Objective: Many of the presented vocabularies are built manually or semi-automatically requiring large investments of time and human effort and consequently the slow growth of these vocabularies. In this paper, we present an automatic method to enrich laymen’s vocabularies that has the benefit of being able to be applied to vocabularies in any domain. Methods: Our entirely automatic approach uses machine learning, specifically Global Vectors for Word Embeddings (GloVe), on a corpus collected from a social media healthcare platform to extend and enhance consumer health vocabularies. Our approach further improves the consumer health vocabularies by incorporating synonyms and hyponyms from the WordNet ontology. The basic GloVe and our novel algorithms incorporating WordNet were evaluated using two laymen datasets from the National Library of Medicine (NLM), Open-Access Consumer Health Vocabulary (OAC CHV) and MedlinePlus Healthcare Vocabulary. Results: The results show that GloVe was able to find new laymen terms with an F-score of 48.44%. Furthermore, our enhanced GloVe approach outperformed basic GloVe with an average F-score of 61%, a relative improvement of 25%. Furthermore, the enhanced GloVe showed a statistical significance over the two ground truth datasets with P < 0.001. Conclusions: This paper presents an automatic approach to enrich consumer health vocabularies using the GloVe word embeddings and an auxiliary lexical source, WordNet. Our approach was evaluated used healthcare text downloaded from MedHelp.org, a healthcare social media platform using two standard laymen vocabularies, OAC CHV, and MedlinePlus. We used the WordNet ontology to expand the healthcare corpus by including synonyms, hyponyms, and hypernyms for each layman term occurrence in the corpus. Given a seed term selected from a concept in the ontology, we measured our algorithms’ ability to automatically extract synonyms for those terms that appeared in the ground truth concept. We founden_US
dc.language.isoenen_US
dc.publisherpeerj computer scienceen_US
dc.subjectOntologies,en_US
dc.subjectConsumer health vocabularyen_US
dc.subjectVocabulary enrichmenten_US
dc.subjectWord embeddingen_US
dc.titlehealth vocabularies using GloVe word embeddings and an auxiliary lexical resourceen_US
dc.typeArticleen_US
Appears in Collections:قسم علوم الحاسبات

Files in This Item:
File Description SizeFormat 
peerj-cs-668.pdf3.52 MBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.