Synonym extraction and abbreviation expansion with ensembles of
Por um escritor misterioso
Last updated 12 janeiro 2025
Background Terminologies that account for variation in language use by linking synonyms and abbreviations to their corresponding concept are important enablers of high-quality information extraction from medical texts. Due to the use of specialized sub-languages in the medical domain, manual construction of semantic resources that accurately reflect language use is both costly and challenging, often resulting in low coverage. Although models of distributional semantics applied to large corpora provide a potential means of supporting development of such resources, their ability to isolate synonymy from other semantic relations is limited. Their application in the clinical domain has also only recently begun to be explored. Combining distributional models and applying them to different types of corpora may lead to enhanced performance on the tasks of automatically extracting synonyms and abbreviation-expansion pairs. Results A combination of two distributional models – Random Indexing and Random Permutation – employed in conjunction with a single corpus outperforms using either of the models in isolation. Furthermore, combining semantic spaces induced from different types of corpora – a corpus of clinical text and a corpus of medical journal articles – further improves results, outperforming a combination of semantic spaces induced from a single source, as well as a single semantic space induced from the conjoint corpus. A combination strategy that simply sums the cosine similarity scores of candidate terms is generally the most profitable out of the ones explored. Finally, applying simple post-processing filtering rules yields substantial performance gains on the tasks of extracting abbreviation-expansion pairs, but not synonyms. The best results, measured as recall in a list of ten candidate terms, for the three tasks are: 0.39 for abbreviations to long forms, 0.33 for long forms to abbreviations, and 0.47 for synonyms. Conclusions This study demonstrates that ensembles of semantic spaces can yield improved performance on the tasks of automatically extracting synonyms and abbreviation-expansion pairs. This notion, which merits further exploration, allows different distributional models – with different model parameters – and different types of corpora to be combined, potentially allowing enhanced performance to be obtained on a wide range of natural language processing tasks.
PDF] Balancing the composition of word embeddings across
Computers, Free Full-Text
PDF] Annotating Mentions of Coronary Artery Disease in Medical
Exploring patterns in dictionary definitions for synonym
PDF] Balancing the composition of word embeddings across
Apollo 24, 7's CDSS solution, built with Google Cloud
PDF) PLOD: An Abbreviation Detection Dataset for Scientific
Table 3 from Alignment-HMM-based Extraction of Abbreviations from
Exploring patterns in dictionary definitions for synonym
Exploring patterns in dictionary definitions for synonym
Exploring patterns in dictionary definitions for synonym
PDF] Finding Synonyms in Medical Texts – Creating a system for
ALICE: an algorithm to extract abbreviations from MEDLINE
Recomendado para você
-
15 Synonyms for I Think: Professional, Academic, and Casual12 janeiro 2025
-
Alli User Guide - Synonym & Antonym Dictionary12 janeiro 2025
-
Creating Synonyms12 janeiro 2025
-
20+ alternative job titles for digital marketeers12 janeiro 2025
-
Creating a new Synonym12 janeiro 2025
-
PDF] Query Rewriting using Automatic Synonym Extraction for E12 janeiro 2025
-
Synonyms for Common Resume Verbs & Adjectives (2023)12 janeiro 2025
-
Table 5 from Finding Synonyms Using Automatic Word Alignment and12 janeiro 2025
-
Synonyms and Antonyms List for English Language, Download Synonyms and Antonyms PDF for SSC12 janeiro 2025
-
Synonyms and Antonyms List for English Language, Download Synonyms12 janeiro 2025
você pode gostar
-
GMAT Prep Test 2: Incomplete Question? : r/GMAT12 janeiro 2025
-
Conversa Com A Autora Edith Rubinstein Sobre Psicopedagogia12 janeiro 2025
-
Copa do Mundo: plataforma interativa feita por estatísticos prevê12 janeiro 2025
-
Pou Gameplay jogar POU no sábado #612 janeiro 2025
-
Online Flying Games at12 janeiro 2025
-
Guilty Gear Strive Season Pass 2 kicks off with Bridget coming to12 janeiro 2025
-
Fatalities List - All Character Button Inputs and Codes - Mortal12 janeiro 2025
-
Sonic the Hedgehog 2 Review: Video Games' Fastest Hero Trudges12 janeiro 2025
-
Oshi no ko chapter 73 coloring by me : r/OshiNoKo12 janeiro 2025
-
GamZui YT12 janeiro 2025