Clustering for clarity: improving word sense disambiguation through multilevel analysis

Szczegóły
Abstrakt

Tytuł:: Clustering for clarity: improving word sense disambiguation through multilevel analysis
Autorzy:: Dubey, Shiv Kishan
Kohli, Narendra
Data publikacji:: 2024
Słowa kluczowe:: embeddings
center embedding
multilevel clustering
word sense disambiguation
WSD
polysemy
homonymy
Język:: angielski
Dostawca treści:: BazTech
: Artykuł

Existing Word Sense Disambiguation (WSD) techniques have limits in exploring word-context relationships since they only deal with the effective use of word embedding, lexical-based information via WordNet, or the precision of clustering algorithms. In order to overcome this limitation, the study suggests a unique hybrid methodology that makes use of context embedding based on center-embedding in order to capture semantic subtleties and utilizing with atwo-level K-means clustering algorithm. Such generated context embedding, producing centroids that serve as representative points for semantic information inside clusters. Additionally, go with such captured cluster- centres in the nested levels of clustering process, locate groups of linked context words and categorize them according to their word meanings that effectively manage polysemy/homonymy as well as detect minute differences in meaning. Our proposed approach offers a substantial improvement over traditional WSD methods by harnessing the power of center-embedding in context representation, enhancingthe precision of clustering and ultimately overcoming existing limitations incontext-based sense disambiguation.

Opracowanie rekordu ze środków MNiSW, umowa nr SONP/SP/546092/2022 w ramach programu "Społeczna odpowiedzialność nauki" - moduł: Popularyzacja nauki i promocja sportu (2024).

Informacja

Clustering for clarity: improving word sense disambiguation through multilevel analysis