Personalized Jargon Identification for Enhanced Interdisciplinary Communication
TLDRThis research investigates features representing domain, subdomain, and individual knowledge to predict individual jargon familiarity and compares supervised and prompt-based approaches, finding that prompt-based methods yield the highest accuracy, though the task remains difficult and supervised approaches have lower false positive rates.
How do people cite this paper?
(generated 2 months ago)This paper's dataset of 10K+ term familiarity annotations has been directly adopted as a benchmark for training and evaluating improved personalized jargon detection models, its finding that jargon familiarity varies by individual background has motivated personalized knowledge support in LLM-based reading tools and audience-aware text simplification systems, its prompt-based methodology for incorporating researcher metadata has been adapted for identifying unfamiliar phrases in document processing pipelines and extended to jargon identification for non-scientist audiences, and it is frequently referenced to justify the importance of addressing jargon as a barrier in interdisciplinary communication and to contextualize efforts in plain language summarization of scientific texts.