Personalized Jargon Identification for Enhanced Interdisciplinary Communication
TLDRThis research investigates features representing domain, subdomain, and individual knowledge to predict individual jargon familiarity and compares supervised and prompt-based approaches, finding that prompt-based methods yield the highest accuracy, though the task remains difficult and supervised approaches have lower false positive rates.
How do people cite this paper?
(generated 20 days ago)This paper's dataset of 10K+ term familiarity annotations has been directly adopted as a benchmark for training and evaluating improved personalized jargon detection models, its finding that jargon familiarity varies by individual background has motivated personalized knowledge support in LLM-based reading tools and audience-aware text simplification systems, its prompt-based methodology for incorporating researcher metadata has been adapted for identifying unfamiliar phrases in document processing pipelines and extended to jargon identification for non-scientist audiences, and it is frequently referenced to justify the importance of addressing jargon as a barrier in interdisciplinary communication and to contextualize efforts in plain language summarization of scientific texts.