Contextualized Evaluations: Judging Language Model Responses to Underspecified Queries

Chaitanya Malaviya·Joseph Chee Chang·Dan Roth

TACL·2024·10 citations

Abstract Language model users often issue queries that lack specification, where the context under which a query was issued—such as the user’s identity, the query’s intent, and the criteria for a response to be useful—is not explicit. For instance, a good r...

Code & Resources

Code: github.com/allenai/ContextEval

How do people cite this paper?

(generated 5 months ago)

This paper's framing of underspecified queries and contextualized evaluation has been used to motivate research on accounting for social context in LLM assessments, to support arguments that human preferences are inherently context-dependent, to inform work on contextual faithfulness in reasoning models, and to underscore the need for alignment procedures that address the full spectrum of real-world, contextualized user interactions.

Loading PDF...

Loading PDF reader...