Update of publication documents...
Done.
LINQS @ UMD

LINQS

STATISTICAL RELATIONAL LEARNING GROUP @ UMD



 

Personal Name Resolution in Email: A Heuristic Approach

Technical Report LAMP-TR150, University of Maryland, College Park, Number LAMP-TR150 - March 2008
Download the publication : LAMP_150.pdf [407Ko]  
Much of the work to date on searching email has focused on personal information management. Archival access poses new challenges, including automatic association of references to unfamiliar individuals using whatever information is available about those people. This paper describes a computational approach to that task motivated by intuitions about the ways people might explore an email collection to find that information. The proposed approach makes use of context in a flexible and adaptive manner. Two techniques for context expansion are: a mixture model that combines evidence from each context to rank candidates, and cutoff model that ranks candidates based on the closest context in which any suitable evidence was found. Both models rely on mentions that could be resolved to a common identity as evidence of the resolution. Results on three relatively small collections indicate that the accuracy of our approach performs favorable compared to the best known technique and results on the full CMU Enron collection indicate that the approach presented in this paper scales well to larger email collections.

BibTex references

@TechReport\{elsayed:umtr08,
  author       = "Elsayed, Tamer and Oard, Doug and Namata, Galileo Mark and Getoor, Lise",
  title        = "Personal Name Resolution in Email: A Heuristic Approach",
  institution  = "University of Maryland, College Park",
  number       = "LAMP-TR150",
  month        = "March",
  year         = "2008",
  note         = "LAMP-TR-150",
}

Other publications in the database