Invited Speaker
David Yarowsky (
Named Entity (NE) Recognition systems vary widely, from high-speed bulk methods optimized for indexing, to deep semantic parsers tuned for specific domains. Optimal ways to combine statistical and symbolic models also vary, depending on applications and tasks. Is it possible to:
· maximize use of knowledge-rich resources (e.g. lexicons, NE grammars, parsing) while permitting corpus-based training for domain or language?
· acquire and share resources (including lexicons and grammars) across languages?
· balance performance speed with reasonable accuracy?
· use specific language patterns while permitting rapid transfer to another language?
· minimize variability in results across language types?
We welcome research on combined models, in which these tradeoffs are calculated in particular ways. We hope that the workshop will bring together work on robust and deep multilingual and mixed language NE recognition from different perspectives. Possible topics include:
· the role of the lexicon vs. dynamic processing information
· grammars and lexicons shared (or ported) across languages
· acquisition of multilingual resources (e.g. from corpora)
· translating NEs across multiple languages
· domain tuning
Papers may cover one or more of these (or related) areas.
Demonstrations of implemented NE systems are also welcome.
Authors should use the main
Papers should be submitted electronically in Word, PDF or PostScript format. Assign a filename based on the paper’s title, transfer to ftp://ftp.research.microsoft.com/incoming/josephp then email an identification page with title, author(s), contact details, and filename to molsen@microsoft.com
·
Submission
deadline:
·
Notification
of acceptance:
·
Deadline
for final papers:
·
Workshop date:
Program
Committee
Roberto
Basili (
Robert Gaizauskas (
Ralph Grishman (
Lauri Karttunen (Parc, Inc.)
Kevin Knight (
Gary Geunbae Lee (Pohang
University of Science and Technology)
Dekang Lin (University of Alberta)
Boyan Onyshkevich
(Department of Defense)
John Prager (
Jeff Reynar (Microsoft Corp.)
Mila Ramos-Santacruz (
Ellen Riloff (University of Utah)
Beth Sundheim (SPAWAR Systems Center, San Diego)
Janine Toole (Gavagai Technology)
Benjamin Tsou (City University of Hong Kong)
Marc Vilain (MITRE Corp.)
Sornlertlamvanich Virach
(Thailand National Electronics and Computer Technology)
(Microsoft Corp.)
Mari Broman Olsen
Mari Broman Olsen
Natural Language Group
Microsoft Corporation
Email: molsen@microsoft.com
Tel: +1-425-705-5019
Fax: +1-425-936-7329