| |
 |
PROCEEDINGS COVER |
| |
 |
PREFACE (GENERAL CHAIR) |
| |
 |
PREFACE (PROGRAM CHAIRS) |
| |
 |
REVIEWERS |
| |
 |
CONFERENCE PROGRAM |
| |
 |
CONFERENCE WEB PAGE (live link) |
| |
 |
CONFERENCE WEB PAGE (archive) |
| |
 |
AUTHOR INDEX |
| |
|
3 |
 |
Invited Talk: Processes that Shape Conversation and their Implications Computational Linguistics |
| Susan E. Brennan |
| |
|
11 |
 |
Invited Talk: Generic NLP Technologies: Language, Knowledge and Information Extraction |
| Jun'ichi Tsujii |
| |
|
19 |
 |
Invited Talk: Spoken Language Technology: Where Do We Go From Here? |
| Roger K. Moore |
| |
|
23 |
 |
Translation with Cascaded Finite State Transducers |
| Stephan Vogel and Hermann Ney |
| |
|
31 |
 |
Phrase-Pattern-based Korean to English Machine Translation using Two Level Translation Pattern Selection |
| Jung-jae Kim, Key-Sun Choi and Young-Soog Chae |
| |
|
37 |
 |
A Maximum Entropy/Minimum Divergence Translation Model |
| George Foster |
| |
|
45 |
 |
Incorporating Compositional Evidence in Memory-Based Partial Parsing |
| Yuval Krymolowski and Ido Dagan |
| |
|
53 |
 |
Tree-gram Parsing: Lexical Dependencies and Structural Relations |
| K. Sima'an |
| |
|
61 |
 |
An Improved Parser for Data-Oriented Lexical-Functional Analysis |
| Rens Bod |
| |
|
69 |
 |
Robust Temporal Processing of News |
| Inderjeet Mani and George Wilson |
| |
|
77 |
 |
Tagging Unknown Proper Names Using Decision Trees |
| Frédéric Béchet, Alexis Nasr and Franck Genet |
| |
|
85 |
 |
The Order of Prenominal Adjectives in Natural Language Generation |
| Robert Malouf |
| |
|
93 |
 |
Spoken Dialogue Management Using Probabilistic Reasoning |
| Nicholas Roy, Joelle Pineau and Sebastian Thrun |
| |
|
101 |
 |
An Unsupervised Approach to Prepositional Phrase Attachment using Contextually Similar Words |
| Patrick Pantel and Dekang Lin |
| |
|
109 |
 |
A Unified Statistical Model for the Identification of English BaseNP |
| Endong Xun, Changning Huang and Ming Zhou |
| |
|
117 |
 |
Rule Writing or Annotation: Cost-efficient Resource Usage for Base Noun Phrase Chunking |
| Grace Ngai and David Yarowsky |
| |
|
126 |
 |
Using Existing Systems to Supplement Small Amounts of Annotated Grammatical Relations Training Data |
| Alexander Yeh |
| |
|
133 |
 |
Distributing Representation for Robust Interpretation of Dialogue Utterances |
| David Milward |
| |
|
142 |
 |
Can Nominal Expressions Achieve Multiple Goals?: An Empirical Study |
| Pamela Jordan |
| |
|
150 |
 |
An Empirical Study of the Influence of Argument Conciseness on Argument Effectiveness |
| Giuseppe Carenini and Johanna D. Moore |
| |
|
158 |
 |
Multi-Agent Explanation Strategies in Real-Time Domains |
| Kumiko Tanaka-Ishii and Ian Frank |
| |
|
166 |
 |
A Computational Approach to Zero-pronouns in Spanish |
| Antonio Ferrández and Jesús Peral |
| |
|
173 |
 |
Coreference for NLP Applications |
| Thomas S. Morton |
| |
|
181 |
 |
Learning Attribute Selections for Non-Pronominal Expressions |
| Pamela Jordan and Marilyn Walker |
| |
|
191 |
 |
Finite-State Non-Concatenative Morphotactics |
| Kenneth R. Beesley and Lauri Karttunen |
| |
|
199 |
 |
A Morphologically Sensitive Clustering Algorithm for Identifying Arabic Roots |
| Anne N. De Roeck and Waleed Al-Fares |
| |
|
207 |
 |
Minimally Supervised Morphological Analysis by Multimodal Alignment |
| David Yarowsky and Richard Wicentowski |
| |
|
217 |
 |
A Constraint-based Approach to English Prosodic Constituents |
| Ewan Klein |
| |
|
225 |
 |
Inducing Probabilistic Syllable Classes Using Multivariate Clustering |
| Karin Müller, Bernd Möbius and Detlef Prescher |
| |
|
233 |
 |
Modeling Local Context for Pitch Accent Prediction |
| Shimei Pan and Julia Hirschberg |
| |
|
241 |
 |
A New Statistical Approach To Chinese Pinyin Input |
| Zheng Chen and Kai-Fu Lee |
| |
|
248 |
 |
Automatic Detecting/Correcting Errors in Chinese Text by an Approximate Word-Matching Algorithm |
| Lei Zhang, Ming Zhou, Changning Huang and Haihua Pan |
| |
|
255 |
 |
Dependency-based Syntactic Analysis of Chinese and Annotation of Parsed Corpus |
| Tom B.Y. Lai and Changning Huang |
| |
|
263 |
 |
Part-of-Speech Tagging Based on Hidden Markov Model Assuming Joint Independence |
| Sang-Zoo Lee, Jun'ichi Tsujii and Hae-Chang Rim |
| |
|
270 |
 |
Language Independent, Minimally Supervised Induction of Lexical Probabilities |
| Silviu Cucerzan and David Yarowsky |
| |
|
278 |
 |
Independence and Commitment: Assumptions for Rapid Training and Execution of Rule-based POS Taggers |
| Mark Hepple |
| |
|
286 |
 |
An Improved Error Model for Noisy Channel Spelling Correction |
| Eric Brill and Robert C. Moore |
| |
|
294 |
 |
Query-Relevant Summarization using FAQs |
| Adam Berger and Vibhu O. Mittal |
| |
|
302 |
 |
An Algorithm for One-page Summarization of a Long Text Based on Thematic Hierarchy Detection |
| Yoshio Nakao |
| |
|
310 |
 |
Summarizing Multilingual Spoken Negotiation Dialogues |
| Norbert Reithinger, Michael Kipp, Ralf Engel and Jan Alexandersson |
| |
|
318 |
 |
Headline Generation Based on Statistical Translation |
| Michele Banko, Vibhu O. Mittal and Michael J. Witbrock |
| |
|
326 |
 |
Named Entity Extraction Based on A Maximum Entropy Model and Transformation Rules |
| Kiyotaka Uchimoto, Qing Ma, Masaki Murata, Hiromi Ozaku and Hitoshi Isahara |
| |
|
336 |
 |
Extracting Causal Knowledge from a Medical Database Using Graphical Patterns |
| Christopher S. G. Khoo, Syin Chan and Yun Niu |
| |
|
344 |
 |
Difficulty Indices for the Named Entity Task in Japanese |
| Chikashi Nobata, Satoshi Sekine and Jun'ichi Tsujii |
| |
|
352 |
 |
Memory-Efficient and Thread-Safe Quasi-Destructive Graph Unification |
| Marcel P. van Lohuizen |
| |
|
360 |
 |
Processing Optimality-theoretic Syntax by Interleaved Chart Parsing and Generation |
| Jonas Kuhn |
| |
|
368 |
 |
A Polynomial-Time Fragment of Dominance Constraints |
| Alexander Koller, Kurt Mehlhorn and Joachim Niehren |
| |
|
376 |
 |
Hidden Markov Model-Based Korean Part-of-Speech Tagging Considering High Agglutinativity, Word-Spacing, and Lexical Correlativity |
| Sang-Zoo Lee, Jun'ichi Tsujii and Hae-Chang Rim |
| |
|
384 |
 |
Synchronous Morphological Analysis of Grapheme and Phoneme for Japanese OCR |
| Masaaki Nagata |
| |
|
392 |
 |
Chinese-Korean Word Alignment Based on Linguistic Comparison |
| Jin-Xia Huang and Key-Sun Choi |
| |
|
400 |
 |
Specifying the Parameters of Centering Theory: a Corpus-Based Evaluation using Text from Application-Oriented Domains |
| M. Poesio, H. Cheng, R. Henschel, J. Hitzeman, R. Kibble and R. Stevenson |
| |
|
408 |
 |
The Role of Centering Theory's Rough-Shift in the Teaching and Evaluation of Writing Skills |
| Eleni Miltsakaki and Karen Kukich |
| |
|
416 |
 |
A Hierarchical Account of Referential Accessibility |
| Nancy Ide and Dan Cristea |
| |
|
425 |
 |
Lexical Transfer Using a Vector-Space Model |
| Eiichiro Sumita |
| |
|
432 |
 |
Using Confidence Bands for Parallel Texts Alignment |
| António Ribeiro, Gabriel Lopes and João Mexia |
| |
|
440 |
 |
Improved Statistical Alignment Models |
| Franz Josef Och and Hermann Ney |
| |
|
448 |
 |
Multi-Component TAG and Notions of Formal Power |
| William Schuler, David Chiang and Mark Dras |
| |
|
456 |
 |
Statistical Parsing with an Automatically-Extracted Tree Adjoining Grammar |
| David Chiang |
| |
|
464 |
 |
Corpus-Based Lexical Choice in Natural Language Generation |
| Srinivas Bangalore and Owen Rambow |
| |
|
472 |
 |
An Information-Theory-Based Feature Type Analysis for the Modeling of Statistical Parsing |
| Zhifang Sui, Jun Zhao and Dekai Wu |
| |
|
480 |
 |
Lexicalized Stochastic Modeling of Constraint-Based Grammars using Log-Linear Measures and EM Training |
| Stefan Riezler, Detlef Prescher, Jonas Kuhn and Mark Johnson |
| |
|
488 |
 |
Utilizing the World Wide Web as an Encyclopedia: Extracting Term Descriptions from Semi-Structured Texts |
| Atsushi Fujii and Tetsuya Ishikawa |
| |
|
496 |
 |
Term Recognition Using Technical Dictionary Hierarchy |
| Jong-Hoon Oh, KyungSoon Lee and Key-Sun Choi |
| |
|
504 |
 |
Mapping WordNets Using Structural Information |
| J. Daudé, L. Padró and G. Rigau |
| |
|
512 |
 |
Automatic Labeling of Semantic Roles |
| Daniel Gildea and Daniel Jurafsky |
| |
|
521 |
 |
Feature Logic for Dotted Types: A Formalism for Complex Word Meanings |
| Manfred Pinkal and Michael Kohlhase |
| |
|
529 |
 |
PENS: A Machine-aided English Writing System for Chinese Users |
| Ting Liu, Ming Zhou, Jianfeng Gao, Endong Xun and Changning Huang |
| |
|
537 |
 |
Diagnostic Processing of Japanese for Computer-Assisted Second Language Learning |
| Jun'ichi Kakegawa, Hisayuki Kanda, Eitaro Fujioka, Makoto Itami and Kohji Itoh |
| |
|
547 |
 |
Word Sense Disambiguation by Learning from Unlabeled Data |
| Seong-Bae Park, Byoung-Tak Zhang and Yung Taek Kim |
| |
|
555 |
 |
Importance of Pronominal Anaphora Resolution in Question Answering Systems |
| José L. Vicedo and Antonio Ferrández |
| |
|
563 |
 |
The Structure and Performance of an Open-Domain Question Answering System |
| Dan Moldovan, Sanda Harabagiu, Marius Pasca, Rada Mihalcea, Roxana Girju, Richard Goodrum and Vasile Rus |
| |
|
571 |
 |
Dimension-Reduced Estimation of Word Co-occurrence Probability |
| Kilyoun Kim and Key-Sun Choi |
| |
|
579 |
 |
Distribution-Based Pruning of Backoff Language Models |
| Jianfeng Gao and Kai-Fu Lee |
| |
|
589 |
 |
Panel: Computational Linguistics Research on Philippine Languages |
| Rachel Edita O. Roxas and Allan Borra |
| |
|
591 |
 |
Panel: Development of Computational Linguistics Research: A Challenge for Indonesia |
| Bobby Nazief |
| |
|
593 |
 |
Panel: Good Spelling of Vietnamese Texts,One Aspect of Computational Linguistics in Vietnam |
| Huy Khanh Phan |
| |
|
595 |
 |
Panel: Computational Linguistics in India: An Overview |
| Akshar Bharati, Vineet Chaitanya and Rajeev Sangal |
| |
|
597 |
 |
Panel: The State of the Art in Thai Language Processing |
| Virach Sornlertlamvanich, Tanapong Potipiti, Chai Wutiwiwatchai and Pradit Mittrapiyanuruk |
| |
|
599 |
 |
Panel: Computational Linguistics in Malaysia |
| Zaharin Yusoff |
| |