|
|||||
|
(NOTE: Some of the older papers are
hard to find in electronic form. Links marked with an asterisk (*) are as
similar to the published paper as I could find.) Journals 1. Church,
K., and Patil, R. (1982) “Coping with Syntactic Ambiguity or How to Put the
Block in the Box on the Table,” American
Journal of Computational Linguistics, Volume 8, Number 3-4,
July-December. pdf (from LDC) 2. Church,
K. (1987) “Phonological Parsing and Lexical Retrieval” Cognition, Volume 25, pp. 53-69. ps 3. Church,
K., and Hanks, P., “Word Association Norms, Mutual Information and
Lexicography,” Computational
Linguistics, Vol 16:1, pp. 22-29, (1991). ps 4. Church,
K., and Gale, W., “A Comparison of the Enhanced Good-Turing and Deleted
Estimation Methods for Estimating Probabilities of English Bigrams,” Computer Speech and Language, 5:1,
(1991). 5. Church,
K., and Gale, W. (1991) “Probability Scoring for Spelling Correction,”
Statistics and Computing. ps (missing
figures) 6. Church,
K., review of Aarts, J., and Meijs, W. (eds.) (1991) “Theory and Practice in
Corpus Linguistics” Rodopi, Amesterdam, Computational
Linguistics. 7. Gale,
W., and Church, K. (1993) “A Program for Aligning Sentences in Bilingual
Corpora,” Computational Linguistics,
19:1, pp. 75-102. pdf
(via LDC) ps (figures missing) text (containing useful appendix) pdf
(via ACM) 8. Gale,
W., Church, K. and Yarowsky, D. (1993) “A Method for Disambiguating Word
Senses in a Large Corpus,” Computers
and Humanities, 26: 415-439. ps 9. Church,
K. and Helfman, J. (1993) “Dotplot: a Program for Exploring Self-Similarity
in Millions of Lines of Text and Code,” The
Journal of Computational and Graphical Statistics, 2:2, pp. 153-174. ps related
pages 10.Church, K. and Mercer, R.
(1993) “Introduction to the Special Issue on Computational Linguistics Using
Large Corpora,” Computational
Linguistics, 19:1, pp. 1-24. pdf (via LDC) ps pdf
(via ACM) 11.Church, K. and Hovy, E.
(1993) “Good Applications for Crummy Machine Translation,” Machine Translation, vol 8, pp.
239-258. ps 12.Church, K. and Rau, L. (1995)
“Commercial Applications of Natural Language Processing,” Communications of the ACM, pp. 71-79. ps pdf
(via ACM) 13.Church, K. and Gale, W.
(1995) “Poisson Mixtures,” Journal of
Natural Language Engineering. ps 14.Isabelle, P. and Church, K.
(eds.) (1997) special issue of Machine
Translation, vol 12:1/2, Kluwer. 15.Dagan ,
I. ,
Church, K. (1997), “Termight: Coordinating Humans and Machines in Bilingual
Terminology Acquisition,” Machine
Translation, vol 12:1/2, pp. 89-107. 16.Yamamoto, M. and Church, K
(2001) “Using Suffix Arrays to compute Term Frequency and Document Frequency
for All Substrings in a Corpus,” Computational
Linguistics, vol 27:1, pp. 1-30, MIT Press. pdf
related
links 17.Amir, A., Church, K. and Dar,
E. (2004), “The
submatrices character count problem: an efficient solution using separable
values,” Information and Computation, 190:1, pp. 100-116. 18.Church, K. (2005) “Reviewing the Reviewers,” Computational
Linguistics, 31:4, pp. 575-578. (via ACM) 19.Li, P. and Church, K. (2007) A sketch algorithm for
estimating two-way and multi-way associations, Computational Linguistics
33(3), 305-354. 20.Li, P. Hastie, T., and Church,
K. (2007), 21.Church, K. (2008)
“Approximate Lexicography and Web Search,” International Journal of
Lexicography, 21(3):325-336. Patents 1. 4,829,580
“Text analysis system with letter sequence recognition and speech stress
assignment arrangement.”
2. 5,146,405
“Methods for part-of-speech determination and usage.” 3. 5,283,833
“Method and apparatus for speech processing using morphology and rhyming.” 4. 5,541,836
“Word disambiguation apparatus and methods.” 5. 5,572,423
“Method for correcting spelling using error frequencies.” 6. 5,608,622
“System for analyzing translations.” 7. 5,647,023
“Method of nonlinear filtering of degraded document images.” Conference Proceedings 1. Church,
K., (1979) “Co-ordinate Squares: A Solution to Many Chess Pawn Endgames,”
(abbreviated version of B.S. Thesis), International
Joint Conference on Artificial Intelligence, Tokyo ,
Japan .
2. Church,
K. (1980) “On Parsing Strategies and Closure,” Association for Computational Linguistics. pdf (direct from ACL) 3. Church,
K. (1983) “A Finite-State Parser for Use in Speech Recognition,” Association for Computational Linguistics.
pdf (direct from ACL) pdf
(via ACM) 4. Church,
K. (1983) “Allophonic and Phonotactic Constraints are Useful,” International Joint Conference on
Artificial Intelligence, Karlsruhe , West Germany . 5. Church,
K. (1983) “A Finite-State Parser for Use in Speech Recognition,” Journal of the Acoustical Society of
America , Supplement 1, Vol. 74. 6. Ejerhed , E.,
and Church, K. (1983) “Finite State Parsing,” in Karlsson, F. (ed), Papers
from the Seventh Scandinavian Conference of Linguistics, University of
Helsinki . 7. Church,
K. (1985) “Stress Assignment in Letter to Sound Rules for Speech Synthesis,” Association for Computational Linguistics.
pdf (direct from ACL)
ps 8. Church,
K. (1986) “Morphological Decomposition and Stress Assignment for Speech
Synthesis,” Association for Computation
Linguistics. pdf
(direct from ACL)
9. Church,
K. (1987) “Three Classes of + Boundaries,” The Eleventh International Congress of Phonetic Sciences, Estonia ,
USSR .
ps 10.Church,
K. (1988) “A Stochastic Parts Program and Noun Phrase Parser for Unrestricted
Text,” Second Conference on Applied
Natural Language Processing, Austin , Texas , pp. 136-143. pdf (direct from ACL)
ps 11.Koskenniemi ,
K., and Church K. (1988) “Complexity, Two-Level Morphology and Finnish,” Coling, Budapest ,
Hungary .
ps 12.Church,
K. (1989) “Syntactic Parsing May Not Help Speech Recognition Very Much,” in Working Notes of the AAAI Symposium:
Text-Based Intelligent Systems. 13.Church,
K., and Hanks, P. (1989) “Word Association Norms, Mutual Information and
Lexicography,” Association for
Computational Linguistics, Vancouver , Canada .
pdf (direct from ACL) 14.Church,
K., and Gale, W. (1989) “Enhanced Good-Turing and Cat-Cal: Two New Methods
for Estimating Probabilities of English Bigrams,” Second Darpa Workshop on Speech and Natural Language, Cape Cod .
ps 15.Church,
K., and Hindle, D. (1990) “Collocational Constraints and Corpus-Based
Linguistics,” in Working Notes of the
AAAI Symposium: Text-Based Intelligent Systems. 16.Church,
K., and Gale, W. (1990) “Poor Estimates of Context are Worse than None,” Third Darpa Workshop on Speech and Natural
Language, Hidden Valley, PA. ps
17.Kernighan,
M., Church, K., Gale, W (1990) “A Spelling Correction Program Based on a
Noisy Channel Model,” Coling,
Helsinki ,
Finland .
pdf (via LDC) ps pdf
(via ACM) 18.Coker,
C., Church, K., Liberman, M. (1990) “Morphology and Rhyming: Two Powerful
Alternatives to Letter-to-Sound Rules for Speech Synthesis,” European Speech
Communication Association, Conference on Speech Synthesis. ps 19.Gale,
W. A. and K. W. Church (1990) “Estimation Procedures for Language Context:
Poor Estimates are Worse than None,” Proceedings in Computational Statistics,
1990, p.69-74, Physica-Verlag, Heidelberg . 20.Gale,
W., and Church, K. (1991) “Identifying Word Correspondences in Parallel
Text,” Fourth Darpa Workshop on Speech
and Natural Language, Asilomar, pp. 152-157. ps* 21.Gale, W., and Church, K.
(1991) “A Program for Aligning Sentences in Bilingual Corpora,” Association for Computational Linguistics.
pdf (direct from ACL) 22.Church,
K. (1991) “Some Statistical Opportunities in Speech and Language,” 23rd
Symposium on the Interface, Computing Science and Statistics, Seattle ,
Washington .
ps 23.Church,
K. (1991) “Concordances for Parallel Text,” Seventh Annual Conference of the UW Centre for the New OED and Text
Research, Oxford
, England
. ps 24.Church,
K. and Hovy, E. (1991) “Good Applications for Crummy Machine Translation,” in
Neal, J. and Walter, S. Natural
Language Processing Systems Evaluation Workshop, University of
California, Berkeley, CA.
25.Church,
K. and Gale, W. (1992) “One Sense per Discourse,” Proceedings of Fifth DARPA Workshop on Speech and Natural Language.
ps 26.Gale,
W., Church, K., Yarowsky, D. (1992) “Estimating Upper and Lower Bounds on the
Performance of Word-Sense Disambiguation Programs,” Association for Computational Linguistics. pdf (via LDC) ps 27.Gale,
W., Church, K., Yarowsky, D. (1992) “Using Bilingual Materials to Develop
Word Sense Disambiguation Methods, TMI,
Montreal . pdf (direct
from ACL) ps 28.Church,
K. and Helfman, J. (1992) “Dotplot: a Program for Exploring Self-Similarity
in Millions of Lines of Text and Code,” Interface.
29.Gale,
W., Church, K. and Yarowsky, D. (1992) “Work on Statistical Methods for Word
Sense Disambiguation,” Proceedings AAAI
Fall Workshop on Statistics in Natural Language. 30.Church,
K. (1993) “Char_align: A Program for Aligning Parallel Texts at the Character
Level,” Association for Computational
Linguistics, pp. 1-8. pdf (direct from ACL)
ps 31.Dagan ,
I. ,
Church, K., and Gale, W. (1993) “Robust Word Alignment for Machine Aided
Translation,” Proceedings of the
Workshop on Very Large Corpora: Academic and Industrial Perspectives,
available from the Association for
Computational Linguistics, pp. 1-8. pdf (via LDC) ps 32.Church,
K., Dagan, I. ,
Gale, W., Fung, P., Helfman, J., Satish, B. (1993) “Aligning Parallel Texts:
Do Methods Developed for English-French Generalize to Asian Languages?” Pacific Asia Conference on Formal and
Computational Linguistics, pp. 1-12. ps 33.Church,
K., Gale, W., Helfman, J., Lewis, D. (1994) “Fax: An Alternative to SGML,” Coling. pdf (via LDC) ps pdf
(via ACM) 34.Fung,
P. and Church, K. (1994) “K-vec: A New Approach for Aligning Parallel Texts,”
Coling. ps 35.Dagan ,
I. and
Church, K. (1994) “TERMIGHT: Identifying and Translating Technical
Terminology” 4th Conference on Applied Natural Language Processing. pdf (direct from ACL) 36.Agazzi ,
O., Church, K., and Gale, W. (1994), “Using OCR and Equalization to
Downsample Documents,” Proceedings of
the 12th International Conference on Pattern Recognition,
Jerusalem ,
Israel ,
October 10-13, pp. 305-309. 37.Church,
K. (1995) “One Term or Two?” Proceedings
of the 18th Annual International ACM SIGIR Conference on Research and
Development in Information Retrieval, pp. 310-318. pdf
(via ACM) ps 38.Church,
K. and Gale, W. (1995) “Inverse Document Frequency (IDF): A Measure of
Deviation from Poisson,” Proceedings of
the Third Workshop on Very Large Corpora, pp. 121-130. ps 39.Caldwell,
D., Church, K. and Fowler, G. (1998), “Using Quantitative Methods to Compress
Call Detail: From Data Warehousing to Data Publishing,” Symposium on Quantitative Analysis for Decision Making, pp.
71-79. 40.Yamamoto,
M. and Church, K (1998) “Using Suffix Arrays to Compute Term Frequency and
Document Frequency for All Substrings in a Corpus,” Proceedings of the Sixth Workshop on Very Large Corpora, pp.
28-37. 41.Belanger,
D., Church, K. and Hume, A. (1999) Virtual Data Warehousing, Data Publishing,
and Call Detail, pp. 106-117, W. Jonker (eds.) “Databases in
Telecommunications 1999, International Workshop Co-located with VLDB-99,”
Edinburgh, Scotland, Proceedings. Lecture Notes in Computer Science, Vol.
1819, Springer, 2000, ISBN 3-540-67667-9 http://www.informatik.uni-trier.de/~ley/db/conf/vldbw/vldbw1999.html.
doc* 42.Ozawa,
T., Yamamoto, M., Umemura, K., Church, K. (1999) Japanese word segmentation
using similarity measure for IR. In Proceedings of the First NTCIR Workshop
on Research in Japanese Text Retrieval and Term Recognition. Tokyo ,
Japan ,
August, 1999, p. 89-96. pdf 43.Buchsbaum ,
A., Caldwell, D., Church, K., Fowler, G. and Muthukrishnan, S. (2000),
“Engineering the Compression of Massive Tables: An Experimental Approach,” in
Proc. 11th ACM-SIAM Symp. on Discrete
Algorithms (SODA), pp. 175-184.
pdf
related links pdf
(via ACM) 44.Church,
K. (2000), “Empirical Estimates of Adaptation: The chance of Two Noriega's is
closer to p/2 than p 2,” Coling, pp. 173-179. pdf (via LDC) pdf ps pdf
(via ACM) 45.Umemura,
K. and Church, K. (2000), “Empirical Term Weighting and Expansion Frequency,”
Empirical Methods in Natural Language
Processing and Very Large Corpora, pp. 117-123. pdf 46.Yamamoto,
E., Yamamoto, M., Umemura, K. and Church, K. (2000) “Dynamic Programming: A
Method for Taking Advantage of Technical Terminology in Japanese Documents,” Information Retrieval with Asian Languages
(IRAL), pp. 125-132.
pdf
(via ACM) 47.Feng ,
L., Umemura, K., Yamamoto, M. and Church, K. (2000) “Using Variable Length
Ngrams for Retrieving Technical Abstracts in Japanese,” Information Retrieval with Asian Languages (IRAL), pp. 213-214. pdf
(via ACM) 48.Sable,
C. and Church, K. (2001) “Using Bins to Empirically Estimate Term Weights for
Text Categorization,” 2001
Conference on Empirical Methods in Natural Language Processing (EMNLP 2001).
ps alternate 49.Sable, C. McKeown, K., and
Church, K. (2002) "NLP Found Helpful (at least for one Text
Categorization Task)," in Proceedings
of the 2002 Conference on Empirical Methods in Natural Language Processing
(EMNLP), Philadelphia, PA. ps 50.Amir, A., Church, K. and Dar,
E. (2002), “Separable attributes: a technique for solving the sub matrices
character count problem,” ACM-SIAM symposium on Discrete algorithms
(SODA), pp. 400-401 pdf
(from ACM) 51.Carey
E. Priebe, David J. Marchette, Youngser Park, Ed- ward J.
Wegman, Jeffrey L. Solka, Diego A. Socolinsky, Damianos Karakos, Ken W. Church,
Roland Guglielmi, Ronald R. Coifman, Dekang Lin, Dennis M. Healy, Marc Q.
Jacobs, Anna Tsao (2004), ``Iterative Denoising for Cross-Corpus Discovery,”
COMPSTAT, Prague, Cz. ps (see also, http://www.ams.jhu.edu/~priebe/confri2004.html) 52.Kenneth
Church and Bo Thiesson (2005), ``The Wild Thing,” ACL, pdf. 53.Ping
Li and Kenneth Church (2005), ``Using Sketches to Estimate Associations,”
HLT/EMNLP-2005, pdf (longer
TR). 54.Li,
P., Hastie, T., Church, K (2006), “Improving Random Projections Using
Marginal Information,” Conference on Learning Theory (COLT). 55.Li, P.,
Hastie, T., Church, K. (2006), “Very Sparse Random Projections,” KDD. 56.Li, P.,
Hastie, T., Church, K. (2007), “Nonlinear Estimators and Tail Bounds for
Dimension Reduction in L1 Using Cauchy Random Projections,” Conference on
Learning Theory (COLT). 57.Li, P.,
Church, K., Hastie, T. (2007), Conditional Random Sampling: A Sketch-based
Sampling Technique for Sparse Data, NIPS. 58.Church,
K., Hart, T and Jianfeng, G. (2007), Compressing Trigram Language Models with
Golomb Coding, Proceedings
of the 2007 Joint Conference on Empirical Methods in Natural Language
Processing and Computational Natural Language Learning (EMNLP-CoNLL), pp.
199-207, http://www.aclweb.org/anthology/D/D07/D07-1021.
59.Surajit
Chaudhuri, Kenneth Church, Arnd Christian König and Liying Sui (2007) Heavy-Tailed Distributions and Multi-Keyword Queries
SIGIR, [paper]. 60.Qiaozhu, M. and Church, K.
(2008) “Entropy of Search Logs: How Hard is Search? With Personalization?
With Backoff?,” The First ACM International Conference on Web Search
and Data Mining (WSDM'08), pages 45-54. [pdf] [slides] [video] 61.Qiaozhu
Mei, Dengyong Zhou, Kenneth Church (2008) “Query Suggestion Using
Hitting Time,” CIKM [pdf] [slides] 62.Li, P., Church, K. and
Hastie, T. (2008) “One sketch for all: Theory and Application of Conditional
Random Sampling,” NIPS. 63.Bhat,
S. and Church, K. (2008) “Variable Selection for Ad Prediction,” http://adlab.microsoft.com/adkdd2008/Program.html.
64.Church, K., Greenberg, A. and Hamilton, J. (2008) “On
Delivering Embarrassingly Distributed Cloud Services, Seventh
ACM Workshop on Hot Topics in Networks, (HotNets-VII), Calgary, pdf, ppt. 65.Arnd
Christian König, Kenneth Church and Martin Markov (2009) A Data
Structure for Sponsored Search, ICDE. 66.Umemura,
K. and Church, K. (2009) “Substring Statistics,” CICLing. 67.Church,
K. (2009) “Has Computational Linguistics Become More Applied?” CICLing.
Books 1. Church,
K. (1988) Phonological Parsing in
Speech Recognition, Kluwer.
2. Armstrong,
S., Church, K., Isabelle, P., Manzi, S., Tzoukermann, E. and Yarowksky, D.
(eds.) (1999), Natural Language
Processing Using Very Large Corpora, Kluwer. Book Chapters 1. Church,
R., and Church, K. (1977) “Plans, Goals, and Search Strategies for the
Seleciton of a Move in Chess,” in Frey, P. (ed), Chess Skill in Man and
Machine, Springer-Verlag.
2. Martin,
W., Church, K., and Patil, R. (1987) “Preliminary Analysis of a Breadth-First
Parsing Algorithm: Theoretical and Experimental Results,” L.Bolc (ed.), Natural Language Parsing Systems,
Springer-Verlag. 3. Church,
K. (1987) “Phonological Parsing and Lexical Retrieval” in Frauenfelder, U.
and Tyler, L., (ed) Spoken Word
Recognition, MIT Press.
4. Church,
K, Gale, W., Hanks, P., Hindle, D. (1991) “Parsing, Word Associations and
Typical Predicate-Argument Relations,” in Tomita, M.(ed.) Current
Issues in Parsing Technology, Kluwer Academic Publishers, Boston, MA. ps 5. Church,
K., Hanks, P., Hindle, D., Gale, W. (1991) “Using Statistics in Lexical
Analysis,” in Zernik (ed), Lexical
Acquisition: Using On-line Resources to Build a Lexicon, Lawrence
Erlbaum, pp. 115-164. ps 6. Liberman ,
M., and Church, K. (1991) “Text Analysis and Word Pronunciation in
Text-to-Speech Synthesis,” in Furui, S., and Sondhi, M. (eds.), Advances in Speech Signal Processing. ps* 7. Church,
K., Hanks, P., Hindle, D., Gale, W., Moon, R., “Substitutablity,” in Atkins
and Zampolli (ed.), (1994), Computational
Approaches to the Lexicon Automating the Lexicon II Schema, Oxford
University Press, pp. 153-180. ps 8. Church,
K, “Text Analysis,” in Mellish (ed.), (to appear) Encyclopedia of Language and Linguistics, Pergamon Press,
Aberdeen University Press.
9. Church,
K., “Comments on Computational Learning Model for Metrical Phonology,” Levine,
R. (ed.), Formal Grammar: Theory and
Implementation, Vancouver Studies in Cognitive Science Series, UBC Press,
Vol. 2 (1992). ps 10.Church, K. (1992) “Current
Practice in Part of Speech Tagging and Suggestions for the Future,” in
Simmons (ed.), Abornik praci: In Honor
of Henry Kucera, Michigan Slavic Studies, pp. 13-48. ps
11.Gale, W. and Church, K.
(1993) “Statistical Approaches to Aligning Sentences and Identifying Word
Correspondences in Parallel Texts: a Report on Work in Progress,” in D. J.
Hand (ed.) Artificial Intelligence
Frontiers in Statistics, AI and Statistics III, pp. 281-294. 12.Gale, W. and Church, K.
(1994) “What's Wrong with Adding One?”
in N. Oostdijk and P. de Haan (eds.), Corpus-Based Research into Languge: In honour of Jan Aarts,
Rodopi, Amsterdam, pp. 189-200. ps 13.Gale, W. and Church, K.
(1994) “Discrimination Decisions in 100,000 Dimensional Spaces,” in A. Zampolli,
N. Calzolari and M. Palmer (eds.), Current
Issues in Computational Linguisitcs: In honour of Don Walker, Kluwer
Academic Publishers, The Netherlands, pp. 429-550. ps 14.Kenneth Church (2005), “The DDI Approach to Morphology,”
in Arppe, Carlson, Heinamaki, Linden,
Miestamo, Piitulainen, Tupakka, Westerlund, Yli-Jyra (eds.), A Finnish
Computer Linguist: Kimmo Koskenniemi Festschrift on the 60th birthday, CSLI
Publications, ps. Tutorials 1. Ngrams ,
ACL 1995. ps 2. NLP Techniques and Text Retrieval,
Lugano Switzerland
. ps 3. Unix for Poets ps 4. Applications of Text Analysis
ps 5.
Empirical
Methods in Natural Language Processing: What's Happened Since the First
SIGDAT Meeting? NAACL-2000 abstract, slides & handout
Invited Talks 1. EACL-1993 2. What’s Happened Since the First
SIGDAT Meeting?, SIGDAT-1999 ppt 3. Empiricism form TMI-1992 to
AMTA-2002 to AMTA-2012: Have IBM Models 1-5 failed to solve all the world's
problems? AMTA-2002 ppt 4. Church, K. (2003) “Speech and
Language Processing: Where have we been and where are we going,” Eurospeech,
Geneva, Switzerland. pdf slides
5. LREC 2004 Workshop CFP:
MEMURA-2004 Methodologies and Evaluation of Multiword Units in Real-world
Applications, http://memura2004.di.ubi.pt/
ppt 6. Joint talk to EMNLP-2004 (http://www.cs.ualberta.ca/~lindek/emnlp04/)
and Senseval-2004 (http://www.cs.unt.edu/~rada/senseval/senseval3/workshop.html)
(2004) ppt 7. Church,
K. (2004) ``Speech and Language Processing: Can we use the past to predict
the future,” Seventh International Conference on TEXT, SPEECH and DIALOGUE,
Brno, Czech Republic. slides pdf 8. Church,
K. (2008) “Entropy of Search Logs: How
Hard is Search,” Harbin, China, http://ir.hit.edu.cn/airs2008/technicalprogram.asp
Panels
1. KDD-2004. slides
|
|||||
|
|