Publications Erik Tjong Kim Sang

Titles of publications usually link to pdf files. Slides of talks associated with recent papers can be found on my talks page. My research profile on Google Scholar also mentions the number of citations per paper and my h-index score.

When citing my work, it is useful to know that my surname consists of the final three words of my name: TJONG KIM SANG. In Latex you can achieve a correct name citation by putting the surname in the author field, followed by a comma and the given name (example).


Papers listed on Google Scholar | Microsoft Academic Search
2017 2016 2015 2014 2013 2012 2011 2010 2009 2008 2007 2006 2005 2004 2003 2002 2001 2000 1999 1998 1996 1995 1993 1992 1991 1988

2017

Determining the function of political tweets, by Erik Tjong Kim Sang, Herbert Kruitbosch, Marcel Broersma and Marc Esteve del Valle. In: Proceedings of the 13th IEEE International Conference on eScience (eScience 2017), IEEE, Auckland, New Zealand, 2017, pages 438-439, ISBN 978-1-5386-2686-3, doi:10.1109/eScience.2017.60. (bibtex)

The CLIN27 Shared Task: Translating Historical Text to Contemporary Language for Improving Automatic Linguistic Annotation, by Erik Tjong Kim Sang, Marcel Bollmann, Remko Boschker, Francisco Casacuberta, Feike Dietz, Stefanie Dipper, Miguel Domingo, Rob van der Goot, Marjo van Koppen, Nikola Ljubešić, Robert Östling, Florian Petran, Eva Pettersson, Yves Scherrer, Marijn Schraagen, Leen Sevens, Jörg Tiedemann, Tom Vanallemeersch and Kalliopi Zervanou. In: Computational Linguistics in the Netherlands Journal, volume 7, pages 53-64, 2017, ISSN 2211-4009. (bibtex)

Finding Interesting Persons in the Dutch Political Twitter Landscape, by Erik Tjong Kim Sang. Internal report Netherlands eScience Center, 13 April 2017, 5 pages. (bibtex)

Identifying Dialect Regions from Syntactic Data, by Erik Tjong Kim Sang. In: From Semantics to Dialectometry - Festschrift in honor of John Nerbonne, editors: Martijn Wieling, Martin Kroon, Gertjan van Noord and Gosse Bouma. College Publications, Milton Keynes, pages 367-373, 2017, ISBN 978-1-84890-230-5. (bibtex)

2016

Finding Rising and Falling Words, by Erik Tjong Kim Sang. In: Proceedings of the COLING 2016 workshop Language Technology Resources and Tools for Digital Humanities, ACL, Osaka, Japan, 2016 (bibtex; software & data)

Verb inflection in the SAND, by Erik Tjong Kim Sang. Internal Report, Meertens Institute, Amsterdam, The Netherlands, 21 October 2016. (bibtex; software)

Nederlab: Towards a Single Portal and Research Environment for Diachronic Dutch Text Corpora, by Hennie Brugman, Martin Reynaert, Nicoline van der Sijs, René van Stipriaan, Erik Tjong Kim Sang and Antal van den Bosch. Proceedings of LREC 2016, Tenth International Conference on Language Resources and Evaluation, ELRA, Portoroz, Slovenia, pages 1277-1281, 2016, ISBN 978-2-9517408-9-1. (bibtex).

Improving Part-of-Speech Tagging of Historical Text by First Translating to Modern Text, by Erik Tjong Kim Sang. 2nd IFIP International Workshop on Computational History and Data-Driven Humanities, editors: Bozic, Mendel-Gleason, Debruyne and O'Sullivan, Springer Verlag, 2016, ISBN 978-3-319-46223-3, doi:10.1007/978-3-319-46224-0. (bibtex)

Visualizing Literary Data, by Erik Tjong Kim Sang. Second Workshop on Visualization as added value in the development, use and evaluation of language resources (VisLR II), LREC 2016, ELRA, Portoroz, Slovenia, pages 30-37, 2016. (bibtex; software access)

2015

Voronoi Diagrams without Bounding Boxes, by Erik Tjong Kim Sang. In: Proceedings of the Joint International Geoinformation Conference 2015, Kuala Lumpur, Malaysia, 2015. (bibtex)

Discovering Dialect Regions in Syntactic Dialect Data, by Erik Tjong Kim Sang. Workshop European Dialect Syntax VIII - Edisyn 2015, Zurich, Switzerland, 2015. (bibtex)

2014

Using Tweets for Assigning Sentiments to Regions, by Erik Tjong Kim Sang. In: Proceedings of the 5th International Workshop on Emotion, Social Signals, Sentiment and Linked Open Data at LREC2014, Reykjavik, Iceland, 2014. (bibtex)

SAND: Relation between the Database and Printed Maps, by Erik Tjong Kim Sang. Internal Report, Meertens Institute, Amsterdam, The Netherlands, 16 May 2014. (bibtex)

Finding Syntactic Characteristics of Surinamese Dutch, by Erik Tjong Kim Sang. Internal Report, Meertens Institute, Amsterdam, The Netherlands, 6 March 2014. (bibtex; see also Nicoline's paper)

Verwerking van achttiende-eeuws Nederlands met Frog, by Erik Tjong Kim Sang. Internal Report, Meertens Institute, Amsterdam, The Netherlands, 13 February 2014. (in Dutch, bibtex)

2013

Dealing with Big Data: the Case of Twitter, by Erik Tjong Kim Sang and Antal van den Bosch. In: Computational Linguistics in the Netherlands Journal, volume 3, ISSN: 2211-4009, pages 121-134, 2013. (bibtex)

Large Scale Syntactic Annotation of Written Dutch: Lassy, by G. van Noord, G. Bouma, F. van Eynde, D. de Kok, J. van der Linde, I. Schuurman, E. Tjong Kim Sang, and V. Vandeghinste. In: Peter Spyns and Jan Odijk (eds.), Essential Speech and Language Technology for Dutch, Springer, 2013, ISBN 978-3-642-30909-0.

Cornetto: a Combinatorial Lexical Semantic Database for Dutch, by P. Vossen, I. Maks, R. Segers, H. van der Vliet, M-F. Moens, K. Hofmann, E. Tjong Kim Sang, and M. de Rijke. In: Peter Spyns and Jan Odijk (eds.), Essential Speech and Language Technology for Dutch, Springer, 2013, ISBN 978-3-642-30909-0.

2012

Predicting the 2011 Dutch Senate Election Results with Twitter, by Erik Tjong Kim Sang and Johan Bos. Proceedings of SASN 2012, the EACL 2012 Workshop on Semantic Analysis in Social Networks, ACL, Avignon, France, 2012, pages 53-60 (bibtex, data.zip). (more than 200 citations)

2011

Extraction of Hypernymy Information from Text, by Erik Tjong Kim Sang, Katja Hofmann and Maarten de Rijke. In: Antal van den Bosch and Gosse Bouma, Interactive Multi-modal Question-Answering. Series: Theory and Applications of Natural Language Processing, Springer-Verlag Berlin Heidelberg, 2011, pages 223-245 (preprint).

Het gebruik van Twitter voor Taalkundig Onderzoek, by Erik Tjong Kim Sang. TABU: Bulletin voor Taalwetenschap, volume 39, number 1/2, 2011, pages 62-72 (in Dutch).

2010

Using a Treebank for Finding Opposites, by Anna Lobanova, Gosse Bouma and Erik Tjong Kim Sang. Proceedings of TLT9, Tartu, Estonia, 2010, pages 139-150.

A Baseline Approach for Detecting Sentences Containing Uncertainty, by Erik Tjong Kim Sang. Proceedings of CoNLL-2010 (shared task paper), Uppsala, Sweden, 2010, pages 148-150.

GikiCLEF: Crosscultural issues in multilingual information access, by Diana Santos, Luís Miguel Cabral, Corina Forascu, Pamela Forner, Fredric Gey, Katrin Lamm, Thomas Mandl, Petya Osenova, Anselmo Peñas, Álvaro Rodrigo, Julia Schulz, Yvonne Skalban and Erik Tjong Kim Sang. Proceedings of LREC (poster), Malta, 2010, pages 2346-2353.

2009

Computational Linguistics and the History of Science, by John Nerbonne, John Kizito, Ismail Fahmi, Erik Tjong Kim Sang and Gosse Bouma. In: Storia della Scienza e Linguistica Computazionale, editor Liborio Dibattista, FrancoAngeli, 2009, pages 55-73 (preprint).

Lexical Patterns or Dependency Patterns: Which Is Better for Hypernym Extraction?, by Erik Tjong Kim Sang and Katja Hofmann. In: Proceedings of CoNLL-2009, Boulder, CO, USA, 2009, pages 174-182. (bibtex)

Automatic Relation Extraction - Can Synonym Extraction Benefit from Antonym Knowledge?, by Anna Lobanova, Jennifer Spenader, Tim Van de Cruys, Tom van der Kleij and Erik Tjong Kim Sang. In: Proceedings of WordNets and other Lexical Semantic Resources - between Lexical Semantics, Lexicography, Terminology and Formal Ontologies (NODALIDA2009 workshop), Odense, Denmark, 2009, pages 17-20.

To Use a Treebank or Not - Which Is Better for Hypernym Extraction?, by Erik Tjong Kim Sang. In Proceedings of the Seventh International Workshop on Treebanks and Linguistic Theories (TLT 7), Groningen, The Netherlands, 2009, pages 171-176 (short paper).

2008

Overview of the CLEF 2008 Multilingual Question Answering Track, by Pamela Forner, Anselmo Peñas, Iñaki Alegria, Corina Forascu, Nicolas Moreau, Petya Osenova, Prokopis Prokopidis, Paulo Rocha, Bogdan Sacaleanu, Richard Sutcliffe and Erik Tjong Kim Sang. In Working Notes for the CLEF 2008 Workshop, Aarhus, Denmark, 2008.

The University of Amsterdam's Question Answering System at QA@CLEF 2007, by Valentin Jijkoun, Katja Hofmann, David Ahn, Mahboob Alam Khalid, Joris van Rantwijk, Maarten de Rijke and Erik Tjong Kim Sang. In Lecture Notes in Computer Science, volume 5152, Springer Berlin / Heidelberg, 2008.

2007

Practical applications of stand-off annotation, by Martha Larson, Valentin Jijkoun, Jobst Löffler and Erik Tjong Kim Sang. In Sprache und Datenverarbeitung, volume 31, number 1-2, pp. 115-129, 2007.

Automatic Extraction of Dutch Hypernym-Hyponym Pairs, by Erik Tjong Kim Sang and Katja Hofmann. In Proceedings of CLIN-2006, Leuven, Belgium, 2007. (bibtex)

The University of Amsterdam at the TREC 2007 QA Track, by Katja Hofmann, Valentin Jijkoun, Mahboob Alam Khalid, Joris van Rantwijk and Erik Tjong Kim Sang. In Notebook of the Sixteenth Text Retrieval Conference (TREC 2007), NIST, 2007.

The University of Amsterdam at CLEF@QA 2007, by Valentin Jijkoun, Katja Hofmann, David Ahn, Mahboob Alam Khalid, Joris van Rantwijk, Maarten de Rijke and Erik Tjong Kim Sang. In Working Notes for the CLEF 2007 Workshop, Budapest, Hungary, 2007.

Entity Retrieval, by Sisay Fissaha Adafre, Maarten de Rijke and Erik Tjong Kim Sang. In: Proceedings of RANLP 2007, Borovets, Bulgaria, 2007.

Automatic Extension of Non-English WordNets, by Katja Hofmann and Erik Tjong Kim Sang. In Proceedings of SIGIR'07 (poster), Amsterdam, The Netherlands, 2007.

An Experiment in Automatic Classification of Pathological Reports by Janneke van der Zwaan, Erik Tjong Kim Sang and Maarten de Rijke. In Proceedings of AIME 2007, Amsterdam, The Netherlands, 2007.

A Constraint Satisfaction Approach to Dependency Parsing, by Sander Canisius and Erik Tjong Kim Sang. In Proceedings of EMNLP-CoNLL 2007 (CoNLL shared task paper), Prague, Czech Republic, 2007.

Extracting Hypernym Pairs from the Web, by Erik Tjong Kim Sang. In Proceedings of ACL 2007 (poster), Prague, Czech Republic, 2007. (bibtex, pdf poster)

The Cornetto Database: Architecture and User-Scenarios, by Piek Vossen, Katja Hofmann, Maarten de Rijke, Erik Tjong Kim Sang and Koen Deschacht. In Proceedings of DIR 2007, Leuven, Belgium, 2007.

2006

The University of Amsterdam at QA@CLEF 2006, by Valentin Jijkoun, Joris van Rantwijk, David Ahn, Erik Tjong Kim Sang and Maarten de Rijke. In Working Notes for the CLEF 2006 Workshop, Alicante, Spain, 2006.

Dependency Parsing by Inference over High-recall Dependency Predictions, by Sander Canisius, Toine Bogers, Antal van den Bosch, Jeroen Geertzen and Erik Tjong Kim Sang. In Proceedings of the Tenth Conference on Natural Language Learning (CoNLL-X), New York City, USA, 2006.

Towards a Multi-Stream Question Answering-As-XML-Retrieval Strategy, by David Ahn, Sisay Fissaha, Valentin Jijkoun, Karin Müller, Maarten de Rijke and Erik Tjong Kim Sang. In Proceedings of the Fourteenth Text Retrieval Conference (TREC 2005), NIST, 2006.

2005

The University of Amsterdam at QA@CLEF 2005, by David Ahn, Valentin Jijkoun, Karin Müller, Maarten de Rijke and Erik Tjong Kim Sang. In Working Notes for the CLEF 2005 Workshop, Vienna, Austria, 2005.

Developing Offline Strategies for Answering Medical Questions, by Erik Tjong Kim Sang, Gosse Bouma and Maarten de Rijke. In Proceedings of the AAAI-05 Workshop on Question Answering in Restricted Domains, Pittsburgh, PA, USA, pp. 41-45. [data]

Applying spelling error techniques for improving semantic role labelling, by Erik Tjong Kim Sang, Sander Canisius, Antal van den Bosch and Toine Bogers. In Proceedings of CoNLL-2005, Ann Arbor, MA, USA, pp. 229-232.

2004

Reduction of Dutch Sentences for Automatic Subtitling, by Erik F. Tjong Kim Sang, Walter Daelemans and Anja Höthker. In: Proceedings of CLIN-2003, University of Antwerp, Antwerp Papers in Linguistics, 111, 2004, pp. 109-123.

Automatic Sentence Simplification for Subtitling in Dutch and English, by Walter Daelemans, Anja Höthker and Erik Tjong Kim Sang. In: Proceedings of LREC-2004, Lisbon, Portugal, 2004, (more than 50 citations) pp. 1045-1048.

Using a Parallel Transcript/Subtitle Corpus for Sentence Compression, by Vincent Vandeghinste and Erik Tjong Kim Sang. In: Proceedings of LREC-2004, Lisbon, Portugal, 2004, pp. 231-234.

Memory-based semantic role labeling: Optimizing features, algorithm, and output, by Antal van den Bosch, Sander Canisius, Walter Daelemans, Iris Hendrickx and Erik Tjong Kim Sang. In: Proceedings of CoNLL-2004, Boston, MA, USA, 2004, pp. 102-105. [bibtex]

2003

Introduction to the CoNLL-2003 Shared Task: Language-Independent Named Entity Recognition, by Erik F. Tjong Kim Sang and Fien De Meulder. In: Proceedings of CoNLL-2003, Edmonton, Canada, 2003, pp. 142-147. [bibtex] (more than 1000 citations)

Generating Subtitles from Linguistically Annotated Text, by Erik F. Tjong Kim Sang. Internal report Atranos project, WP4-12, University of Antwerp, 2003, 19 pages.

2002

Memory-Based Named Entity Recognition, by Erik F. Tjong Kim Sang. In: Proceedings of CoNLL-2002, Taipei, Taiwan, 2002, pp. 203-206. [bibtex]

Introduction to the CoNLL-2002 Shared Task: Language-Independent Named Entity Recognition, by Erik F. Tjong Kim Sang. In: Proceedings of CoNLL-2002, Taipei, Taiwan, 2002, pp. 155-158. [bibtex]

Memory-Based Shallow Parsing, by Erik F. Tjong Kim Sang. In Journal of Machine Learning Research, volume 2 (March), 2002, pp. 559-594. [bibtex] (more than 100 citations)

2001

Transforming a Chunker to a Parser, by Erik F. Tjong Kim Sang. In: Walter Daelemans, Khalil Sima'an, Jorn Veenstra and Jakub Zavrel (eds.), Computational Linguistics in the Netherlands 2000, Rodopi, 2001, pp. 177-188. [bibtex]

Learning Computational Grammars, by John Nerbonne, Anja Belz, Nicola Cancedda, Hervé Déjean, James Hammerton, Rob Koeling, Stasinos Konstantopoulos, Miles Osborne, Franck Thollard and Erik F. Tjong Kim Sang. In: Walter Daelemans and Rémi Zajac (eds.), Proceedings of CoNLL-2001, Toulouse, France, 2001, pp. 97-104. [bibtex]

Memory-Based Clause Identification, by Erik F. Tjong Kim Sang. In: Walter Daelemans and Rémi Zajac (eds.), Proceedings of CoNLL-2001, Toulouse, France, 2001, pp. 67-69. [bibtex]

Introduction to the CoNLL-2001 Shared Task: Clause Identification, by Erik F. Tjong Kim Sang and Hervé Déjean. In: Walter Daelemans and Rémi Zajac (eds.), Proceedings of CoNLL-2001, Toulouse, France, 2001, pp. 53-57. [bibtex] (more than 50 citations)

Combining a self-organising map with memory-based learning, by James Hammerton and Erik F. Tjong Kim Sang. In: Walter Daelemans and Rémi Zajac (eds.), Proceedings of CoNLL-2001, Toulouse, France, 2001, pp. 9-14. [bibtex]

2000

Noun Phrase Recognition by System Combination, by Erik F. Tjong Kim Sang. In Antal van den Bosch and Hans Wiegaard (eds.), Proceedings of the Twelfth Belgium-Netherlands Artificial Intelligence Conference (BNAIC'00), Tilburg, The Netherlands, 2000, pp. 335-336 (extended abstract of ANLP-NAACL 2000 paper). [bibtex]

Meta-Learning for Phonemic Annotation of Corpora, by Véronique Hoste, Walter Daelemans, Erik Tjong Kim Sang and Steven Gillis. In Antal van den Bosch and Hans Wiegaard (eds.), Proceedings of the Twelfth Belgium-Netherlands Artificial Intelligence Conference (BNAIC'00), Tilburg, The Netherlands, 2000, pp. 331-332 (extended abstract of ICML 2000 paper). [bibtex]

Learning the Logic of Simple Phonotactics, by Erik F. Tjong Kim Sang and John Nerbonne. In James Cussens and Saso Dzeroski (eds), Learning Language in Logic, Lecture Notes in Computer Science, vol. 1925, Springer Verlag, 2000, pp. 110-124. [bibtex]

Introduction to the CoNLL-2000 Shared Task: Chunking, by Erik F. Tjong Kim Sang and Sabine Buchholz 2000e. In Proceedings of CoNLL-2000, Lisbon, Portugal, 2000, pp. 127-132. [bibtex] (more than 500 citations)

Text Chunking by System Combination, by Erik F. Tjong Kim Sang. In Proceedings of CoNLL-2000, Lisbon, Portugal, 2000, pp. 151-153. [bibtex] (more than 50 citations)

Applying System Combination to Base Noun Phrase Identification, by Erik F. Tjong Kim Sang, Walter Daelemans, Hervé Déjean, Rob Koeling, Yuval Krymolowski, Vasin Punyakanok and Dan Roth, 2000c. In Proceedings of COLING 2000, Saarbrücken, Germany. Morgan Kaufman Publishers, 2000, pp. 857-863. [bibtex]

Meta-Learning for Phonemic Annotation of Corpora, by Véronique Hoste, Walter Daelemans, Erik Tjong Kim Sang and Steven Gillis. In Proceedings of Seventeenth International Conference on Machine Learning, Stanford University, USA. Morgan Kaufman Publishers, 2000, pp. 375-382. [bibtex]

Noun Phrase Recognition by System Combination, by Erik F. Tjong Kim Sang. In Proceedings of ANLP-NAACL 2000, Seattle, Washington, USA. Morgan Kaufman Publishers, 2000, pp. 50-55. [bibtex] (more than 50 citations)

1999

Learning Simple Phonotactics by Erik F. Tjong Kim Sang and John Nerbonne. In C. Lee Giles and Ron Sun, eds. Proceedings of the Workshop on Neural, Symbolic, and Reinforcement Methods for Sequence Processing, ML2 workshop at IJCAI'99, Stockholm, Sweden, 1999, pp. 41-46. [bibtex]

CoNLL-99, Computational Language Learning (workshop proceedings), by Miles Osborne and Erik Tjong Kim Sang. Association for Computational Linguistics, 1999.

Representing Text Chunks, by Erik F. Tjong Kim Sang and Jorn Veenstra. In Proceedings of EACL'99, Bergen, Norway. Morgan Kaufman Publishers, 1999, pp. 173-179. [bibtex] (more than 200 citations)

1998

Machine Learning of Phonotactics, by Erik F. Tjong Kim Sang. PhD thesis, University of Groningen, 1998. [bibtex]

CATCH: A Program for Developing World Wide Web CALL Material, by Erik F. Tjong Kim Sang. In Proceedings of the 11th Nordic Conference on Computational Linguistics, Copenhagen, Denmark. Center for Sprogteknologi, 1998.

1996

Converting the Scania Framemaker Documents to TEI SGML, by Erik F. Tjong Kim Sang. Internal report, Department of Linguistics, Uppsala University, 1996.

Aligning the Scania Corpus by Erik F. Tjong Kim Sang. Internal report, Department of Linguistics, Uppsala University, 1996.

1995

The Limitations of Modeling Finite State Grammars with Simple Recurrent Networks, by Erik F. Tjong Kim Sang. In Toine Andernach and Anton Nijholt editors, Computational Linguistics in The Netherlands (CLIN) 1994, University of Twente, 1995.

1993

Acquiring Digital Phonology by Erik F. Tjong Kim Sang. In Wietske Sijtsma and Olga Zweekhorst editors, Computational Linguistics in The Netherlands (CLIN) 1992, ITK Tilburg, 1993.

1992

Machine Learning of Natural Language, by Erik F. Tjong Kim Sang. In Dicky Gilbers and Sietze Looyenga editors, Language and Cognition 2, University of Groningen, 1992.

Strategieën voor LINGO (Strategies for the Lingo Game) by Erik F. Tjong Kim Sang. In H. de Swaan Arons and H. Koppelaar and E.J.H. Kerckhoffs editors, Proceedings Nederlandstalige AI Conferentie 1992, Delft University of Technology, 1992. In Dutch.

A Connectionist Representation for Phrase Structure, by Erik F. Tjong Kim Sang. In M. Drossaers and A. Nijholt editors, Proceedings of the Twente Workshop on Language Technology III, University of Twente, 1992.

1991

A Connectionist View at Knowledge Representation, by Erik F. Tjong Kim Sang. In Mark Kas and Eric Reuland and Co Vet editors, Language and Cognition 1, University of Groningen, 1991.

1988

Partiële evaluatie van Prolog en haar toepassing bij het genereren van abstracte Prolog machines (Partial Evaluation of Prolog) by Erik F. Tjong Kim Sang. Masters thesis, Department of Technical Computing Science, Delft University of Technology, 1988. In Dutch.


Home page
Last update: November 16, 2016. erikt(at)xs4all.nl