Parse tree of the phrase 'The Riddle of Literary Quality'

Academic homepage of Andreas van Cranenburgh

I am a postdoc at Heinrich Heine Universität Düsseldorf in the Beyond CFG project. I was previously a PhD candidate in the project The Riddle of Literary Quality. My primary interests are statistical parsing and syntactic patterns, with particular interest in tree fragments and discontinuous constituents.

Mail: cranenburgh@phil.hhu.de
Code: https://github.com/andreasvc and https://gist.github.com/andreasvc/
Profiles: Google Scholar; Semantic Scholar.

Education

Peer reviewed publications (bibtex)

Corina Koolen, Andreas van Cranenburgh (2017).
Blue eyes and porcelain cheeks: Computational extraction of physical descriptions from Dutch chick lit and literary novels.
Digital Scholarship Humanities, to appear.
https://doi.org/10.1093/llc/fqx016

Corina Koolen, Andreas van Cranenburgh (2017).
These are not the Stereotypes You are Looking For: Bias and Fairness in Authorial Gender Attribution.
Proceedings of the First Ethics in NLP workshop, pp. 12-22.
http://aclweb.org/anthology/W17-1602 (notebook)

Andreas van Cranenburgh, Rens Bod (2017).
A Data-Oriented Model of Literary Language.
Proceedings of EACL, pp. 1228-1238.
http://aclweb.org/anthology/E17-1115 (code; slides; Q&A)

Andreas van Cranenburgh, Remko Scha, Rens Bod (2016).
Data-Oriented Parsing with Discontinuous Constituents and Function Tags.
Journal of Language Modelling, vol. 4, no. 1, pp. 57-111.
http://dx.doi.org/10.15398/jlm.v4i1.100 (code; grammars)

Kim Jautze, Andreas van Cranenburgh, Corina Koolen (2016).
Topic Modeling Literary Quality.
Digital Humanities 2016, Krakow, Poland, 11-16 July.
http://andreasvc.github.io/dh2016.pdf

Andreas van Cranenburgh, Corina Koolen (2015).
Identifying Literary Novels with Bigrams.
Proceedings of the Fourth Workshop on Computational Linguistics for Literature, pp. 58-67.
http://aclweb.org/anthology/W15-0707 (poster)

Federico Sangati, Andreas van Cranenburgh (2015).
Multiword Expression Identification with Recurring Tree Fragments and Association Measures.
Proceedings of the 11th Workshop on Multiword Expressions, pp. 10-18.
http://aclweb.org/anthology/W15-0902 (slides)

Andreas van Cranenburgh (2014).
Extraction of Phrase-Structure Fragments with a Linear Average Time Tree Kernel.
Computational Linguistics in the Netherlands Journal, vol. 4, pp. 3-16.
paper

Dirk Roorda, Gino Kalkman, Martijn Naaijer, Andreas van Cranenburgh (2014).
LAF-Fabric: a data analysis tool for Linguistic Annotation Framework with an application to the Hebrew Bible.
Computational Linguistics in the Netherlands Journal, vol. 4, pp. 105-120.
paper

Andreas van Cranenburgh, Rens Bod (2013).
Discontinuous Parsing with an Efficient and Accurate DOP Model.
Proceedings of the International Conference on Parsing Technologies, Nara, Japan, 27-29 November.
paper; slides; code; notes.

Kim Jautze, Corina Koolen, Andreas van Cranenburgh, Hayco de Jong (2013).
From high heels to weed attics: a syntactic investigation of chick lit and literature.
Proceedings of the Computational Linguistics for Literature workshop, Atlanta, Georgia, June 14.
http://aclweb.org/anthology/W13-1410 (slides)

Andreas van Cranenburgh (2012).
Literary authorship attribution with phrase-structure fragments.
Proceedings of the Computational Linguistics for Literature workshop, pp. 59-63.
http://aclweb.org/anthology/W12-2508 (code, slides, revised paper—includes results on Federalist papers).

Andreas van Cranenburgh (2012).
Efficient parsing with linear context-free rewriting systems.
Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics (EACL), Avignon, France, April 23–27.
http://aclweb.org/anthology/E12-1047 (poster, errata, corrected version, code).

Maria Aloni, Andreas van Cranenburgh, Raquel Fernández, Marta Sznajder (2012).
Building a Corpus of Indefinite Uses Annotated with Fine-grained Semantic Functions.
The eighth international conference on Language Resources and Evaluation (LREC), Istanbul, Turkey, May 23–25.
http://www.lrec-conf.org/proceedings/lrec2012/pdf/362_Paper.pdf

Andreas van Cranenburgh, Remko Scha, Federico Sangati (2011).
Discontinuous Data-Oriented Parsing: A mildly context-sensitive all-fragments grammar.
Proceedings of the 2nd Workshop on Statistical Parsing of Morphologically-Rich Languages (SPMRL), pages 34–44, Dublin, Ireland, October 6.
http://aclweb.org/anthology/W11-3805 (slides, template for slides, code).

Andreas van Cranenburgh, Galit Sassoon, Raquel Fernández (2010).
Invented antonyms: Esperanto as a semantic lab.
Proceedings of the 26th Annual Meeting of the Israel Association for Theoretical Linguistics (IATL 26).
http://dare.uva.nl/en/record/371912

Reports

Andreas van Cranenburgh (2012).
Extracting tree fragments in linear average time.
ILLC technical report. http://dare.uva.nl/en/record/421534

Teaching

Talks