Education

  • Ph.D. student in Computational Linguistics   2017 -- Expected 2022

    Georgetown University

  • Ph.D. student in Linguistics   2016 -- 2017

    SUNY - Stony Brook University

  • M.A. in Linguistics   2015-2016

    Leiden University

  • B.A. in Applied Mathematics, French & Linguistics   2011-2015

    University of California - Berkeley

Interests

  • Natural Language Processing
  • Computational Linguistics
  • Corpus Linguistics
  • Syntax-Semantics Representations
  • Discourse Theories
  • Entities & Coreference

Publications

. GUMBY – A Free, Balanced, and Rich EnglishWeb Corpus . In Proceedings of the International Conference on Language Resources and Evaluation (LREC 2020), Marseille, France, 2020.

. A Corpus of Adpositional Supersenses for Mandarin Chinese . In Proceedings of the International Conference on Language Resources and Evaluation (LREC 2020), Marseille, France, 2020.

Preprint

. Modeling Long-Range Context for Concurrent Dialogue Acts Recognition. In Proceedings of the 28th ACM International Conference on Information and Knowledge Management (CIKM 2019), Beijing, China, 2019.

Preprint Poster Slides Code

. GumDrop at the DISRPT2019 Shared Task: A Model Stacking Approach to Discourse Unit Segmentation and Connective Detection. In Proceedings of the Workshop on Discourse Relation Parsing and Treebanking (DISRPT 2019) at NAACL-HLT 2019, 133-143, Minneapolis, MN, 2019.

PDF

. Adpositional Supersenses for Mandarin Chinese. In Proceedings of the Society for Computation in Linguistics (SCiL 2019), vol. 2, 334–337, New York, NY, 2019.

Preprint Poster

. All roads lead to UD: Converting Stanford and Penn parses to English Universal Dependencies with multilayer annotations. In Proceedings of the Joint Workshop on Linguistic Annotation, Multiword Expressions and Constructions at COLING 2018, 167–177, Santa Fe, NM, 2018.

PDF Poster

Other presentations

. Validating and Merging a Growing Multilayer Corpus: the Case of GUM. Abstract presented at the 14th American Association for Corpus Linguistics (AACL 2018) Conference, Atlanta, GA, 2018.

Slides

Teaching

Georgetown University

  • LING-462/COSC-482: Statistical Machine Translation
    Teaching AssistantSpring 2020
  • LING-469: Analyzing Language Data with R
    Teaching AssistantSpring 2020
  • LING-469: Analyzing Language Data with R
    Teaching AssistantSpring 2019
  • LING-362: Intro: Natural Language Processing
    Teaching AssistantFall 2018

SUNY - Stony Brook University

  • LIN-230: Languages of the World
    Teaching AssistantSpring 2017
  • LIN-200: Language in the U.S.
    Teaching AssistantFall 2016

Reviewing

Primary reviewer

  • SRM@ACL2020    ACL 2020 Student Research Workshop
  • MASC-SLL2020    The 8th Mid-Atlantic Student Colloquium on Speech, Language and Learning
  • WiNLP@ACL2020   Widening NLP Workshop 2020
  • ACL 2020       2020 Annual Conference of the Association for Computational Linguistics
  • WiNLP@ACL2019   Widening NLP Workshop 2019

Secondary reviewer

  • DMR@ACL2019    The First International Workshop on Designing Meaning Representations

Personal