Linguistics 567: Grammar Engineering

Lab 7 Due 2/22

Preliminaries

As usual, you'll need to turn in before and after snapshots of your test suite. If you end up modifying your test suite in the course of working on this lab, it can be edifying to run your previous grammar (as submitted for Lab 6) over the new test suite when you're all done and then comparing that to the Lab 7 grammar.

Background

The goal of this lab is to be able to parse the two sentences I can eat glass. It doesn't hurt me., assign them appropriate semantics, and generate back. You have already done some of the work: from previous labs, your grammar should already handle pronouns, case (if applicable), and transitive verbs. You should already have most of the vocabulary required (except can and possibly not). You may need to add the appropriate verb forms, and may get inspired to add some lexical rules for verbal agreement in the process (if applicable, and if you haven't already). In addition, depending on how the sentences translate in your language, you might need to consider a new valence pattern for verbs and a new type of nouns (mass nouns).

The two key components (modals and negation) may have already been handled by the customization system. In that case, you'll be asked to identify something else to improve about your grammar for this week's lab. The instructions below cover a few strategies for negation and modals, in case things aren't working as expected. If your language does something not covered here and not covered by the customization system, talk to me :)

Requirements

  1. Your grammar should be able to parse I can eat glass and It doesn't hurt me and return semantic representations like those below.
  2. Your grammar should be able to generate from the semantic representations of each of these sentences without returning any ungrammatical strings.
  3. If your Lab 6 grammar can't do the above, your assignment for Lab 7 is to get it working so it can.
  4. If your Lab 6 grammar already has the right behavior for these strings, then you should find something else to clean up or extend. In that case, send me a note by Wednesday 2/20 describing what you intended to do.
  5. Either way, your write up will need to describe how negation and modals are working in your language.

Semantic representations

Your semantic representations for the two sentences should look approximately like this, modulo the relations showing up in a different order, the variables (e's, x's, and h's) showing up with different numbers, the SEMSORT information showing up in different places. Also, if your language tends to use prodrop rather than overt pronouns, you might end up without any representation of the pronouns in these sentences. Finally, if you need a complex predicate in place of, say, "hurt", then you'll also have some differences.

can as an auxiliary verb

Use this version if in your language the morpheme expressing the same notion as can is a separate word which takes a VP complement and a subject.

can as a bound morpheme

Use this version if the morpheme expressing the same meaning as can in your language attaches morphologically to the main verb of the sentence.

Negation: markers on either end

This option is for languages that mark negation with particles on either end of the clause or VP (or alternatively, with intonation or [in signed languages] non-manual signs which extend the length of the clause/constituent and are represented in transcription with markers on either end).

If the two markers show up immediately adjacent to the verb (rather than VP or S), consider whether it might be more appropriate to treat them as inflection.

The first thing is to consider whether there is any evidence for attaching the markers one at a time. In these instructions I focus on the case where there is not, so please contact me if you think the markers should attach one at a time in your language. Rather than attach one of these markers before the other, the most straight-forward thing appears to be to create a ternary rule. I've added some types supporting ternary rules to the matrix (included in the patch provided last week).

We're going to take a construction-y approach to analysis, creating a phrase structure rule which calls for specific elements in two of the three daughters and does the right thing in the semantics itself. Specifically, do the following:

Negation as an adverb modifier

If your language uses an adverbial strategy, the customization script probably did the right thing. This is included just in case.

Use this version if your language expresses sentential negation via an adverb which modifies the V, VP or S.

(Note: English has two forms of sentential negation "contracted", which is actually an affix on the verb, cf. Zwicky and Pullum 1983, and the full-form adverb. This adverb is not actually treated syntactically as a modifier in sentential negation, but rather selected by auxiliary verbs, including the do of do-support. For the details of this analysis, see Sag, Wasow and Bender 2003 chapter 13 and Kim and Sag 1995. I would be surprised if another language being treated in this class had a system very similar to the English one, as it seems like a pretty quirky part of English grammar. Further, it's a subtle matter to establish what is actually going on in English, and I don't think anyone would have time in one week to show the same about another language.)

Negation as a verbal affix

If your language uses an adverbial strategy, the customization script probably did the right thing. This is included just in case.

Use this version if your language expresses sentential negation by adding a morpheme to the main verb.

Two-part negation

If your language uses an adverbial strategy, the customization script probably did the right thing. This is included just in case.

Use this version if your language expresses negation with both an affix on the verb and an adverb (e.g., French ne ... pas). If both elements are arguably affixes, you probably just want to write a pair of lexical rules, i.e., take the "Negation as a verbal affix" route, but write two rules and make sure you can require that they both apply or neither apply.

Write up

  1. Describe the facts of sentential negation in your language. Used glossed examples from your test suite.
  2. Describe how you implemented sentential negation in your grammar and/or how the negation analysis provided by the customization system works.
  3. Describe the current coverage of your grammar wrt to the sentential negation facts of the language (syntactic and semantic: are you getting the right strings and only the right strings? Do they get the right meaning?). If you don't have complete coverage, speculate as to what you need to do to get there. If there are particular problematic strings, please include them in the write up so I can try parsing them.
  4. Describe the facts of how the equivalent of can is expressed in your language. Use glossed examples from your test suite.
  5. Describe how you implemented modal "can" in your grammar and/or how the auxiliary analysis provided by the customization system works.
  6. Describe the current coverage of your grammar wrt to the facts of how can is expressed in your language (syntactic and semantic: are you getting the right strings and only the right strings? Do they get the right meaning?). If you don't have complete coverage, speculate as to what you need to do to get there. If there are particular problematic strings, please include them in the write up so I can try parsing them.
  7. If you added anything else to your grammar or refined anything about it, describe what you did, including glossed examples and references to the relevant bits of tdl.

Submit your assignment


Back to main course page