The goals for this lab are to reduce the MT output, by working with the VPM (variable property mapping) and doing further morphological clean up and/or adding constraints to semantically empty auxiliaries, and to get one more phenomenon working (preferably adnominal possession, clausal complements, or wh questions if you don't already have these working). We will primarily be working by editing tdl this week, but might consult the customization system for inspiration on how to implement things. You will also use [incr tsdb()] to compare the initial and final state of your grammar for the week over the testsuites.
For tdl editing, please practice incremental development: Test as frequently as you possibly can, both by compiling the grammar and by testing specific sentences.
This lab entails the following general steps, which are not (fully) ordered with respect to each other.
You may have noticed that you get many variants on generation if you start with a form that is underspecified for e.g., aspect or evidentiality. We can get a handle on this by using variable property mapping to supply default values in the unmarked case (either in monolingual generation or in the MT scenario). The basic strategy is to take any underspecified values in variable properties and translate them, via vpm, to something that conflicts with any more specific values your grammar can produce.
The file semi.vpm provides a mapping between grammar-external features of indices (referential indices and events) and their values, and grammar-internal ones. For background on VPM, see the DELPH-IN wiki. As soon as you start using a VPM file, then only variable properties (features on indices) that are handled in the file are actually preserved.
(mt:read-vpm (lkb-pathname (parent-directory) "semi.vpm") :semi)
E.TENSE : E.TENSE present <> present past <> past future <> future * <> *To look like this:
E.TENSE : TENSE present <> present past <> past future <> future * <> *
no-aspect := aspect.
* >> no-aspect no-aspect << [e]
perfective << [e]
Test your semi.vpm file by doing self-translation (i.e. translating from your language to your language). You should see fewer strings coming out.
Here you'll be doing tdl editing to add constraints to morphology that is underconstrained (so showing up all over the place...) If the problem is with semantically empty words, we'll be adding constraints to the words or possibly refining trigger rules. Note that with semantically empty words, the problem might be that they are appearing where they shouldn't be or that they aren't appearing at all (and causing generation to fail).
Please post to Canvas Tuesday with:
I'll reply with suggestions about how to proceed.
For the write-up for this section, I'd like to see both a description of what changes you made and some quantitative observations about how the number of generator outputs is affected.
Here, we'd like to get wh questions OR adnominal possession OR clausal complements working, i.e. the equivalents SOME of these sentences from the eng.txt:
The dog s car sleeps My dogs sleep I think that you know that dogs chase cars I ask whether you know that dogs chase cars Who sleeps What do the dogs chase What do you think the dogs chase Who asked what the dogs chase I asked what the dogs chased
Please post to Canvas Tuesday with:
Depending on which phenomenon you are working on, we might be primarily doing tdl fixes or I might suggest that you get an initial anlaysis from the customization system and merg it with your current grammar.
If the above sentences are already all working, document that. You can optionally post to Canvas with something you'd like to work on instead. OPTIONALLY.
I will advise on Canvas how to proceed in each case!
Following the same procedure as usual, do a test runs over your testsuite.
Collect the following information to provide in your write up:
Your write up should be a plain text file (not .doc, .rtf or .pdf) which includes the following:
tar czf iso-lab6.tgz iso-lab6