Ling 573 - Natural Language Processing Systems and Applications
Spring 2015
Deliverable #4: Final Summarization Systems
Code and Results due: May 29, 2015: 23:59
Final Report due: June 9, 2015: 17:00


Goals

In this deliverable, you will complete development of your summarization system. You will

System Enhancement

This final deliverable must include substantive enhancements beyond your baseline system and further extensions over your D3 system.

Content Realization

For this deliverable, one focus will be on improving your systems through enhanced content realization. Content realization can address:

You may build on techniques presented in class, described in the reading list, and proposed in other research articles.

We will also be conducting a manual readability evaluation in addition to the ROUGE content scoring, to give improvements in this area due credit.

Data

We will be focusing on the TAC summarization shared task. Since this is the final deliverable, you will evaluate not only on the 2010 devtest data you have used all term, but also on held-out test data.

Document Collection

Devtest Corpus

The AQUAINT and AQUAINT-2 Corpora collections were used for the summarization task for a number of years and form the devtest corpus. The collections can be found on patas in /corpora/LDC/LDC02T31/ (AQUAINT, 1996-2000) and /corpora/LDC/LDC08T25/ (AQUAINT-2, 2004-2006).

Evaluation Corpus

The held-out document sets for the final evaluation are drawn from the English Gigaword corpus, from years 2007 and 2008. This collection may be found on patas in /corpora/LDC/LDC11T07. (Note: Given the size of this corpus, it's still fine if you use the main corpus as your background corpus.)

Training Data

You may use any of the DUC or TAC summarization data through 2009 for training and developing your system. For previous years, there are prepared document sets and model summaries to allow you to train and tune your summarization system.

All model files appear in /dropbox/14-15/573/Data/models.

All document specification files appear in /dropbox/14-15/573/Data/Documents.

Training data appear in the training subdirectories and devtest data in the devtest directory.

Development Test Data

You should evaluate on the TAC-2010 topic-oriented document sets and their corresponding model summaries. You should only evaluate your system on the the 'A' sets. Development test data appears in the devtest subdirectories.

Held-out Evaluation Test Data

You should also evaluate on the TAC-2011 topic-oriented document sets and their corresponding model summaries, again only on the 'A' sets. This evaluation test data appears in the evaltest subdirectories.

Scoring

You will employ the standard automatic ROUGE method to evaluate the results from your summarization system.

Outputs

Create two directories under the outputs directory containing the summaries based on running your final summarization system as below: You should do this as follows:

Completing the project report

This final version should include all required sections, as well as a complete system architecture description and proper bibliography including all and only the papers you have actually referenced. See this document for full details. The project report must also include a substantive error analysis. Please name your report D4.pdf.

Presentation

Your presentation may be prepared in any computer-projectable format, including HTML, PDF, PPT, and Word. Your presentation should take about 10 minutes to cover your main content, including: Your presentation should be deposited in your doc directory, but it is not due until the actual presentation time. You may continue working on it after the main deliverable is due.

Summary