Towards Large Scale Summarization

dc.contributor.advisorMausam, .en_US
dc.contributor.authorChristensen, Janara Mariaen_US
dc.date.accessioned2015-02-24T17:33:20Z
dc.date.available2015-02-24T17:33:20Z
dc.date.issued2015-02-24
dc.date.submitted2014en_US
dc.descriptionThesis (Ph.D.)--University of Washington, 2014en_US
dc.description.abstractAs the Internet grows and information is increasingly available, it is more and more difficult to understand what is most important without becoming overwhelmed by details. We need systems which can organize this information and present it in a coherent fashion. These systems should also be flexible, enabling the user to tailor the results to his or her own needs. Current solutions such as summarization are static and lack coherent organization. Even structured solutions such as timelines are inflexible. These problems become increasingly important as the size of the information grows. I propose a new approach to scaling up summarization called hierarchical summarization, which emphasizes organization and flexibility. In a hierarchical summary, the top level gives the most general overview of the information, and each subsequent level gives more detail. Hierarchical summarization allows the user to understand at a high level the most important information, and then explore what is most interesting to him or her without being overwhelmed by information. In this work, I formalize the characteristics necessary for good hierarchical summaries and provide algorithms to generate them. I perform user studies which demonstrate the value of hierarchical summaries over competing methods on datasets much larger than used for traditional summarization.en_US
dc.embargo.termsOpen Accessen_US
dc.format.mimetypeapplication/pdfen_US
dc.identifier.otherChristensen_washington_0250E_13808.pdfen_US
dc.identifier.urihttp://hdl.handle.net/1773/27448
dc.language.isoen_USen_US
dc.rightsCopyright is held by the individual authors.en_US
dc.subject.otherComputer scienceen_US
dc.subject.othercomputer science and engineeringen_US
dc.titleTowards Large Scale Summarizationen_US
dc.typeThesisen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Christensen_washington_0250E_13808.pdf
Size:
1.7 MB
Format:
Adobe Portable Document Format