Honours Project Title:
Automatic Text Summarization for English Language
In this day and age with a vast stream of information, most of the data around us are in the form of text. Textual summarization has become a necessity in order to provide more compact information to users, thereby eliminating the need to read more text than is required. This report provides a comprehensive overview of the problem and methods of textual summarization as it pertains to English language. For this project, two extractive summarization techniques were implemented, and compared with each other (along with other summarization engines publicly available) to discover the relationships, dierences, strengths and weaknesses of these summarization techniques on the various data that they summarized. Consequentially, a summarization engine was also created and made publicly available on the web.
Please find in the zip file, the final report in PDF along with the full source code for the project. In order to run the project successfully you must have the nltk library installed. For a demo of the engine, please visit: http://dabbyndubisi.me:3001