This is a read-only archive. Find the latest Linux articles, documentation, and answers at the new Linux.com!

Linux.com

Condensing with Open Text Summarizer

Posted by: Anonymous [ip: 209.30.144.213] on December 15, 2008 08:58 PM
Summarized at a 30% ratio:

Properly speaking, Nadav Rotem's Open Text Summarizer (OTS) is not a summarizer at all. True summaries generally involve rewording contents at a higher level of generality while preserving the meaning, not just producing a condensed version of the original the way that OTS does. However, within its limits, OTS is an efficient tool for automatically producing abstracts of non-fiction, that, in the last 15 months, has received favorable mention from at least four academic publications, including one in which it outperformed similar utilities, including commercial ones such as Copernic and Subject Search Summarizer. You can use the command-line version of OTS for plain text files, including HTML files, although the output for HTML files inevitably includes tags. If you prefer, you can save the output to a file with ots --out=outputfile inputfile. By default, the output file is 20% the length of the input file, based on the number of sentences in the input file. With the AbiWord plugin, you have fewer options, but all you need to do is select Tools -> Summarize, and choose the percentage length of the output file, and the result is entered into a new, unnamed file. You might want to tweak the results to provide a true summary rather than a condensation, but, even so, using OTS requires less time and involves less active thinking than writing a summary from scratch.

#

Return to Condensing with Open Text Summarizer