Posted by: Anonymous
on December 19, 2007 03:13 PM
I have met the problems which are mentioned in your article.
"I've found that these utilities sometimes can't process documents that are saved with Word's Fast Save feature, which quickly saves a file by tacking any changes to the end of the file"
I used the Catdoc in Linux for converting the word documents to texts and then save them in database. But after processing more than 13000 documents, there are 87 errors named "This was fast-saved N times. Some information is lost" being catched. The percentage of errors is over 0.6 , it is a little high.
I know the reason is that these documents are saved with the option "fast saved", the tool Catdoc can not handle it well in this situation.
Is there anyone with the same problem and how do you solve it ?
Or anyone knows some famous forum can help me find out a solution?
It is very urgent, thanks.