Multi Commander > Support and Feedback

MC doesn't find phrases and sentences any more

<< < (2/2)

Mathias (Author):
MC do not and have never decoded doc files.  doc files are binaries files with the own special format.
If you are lucky it might find the wanted text in the file, What it does is a raw binary search of the files. And it depends on how words have saved the file.
You can try to force the binary matching where you select what type of content it is. If it detects is as UTF8 it will try to convert text and then it will go wrong.


Ulfhednar:
Thanks for that info Mathias. 

Could MC be able to read pdf? (i.e if they have had OCR or input text contents.)
I'm curious to know if MC would see & use this http://www.pdflib.com/products/tet-pdf-ifilter/ if installed or is your search process totally discrete from Win search processes.
Thanks

Skinman:

--- Quote from: Mathias (Author) on January 05, 2017, 22:32:16 ---MC do not and have never decoded doc files.  doc files are binaries files with the own special format.


--- End quote ---

Both wrong. There is lots of plain text in doc files and it used to work perfectly with earlier versions, as I wrote. Still does with some files, as I wrote.

Mathias (Author):
It depends on word format and it also depends and what format MC thinks the file is(text,utf8, unicode, binary), and it also depends on how the text in the word doc is formated. Modern Word files are actually zip files. and with them you will not find anything since there is no chance the text exists as plain text.

That you find it before is pure luck.

But if you force it to do a binary check it might find the text as long as the text only have A-Z , 0-9 and no special characters.
and that the text document is not encoded with newer word versions.


Navigation

[0] Message Index

[*] Previous page

Go to full version