textract icon indicating copy to clipboard operation
textract copied to clipboard

antiword error in "is not a word document"

Open Hari-Guhan-s opened this issue 8 years ago • 4 comments

i tried to parse a .doc file through textract latest versio with antiword latest version in ubuntu but is fails saying"filename is not a Word Document " i checked the files MIME Type its application/msword so can you please help me out.

Hari-Guhan-s avatar Nov 24 '17 07:11 Hari-Guhan-s

Hello, I am also facing the same issue.Any lead for this would be appreciated.

Thanks kadir khan

kadir-calvid avatar Feb 22 '18 12:02 kadir-calvid

Can you provide more information such as your python version, textract version, the complete error message and a test doc file?

jpweytjens avatar Aug 27 '19 15:08 jpweytjens

Guys, any solution for this issue? I'm dealing the same problem with that file test1.doc.tar.gz

luishenriquesb avatar Jun 17 '20 13:06 luishenriquesb

me too

Gaoyongxian666 avatar Oct 20 '22 13:10 Gaoyongxian666