Convert to text plugin bug?
I think we've come across a bug in the Convert to Text plugin.
In the plugin, line 336, you are trying to determine the installed version of Java
list($release,$version,$build) = explode(".", $versionString);
$version was returning 0 because the installed version was 11.0.8.
You were looking for the middle value, which is wrong. You should be looking at the first value to determine the version.
I can see the plugin was last updated back in 2017 so maybe the Java version was different back then...
We've updated the plugin at our end but thought we should let you know.
Overall this plugin is working well for us however we are noticing that some .docx and .xlsx files are not being converted to text.
For example, when I upload .xlsx files these are never converted to text however it works for .xls files. I am also able to create and upload .docx files and these are converted to text correctly but when I use an existing .docx file from a sample file website (e.g. https://file-examples-com.github.io/uploads/2017/02/file-sample_100kB.docx) this never converts.
Are there any known issues with .xlsx and .docx file types or could there be some code that needs checking over?
Any help would be hugely appreciated.
Upon checking this issue, I realized that upgrading your version of Tika to the most recent version fixes the parsing of text for .xlsx files and .docx files that were taken from the internet.
I have made a copy of my modified converttotext plugin folder that has been modified in order to make them work.
Simply download the link, go to your plugins folder, and replace the "convertToText" folder with the "convertToText" folder that is the zip file in the link.
If you have any more questions, feel free to message me.
PHP Programmer (In Training)
Ok great, glad it's working. I've updated the plugin online here as well for anyone else who needs it: