Thread Rating:
  • 0 Vote(s) - 0 Average
  • 1
  • 2
  • 3
  • 4
  • 5
Test xpdf with php
#25
(11-29-2012, 12:49 AM)Daniel Wrote: Hi Uwe,

Sounds a little bit like rocket science to me :-)

I tried to do various changes, including just using your suggestion. The fulltext index goes trough, but the documents are not indexed. I tested them before in the command line to ensure that they can be read out and used for a fulltext index, they are OK.

Sometimes (but unfortunately not always) the httpd error logs shows me these lines:

sh: catdoc: not found
sh: pdftotext: not found

One for each document, depending on their types.

Is there a debug option in LetoDMS where i can see what the fulltext indexer does and what he accesses during the process? The logs on my system are very poor and messages like the above ones are not really helpful.

Also, as tested above, PHP can access both tools, but there may be just a small bit wrong and i want to find this out.

Indexing in general works by calling a programm which turns the content of the document into a plain text file (basically a list of words). Such a programm can be configured for each document mimetype. A '%s' in the command will be replaced by the documents filename. LetoDMS takes the output of that programm, removes stop words and indexes the remaining words. On Unix this is a quite common solution. The crutial part in your case seems to be the path to those programms like catdoc and pdftotext. They are called by the web server and therefore you will have to call them with their full path or make sure the path is in the $PATH variable.
This has nothing todo with php, it is the shell environment.

The only way to debug this, is to dig into code an place some echos
at the right place. But I doubt this is needed in your case.

Uwe
Reply


Messages In This Thread
Test xpdf with php - by Daniel - 11-19-2012, 12:04 AM

Forum Jump:


Users browsing this thread: 1 Guest(s)