Hi I need Sphider to index all documents but when searching I need to somehow pass a 'user' parameter to make sure only current user's owned documents are searched. What is Xpdf? Sony bvh-3100 service manual. Xpdf is a free PDF viewer and toolkit, including a text extractor, image converter, HTML converter, and more. Most of the tools are available as open source.
One of my customers has an insane amount of PDF and Microsoft Word DOC files on their website. It's core to their online services so it's not as though they're garbage files up on the server. My customer wanted their website's search engine (Sphider) to read these PDF files and DOC files so that their clients could get at the documents they needed without going through a bunch of summary pages to get them. I was successful in the task, so let me show you how to read PDF and DOC files using PHP.
Reading PDF Files
To read PDF files, you will need to install the XPDF package, which includes 'pdftotext.' Once you have XPDF/pdftotext installed, you run the following PHP statement to get the PDF text:
Reading DOC Files
Like the PDF example above, you'll need to download another package. This package is called Antiword. Here's the code to grab the Word DOC content:
The above code does NOT read DOCX files and does not (and purposely so) preserve formatting. There are other libraries that will preserve formatting but in our case, we just want to get at the text.
A special thank you to Jeremy Parrish for his help and insight with this task.
I am trying to configure our server for a client's website that we host. Their site is PHP, which I'm not terribly familiar with, so I don't know if it works in some weird way. (All our sites are in ASP.)
They manage their own site generally, via FTP. They are trying to add sphiderplus
to the site, which is where these issues started.
After logging in to sphiderplus, you receive numerous copies of this error (for a number of files and folders under the sphiderplus root):
I'm not sure what I need to do to give PHP the proper permissions for sphiderplus to run properly. Here are the steps I've taken so far after googling around, which did not work..
- created a separate application pool for the PHP site
- created a separate local user for the above app pool
- gave that local user
Full Control
rights on the sphiderplus root directory (with propagation) - gave
IUSR
Full Control
rights on the sphiderplus root directory (with propagation) - gave
IIS_IUSRS
Full Control
rights on the sphiderplus root directory (with propagation) - gave
NETWORK SERVICE
Full Control
rights on the sphiderplus root directory (with propagation)
.. and then just for fun, when it still wouldn't work, to test,
- gave
Everyone
Full Control
rights on the sphiderplus root directory (with propagation)
I don't know what else to try. Is there maybe some sort of ini
or config
file that I need to change some lines in or something? I really would've thought, if nothing else, then giving Everyone
full control on the whole sphiderplus directory would've worked. I'm not really sure what else to do.
We are running
- PHP 7.0.5 installed from Microsoft's Web Platform Installer
- FastCGI installed from Micosoft's Web Platform Installer
- IIS 7.5.7600.16385
- Windows Server 2008 R2 SP1 (v 6.1 b 7601).
1 Answer
For what it is worth, I got around this by changing how it was checking the writability of the files/folders.
The sphiderplus code was checking like this..
I changed that code to this..
Seems to work fine.