Skip to main content

This site requires you to update your browser. Your browsing experience maybe affected by not having the most up to date version.

General Questions

General questions about getting started with SilverStripe that don't fit in any of the categories above.

Moderators: martimiz, Sean, biapar, Willr, Ingo, swaiba, simon_w

Search inside PDF


6 Posts   1954 Views


15 January 2010 at 9:15pm (Last edited: 16 January 2010 5:19pm), Community Member, 10 Posts

Is this possible with the SS Search?


16 January 2010 at 2:15am (Last edited: 16 January 2010 5:19pm), Community Member, 901 Posts

I don't think it's possible.
You could build a SS wrapper around the Zend Lucene Engine (, as described in this Blog-Post:

This could get tricky though :)
I also think you're in the wrong forum with that question.


18 January 2010 at 4:11pm Community Member, 89 Posts

Also I think you can relatively easily extract the PDF contents onAfterWrite using `pdftotext` and chuck it into the database as Content so the search can pick it up. This works pretty well.


25 January 2010 at 12:50pm Forum Moderator, 801 Posts

For a more robust and performant solution (rather than MySQL fulltext), have a look at the 'sphinx' module:

Its a fairly new module, and requires you to use SilverStripe 2.4 alpha1, but the underlying technology (sphinx search) is quite stable.
This changeset (committed a couple of days ago) explains how to work with PDFs in sphinx:

BTW, it uses pdftotext as well :)


25 January 2010 at 8:32pm Community Member, 901 Posts

Hey Ingo that sounds really neat.
The sphinx binaries have to be installed on the server to use this, right?


26 January 2010 at 9:28am Community Member, 89 Posts

Yeah, you need to install that on the server and run it as a daemon. You probably won't be able to run it on shared hosting. But it's worth the effort, very quick :)