21309 Posts in 5738 Topics by 2603 members
|
Page:
1
|
Go to End | |
| Author | Topic: | 1679 Views |
-
Search inside PDF

15 January 2010 at 9:15pm Last edited: 16 January 2010 5:19pm
Is this possible with the SS Search?
-
Re: Search inside PDF

16 January 2010 at 2:15am Last edited: 16 January 2010 5:19pm
I don't think it's possible.
You could build a SS wrapper around the Zend Lucene Engine (http://framework.zend.com/manual/en/zend.search.lucene.html), as described in this Blog-Post: http://www.kapustabrothers.com/2008/01/20/indexing-pdf-documents-with-zend_search_lucene/This could get tricky though
I also think you're in the wrong forum with that question. -
Re: Search inside PDF

18 January 2010 at 4:11pm
Also I think you can relatively easily extract the PDF contents onAfterWrite using `pdftotext` and chuck it into the database as Content so the search can pick it up. This works pretty well.
-
Re: Search inside PDF

25 January 2010 at 12:50pm
For a more robust and performant solution (rather than MySQL fulltext), have a look at the 'sphinx' module:
http://open.silverstripe.org/browser/modules/sphinx/trunkIts a fairly new module, and requires you to use SilverStripe 2.4 alpha1, but the underlying technology (sphinx search) is quite stable.
This changeset (committed a couple of days ago) explains how to work with PDFs in sphinx:
http://open.silverstripe.org/changeset/97360/modules/sphinx/trunkBTW, it uses pdftotext as well
-
Re: Search inside PDF

25 January 2010 at 8:32pm
Hey Ingo that sounds really neat.
The sphinx binaries have to be installed on the server to use this, right? -
Re: Search inside PDF

26 January 2010 at 9:28am
Yeah, you need to install that on the server and run it as a daemon. You probably won't be able to run it on shared hosting. But it's worth the effort, very quick
-
Re: Search inside PDF

10 November 2010 at 10:37am
For those who end up on this thread - I've implemented Zend Lucene for Silverstripe, with PDF, Word, and Excel searching. No extra software needed, it's just PHP.
| 1679 Views | ||
|
Page:
1
|
Go to Top |




