On Mon, 29 Nov 2004 11:19:07 +0000 (UTC), funkster
<webforumsuser (AT) macromedia (DOT) com> wrote:
Quote:
Hi,
Is it possible to build a search page that searches the text of PDFs?
maybe even search a multipage PDF and return the page number the search
text is located in?
Thanks. |
Yes but is not easy, this has to do with ghostcript hability to read text
through the PDF as posfix. So for example if you are using Linux and have
ghostcript you could do a:
less archive.pdf
and it will display the text of the PDF. So doing 'grep' will also locate
the word you want to do using:
less archive.pdf | grep (your keyword)
The other way (more modern) is xml which changes from posix to xml
compliant and finally to a stout of the xml tree which is easier to search.
--
Alexandro Colorado
------------------------------
Support Engineer
InterAKT Online
http://www.interaktonline.com
Tel: 40(21) 312.5312