Gamingforce Interactive Forums
85240 35212

Go Back   Exploding Garrmondo Weiner Interactive Swiss Army Penis > Garrmondo Network > Help Desk
Register FAQ GFWiki Community Donate Arcade ChocoJournal Calendar

Notices

Welcome to the Exploding Garrmondo Weiner Interactive Swiss Army Penis.
GFF is a community of gaming and music enthusiasts. We have a team of dedicated moderators, constant member-organized activities, and plenty of custom features, including our unique journal system. If this is your first visit, be sure to check out the FAQ or our GFWiki. You will have to register before you can post. Membership is completely free (and gets rid of the pesky advertisement unit underneath this message).


Searching for words within PDFs
Reply
 
Thread Tools
Hotobu
Good Chocobo


Member 5982

Level 14.90

Apr 2006


Reply With Quote
Old Jul 27, 2007, 07:30 PM #1 of 7
Searching for words within PDFs

I have a collection of 100+ .pdf documents that are interrelated. Sometimes I would like to find a common word within these documents. Is there a way in which I can search within all of them to produce a result? Thank you

Most amazing jew boots
neus
You're getting slower!


Member 512

Level 20.69

Mar 2006


Reply With Quote
Old Jul 29, 2007, 11:20 PM #2 of 7
You could try "find | xargs grep wordswordswords" but that'd require linux command line and text files. I'm pretty sure PDFs encrypt text so that's moot.
Try looking for a program to rip text out of a PDF and then use that command above.
Ah, here we go. Google to the rescue.

There's nowhere I can't reach.

Last edited by neus; Jul 29, 2007 at 11:23 PM.
LiquidAcid
Chocorific


Member 6745

Level 38.97

May 2006


Reply With Quote
Old Jul 30, 2007, 07:13 AM Local time: Jul 30, 2007, 01:13 PM #3 of 7
There are pdf2txt utitilities on linux that extract the text from a pdf, so this shouldn't be the problem when scripting. You only get in trouble when the text in the pdf is in fact no text but a bitmap. The even the acrobat reader will fail searching for text.

This thing is sticky, and I don't like it. I don't appreciate it.
killmoms
Professional Mac-head


Member 277

Level 15.11

Mar 2006


Reply With Quote
Old Jul 30, 2007, 08:23 AM Local time: Jul 30, 2007, 06:23 AM #4 of 7
Macs content-index PDFs automatically, searchable in Spotlight.

I was under the impression that any of the desktop search programs for Windows did the same thing (MSN/Google Desktop Search for XP, or the built-in search in Vista). Do they not?

I am a dolphin, do you want me on your body?
killmoms - Well, don't really.
Makin' trailers er'ry day.
LiquidAcid
Chocorific


Member 6745

Level 38.97

May 2006


Reply With Quote
Old Jul 30, 2007, 05:33 PM Local time: Jul 30, 2007, 11:33 PM #5 of 7
I think not. At least the normal search in win2k interprets any file except standard text files as binary data.

I was speaking idiomatically.
Dyesan
hey YOU!


Member 1790

Level 17.02

Mar 2006


Reply With Quote
Old Aug 1, 2007, 11:25 PM #6 of 7
Use Foxit. Free Download

What kind of toxic man-thing is happening now?
shadoweave
Chocobo


Member 23806

Level 9.35

Aug 2007


Reply With Quote
Old Aug 12, 2007, 01:46 AM Local time: Aug 12, 2007, 02:46 PM #7 of 7
There are pdf2txt utitilities on linux that extract the text from a pdf, so this shouldn't be the problem when scripting. You only get in trouble when the text in the pdf is in fact no text but a bitmap. The even the acrobat reader will fail searching for text.
Like what LiquidAcid said, there's no certain way to search for words if the words are actually image files. But if they ARE text, there's an option to search within pdf files in an entire directory in Adobe Acrobat Reader itself. The function's the Full Reader Search I believe.

FELIPE NO
Reply


Exploding Garrmondo Weiner Interactive Swiss Army Penis > Garrmondo Network > Help Desk > Searching for words within PDFs

Forum Jump


All times are GMT -5. The time now is 03:37 PM.


Powered by vBulletin® Version 3.8.9
Copyright ©2000 - 2025, vBulletin Solutions, Inc.