Menu
News
All News
Dungeons & Dragons
Level Up: Advanced 5th Edition
Pathfinder
Starfinder
Warhammer
2d20 System
Year Zero Engine
Industry News
Reviews
Dragon Reflections
White Dwarf Reflections
Columns
Weekly Digests
Weekly News Digest
Freebies, Sales & Bundles
RPG Print News
RPG Crowdfunding News
Game Content
ENterplanetary DimENsions
Mythological Figures
Opinion
Worlds of Design
Peregrine's Nest
RPG Evolution
Other Columns
From the Freelancing Frontline
Monster ENcyclopedia
WotC/TSR Alumni Look Back
4 Hours w/RSD (Ryan Dancey)
The Road to 3E (Jonathan Tweet)
Greenwood's Realms (Ed Greenwood)
Drawmij's TSR (Jim Ward)
Community
Forums & Topics
Forum List
Latest Posts
Forum list
*Dungeons & Dragons
Level Up: Advanced 5th Edition
D&D Older Editions, OSR, & D&D Variants
*TTRPGs General
*Pathfinder & Starfinder
EN Publishing
*Geek Talk & Media
Search forums
Chat/Discord
Resources
Wiki
Pages
Latest activity
Media
New media
New comments
Search media
Downloads
Latest reviews
Search resources
EN Publishing
Store
EN5ider
Adventures in ZEITGEIST
Awfully Cheerful Engine
What's OLD is NEW
Judge Dredd & The Worlds Of 2000AD
War of the Burning Sky
Level Up: Advanced 5E
Events & Releases
Upcoming Events
Private Events
Featured Events
Socials!
EN Publishing
Twitter
BlueSky
Facebook
Instagram
EN World
BlueSky
YouTube
Facebook
Twitter
Twitch
Podcast
Features
Top 5 RPGs Compiled Charts 2004-Present
Adventure Game Industry Market Research Summary (RPGs) V1.0
Ryan Dancey: Acquiring TSR
Q&A With Gary Gygax
D&D Rules FAQs
TSR, WotC, & Paizo: A Comparative History
D&D Pronunciation Guide
Million Dollar TTRPG Kickstarters
Tabletop RPG Podcast Hall of Fame
Eric Noah's Unofficial D&D 3rd Edition News
D&D in the Mainstream
D&D & RPG History
About Morrus
Log in
Register
What's new
Search
Search
Search titles only
By:
Forums & Topics
Forum List
Latest Posts
Forum list
*Dungeons & Dragons
Level Up: Advanced 5th Edition
D&D Older Editions, OSR, & D&D Variants
*TTRPGs General
*Pathfinder & Starfinder
EN Publishing
*Geek Talk & Media
Search forums
Chat/Discord
Menu
Log in
Register
Install the app
Install
Upgrade your account to a Community Supporter account and remove most of the site ads.
Community
General Tabletop Discussion
*Geek Talk & Media
PDF Help
JavaScript is disabled. For a better experience, please enable JavaScript in your browser before proceeding.
You are using an out of date browser. It may not display this or other websites correctly.
You should upgrade or use an
alternative browser
.
Reply to thread
Message
<blockquote data-quote="Brown Jenkin" data-source="post: 1218241" data-attributes="member: 2572"><p>The book is an architectural survey book. With the exception of a few full page photos, most pages are 80% text - 20 % photos (1-3 small photos per page). All the pages were scanned as full resolution tiffs, and have also been saved as 150dpi greyscale jpgs. As a test I have done the first 15 pages (out of 150) without OCR and now have the size down to 7.5 meg. This would equal 75 meg for the enitire book. I have looked at other PDF books out there with roughly equivalent text/image ratio and page numbers and they have been 10-12 Meg in size. These other books are real text and not all image files which acounts for the discrepency. Time is a factor so I will rephrase my request slightly. </p><p></p><p>Would a 75 Meg PDF be unreasonable? </p><p>What if it was availible by chapter as well? </p><p>What about if it was availible by request on CD ($1-$3 to cover materials and postage)? </p><p></p><p>Now if I go OCR and convert it to text how difficult will it be?</p><p>Is there software that would convert the text but leave the layout so it doesn't have to be reformated? What software?</p><p>Even given Hi-res Tiffs how accurate is OCR? Will I have to spend hours looking for mistakes and correcting them manually (And I will have to ensure accuracy)?</p></blockquote><p></p>
[QUOTE="Brown Jenkin, post: 1218241, member: 2572"] The book is an architectural survey book. With the exception of a few full page photos, most pages are 80% text - 20 % photos (1-3 small photos per page). All the pages were scanned as full resolution tiffs, and have also been saved as 150dpi greyscale jpgs. As a test I have done the first 15 pages (out of 150) without OCR and now have the size down to 7.5 meg. This would equal 75 meg for the enitire book. I have looked at other PDF books out there with roughly equivalent text/image ratio and page numbers and they have been 10-12 Meg in size. These other books are real text and not all image files which acounts for the discrepency. Time is a factor so I will rephrase my request slightly. Would a 75 Meg PDF be unreasonable? What if it was availible by chapter as well? What about if it was availible by request on CD ($1-$3 to cover materials and postage)? Now if I go OCR and convert it to text how difficult will it be? Is there software that would convert the text but leave the layout so it doesn't have to be reformated? What software? Even given Hi-res Tiffs how accurate is OCR? Will I have to spend hours looking for mistakes and correcting them manually (And I will have to ensure accuracy)? [/QUOTE]
Insert quotes…
Verification
Post reply
Community
General Tabletop Discussion
*Geek Talk & Media
PDF Help
Top