I am not an expert in PDF internal formats at this point. I may need to start learning. I also have an application, actually a long bash script, that I want to extend it's capabilities to output several scanned pages that have had OCR performed and merge the text with the original image in a PDF. The package is called speedy-ocr. Does having a sandwhiched PDF mean that the text is then editable in Adobe as opposed to just attached as a searchable, structured note? I am writing this script to simplify scanning and OCR functionality for the blind and visually impaired community. Screen readers, Orca in this case, will need structured text so that the text can be read in the appropriate order, if possible. I do not know yet how much of the structure can be retreived from cuneiform, if any. For our purposes, having the font information is not necessary for most users. They just need to be able to retreive and store fairly accurate text, in the correct reading order, for each page. Is this type of merge different than a sandwhiched PDF? Is this simply attached searchable text? We have a distribution of Ubuntu 10.0.4 Lucid that configures several accessibility systems and a group of developers world wide are attempting to fix gnome applications for accessibility. Most of the fixes get sent upstream and incorporated into Ubuntu, partly because Luke is now using the Vinux distribution as a testbed. The distribution is called Vinux, and it's home page is vinux.org.uk. Our repositories are also on LaunchPad.net. Don Marang There is just so much stuff in the world that, to me, is devoid of any real substance, value, and content that I just try to make sure that I am working on things that matter. Dean Kamen -------------------------------------------------- From: "Martin Wildam"