In theory, to the extent that we care about database size, the extracted_text inflation is more severe in the repozitory database, *if* extracted_text is even part of what gets serialized. --Paul On Jul 10, 2014, at 10:15 AM, Chris Rossi