- The unauthorized error happened while downloading a file referenced in a record of the OAI feed. I improve the download step not to fail, but to collect errors and display them at the end of the script, with more information (which URL was concerned about the problem, what is the problem). With this feature you will be able to process all the feed, i.e. make a full backup, and have a summary of the files you could not download.
- Categories are implemented in the document library as OAI sets (standard OAI feature). Those were not saved before. I added the backup of the sets in a file called sets.oai, which list all available categories associated a unique identifier. After each record dump include in the header to which sets they belongs to. This set identifier stays unique and make possible to rename / re-title categories without problems on the client side. All those files (sets, record dump) are valid XML (that means it is easy for you to reuse information from it after).
After you tested the new version of this script, we will know which record (file) in the Document Library triggered the unauthorized error you got, and check why you don't have access rights to that file. (We can't do it now, since we don't know which file triggered that error).
I improved the oaibackup script:
- The unauthorized error happened while downloading a file referenced in a record of the OAI feed. I improve the download step not to fail, but to collect errors and display them at the end of the script, with more information (which URL was concerned about the problem, what is the problem). With this feature you will be able to process all the feed, i.e. make a full backup, and have a summary of the files you could not download.
- Categories are implemented in the document library as OAI sets (standard OAI feature). Those were not saved before. I added the backup of the sets in a file called sets.oai, which list all available categories associated a unique identifier. After each record dump include in the header to which sets they belongs to. This set identifier stays unique and make possible to rename / re-title categories without problems on the client side. All those files (sets, record dump) are valid XML (that means it is easy for you to reuse information from it after).
After you tested the new version of this script, we will know which record (file) in the Document Library triggered the unauthorized error you got, and check why you don't have access rights to that file. (We can't do it now, since we don't know which file triggered that error).