droa
March 7th, 2017, 03:05 AM
We are using Amanda 3.4.3 with an "S3" tape ( actually implemented via EMC ATMOS - so using STORAGE_API "S3" ).
Backups are working fine, and we can recover files using amrecover when the tapes are 1 or 2 blocks ( blocksize set to 50M ).
However if we try to recover files from a tape that has more than 2 blocks, amrecover just hangs and never completes (I've waited for more than 1 hour... ).
These are the last entries in the amidxtaped debug log:
...
Tue Mar 07 11:49:38.583421260 2017: pid 29133: thd-0x1dc1690: amidxtaped: parse_inventory: load slot 16 with label 'ProdSet-16'
Tue Mar 07 11:49:38.602256436 2017: pid 29133: thd-0x1dc1690: amidxtaped: /usr/local/share/perl5/Amanda/Recovery/Scan.pm:307:info:1200000 slot 16
Tue Mar 07 11:49:38.603489703 2017: pid 29133: thd-0x1dc1690: amidxtaped: S3 driver using bucket 'prod-backups', prefix 'ProdSet/slot-16/tape'
Tue Mar 07 11:49:38.603514074 2017: pid 29133: thd-0x1dc1690: amidxtaped: curl version: libcurl/7.53.1 OpenSSL/1.0.1e zlib/1.2.3 c-ares/1.12.0 libssh2/1.8.0 nghttp2/1.6.0
Tue Mar 07 11:49:38.603523865 2017: pid 29133: thd-0x1dc1690: amidxtaped: curl compiled for OPENSSL
Tue Mar 07 11:49:38.608284630 2017: pid 29133: thd-0x1dc1690: amidxtaped: Create 6 threads
Tue Mar 07 11:49:39.080822231 2017: pid 29133: thd-0x1dc1690: amidxtaped: Building type TAPESTART header of 0-52428800 bytes with name='ProdSet-16' disk='' dumplevel=0 and blocksize=0
Tue Mar 07 11:49:39.082250014 2017: pid 29133: thd-0x1dc1690: amidxtaped: /usr/local/share/perl5/Amanda/Recovery/Scan.pm:459:info:1200001 ProdSet-16
Tue Mar 07 11:49:39.082534856 2017: pid 29133: thd-0x1dc1690: amidxtaped: ignoring spurious Amanda::Recovery::Scan abort call
Tue Mar 07 11:49:39.502133244 2017: pid 29133: thd-0x1dc1690: amidxtaped: Amanda::Recovery::Clerk: successfully located first part for recovery
Tue Mar 07 11:49:39.508682900 2017: pid 29133: thd-0x1dc1690: amidxtaped: Building type FILE header of 128-32768 bytes with name='server.example.com' disk='/opt' dumplevel=0 and blocksize=32768
Tue Mar 07 11:49:39.508801525 2017: pid 29133: thd-0x1dc1690: amidxtaped: CTL >> HEADER-SEND-SIZE 551
Tue Mar 07 11:49:39.509333878 2017: pid 29133: thd-0x1dc1690: amidxtaped: CTL << HEADER-READY
Tue Mar 07 11:49:41.228676865 2017: pid 29133: thd-0x1dc1690: amidxtaped: CTL << HEADER-DONE
Tue Mar 07 11:49:41.228870784 2017: pid 29133: thd-0x1dc1690: amidxtaped: CTL >> STATE-SEND
Tue Mar 07 11:49:41.229378230 2017: pid 29133: thd-0x1dc1690: amidxtaped: CTL << STATE-READY
Tue Mar 07 11:49:41.230058205 2017: pid 29133: thd-0x1dc1690: amidxtaped: CTL << STATE-DONE
Tue Mar 07 11:49:41.230139287 2017: pid 29133: thd-0x1dc1690: amidxtaped: CTL >> USE-DAR NO
Tue Mar 07 11:49:41.230473653 2017: pid 29133: thd-0x1dc1690: amidxtaped: CTL << USE-DAR NO
Tue Mar 07 11:49:41.230585459 2017: pid 29133: thd-0x1dc1690: amidxtaped: CTL << AVAIL-DATAPATH AMANDA
Tue Mar 07 11:49:41.231037363 2017: pid 29133: thd-0x1dc1690: amidxtaped: Starting <[email protected] (<[email protected]> -> <[email protected]>)>
Tue Mar 07 11:49:41.231084087 2017: pid 29133: thd-0x1dc1690: amidxtaped: Final linkage: <[email protected]> -(PULL_BUFFER)-> <[email protected]> -(WRITEFD)-> <[email protected]>
Tue Mar 07 11:49:41.231107431 2017: pid 29133: thd-0x1dc1690: amidxtaped: setup_impl: 3, 2
Tue Mar 07 11:49:41.231235341 2017: pid 29133: thd-0x1dc1690: amidxtaped: xfer_queue_message: MSG: <[email protected] type=XMSG_READY elt=<[email protected]> version=0>
Tue Mar 07 11:49:41.231267481 2017: pid 29133: thd-0x2cdf5c0: amidxtaped: pull_and_write
Tue Mar 07 11:49:41.231320900 2017: pid 29133: thd-0x1dc1690: amidxtaped: CTL >> USE-DATAPATH AMANDA
Tue Mar 07 11:49:41.232664610 2017: pid 29133: thd-0x1dc1690: amidxtaped: CTL << DATAPATH-OK
Tue Mar 07 11:49:41.232778307 2017: pid 29133: thd-0x1dc1690: amidxtaped: CTL >> DATA-SEND
Tue Mar 07 11:49:41.233137055 2017: pid 29133: thd-0x1dc1690: amidxtaped: CTL << DATA-READY
Tue Mar 07 11:49:41.233220578 2017: pid 29133: thd-0x1dc1690: amidxtaped: Amanda::Recovery::Clerk: starting recovery
Tue Mar 07 11:49:41.235146117 2017: pid 29133: thd-0x1dc1690: amidxtaped: Amanda::Recovery::Clerk: reading file 55 on 'ProdSet-16'
Tue Mar 07 11:49:56.241680180 2017: pid 29133: thd-0x1dc1690: amidxtaped: /usr/local/share/perl5/Amanda/Restore.pm:1698:info:4900000 86755 kb
Tue Mar 07 11:50:11.256969517 2017: pid 29133: thd-0x1dc1690: amidxtaped: /usr/local/share/perl5/Amanda/Restore.pm:1698:info:4900000 102400 kb
Tue Mar 07 11:50:26.271854228 2017: pid 29133: thd-0x1dc1690: amidxtaped: /usr/local/share/perl5/Amanda/Restore.pm:1698:info:4900000 102400 kb
Tue Mar 07 11:50:41.287620042 2017: pid 29133: thd-0x1dc1690: amidxtaped: /usr/local/share/perl5/Amanda/Restore.pm:1698:info:4900000 102400 kb
Tue Mar 07 11:50:56.295778675 2017: pid 29133: thd-0x1dc1690: amidxtaped: /usr/local/share/perl5/Amanda/Restore.pm:1698:info:4900000 102400 kb
Tue Mar 07 11:51:11.304232529 2017: pid 29133: thd-0x1dc1690: amidxtaped: /usr/local/share/perl5/Amanda/Restore.pm:1698:info:4900000 102400 kb
....
I've done a few tests to try to troubleshoot the issue, but haven't got anywhere:
amfetchdump works fine retrieving the the full DLE that we are trying to recover. I can then manually extract our backed up files ok.
enabling the VERBOSE S3 device parameter shows the 2 blocks successfully retrieved, but no attempt to retrieve the third block
increasing the NB_RECOVERY_THREAD S3 device parameter increases the number of blocks retrieved to 2 x the number of threads ( eg 6 threads causes 12 blocks to be retrieved )
Does anyone have any ideas what could be wrong?
Happy to supply more information if that helps.
Kind Regards
Dave
Backups are working fine, and we can recover files using amrecover when the tapes are 1 or 2 blocks ( blocksize set to 50M ).
However if we try to recover files from a tape that has more than 2 blocks, amrecover just hangs and never completes (I've waited for more than 1 hour... ).
These are the last entries in the amidxtaped debug log:
...
Tue Mar 07 11:49:38.583421260 2017: pid 29133: thd-0x1dc1690: amidxtaped: parse_inventory: load slot 16 with label 'ProdSet-16'
Tue Mar 07 11:49:38.602256436 2017: pid 29133: thd-0x1dc1690: amidxtaped: /usr/local/share/perl5/Amanda/Recovery/Scan.pm:307:info:1200000 slot 16
Tue Mar 07 11:49:38.603489703 2017: pid 29133: thd-0x1dc1690: amidxtaped: S3 driver using bucket 'prod-backups', prefix 'ProdSet/slot-16/tape'
Tue Mar 07 11:49:38.603514074 2017: pid 29133: thd-0x1dc1690: amidxtaped: curl version: libcurl/7.53.1 OpenSSL/1.0.1e zlib/1.2.3 c-ares/1.12.0 libssh2/1.8.0 nghttp2/1.6.0
Tue Mar 07 11:49:38.603523865 2017: pid 29133: thd-0x1dc1690: amidxtaped: curl compiled for OPENSSL
Tue Mar 07 11:49:38.608284630 2017: pid 29133: thd-0x1dc1690: amidxtaped: Create 6 threads
Tue Mar 07 11:49:39.080822231 2017: pid 29133: thd-0x1dc1690: amidxtaped: Building type TAPESTART header of 0-52428800 bytes with name='ProdSet-16' disk='' dumplevel=0 and blocksize=0
Tue Mar 07 11:49:39.082250014 2017: pid 29133: thd-0x1dc1690: amidxtaped: /usr/local/share/perl5/Amanda/Recovery/Scan.pm:459:info:1200001 ProdSet-16
Tue Mar 07 11:49:39.082534856 2017: pid 29133: thd-0x1dc1690: amidxtaped: ignoring spurious Amanda::Recovery::Scan abort call
Tue Mar 07 11:49:39.502133244 2017: pid 29133: thd-0x1dc1690: amidxtaped: Amanda::Recovery::Clerk: successfully located first part for recovery
Tue Mar 07 11:49:39.508682900 2017: pid 29133: thd-0x1dc1690: amidxtaped: Building type FILE header of 128-32768 bytes with name='server.example.com' disk='/opt' dumplevel=0 and blocksize=32768
Tue Mar 07 11:49:39.508801525 2017: pid 29133: thd-0x1dc1690: amidxtaped: CTL >> HEADER-SEND-SIZE 551
Tue Mar 07 11:49:39.509333878 2017: pid 29133: thd-0x1dc1690: amidxtaped: CTL << HEADER-READY
Tue Mar 07 11:49:41.228676865 2017: pid 29133: thd-0x1dc1690: amidxtaped: CTL << HEADER-DONE
Tue Mar 07 11:49:41.228870784 2017: pid 29133: thd-0x1dc1690: amidxtaped: CTL >> STATE-SEND
Tue Mar 07 11:49:41.229378230 2017: pid 29133: thd-0x1dc1690: amidxtaped: CTL << STATE-READY
Tue Mar 07 11:49:41.230058205 2017: pid 29133: thd-0x1dc1690: amidxtaped: CTL << STATE-DONE
Tue Mar 07 11:49:41.230139287 2017: pid 29133: thd-0x1dc1690: amidxtaped: CTL >> USE-DAR NO
Tue Mar 07 11:49:41.230473653 2017: pid 29133: thd-0x1dc1690: amidxtaped: CTL << USE-DAR NO
Tue Mar 07 11:49:41.230585459 2017: pid 29133: thd-0x1dc1690: amidxtaped: CTL << AVAIL-DATAPATH AMANDA
Tue Mar 07 11:49:41.231037363 2017: pid 29133: thd-0x1dc1690: amidxtaped: Starting <[email protected] (<[email protected]> -> <[email protected]>)>
Tue Mar 07 11:49:41.231084087 2017: pid 29133: thd-0x1dc1690: amidxtaped: Final linkage: <[email protected]> -(PULL_BUFFER)-> <[email protected]> -(WRITEFD)-> <[email protected]>
Tue Mar 07 11:49:41.231107431 2017: pid 29133: thd-0x1dc1690: amidxtaped: setup_impl: 3, 2
Tue Mar 07 11:49:41.231235341 2017: pid 29133: thd-0x1dc1690: amidxtaped: xfer_queue_message: MSG: <[email protected] type=XMSG_READY elt=<[email protected]> version=0>
Tue Mar 07 11:49:41.231267481 2017: pid 29133: thd-0x2cdf5c0: amidxtaped: pull_and_write
Tue Mar 07 11:49:41.231320900 2017: pid 29133: thd-0x1dc1690: amidxtaped: CTL >> USE-DATAPATH AMANDA
Tue Mar 07 11:49:41.232664610 2017: pid 29133: thd-0x1dc1690: amidxtaped: CTL << DATAPATH-OK
Tue Mar 07 11:49:41.232778307 2017: pid 29133: thd-0x1dc1690: amidxtaped: CTL >> DATA-SEND
Tue Mar 07 11:49:41.233137055 2017: pid 29133: thd-0x1dc1690: amidxtaped: CTL << DATA-READY
Tue Mar 07 11:49:41.233220578 2017: pid 29133: thd-0x1dc1690: amidxtaped: Amanda::Recovery::Clerk: starting recovery
Tue Mar 07 11:49:41.235146117 2017: pid 29133: thd-0x1dc1690: amidxtaped: Amanda::Recovery::Clerk: reading file 55 on 'ProdSet-16'
Tue Mar 07 11:49:56.241680180 2017: pid 29133: thd-0x1dc1690: amidxtaped: /usr/local/share/perl5/Amanda/Restore.pm:1698:info:4900000 86755 kb
Tue Mar 07 11:50:11.256969517 2017: pid 29133: thd-0x1dc1690: amidxtaped: /usr/local/share/perl5/Amanda/Restore.pm:1698:info:4900000 102400 kb
Tue Mar 07 11:50:26.271854228 2017: pid 29133: thd-0x1dc1690: amidxtaped: /usr/local/share/perl5/Amanda/Restore.pm:1698:info:4900000 102400 kb
Tue Mar 07 11:50:41.287620042 2017: pid 29133: thd-0x1dc1690: amidxtaped: /usr/local/share/perl5/Amanda/Restore.pm:1698:info:4900000 102400 kb
Tue Mar 07 11:50:56.295778675 2017: pid 29133: thd-0x1dc1690: amidxtaped: /usr/local/share/perl5/Amanda/Restore.pm:1698:info:4900000 102400 kb
Tue Mar 07 11:51:11.304232529 2017: pid 29133: thd-0x1dc1690: amidxtaped: /usr/local/share/perl5/Amanda/Restore.pm:1698:info:4900000 102400 kb
....
I've done a few tests to try to troubleshoot the issue, but haven't got anywhere:
amfetchdump works fine retrieving the the full DLE that we are trying to recover. I can then manually extract our backed up files ok.
enabling the VERBOSE S3 device parameter shows the 2 blocks successfully retrieved, but no attempt to retrieve the third block
increasing the NB_RECOVERY_THREAD S3 device parameter increases the number of blocks retrieved to 2 x the number of threads ( eg 6 threads causes 12 blocks to be retrieved )
Does anyone have any ideas what could be wrong?
Happy to supply more information if that helps.
Kind Regards
Dave