2

SD-7.4.4 (ubuntu 16) Director-7.4.4(ubuntu 16) FD-5.2.10 (windows)

我在使用 Bacula 备份 Windows 客户端时遇到问题。当备份大小约为 1MB 或 2 时,我可以正常运行备份,但是当运行 500MB 的备份时,每次都会遇到相同的错误

“该工作的主管与 SD 的连接已丢失。”

有些事情要提。当我发出status client

Terminated Jobs:  JobId  Level    Files      Bytes   Status   Finished
======================================================================
    81  Full      5,796    514.8 M  OK       06-Nov-17 12:50 BackupComputerA

当我发出status dir

06-Nov 17:58 acme-director JobId 81: Error: Director's connection to SD for this Job was lost.
06-Nov 17:58 acme-director JobId 81: Error: Bacula acme-director 7.4.4 (202Sep16):
  Build OS:               arm-unknown-linux-gnueabihf debian 9.0
  JobId:                  81
  Job:                    BackupComputerA.2017-11-06_17.41.01_03
  Backup Level:           Full (upgraded from Incremental)
  Client:                 "Computer-A-fd" 5.2.10 (28Jun12) Microsoft  (build 9200), 32-bit,Cross-compile,Win32
  FileSet:                "Full Set" 2017-11-03 22:12:58
  Pool:                   "RemoteFile" (From Job resource)
  Catalog:                "MyCatalog" (From Client resource)
  Storage:                "File1" (From Job resource)
  Scheduled time:         06-Nov-2017 17:40:59
  Start time:             06-Nov-2017 17:41:04
  End time:               06-Nov-2017 17:58:00
  Elapsed time:           16 mins 56 secs
  Priority:               10
  FD Files Written:       5,796
  SD Files Written:       0
  FD Bytes Written:       514,883,164 (514.8 MB)
  SD Bytes Written:       0 (0 B)
  Rate:                   506.8 KB/s
  Software Compression:   100.0% 1.0:1
  Snapshot/VSS:           yes
  Encryption:             yes
  Accurate:               no
  Volume name(s):         
  Volume Session Id:      1
  Volume Session Time:    1509989906
  Last Volume Bytes:      8,045,880,119 (8.045 GB)
  Non-fatal FD errors:    1
  SD Errors:              0
  FD termination status:  OK
  SD termination status:  Error
  Termination:            *** Backup Error ***

备份大约 5 分钟后,我收到一条消息:

Running Jobs:
Console connected at 06-Nov-17 18:08
 JobId  Type Level     Files     Bytes  Name              Status
======================================================================
    83  Back Full          0         0  BackupComputerE   has terminated
====

作业完成并终止,但之后失去连接,我从来没有得到

“好的”

用于状态更新。

我已将其添加"Heartbeat Interval = 1 Minute"到所有守护进程中,但仍然没有运气。Director上使用mysql作为数据库

未来感谢您的帮助

4

1 回答 1

2

对于任何有同样问题的人,我可以通过向客户端添加心跳间隔并调整保持活动时间来解决 SD 和导演之间的这个问题

sysctl -w net.ipv4.tcp_keepalive_time=60

在存储守护进程和导向器上。使用 bconsole 远程连接到 director 也会中断工作,所以我在与 director 相同的机器上运行 bconsole 并通过 ssh 连接。

于 2017-11-08T13:54:52.570 回答