2017-08-29 121 views
0

我的系统崩溃就像昨天一样,我一直无法恢复它。我没有设置这个,所以我不知道所有的细节,但请询问所需的任何细节。 它曾经是任何虚拟机重新启动系统会自动开始工作,但这个崩溃之后它只是没有了:码头集装箱不会自动启动与码头群

这里是我的docker info输出

Containers: 168 
Running: 0 
Paused: 0 
Stopped: 168 
Images: 241 
Server Version: 1.12.2 
Storage Driver: aufs 
Root Dir: /var/lib/docker/aufs 
Backing Filesystem: extfs 
Dirs: 1228 
Dirperm1 Supported: true 
Logging Driver: json-file 
Cgroup Driver: cgroupfs 
Plugins: 
Volume: local 
Network: overlay bridge host null 
Swarm: pending 
NodeID: bg8sh8m6zm5llezlmcw00nqx6 
Is Manager: true 
ClusterID: 1wfvx3ze7tm1bb56a5zyk9xqs 
Managers: 1 
Nodes: 2 
Orchestration: 
    Task History Retention Limit: 5 
Raft: 
    Snapshot Interval: 10000 
    Heartbeat Tick: 1 
    Election Tick: 3 
Dispatcher: 
    Heartbeat Period: 5 seconds 
CA Configuration: 
    Expiry Duration: 3 months 
Node Address: ADDRESS //hidden for security reasons 
Runtimes: runc 
Default Runtime: runc 
Security Options: apparmor seccomp 
Kernel Version: 4.4.0-91-generic 
Operating System: Ubuntu 16.04.1 LTS 
OSType: linux 
Architecture: x86_64 
CPUs: 2 
Total Memory: 6.804 GiB 
Name: swarm-manager-1 
ID: AXPO:VFSV:TDT3:6X7Y:QNAO:OZJN:U23R:V5S2:FU33:WUNI:CRPK:2E2C 
Docker Root Dir: /var/lib/docker 
Debug Mode (client): false 
Debug Mode (server): false 
Registry: https://index.docker.io/v1/ 
WARNING: No swap limit support 
Insecure Registries: 
127.0.0.0/8 

我的搬运工节点LS输出:

ID       HOSTNAME   STATUS AVAILABILITY MANAGER STATUS 
bg8sh8m6zm5llezlmcw00nqx6 * swarm-manager-1 Ready Active  Leader 
c21j8nzzy3151vv06m54yyd1u swarm-worker-1 Unknown Active 

这里是从搬运工输出node inspect swarm-manager-1

[ 
    { 
     "ID": "bg8sh8m6zm5llezlmcw00nqx6", 
     "Version": { 
      "Index": 67823 
     }, 
     "CreatedAt": "2016-10-13T23:46:00.580142516Z", 
     "UpdatedAt": "2017-08-29T19:48:35.4197366Z", 
     "Spec": { 
      "Role": "manager", 
      "Availability": "active" 
     }, 
     "Description": { 
      "Hostname": "swarm-manager-1", 
      "Platform": { 
       "Architecture": "x86_64", 
       "OS": "linux" 
      }, 
      "Resources": { 
       "NanoCPUs": 2000000000, 
       "MemoryBytes": 7305609216 
      }, 
      "Engine": { 
       "EngineVersion": "1.12.2", 
       "Plugins": [ 
        { 
         "Type": "Network", 
         "Name": "bridge" 
        }, 
        { 
         "Type": "Network", 
         "Name": "host" 
        }, 
        { 
         "Type": "Network", 
         "Name": "null" 
        }, 
        { 
         "Type": "Network", 
         "Name": "overlay" 
        }, 
        { 
         "Type": "Volume", 
         "Name": "local" 
        } 
       ] 
      } 
     }, 
     "Status": { 
      "State": "ready" 
     }, 
     "ManagerStatus": { 
      "Leader": true, 
      "Reachability": "reachable", 
      "Addr": "ADDRESS" //hidden 
     } 
    } 
] 

这里是docker node inspect swarm-worker-1

[ 
    { 
     "ID": "c21j8nzzy3151vv06m54yyd1u", 
     "Version": { 
      "Index": 67824 
     }, 
     "CreatedAt": "2017-02-21T05:42:31.467777741Z", 
     "UpdatedAt": "2017-08-29T19:48:35.4252027Z", 
     "Spec": { 
      "Role": "worker", 
      "Availability": "active" 
     }, 
     "Description": { 
      "Hostname": "swarm-worker-1", 
      "Platform": { 
       "Architecture": "x86_64", 
       "OS": "linux" 
      }, 
      "Resources": { 
       "NanoCPUs": 2000000000, 
       "MemoryBytes": 7305609216 
      }, 
      "Engine": { 
       "EngineVersion": "1.12.2", 
       "Plugins": [ 
        { 
         "Type": "Network", 
         "Name": "bridge" 
        }, 
        { 
         "Type": "Network", 
         "Name": "host" 
        }, 
        { 
         "Type": "Network", 
         "Name": "null" 
        }, 
        { 
         "Type": "Network", 
         "Name": "overlay" 
        }, 
        { 
         "Type": "Volume", 
         "Name": "local" 
        } 
       ] 
      } 
     }, 
     "Status": { 
      "State": "unknown", 
      "Message": "Node moved to \"unknown\" state due to leadership change in cluster" 
     } 
    } 
] 

任何想法如何拿回来上班输出?

+0

你试过'''autorestart'''在你的堆栈? ,看看这个文档https://docs.docker.com/docker-cloud/apps/autorestart/#using-the-api-and-cli –

+0

@juliansalas我没有使用码头云。但事情是,它工作得很好,所以我想有一些重启机制已经到位了...... – Elsban

+2

在swarm worker上执行'docker swarm leave',然后在master上运行'docker swarm join-token worker'并且在worker上执行结果令牌命令。它应该开始工作。崩溃可能导致了一个问题。如果这没有帮助,请告诉我们 –

回答

1

在swarm worker上执行docker swarm leave,然后在主设备run docker swarm join-token worker上执行生成的令牌命令并返回worker。它应该开始工作。

崩溃可能导致了一个问题。这破坏了群体的状态,因此你需要重新创建它