2017-09-04 46 views
0

我有3个节点设置运行马拉松,mesos主,mesos,奴隶和动物园管理员启用HA配置,然后测试使用mesos,执行简单的Hello应用程序的部署和它的工作作为预期。马拉松应用程序部署卡住得到的等待状态

现在一切都很好,所以我连接到Marathon并部署了一个简单的应用程序来测试马拉松:(echo“hello”>> /tmp/output.txt),但应用程序被吸入“等待”状态。

可能是什么问题,防止马拉松使用mesos资源进行部署?

从mesos主

日志:

I0904 11:23:27.064332 19769 master.cpp:2813] Received SUBSCRIBE call for framework 'marathon' at [email protected]:36324 
I0904 11:23:27.064623 19769 master.cpp:2890] Subscribing framework marathon with checkpointing enabled and capabilities [ PARTITION_AWARE ] 
I0904 11:23:27.064669 19769 master.cpp:6272] Updating info for framework cb16118a-2257-4020-a907-63aa6294e11b-0000 
I0904 11:23:27.064697 19769 master.cpp:2994] Framework cb16118a-2257-4020-a907-63aa6294e11b-0000 (marathon) at [email protected]:36324 failed over 
I0904 11:23:27.065032 19770 hierarchical.cpp:342] Activated framework cb16118a-2257-4020-a907-63aa6294e11b-0000 
I0904 11:23:27.065465 19770 master.cpp:7305] Sending 3 offers to framework cb16118a-2257-4020-a907-63aa6294e11b-0000 (marathon) at [email protected]:36324 
I0904 11:23:27.907865 19769 http.cpp:1115] HTTP GET for /files/read?_=1504517007920&jsonp=jQuery17109098185077823333_1504516979864&length=50000&offset=352538&path=%2Fmaster%2Flog from 192.168.40.1:53525 with User-Agent='Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/60.0.3112.113 Safari/537.36' 
I0904 11:23:28.916651 19768 http.cpp:1115] HTTP GET for /files/read?_=1504517008930&jsonp=jQuery17109098185077823333_1504516979865&length=50000&offset=353797&path=%2Fmaster%2Flog from 192.168.40.1:53525 with User-Agent='Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/60.0.3112.113 Safari/537.36' 
E0904 11:23:30.071293 19775 process.cpp:2450] Failed to shutdown socket with fd 39, address 192.168.40.159:58072: Transport endpoint is not connected 
I0904 11:23:30.073277 19768 master.cpp:1430] Framework cb16118a-2257-4020-a907-63aa6294e11b-0000 (marathon) at [email protected]:36324 disconnected 
I0904 11:23:30.073307 19768 master.cpp:3160] Deactivating framework cb16118a-2257-4020-a907-63aa6294e11b-0000 (marathon) at [email protected]:36324 
I0904 11:23:30.073485 19768 master.cpp:3137] Disconnecting framework cb16118a-2257-4020-a907-63aa6294e11b-0000 (marathon) at [email protected]:36324 
I0904 11:23:30.073496 19768 master.cpp:1445] Giving framework cb16118a-2257-4020-a907-63aa6294e11b-0000 (marathon) at [email protected]:36324 1weeks to failover 
I0904 11:23:30.073519 19768 hierarchical.cpp:374] Deactivated framework cb16118a-2257-4020-a907-63aa6294e11b-0000 

卷曲-XGET 'http://mesosphere2:8098/v2/queue?pretty' | JQ

{ 
    "queue": [ 
    { 
     "count": 1, 
     "delay": { 
     "timeLeftSeconds": 0, 
     "overdue": true 
     }, 
     "since": "2017-09-04T13:12:42.024Z", 
     "processedOffersSummary": { 
     "processedOffersCount": 12, 
     "unusedOffersCount": 12, 
     "lastUnusedOfferAt": "2017-09-04T13:14:52.554Z", 
     "rejectSummaryLastOffers": [ 
      { 
      "reason": "UnfulfilledRole", 
      "declined": 3, 
      "processed": 3 
      }, 
      { 
      "reason": "UnfulfilledConstraint", 
      "declined": 0, 
      "processed": 0 
      }, 
      { 
      "reason": "NoCorrespondingReservationFound", 
      "declined": 0, 
      "processed": 0 
      }, 
      { 
      "reason": "InsufficientCpus", 
      "declined": 0, 
      "processed": 0 
      }, 
      { 
      "reason": "InsufficientMemory", 
      "declined": 0, 
      "processed": 0 
      }, 
      { 
      "reason": "InsufficientDisk", 
      "declined": 0, 
      "processed": 0 
      }, 
      { 
      "reason": "InsufficientGpus", 
      "declined": 0, 
      "processed": 0 
      }, 
      { 
      "reason": "InsufficientPorts", 
      "declined": 0, 
      "processed": 0 
      } 
     ], 
     "rejectSummaryLaunchAttempt": [ 
      { 
      "reason": "UnfulfilledRole", 
      "declined": 12, 
      "processed": 12 
      }, 
      { 
      "reason": "UnfulfilledConstraint", 
      "declined": 0, 
      "processed": 0 
      }, 
      { 
      "reason": "NoCorrespondingReservationFound", 
      "declined": 0, 
      "processed": 0 
      }, 
      { 
      "reason": "InsufficientCpus", 
      "declined": 0, 
      "processed": 0 
      }, 
      { 
      "reason": "InsufficientMemory", 
      "declined": 0, 
      "processed": 0 
      }, 
      { 
      "reason": "InsufficientDisk", 
      "declined": 0, 
      "processed": 0 
      }, 
      { 
      "reason": "InsufficientGpus", 
      "declined": 0, 
      "processed": 0 
      }, 
      { 
      "reason": "InsufficientPorts", 
      "declined": 0, 
      "processed": 0 
      } 
     ] 
     }, 
     "app": { 
     "id": "/test03", 
     "acceptedResourceRoles": [ 
      "slave_public" 
     ], 
     "backoffFactor": 1.15, 
     "backoffSeconds": 1, 
     "container": { 
      "type": "DOCKER", 
      "docker": { 
      "forcePullImage": false, 
      "image": "laghao/hello-marathon", 
      "network": "BRIDGE", 
      "parameters": [], 
      "portMappings": [ 
       { 
       "containerPort": 80, 
       "hostPort": 80, 
       "labels": {}, 
       "protocol": "tcp", 
       "servicePort": 10003 
       } 
      ], 
      "privileged": false 
      }, 
      "volumes": [] 
     }, 
     "cpus": 0.1, 
     "disk": 0, 
     "executor": "", 
     "instances": 1, 
     "labels": {}, 
     "maxLaunchDelaySeconds": 3600, 
     "mem": 64, 
     "gpus": 0, 
     "portDefinitions": [ 
      { 
      "port": 10003, 
      "name": "default", 
      "protocol": "tcp" 
      } 
     ], 
     "requirePorts": false, 
     "upgradeStrategy": { 
      "maximumOverCapacity": 1, 
      "minimumHealthCapacity": 1 
     }, 
     "version": "2017-09-04T13:12:41.993Z", 
     "versionInfo": { 
      "lastScalingAt": "2017-09-04T13:12:41.993Z", 
      "lastConfigChangeAt": "2017-09-04T13:12:41.993Z" 
     }, 
     "killSelection": "YOUNGEST_FIRST", 
     "unreachableStrategy": { 
      "inactiveAfterSeconds": 300, 
      "expungeAfterSeconds": 600 
     } 
     } 
    } 
    ] 
} 
+0

你能显示马拉松日志吗? _等待_表示没有资源可用于满足应用程序限制。在最新的Marathon 1.4+中,您可以使用[/ v2/queue endpoint](https://mesosphere.github.io/marathon/docs/generated/api.html#v2_queue_get)调试缺少给定部署的资源。 – janisz

回答

0

documentation

的应用停留在“等待”永远 这意味着马拉松不会Mesos,允许它来启动这个应用程序的任务收到“资源商情”。最简单的失败是集群中没有足够的可用资源,或者另一个框架拥有所有这些资源。您可以查看Mesos UI的可用资源。请注意,所需资源(如CPU,Mem,Disk)必须全部在单个主机上可用。

如果你不自己想办法了,你创建一个GitHub的问题,请Mesos /状态端点的输出附加到bug报告,以便我们可以检查可用的群集资源。

在你的情况下,应用程序角色需求和代理角色有问题。你可以从UnfulfilledRole中推断出来。

马拉松约1.4卡住部署介绍信息。您可以查询/v2/queue并获得统计为什么优惠被拒绝。

+0

以及我读的线程关于“等待”状态,但资源是可用的,因为我可以通过mesos直接部署所以这个问题在某种程度上mesos马拉松之间的沟通,一个线程在马拉松开集团以及:和/ V2 /队列张贴在那里:https://groups.google.com/forum/#!topic/marathon-framework/r1aKkRXIXAE –

+0

它看起来像问题与角色。你可以向你展示应用程序json和代理配置。 – janisz

+0

你是对的 - 我再次更改了部署脚本,你可以在组中检查它,你可以部署它并给我一个反馈? –

相关问题