Commit Graph

8 Commits

Author SHA1 Message Date
wanyaoqi
4484a96c8d feat(region,host): vgpu support, refector host registor (#17879)
* feat(region,host): refactor host register

* feat(region,host): vgpu support

add nvidia/amd vgpu support.
nvidia vgpu need manaul configure vgpu instance before add in host.conf

Signed-off-by: wanyaoqi <d3lx.yq@gmail.com>

---------

Signed-off-by: wanyaoqi <d3lx.yq@gmail.com>
2023-09-01 15:42:22 +08:00
wanyaoqi
a6705c4452 fix(region,host): host health misc fix
Signed-off-by: wanyaoqi <d3lx.yq@gmail.com>
2023-03-13 09:41:27 +08:00
wanyaoqi
02a9d84d6e fix(region, host): misc fix host health checker (#14575)
- hostagent remove option enable host health, default enabled,
  its rely on etcd endpoint register.
- add option auto migrate on host shutdown.
- fix hostman check network is available.
- use hostname instead hostId as etcd key
- init health checker before host instace init.
- refector host_health checker.

Signed-off-by: wanyaoqi <wanyaoqi@yunion.cn>
2022-07-07 10:28:27 +08:00
wanyaoqi
dc92f09d95 fix(region,host): auto migrate on host down (#14497)
* fix(region,host): auto migrate on host down

This patch fix source servers not shutdown on network unreachable.
Restart etcd session add timeout, and shutdown servers add force
option incase host stuck on scriptStop.

Signed-off-by: wanyaoqi <wanyaoqi@yunion.cn>

* host_health: add check network available

On host etcd keep alive failed, add check networks is available.
If networks available, may be is the etcd not work, in this case
we will net shutdowns servers and wait for etcd ready.

Signed-off-by: wanyaoqi <d3lx.yq@gmail.com>
2022-06-23 16:26:18 +08:00
Zexi Li
8423aa9e63 feat(region,host): usb passthrough 2021-12-28 19:05:42 +08:00
zhaoxiangchun
14f0342b36 fix(host): update host metric tags when server or host changeowner 2021-04-08 17:17:45 +08:00
Zexi Li
95a58cc606 implement generic service informer 2020-05-29 19:59:34 +08:00
wanyaoqi
d727248929 fix host health 2020-04-30 11:32:16 +08:00