- リリース ノート
- 要件
- インストール
- 基本情報
- プロジェクト
- データセット
- ML パッケージ
- パイプライン
- ML スキル
- ML ログ
- AI Fabric での Document Understanding
- 基本的なトラブルシューティング ガイド
サポート
このページには、インストール時および製品の使用中に、バグの報告および問題のトラブルシューティングを行うための、関連情報の入手先が詳しく示されています。
AI Fabric の健全性をチェックして、インストールに関する問題を特定できるようにするための、診断ツールが用意されています。この診断ツールを実行するには、AI Fabric のホストに接続して、次のコマンドを実行します。
bash <(curl https://raw.githubusercontent.com/UiPath/ai-customer-scripts/master/platform/generate-report.sh)
bash <(curl https://raw.githubusercontent.com/UiPath/ai-customer-scripts/master/platform/generate-report.sh)
エアギャップ環境で、マシン自体から上記の URL にアクセスできない場合は、新しいファイル generate-report.sh を作成し、そこに上記のファイルをコピーして貼り付けてから、次のコマンドを実行します。
bash generate-report.sh
bash generate-report.sh
これにより、さまざまな AI Fabric サービスのステータスを報告するファイル aifabric-diagnostics-latest.log (以下の例を参照) が生成されます。AI Fabric マシン上で正しいポートが実際に開かれている場合は、ファイルと ML パッケージをアップロードするためのテストを実行します。すると、証明書と GPU のステータスに関する情報が表示されます。
Fetching Core Services Status
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 867 0 867 0 0 3454 0 --:--:-- --:--:-- --:--:-- 3468
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 862 0 862 0 0 3747 0 --:--:-- --:--:-- --:--:-- 3747
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 470 0 470 0 0 13055 0 --:--:-- --:--:-- --:--:-- 13055
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 569 0 569 0 0 14589 0 --:--:-- --:--:-- --:--:-- 14973
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 444 0 444 0 0 12333 0 --:--:-- --:--:-- --:--:-- 12333
Starting Orchestrator Connection Check
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 492 100 492 0 0 5857 0 --:--:-- --:--:-- --:--:-- 5927
Successfully received response from orchestrator: HTTP/2 200
cache-control: no-store, must-revalidate, no-cache, max-age=0
content-type: application/json; charset=utf-8
x-correlation-id: 655adc9a-df94-47b3-8a35-40ffed513acc
api-supported-versions: 10.0
x-content-type-options: nosniff
x-frame-options: DENY
strict-transport-security: max-age=31536000; includeSubDomains
server:
date: Tue, 08 Dec 2020 14:28:24 GMT
content-length: 492
{"keys":[{"alg":"RS256","e":"AQAB","kid":"BA16
...
Checking aifabric ports availability in the Cluster
aif.snvenkat1.xyz (52.178.221.160:31390) open
aif.snvenkat1.xyz (52.178.221.160:31443) open
aif.snvenkat1.xyz (52.178.221.160:6443) open
Open
Fetching Certificate Details from Orchstrator and AIFabric
depth=0 CN = aifabricqaorchtest.northeurope.cloudapp.azure.com
verify error:num=20:unable to get local issuer certificate
verify return:1
depth=0 CN = aifabricqaorchtest.northeurope.cloudapp.azure.com
verify error:num=21:unable to verify the first certificate
verify return:1
DONE
depth=2 C = US, ST = New Jersey, L = Jersey City, O = The USERTRUST Network, CN = USERTrust RSA Certification Authority
verify return:1
depth=1 C = AT, O = ZeroSSL, CN = ZeroSSL RSA Domain Secure Site CA
verify return:1
depth=0 CN = aif.snvenkat1.xyz
verify return:1
DONE
Check if GPU is installed in the Cluster!!
Node: dm-onebox
GPU Capacity : 1
GPU Node Found!
-----Analysis Start
Core Services Status:
Deployer : "UP"
Trainer : "UP"
PkgManagaer : "UP"
Helper : "UP"
AppManager : "UP"
RabbitMQ : "UP"
AIFabric Ports Status:
AIFabric Port (31390) : Open
Storage Port (31443) : Open
Kubernetes Port (6443) : Open
Databases Health:
Deployer DB : "UP"
Trainer DB : "UP"
Helper DB : "UP"
PkgManager DB : "UP"
AppManager DB : "UP"
DockerRegistry Health:
Deployer Registry : "UP"
Trainer Registry : "UP"
Orchestrator Connection Status:
Orchestrator connection is Healthy!
Certificates Check:
Your orchestrator certificate is valid for following IP/Hosts, please make sure it matches the host/IP you are using in AIFabric Setup.
DNS:aifabricqaorchtest.northeurope.cloudapp.azure.com
Expiry Date of the Orchestrator Certificate : Jul 22 12:11:30 2021 GMT
Your AIFabric Ingress Host certificate is valid for following IP/Hosts, please make sure it matches the host/IP you are using for AIFabric Setup in Orchestrator
DNS:aif.snvenkat1.xyz
Expiry Date of the AIFabric Certificate : Dec 24 23:59:59 2020 GMT
Storage Checks:
1. Object Storage - File Upload Test Successful
2. Object Storage - File Deletion Test Successful
GPU Drivers Check:
GPU Available and Working Fine. Total no of nodes with GPU - 1
-----Analysis End
**Report Generated on Tue Dec 8 14:28:31 UTC 2020
Fetching Core Services Status
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 867 0 867 0 0 3454 0 --:--:-- --:--:-- --:--:-- 3468
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 862 0 862 0 0 3747 0 --:--:-- --:--:-- --:--:-- 3747
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 470 0 470 0 0 13055 0 --:--:-- --:--:-- --:--:-- 13055
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 569 0 569 0 0 14589 0 --:--:-- --:--:-- --:--:-- 14973
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 444 0 444 0 0 12333 0 --:--:-- --:--:-- --:--:-- 12333
Starting Orchestrator Connection Check
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 492 100 492 0 0 5857 0 --:--:-- --:--:-- --:--:-- 5927
Successfully received response from orchestrator: HTTP/2 200
cache-control: no-store, must-revalidate, no-cache, max-age=0
content-type: application/json; charset=utf-8
x-correlation-id: 655adc9a-df94-47b3-8a35-40ffed513acc
api-supported-versions: 10.0
x-content-type-options: nosniff
x-frame-options: DENY
strict-transport-security: max-age=31536000; includeSubDomains
server:
date: Tue, 08 Dec 2020 14:28:24 GMT
content-length: 492
{"keys":[{"alg":"RS256","e":"AQAB","kid":"BA16
...
Checking aifabric ports availability in the Cluster
aif.snvenkat1.xyz (52.178.221.160:31390) open
aif.snvenkat1.xyz (52.178.221.160:31443) open
aif.snvenkat1.xyz (52.178.221.160:6443) open
Open
Fetching Certificate Details from Orchstrator and AIFabric
depth=0 CN = aifabricqaorchtest.northeurope.cloudapp.azure.com
verify error:num=20:unable to get local issuer certificate
verify return:1
depth=0 CN = aifabricqaorchtest.northeurope.cloudapp.azure.com
verify error:num=21:unable to verify the first certificate
verify return:1
DONE
depth=2 C = US, ST = New Jersey, L = Jersey City, O = The USERTRUST Network, CN = USERTrust RSA Certification Authority
verify return:1
depth=1 C = AT, O = ZeroSSL, CN = ZeroSSL RSA Domain Secure Site CA
verify return:1
depth=0 CN = aif.snvenkat1.xyz
verify return:1
DONE
Check if GPU is installed in the Cluster!!
Node: dm-onebox
GPU Capacity : 1
GPU Node Found!
-----Analysis Start
Core Services Status:
Deployer : "UP"
Trainer : "UP"
PkgManagaer : "UP"
Helper : "UP"
AppManager : "UP"
RabbitMQ : "UP"
AIFabric Ports Status:
AIFabric Port (31390) : Open
Storage Port (31443) : Open
Kubernetes Port (6443) : Open
Databases Health:
Deployer DB : "UP"
Trainer DB : "UP"
Helper DB : "UP"
PkgManager DB : "UP"
AppManager DB : "UP"
DockerRegistry Health:
Deployer Registry : "UP"
Trainer Registry : "UP"
Orchestrator Connection Status:
Orchestrator connection is Healthy!
Certificates Check:
Your orchestrator certificate is valid for following IP/Hosts, please make sure it matches the host/IP you are using in AIFabric Setup.
DNS:aifabricqaorchtest.northeurope.cloudapp.azure.com
Expiry Date of the Orchestrator Certificate : Jul 22 12:11:30 2021 GMT
Your AIFabric Ingress Host certificate is valid for following IP/Hosts, please make sure it matches the host/IP you are using for AIFabric Setup in Orchestrator
DNS:aif.snvenkat1.xyz
Expiry Date of the AIFabric Certificate : Dec 24 23:59:59 2020 GMT
Storage Checks:
1. Object Storage - File Upload Test Successful
2. Object Storage - File Deletion Test Successful
GPU Drivers Check:
GPU Available and Working Fine. Total no of nodes with GPU - 1
-----Analysis End
**Report Generated on Tue Dec 8 14:28:31 UTC 2020
<machine-ip>:8800
) に移動し、上部のナビゲーション バーにある [トラブルシューティング] をクリックします。このボタンをクリックすると新しいサポート バンドルが生成されるので、そのバンドルをダウンロードします。
UiPath のサポートにご連絡いただければ、提供されたバンドルを基に問題を解決いたします。
何らかの理由で管理コンソールからサポート バンドルを作成できない場合は、以下のコマンドを使用して、Linux ターミナルからサポート バンドルを作成します。
curl https://krew.sh/support-bundle | bash
kubectl support-bundle https://kots.io
curl https://krew.sh/support-bundle | bash
kubectl support-bundle https://kots.io
以下のように、お使いのマシンに specs.yaml ファイルを作成します。
apiVersion: troubleshoot.replicated.com/v1beta1
kind: Collector
metadata:
name: collector-sample
spec:
collectors:
- clusterInfo: {}
- clusterResources: {}
- ceph: {}
- exec:
args:
- "-U"
- kotsadm
collectorName: kotsadm-postgres-db
command:
- pg_dump
containerName: kotsadm-postgres
name: kots/admin_console
selector:
- app=kotsadm-postgres
timeout: 10s
- logs:
collectorName: kotsadm-postgres-db
name: kots/admin_console
selector:
- app=kotsadm-postgres
- logs:
collectorName: kotsadm-api
name: kots/admin_console
selector:
- app=kotsadm-api
- logs:
collectorName: kotsadm-operator
name: kots/admin_console
selector:
- app=kotsadm-operator
- logs:
collectorName: kotsadm
name: kots/admin_console
selector:
- app=kotsadm
- logs:
collectorName: kurl-proxy-kotsadm
name: kots/admin_console
selector:
- app=kurl-proxy-kotsadm
- secret:
collectorName: kotsadm-replicated-registry
includeValue: false
key: .dockerconfigjson
name: kotsadm-replicated-registry
- logs:
collectorName: rook-ceph-agent
selector:
- app=rook-ceph-agent
namespace: rook-ceph
name: kots/rook
- logs:
collectorName: rook-ceph-mgr
selector:
- app=rook-ceph-mgr
namespace: rook-ceph
name: kots/rook
- logs:
collectorName: rook-ceph-mon
selector:
- app=rook-ceph-mon
namespace: rook-ceph
name: kots/rook
- logs:
collectorName: rook-ceph-operator
selector:
- app=rook-ceph-operator
namespace: rook-ceph
name: kots/rook
- logs:
collectorName: rook-ceph-osd
selector:
- app=rook-ceph-osd
namespace: rook-ceph
name: kots/rook
- logs:
collectorName: rook-ceph-osd-prepare
selector:
- app=rook-ceph-osd-prepare
namespace: rook-ceph
name: kots/rook
- logs:
collectorName: rook-ceph-rgw
selector:
- app=rook-ceph-rgw
namespace: rook-ceph
name: kots/rook
- logs:
collectorName: rook-discover
selector:
- app=rook-discover
namespace: rook-ceph
name: kots/rook
apiVersion: troubleshoot.replicated.com/v1beta1
kind: Collector
metadata:
name: collector-sample
spec:
collectors:
- clusterInfo: {}
- clusterResources: {}
- ceph: {}
- exec:
args:
- "-U"
- kotsadm
collectorName: kotsadm-postgres-db
command:
- pg_dump
containerName: kotsadm-postgres
name: kots/admin_console
selector:
- app=kotsadm-postgres
timeout: 10s
- logs:
collectorName: kotsadm-postgres-db
name: kots/admin_console
selector:
- app=kotsadm-postgres
- logs:
collectorName: kotsadm-api
name: kots/admin_console
selector:
- app=kotsadm-api
- logs:
collectorName: kotsadm-operator
name: kots/admin_console
selector:
- app=kotsadm-operator
- logs:
collectorName: kotsadm
name: kots/admin_console
selector:
- app=kotsadm
- logs:
collectorName: kurl-proxy-kotsadm
name: kots/admin_console
selector:
- app=kurl-proxy-kotsadm
- secret:
collectorName: kotsadm-replicated-registry
includeValue: false
key: .dockerconfigjson
name: kotsadm-replicated-registry
- logs:
collectorName: rook-ceph-agent
selector:
- app=rook-ceph-agent
namespace: rook-ceph
name: kots/rook
- logs:
collectorName: rook-ceph-mgr
selector:
- app=rook-ceph-mgr
namespace: rook-ceph
name: kots/rook
- logs:
collectorName: rook-ceph-mon
selector:
- app=rook-ceph-mon
namespace: rook-ceph
name: kots/rook
- logs:
collectorName: rook-ceph-operator
selector:
- app=rook-ceph-operator
namespace: rook-ceph
name: kots/rook
- logs:
collectorName: rook-ceph-osd
selector:
- app=rook-ceph-osd
namespace: rook-ceph
name: kots/rook
- logs:
collectorName: rook-ceph-osd-prepare
selector:
- app=rook-ceph-osd-prepare
namespace: rook-ceph
name: kots/rook
- logs:
collectorName: rook-ceph-rgw
selector:
- app=rook-ceph-rgw
namespace: rook-ceph
name: kots/rook
- logs:
collectorName: rook-discover
selector:
- app=rook-discover
namespace: rook-ceph
name: kots/rook
次に、以下のコマンドを実行します。
kubectl support-bundle /path/to/spec.yaml
kubectl support-bundle /path/to/spec.yaml
UiPath のサポートにご連絡いただければ、提供されたバンドルを基に問題を解決いたします。
Data Manager の問題を報告する場合は、生成されたログを含めてください。ログを取得するには、以下を実行します。
- Data Manager の右上にある疑問符をクリックします。Data Manager のヘルプ メニューが表示されます。
- [エラー報告] セクションで、[エラー報告用の最新のログを収集] をクリックします。[最近のログ] ウィンドウが表示されます。