有關如何完成以下任務的步驟:
注意:如果需要 SupportAssist 記錄,請參閱 KBA 000135669:「如何從 SupportAssist 企業版匯出 SupportAssist 記錄集合?連線或中斷連線 「
A. 偵錯 Metro Node 問題需要哪些記錄?
注意:
以下是這兩個檔名的範例,日期和時間(顯示為 YYYY-MM-DD-HH.MM.SS)將從收集這些檔名的日期和時間開始:
二. 如何在 Metro Node 叢集上擷取收集診斷?
注意:涵蓋過去 30 天的基本檔案足以調查並解決大部分問題。除非支援部門另有指示,否則應使用這些選項。
若要擷取此資料,請使用下列旗標執行 collect-diagnostics 命令:「--noextended」和「--last-logs 30d」。
在導向器節點 Linux 提示字元 (例如 service@director-1-1-a) 建立 SSH 工作階段,然後登入 vplexcli。
範例輸出:
login as: service Keyboard-interactive authentication prompts from server: | Password: End of keyboard-interactive prompts from server Last login: <date and timestamp data> from x.x.x.x service@director-1-1-a:~> service@director-1-1-a:~> vplexcli Trying ::1... Connected to localhost. Escape character is '^]'. VPlexcli:/>
範例輸出:
VPlexcli:/> collect-diagnostics --noextended --last-logs 30d ('WARNING:The collect-diagnostics command was issued with option --noextended.\n',) The following file(s) will NOT be collected: core files fast trace dump files slow trace dump files udcom trace dump files udcom legacy trace files user-defined performance sink files the management console's heap ('WARNING:Only the logs that are generated in the last 30 days are collected.') 2024-02-09 19:55:12 UTC: ****Initializing collect-diagnostics... 2024-02-09 19:55:13 UTC: No cluster-witness server found. 2024-02-09 19:55:13 UTC: Free space = 88G 2024-02-09 19:55:13 UTC: Total space needed = 1907M ================================================================================ Starting collect-diagnostics, this operation might take a while... ================================================================================ Executing cluster collection ..
C. 如何驗證導向器/節點上現有的收集診斷套裝。
當 collect-diagnostics 命令完成並返回到 vplexcli 提示時,請連線至使用 winscp [或同等 SCP 公用程式] 執行命令的導向器,並導覽至資料夾 /diag/collect-diagnostics-out/
識別具有正確時間戳的日誌檔,並將其下載到本地工作站。
D. 如何中止正在進行的收集診斷
注意: 這是非破壞性活動。由於沒有中止收集程序的直接命令,您必須重新啟動管理主控台。不過,在中止執行中的收集診斷程式之前,請先聯絡支援部門,說明您為何要中止執行收集診斷程式,以確保其正常運作,因為可能會遺失資料。中止動作後重新執行收集診斷時,將無法再次收集遺失的資料。
範例輸出:
VPlexcli:/> collect-diagnostics --noextended --last-logs 30d ('WARNING:The collect-diagnostics command was issued with option --noextended.\n',) The following file(s) will NOT be collected: core files fast trace dump files slow trace dump files udcom trace dump files udcom legacy trace files user-defined performance sink files the management console's heap ('WARNING:Only the logs that are generated in the last 30 days are collected.') 2022-02-09 19:55:12 UTC: ****Initializing collect-diagnostics... 2022-02-09 19:55:13 UTC: No cluster-witness server found. 2022-02-09 19:55:13 UTC: Free space = 88G 2022-02-09 19:55:13 UTC: Total space needed = 1907M ================================================================================ Starting collect-diagnostics, this operation might take a while... ================================================================================ Executing cluster collection ..
範例輸出:
login as: service Using keyboard-interactive authentication. Password: Last login: <date and time stamp data> from x.x.x.x service@director-1-1-b:~>
範例輸出:
service@director-1-1-b:~> sudo systemctl restart VPlexManagementConsole.service
"Connection closed by foreign host."
範例輸出 (檢查輸出的最後一行):
VPlexcli:/> collect-diagnostics --noextended --last-logs 30d ('WARNING:The collect-diagnostics command was issued with option --noextended.\n',) The following file(s) will NOT be collected: core files fast trace dump files slow trace dump files udcom trace dump files udcom legacy trace files user-defined performance sink files the management console's heap ('WARNING:Only the logs that are generated in the last 30 days are collected.') 2022-02-09 20:02:03 UTC: ****Initializing collect-diagnostics... 2022-02-09 20:02:04 UTC: No cluster-witness server found. 2022-02-09 20:02:04 UTC: Free space = 88G 2022-02-09 20:02:04 UTC: Total space needed = 1907M ================================================================================ Starting collect-diagnostics, this operation might take a while... ================================================================================ Executing cluster collection .. ERROR Executing SMS log collection .. Connection closed by foreign host. <<<
*如果未省略擴展檔
範例輸出:
service@director-1-1-b:/diag> ll total 32 drwxr-xr-x 2 service groupSvc 4096 Feb 9 20:03 collect-diagnostics-tmp-ext drwxr-xr-x 2 service groupSvc 4096 Feb 9 20:03 collect-diagnostics-jobs drwxr-xr-x 2 service groupSvc 4096 Feb 9 20:04 collect-diagnostics-out drwxr-xr-x 3 service groupSvc 4096 Feb 9 20:02 collect-diagnostics-tmp drwx------ 2 root root 16384 Jan 27 16:54 lost+found drwx--x--x 3 service groupSvc 4096 Dec 17 03:08 share service@director-1-1-b:/diag>
範例輸出:
service@director-1-1-b:/diag> rm -r collect-diagnostics-jobs service@director-1-1-b:/diag> rm -r collect-diagnostics-tmp service@director-1-1-b:/diag> ll total 24 drwxr-xr-x 2 service groupSvc 4096 Feb 9 20:04 collect-diagnostics-out drwx------ 2 root root 16384 Jan 27 16:54 lost+found drwx--x--x 3 service groupSvc 4096 Dec 17 03:08 share service@director-1-1-b:/diag>
注意:擴展檔通常用於調查節點崩潰。如果正在調查節點當機,且支援部門尚未擷取所有必要的記錄,請先諮詢支援部門,再清理 collect-diagnostics-tmp-ext 目錄,因為這樣做可能會刪除必要的核心檔案。