有关如何完成以下任务的步骤:
提醒:如果需要 SupportAssist 日志,请让客户参阅 KBA 000135669“如何从 SupportAssist Enterprise 导出 SupportAssist 日志收集?已连接或已断开连接 “
A. 调试 Metro Node 问题需要哪些日志?
提醒:
以下是这两个文件名的示例,日期和时间(显示为 YYYY-MM-DD-HH.MM.SS)将从收集这些文件名的日期和时间开始:
B。 如何在 Metro Node 群集上捕获收集诊断信息?
提醒:涵盖过去 30 天的基本文件足以调查和解决大多数问题。除非支持人员另有指示,否则应使用这些选项。
要捕获此数据,请使用以下标记“--noextended”和“--last-logs 30d”运行 collect-diagnostics 命令。
在控制器节点 Linux 提示符下建立 SSH 会话(例如 service@director-1-1-a),然后登录 vplexcli。
输出示例:
login as: service Keyboard-interactive authentication prompts from server: | Password: End of keyboard-interactive prompts from server Last login: <date and timestamp data> from x.x.x.x service@director-1-1-a:~> service@director-1-1-a:~> vplexcli Trying ::1... Connected to localhost. Escape character is '^]'. VPlexcli:/>
输出示例:
VPlexcli:/> collect-diagnostics --noextended --last-logs 30d ('WARNING:The collect-diagnostics command was issued with option --noextended.\n',) The following file(s) will NOT be collected: core files fast trace dump files slow trace dump files udcom trace dump files udcom legacy trace files user-defined performance sink files the management console's heap ('WARNING:Only the logs that are generated in the last 30 days are collected.') 2024-02-09 19:55:12 UTC: ****Initializing collect-diagnostics... 2024-02-09 19:55:13 UTC: No cluster-witness server found. 2024-02-09 19:55:13 UTC: Free space = 88G 2024-02-09 19:55:13 UTC: Total space needed = 1907M ================================================================================ Starting collect-diagnostics, this operation might take a while... ================================================================================ Executing cluster collection ..
C。 如何验证控制器/节点上的现有 collect-diagnostics 程序包。
当 collect-diagnostics 命令完成并返回到 vplexcli 提示符时,使用 winscp [或等效的 SCP 实用程序] 连接到您运行该命令的控制器,然后导航到文件夹 /diag/collect-diagnostics-out/
使用正确的时间戳识别日志文件,并将其下载到本地工作站。
D。 如何中止持续收集诊断
注意: 这是一项无中断活动。由于没有直接命令可中止收集过程,因此您必须重新启动管理控制台。但是,在中止正在运行的 collect-diagnostics 之前,请联系支持人员解释中止运行 collect-diagnostics 的原因,以确保它正常,因为可能会丢失数据。在中止操作后重新运行 collect-diagnostics 时,这些丢失的数据将无法再次收集。
输出示例:
VPlexcli:/> collect-diagnostics --noextended --last-logs 30d ('WARNING:The collect-diagnostics command was issued with option --noextended.\n',) The following file(s) will NOT be collected: core files fast trace dump files slow trace dump files udcom trace dump files udcom legacy trace files user-defined performance sink files the management console's heap ('WARNING:Only the logs that are generated in the last 30 days are collected.') 2022-02-09 19:55:12 UTC: ****Initializing collect-diagnostics... 2022-02-09 19:55:13 UTC: No cluster-witness server found. 2022-02-09 19:55:13 UTC: Free space = 88G 2022-02-09 19:55:13 UTC: Total space needed = 1907M ================================================================================ Starting collect-diagnostics, this operation might take a while... ================================================================================ Executing cluster collection ..
输出示例:
login as: service Using keyboard-interactive authentication. Password: Last login: <date and time stamp data> from x.x.x.x service@director-1-1-b:~>
输出示例:
service@director-1-1-b:~> sudo systemctl restart VPlexManagementConsole.service
"Connection closed by foreign host."
输出示例(检查输出的最后一行):
VPlexcli:/> collect-diagnostics --noextended --last-logs 30d ('WARNING:The collect-diagnostics command was issued with option --noextended.\n',) The following file(s) will NOT be collected: core files fast trace dump files slow trace dump files udcom trace dump files udcom legacy trace files user-defined performance sink files the management console's heap ('WARNING:Only the logs that are generated in the last 30 days are collected.') 2022-02-09 20:02:03 UTC: ****Initializing collect-diagnostics... 2022-02-09 20:02:04 UTC: No cluster-witness server found. 2022-02-09 20:02:04 UTC: Free space = 88G 2022-02-09 20:02:04 UTC: Total space needed = 1907M ================================================================================ Starting collect-diagnostics, this operation might take a while... ================================================================================ Executing cluster collection .. ERROR Executing SMS log collection .. Connection closed by foreign host. <<<
*如果未省略扩展文件
输出示例:
service@director-1-1-b:/diag> ll total 32 drwxr-xr-x 2 service groupSvc 4096 Feb 9 20:03 collect-diagnostics-tmp-ext drwxr-xr-x 2 service groupSvc 4096 Feb 9 20:03 collect-diagnostics-jobs drwxr-xr-x 2 service groupSvc 4096 Feb 9 20:04 collect-diagnostics-out drwxr-xr-x 3 service groupSvc 4096 Feb 9 20:02 collect-diagnostics-tmp drwx------ 2 root root 16384 Jan 27 16:54 lost+found drwx--x--x 3 service groupSvc 4096 Dec 17 03:08 share service@director-1-1-b:/diag>
输出示例:
service@director-1-1-b:/diag> rm -r collect-diagnostics-jobs service@director-1-1-b:/diag> rm -r collect-diagnostics-tmp service@director-1-1-b:/diag> ll total 24 drwxr-xr-x 2 service groupSvc 4096 Feb 9 20:04 collect-diagnostics-out drwx------ 2 root root 16384 Jan 27 16:54 lost+found drwx--x--x 3 service groupSvc 4096 Dec 17 03:08 share service@director-1-1-b:/diag>
提醒:扩展文件通常用于调查节点崩溃。如果正在对节点崩溃进行调查,并且支持人员未捕获所有必要的日志,请在清理 collect-diagnostics-tmp-ext 目录之前与支持人员联系,因为这样做可能会删除必要的核心文件。