Sridevi Ramesh | 1d85af0 | 2019-02-22 04:08:15 -0600 | [diff] [blame] | 1 | *** Settings *** |
| 2 | Documentation Utility for RAS test scenarios through HOST & BMC. |
| 3 | Resource ../../lib/utils.robot |
Sridevi Ramesh | 1d85af0 | 2019-02-22 04:08:15 -0600 | [diff] [blame] | 4 | Resource ../../lib/ras/host_utils.robot |
| 5 | Resource ../../lib/resource.robot |
| 6 | Resource ../../lib/state_manager.robot |
| 7 | Resource ../../lib/boot_utils.robot |
| 8 | Variables ../../lib/ras/variables.py |
| 9 | Variables ../../data/variables.py |
| 10 | Resource ../../lib/dump_utils.robot |
| 11 | |
| 12 | Library DateTime |
| 13 | Library OperatingSystem |
| 14 | Library random |
| 15 | Library Collections |
| 16 | |
| 17 | *** Variables *** |
| 18 | ${stack_mode} normal |
| 19 | |
| 20 | *** Keywords *** |
| 21 | |
| 22 | Verify And Clear Gard Records On HOST |
| 23 | [Documentation] Verify And Clear gard records on HOST. |
| 24 | |
| 25 | ${output}= Gard Operations On OS list |
| 26 | Should Not Contain ${output} No GARD |
| 27 | Gard Operations On OS clear all |
| 28 | |
| 29 | Verify Error Log Entry |
| 30 | [Documentation] Verify error log entry & signature description. |
| 31 | [Arguments] ${signature_desc} ${log_prefix} |
| 32 | # Description of argument(s): |
| 33 | # signature_desc Error log signature description. |
| 34 | # log_prefix Log path prefix. |
| 35 | |
Sridevi Ramesh | 1d85af0 | 2019-02-22 04:08:15 -0600 | [diff] [blame] | 36 | |
| 37 | Error Logs Should Exist |
| 38 | |
| 39 | Collect eSEL Log ${log_prefix} |
| 40 | ${error_log_file_path}= Catenate ${log_prefix}esel.txt |
| 41 | ${rc} ${output}= Run and Return RC and Output |
| 42 | ... grep -i ${signature_desc} ${error_log_file_path} |
| 43 | Should Be Equal ${rc} ${0} |
| 44 | Should Not Be Empty ${output} |
| 45 | |
| 46 | Inject Recoverable Error With Threshold Limit |
| 47 | [Documentation] Inject and verify recoverable error on processor through |
Sridevi Ramesh | 9c6ec28 | 2019-03-25 03:35:46 -0500 | [diff] [blame] | 48 | ... BMC/HOST. |
Sridevi Ramesh | 1d85af0 | 2019-02-22 04:08:15 -0600 | [diff] [blame] | 49 | ... Test sequence: |
Sridevi Ramesh | 9c6ec28 | 2019-03-25 03:35:46 -0500 | [diff] [blame] | 50 | ... 1. Inject recoverable error on a given target |
Sridevi Ramesh | 3e2a3bd | 2019-05-09 05:30:53 -0500 | [diff] [blame] | 51 | ... (e.g: Processor core, CAPP, MCA) through BMC/HOST. |
Sridevi Ramesh | 9c6ec28 | 2019-03-25 03:35:46 -0500 | [diff] [blame] | 52 | ... 2. Check If HOST is running. |
| 53 | ... 3. Verify error log entry & signature description. |
Sridevi Ramesh | 1d85af0 | 2019-02-22 04:08:15 -0600 | [diff] [blame] | 54 | ... 4. Verify & clear gard records. |
Sridevi Ramesh | 9617ebd | 2019-11-25 10:57:21 -0600 | [diff] [blame] | 55 | [Arguments] ${interface_type} ${fir_address} ${value} ${threshold_limit} |
Sridevi Ramesh | 1d85af0 | 2019-02-22 04:08:15 -0600 | [diff] [blame] | 56 | ... ${signature_desc} ${log_prefix} |
| 57 | # Description of argument(s): |
| 58 | # interface_type Inject error through 'BMC' or 'HOST'. |
Sridevi Ramesh | 9617ebd | 2019-11-25 10:57:21 -0600 | [diff] [blame] | 59 | # fir_address FIR (Fault isolation register) value (e.g. 2011400). |
| 60 | # value (e.g 2000000000000000). |
Sridevi Ramesh | 1d85af0 | 2019-02-22 04:08:15 -0600 | [diff] [blame] | 61 | # threshold_limit Threshold limit (e.g 1, 5, 32). |
| 62 | # signature_desc Error log signature description. |
| 63 | # log_prefix Log path prefix. |
| 64 | |
Sridevi Ramesh | f6e0886 | 2019-11-11 08:37:08 -0600 | [diff] [blame] | 65 | Run Keyword Inject Error Through ${interface_type} |
Sridevi Ramesh | 9617ebd | 2019-11-25 10:57:21 -0600 | [diff] [blame] | 66 | ... ${fir_address} ${value} ${threshold_limit} ${master_proc_chip} |
Sridevi Ramesh | 1d85af0 | 2019-02-22 04:08:15 -0600 | [diff] [blame] | 67 | |
| 68 | Is Host Running |
| 69 | ${output}= Gard Operations On OS list |
| 70 | Should Contain ${output} No GARD |
| 71 | Verify Error Log Entry ${signature_desc} ${log_prefix} |
Sridevi Ramesh | 1d85af0 | 2019-02-22 04:08:15 -0600 | [diff] [blame] | 72 | |
| 73 | |
Sridevi Ramesh | 9c6ec28 | 2019-03-25 03:35:46 -0500 | [diff] [blame] | 74 | Inject Unrecoverable Error |
| 75 | [Documentation] Inject and verify unrecoverable error on processor through |
| 76 | ... BMC/HOST. |
Sridevi Ramesh | 1d85af0 | 2019-02-22 04:08:15 -0600 | [diff] [blame] | 77 | ... Test sequence: |
Sridevi Ramesh | 9c6ec28 | 2019-03-25 03:35:46 -0500 | [diff] [blame] | 78 | ... 1. Inject unrecoverable error on a given target |
Sridevi Ramesh | 3e2a3bd | 2019-05-09 05:30:53 -0500 | [diff] [blame] | 79 | ... (e.g: Processor core, CAPP, MCA) through BMC/HOST. |
Sridevi Ramesh | 9c6ec28 | 2019-03-25 03:35:46 -0500 | [diff] [blame] | 80 | ... 2. Check If HOST is rebooted. |
| 81 | ... 3. Verify & clear gard records. |
| 82 | ... 4. Verify error log entry & signature description. |
| 83 | ... 5. Verify & clear dump entry. |
Sridevi Ramesh | 9617ebd | 2019-11-25 10:57:21 -0600 | [diff] [blame] | 84 | [Arguments] ${interface_type} ${fir_address} ${value} ${threshold_limit} |
Sridevi Ramesh | f6e0886 | 2019-11-11 08:37:08 -0600 | [diff] [blame] | 85 | ... ${signature_desc} ${log_prefix} ${bmc_reboot}=${0} |
Sridevi Ramesh | 1d85af0 | 2019-02-22 04:08:15 -0600 | [diff] [blame] | 86 | # Description of argument(s): |
Sridevi Ramesh | 9c6ec28 | 2019-03-25 03:35:46 -0500 | [diff] [blame] | 87 | # interface_type Inject error through 'BMC' or 'HOST'. |
Sridevi Ramesh | 9617ebd | 2019-11-25 10:57:21 -0600 | [diff] [blame] | 88 | # fir_address FIR (Fault isolation register) value (e.g. 2011400). |
| 89 | # value (e.g 2000000000000000). |
Sridevi Ramesh | 1d85af0 | 2019-02-22 04:08:15 -0600 | [diff] [blame] | 90 | # threshold_limit Threshold limit (e.g 1, 5, 32). |
| 91 | # signature_desc Error Log signature description. |
| 92 | # (e.g 'mcs(n0p0c0) (MCFIR[0]) mc internal recoverable') |
| 93 | # log_prefix Log path prefix. |
Sridevi Ramesh | f6e0886 | 2019-11-11 08:37:08 -0600 | [diff] [blame] | 94 | # bmc_reboot Do bmc reboot If bmc_reboot is set. |
Sridevi Ramesh | 1d85af0 | 2019-02-22 04:08:15 -0600 | [diff] [blame] | 95 | |
Sridevi Ramesh | f6e0886 | 2019-11-11 08:37:08 -0600 | [diff] [blame] | 96 | Run Keyword Inject Error Through ${interface_type} |
Sridevi Ramesh | 9617ebd | 2019-11-25 10:57:21 -0600 | [diff] [blame] | 97 | ... ${fir_address} ${value} ${threshold_limit} ${master_proc_chip} |
Sridevi Ramesh | f6e0886 | 2019-11-11 08:37:08 -0600 | [diff] [blame] | 98 | |
| 99 | # Do BMC Reboot after error injection. |
| 100 | Run Keyword If ${bmc_reboot} Run Keywords |
| 101 | ... Initiate BMC Reboot |
| 102 | ... Wait For BMC Ready |
| 103 | ... Initiate Host PowerOff |
| 104 | ... Initiate Host Boot |
Sridevi Ramesh | 9c6ec28 | 2019-03-25 03:35:46 -0500 | [diff] [blame] | 105 | ... ELSE |
Sridevi Ramesh | f6e0886 | 2019-11-11 08:37:08 -0600 | [diff] [blame] | 106 | ... Wait Until Keyword Succeeds 500 sec 20 sec Is Host Rebooted |
Sridevi Ramesh | 9c6ec28 | 2019-03-25 03:35:46 -0500 | [diff] [blame] | 107 | |
Sridevi Ramesh | 1d85af0 | 2019-02-22 04:08:15 -0600 | [diff] [blame] | 108 | Wait for OS |
| 109 | Verify Error Log Entry ${signature_desc} ${log_prefix} |
Rahul Maheshwari | a89ff9e | 2020-09-25 05:04:33 -0500 | [diff] [blame] | 110 | |
| 111 | ${dump_service_status} ${stderr} ${rc}= BMC Execute Command systemctl status xyz.openbmc_project.Dump.Manager.service |
| 112 | Should Contain ${dump_service_status} Active: active (running) |
| 113 | |
| 114 | ${resp}= OpenBMC Get Request ${DUMP_URI} |
| 115 | Run Keyword If '${resp.status_code}' == '${HTTP_NOT_FOUND}' |
| 116 | ... Set Test Variable ${DUMP_ENTRY_URI} /xyz/openbmc_project/dump/entry/ |
| 117 | |
Sridevi Ramesh | f6e0886 | 2019-11-11 08:37:08 -0600 | [diff] [blame] | 118 | Read Properties ${DUMP_ENTRY_URI}list |
Sridevi Ramesh | 1d85af0 | 2019-02-22 04:08:15 -0600 | [diff] [blame] | 119 | Delete All BMC Dump |
| 120 | Verify And Clear Gard Records On HOST |
| 121 | |
Rahul Maheshwari | a89ff9e | 2020-09-25 05:04:33 -0500 | [diff] [blame] | 122 | |
Sridevi Ramesh | 1d85af0 | 2019-02-22 04:08:15 -0600 | [diff] [blame] | 123 | Fetch FIR Address Translation Value |
| 124 | [Documentation] Fetch FIR address translation value through HOST. |
Sridevi Ramesh | 9617ebd | 2019-11-25 10:57:21 -0600 | [diff] [blame] | 125 | [Arguments] ${fir_address} ${target_type} |
Sridevi Ramesh | 1d85af0 | 2019-02-22 04:08:15 -0600 | [diff] [blame] | 126 | # Description of argument(s): |
Sridevi Ramesh | 9617ebd | 2019-11-25 10:57:21 -0600 | [diff] [blame] | 127 | # fir_address FIR (Fault isolation register) value (e.g. '2011400'). |
Sridevi Ramesh | 1d85af0 | 2019-02-22 04:08:15 -0600 | [diff] [blame] | 128 | # core_id Core ID (e.g. '9'). |
| 129 | # target_type Target type (e.g. 'EX', 'EQ', 'C'). |
| 130 | |
| 131 | Login To OS Host |
| 132 | Copy Address Translation Utils To HOST OS |
| 133 | |
| 134 | # Fetch processor chip IDs. |
| 135 | ${proc_chip_id}= Get ProcChipId From OS Processor ${master_proc_chip} |
| 136 | # Example output: |
| 137 | # 00000000 |
| 138 | |
| 139 | ${core_ids}= Get Core IDs From OS ${proc_chip_id[-1]} |
| 140 | # Example output: |
| 141 | #./probe_cpus.sh | grep 'CHIP ID: 0' | cut -c21-22 |
| 142 | # ['14', '15', '16', '17'] |
| 143 | |
| 144 | # Ignoring master core ID. |
| 145 | ${output}= Get Slice From List ${core_ids} 1 |
| 146 | # Feth random non-master core ID. |
| 147 | ${core_ids_sub_list}= Evaluate random.sample(${core_ids}, 1) random |
| 148 | ${core_id}= Get From List ${core_ids_sub_list} 0 |
| 149 | ${translated_fir_addr}= FIR Address Translation Through HOST |
Sridevi Ramesh | 9617ebd | 2019-11-25 10:57:21 -0600 | [diff] [blame] | 150 | ... ${fir_address} ${core_id} ${target_type} |
Sridevi Ramesh | 1d85af0 | 2019-02-22 04:08:15 -0600 | [diff] [blame] | 151 | |
| 152 | [Return] ${translated_fir_addr} |
| 153 | |
| 154 | RAS Test SetUp |
| 155 | [Documentation] Validates input parameters. |
| 156 | |
| 157 | Should Not Be Empty |
| 158 | ... ${OS_HOST} msg=You must provide DNS name/IP of the OS host. |
| 159 | Should Not Be Empty |
| 160 | ... ${OS_USERNAME} msg=You must provide OS host user name. |
| 161 | Should Not Be Empty |
| 162 | ... ${OS_PASSWORD} msg=You must provide OS host user password. |
| 163 | |
Sridevi Ramesh | f6e0886 | 2019-11-11 08:37:08 -0600 | [diff] [blame] | 164 | Smart Power Off |
| 165 | |
Sridevi Ramesh | 1d85af0 | 2019-02-22 04:08:15 -0600 | [diff] [blame] | 166 | # Boot to OS. |
Sridevi Ramesh | f6e0886 | 2019-11-11 08:37:08 -0600 | [diff] [blame] | 167 | REST Power On quiet=${1} |
Sridevi Ramesh | 1d85af0 | 2019-02-22 04:08:15 -0600 | [diff] [blame] | 168 | # Adding delay after host bring up. |
| 169 | Sleep 60s |
| 170 | |
| 171 | RAS Suite Setup |
| 172 | [Documentation] Create RAS log directory to store all RAS test logs. |
| 173 | |
| 174 | ${RAS_LOG_DIR_PATH}= Catenate ${EXECDIR}/RAS_logs/ |
| 175 | Set Suite Variable ${RAS_LOG_DIR_PATH} |
| 176 | Set Suite Variable ${master_proc_chip} False |
| 177 | |
| 178 | Create Directory ${RAS_LOG_DIR_PATH} |
| 179 | OperatingSystem.Directory Should Exist ${RAS_LOG_DIR_PATH} |
| 180 | Empty Directory ${RAS_LOG_DIR_PATH} |
| 181 | |
| 182 | Should Not Be Empty ${ESEL_BIN_PATH} |
| 183 | Set Environment Variable PATH %{PATH}:${ESEL_BIN_PATH} |
| 184 | |
| 185 | # Boot to Os. |
Sridevi Ramesh | f6e0886 | 2019-11-11 08:37:08 -0600 | [diff] [blame] | 186 | REST Power On quiet=${1} |
Sridevi Ramesh | 1d85af0 | 2019-02-22 04:08:15 -0600 | [diff] [blame] | 187 | |
| 188 | # Check Opal-PRD service enabled on host. |
| 189 | ${opal_prd_state}= Is Opal-PRD Service Enabled |
| 190 | Run Keyword If '${opal_prd_state}' == 'disabled' |
| 191 | ... Enable Opal-PRD Service On HOST |
| 192 | |
| 193 | RAS Suite Cleanup |
| 194 | [Documentation] Perform RAS suite cleanup and verify that host |
| 195 | ... boots after test suite run. |
| 196 | |
| 197 | # Boot to OS. |
Michael Walsh | 9fbc1f0 | 2019-10-22 13:39:44 -0500 | [diff] [blame] | 198 | REST Power On |
Sridevi Ramesh | 1d85af0 | 2019-02-22 04:08:15 -0600 | [diff] [blame] | 199 | Delete Error Logs |
| 200 | Gard Operations On OS clear all |
Sridevi Ramesh | 3e2a3bd | 2019-05-09 05:30:53 -0500 | [diff] [blame] | 201 | |
| 202 | |
| 203 | Inject Error At HOST Boot Path |
| 204 | |
| 205 | [Documentation] Inject and verify recoverable error on processor through |
| 206 | ... BMC using pdbg tool at HOST Boot path. |
| 207 | ... Test sequence: |
| 208 | ... 1. Inject error on a given target |
| 209 | ... (e.g: Processor core, CAPP, MCA) through BMC using |
| 210 | ... pdbg tool at HOST Boot path. |
| 211 | ... 2. Check If HOST is rebooted and running. |
| 212 | ... 3. Verify error log entry & signature description. |
| 213 | ... 4. Verify & clear gard records. |
Sridevi Ramesh | 9617ebd | 2019-11-25 10:57:21 -0600 | [diff] [blame] | 214 | [Arguments] ${fir_address} ${value} ${signature_desc} ${log_prefix} |
Sridevi Ramesh | 3e2a3bd | 2019-05-09 05:30:53 -0500 | [diff] [blame] | 215 | # Description of argument(s): |
Sridevi Ramesh | 9617ebd | 2019-11-25 10:57:21 -0600 | [diff] [blame] | 216 | # fir_address FIR (Fault isolation register) value (e.g. 2011400). |
| 217 | # value (e.g 2000000000000000). |
Sridevi Ramesh | 3e2a3bd | 2019-05-09 05:30:53 -0500 | [diff] [blame] | 218 | # signature_desc Error log signature description. |
| 219 | # log_prefix Log path prefix. |
| 220 | |
Sridevi Ramesh | 9617ebd | 2019-11-25 10:57:21 -0600 | [diff] [blame] | 221 | Inject Error Through BMC At HOST Boot ${fir_address} ${value} |
Sridevi Ramesh | 3e2a3bd | 2019-05-09 05:30:53 -0500 | [diff] [blame] | 222 | |
| 223 | Wait Until Keyword Succeeds 500 sec 20 sec Is Host Rebooted |
| 224 | Wait for OS |
| 225 | Verify Error Log Entry ${signature_desc} ${log_prefix} |
| 226 | Verify And Clear Gard Records On HOST |