HPE Storage Users Group
https://3parug.com/

Failed disks. How fix errors and replace.
https://3parug.com/viewtopic.php?f=18&t=3279
Page 1 of 1

Author:  arunreddyn [ Fri Sep 13, 2019 5:38 pm ]
Post subject:  Failed disks. How fix errors and replace.

Hello,

I have some failed/degraded disks. There are no amber lights on the front. I know for disk 8 some time back this failed and my admin replaced this with another one however looks like he put in a different media type. Can you please help how to go about these?


cli% showpd -s (pasted only ones with issue)
Id CagePos Type -State-- -------------------------------------------Detailed_State------------------------
8 0:8:0 FC failed vacated,invalid_media,servicing
95 3:23:0 SSD failed vacated,disabled_A_port,disabled_B_port,invalid_media,no_valid_ports,invalid,inquiry_failed,servicing
109 4:13:0? FC failed vacated,missing,invalid_media,no_valid_ports,servicing
126 0:8:0? SSD degraded missing,no_valid_ports,servicing

************************
cli% showpd -failed -degraded

---Size(MB)--- ----Ports----
Id CagePos Type RPM State Total Free A B Capacity(GB)
8 0:8:0 FC 10 failed 417792 0 1:0:1* 0:0:1 450
95 3:23:0 SSD 150 failed 189440 0 1:0:2- 0:0:2- 200
109 4:13:0? FC 10 failed 417792 0 ----- ----- --
126 0:8:0? SSD 150 degraded 189440 189440 ----- ----- --
-----------------------------------------------------------------------
4 total 1214464 189440



-------------------------------------------------
servicemag

cli% servicemag status
Cage 0, magazine 8:
The magazine was successfully brought offline by a servicemag start command.
The command completed Sat Jan 12 20:47:48 2019.
servicemag start -wait -pdid 126 -- Succeeded

Cage 3, magazine 23:
The magazine was successfully brought offline by a servicemag start command.
The command completed Fri May 17 10:19:48 2019.
servicemag start -wait -pdid 95 -- Succeeded

Attachments:
Capture.PNG
Capture.PNG [ 13.72 KiB | Viewed 17346 times ]

Author:  Yavor [ Fri Sep 20, 2019 9:16 am ]
Post subject:  Re: Failed disks. How fix errors and replace.

Hello,
how about you take the SSD from 0:8:0 and put it to 3:23:0
The other disks you have to replace i guess.
Run showpd -i <PD ID Number> to get the model of the disk.
It is a good practice to check :
showpd -space <PD ID Number>
before replacing. It should show all the size in failed.
For the disk in cage4 there should be some more investigation.
Eventually you can manually run servicemag and then replace it once it is finished.

Author:  arunreddyn [ Thu Oct 03, 2019 12:32 am ]
Post subject:  Re: Failed disks. How fix errors and replace.

Hi Yavor,

I replaced the disk on 0:8 location with same 200GB SSD but it gave error resuming serviemag. error is "New disk 146 is smaller than replaced disk 8". Is there a way to force servicemag to use the current disk given its same type? May be the new one is off by few bytes :?:

sanhou01 cli% servicemag status -d 0 8
A servicemag resume command failed on this magazine.
The command completed at Wed Oct 2 16:19:47 2019.
failed to retrieve time that relocation started, no estimate available
The output of the servicemag resume was:
servicemag resume 0 8
... mag 0 8 already onlooped
... firmware is current on pd WWN [5000CCA01331143B]
... firmware is current on pd WWN [5000CCA02231FC5B] Id [ 8]
... firmware is current on pd WWN [5000CCA0131BB443] Id [126]
... checking for valid disks...
... checking for valid disks...
... disks in mag : 0 8
... normal disks: WWN [5000CCA01331143B] Id [146] diskpos [0]
... not normal disks: WWN [5000CCA02231FC5B] Id [ 8]
.................... WWN [5000CCA0131BB443] Id [126]
... verifying spare space for disks 8 and 146
Failed --
New disk 146 is smaller than replaced disk 8
servicemag resume 0 8 -- Failed



sanhou01 cli% showpd -c -failed -degraded
------- Normal Chunklets -------- ---- Spare Chunklets ----
- Used - -------- Unused -------- - Used - ---- Unused ----
Id CagePos Type State Total OK Fail Free Uninit Unavail Fail OK Fail Free Uninit Fail
8 0:8:0? FC failed 408 0 0 0 0 63 326 0 0 0 0 19
95 3:23:0 SSD failed 185 0 0 0 35 0 127 0 0 0 0 23
101 4:5:0 FC failed 408 0 0 0 54 0 336 0 0 0 0 18
109 4:13:0? FC failed 408 0 0 0 90 0 318 0 0 0 0 0
126 0:8:0? SSD degraded 185 0 0 185 0 0 0 0 0 0 0 0
146 0:8:0 SSD degraded 185 0 0 0 0 185 0 0 0 0 0 0
-------------------------------------------------------------------------------------------
6 total 1779 0 0 185 179 248 1107 0 0 0 0 60

Author:  MammaGutt [ Fri Oct 04, 2019 2:37 am ]
Post subject:  Re: Failed disks. How fix errors and replace.

So....

you have 3 failed drives...
450GB 10k FC drives in 0:8:0 and 4:13:0
200GB 150k SSD in 3:23:0

Why are you still trying to push the SSD into 0:8:0? That slot is waiting for a 450GB 10k FC drive... The SSD needs to go into 3:23:0...

Author:  arunreddyn [ Thu Oct 10, 2019 2:25 pm ]
Post subject:  Re: Failed disks. How fix errors and replace.

Thanks Mammagut. I think i got confused with the disk and inserted incorrect one in the slot...I placed the SSD on 3:23:0 and looks like it went fine but the old PD ID is still showing up and new PD is normal. I will post another thread.

Page 1 of 1 All times are UTC - 5 hours
Powered by phpBB © 2000, 2002, 2005, 2007 phpBB Group
http://www.phpbb.com/