HPE Storage Users Group

A Storage Administrator Community




Post new topic Reply to topic  [ 5 posts ] 
Author Message
 Post subject: Failed disks. How fix errors and replace.
PostPosted: Fri Sep 13, 2019 5:38 pm 

Joined: Sun Aug 25, 2019 10:11 pm
Posts: 13
Hello,

I have some failed/degraded disks. There are no amber lights on the front. I know for disk 8 some time back this failed and my admin replaced this with another one however looks like he put in a different media type. Can you please help how to go about these?


cli% showpd -s (pasted only ones with issue)
Id CagePos Type -State-- -------------------------------------------Detailed_State------------------------
8 0:8:0 FC failed vacated,invalid_media,servicing
95 3:23:0 SSD failed vacated,disabled_A_port,disabled_B_port,invalid_media,no_valid_ports,invalid,inquiry_failed,servicing
109 4:13:0? FC failed vacated,missing,invalid_media,no_valid_ports,servicing
126 0:8:0? SSD degraded missing,no_valid_ports,servicing

************************
cli% showpd -failed -degraded

---Size(MB)--- ----Ports----
Id CagePos Type RPM State Total Free A B Capacity(GB)
8 0:8:0 FC 10 failed 417792 0 1:0:1* 0:0:1 450
95 3:23:0 SSD 150 failed 189440 0 1:0:2- 0:0:2- 200
109 4:13:0? FC 10 failed 417792 0 ----- ----- --
126 0:8:0? SSD 150 degraded 189440 189440 ----- ----- --
-----------------------------------------------------------------------
4 total 1214464 189440



-------------------------------------------------
servicemag

cli% servicemag status
Cage 0, magazine 8:
The magazine was successfully brought offline by a servicemag start command.
The command completed Sat Jan 12 20:47:48 2019.
servicemag start -wait -pdid 126 -- Succeeded

Cage 3, magazine 23:
The magazine was successfully brought offline by a servicemag start command.
The command completed Fri May 17 10:19:48 2019.
servicemag start -wait -pdid 95 -- Succeeded


Attachments:
Capture.PNG
Capture.PNG [ 13.72 KiB | Viewed 17334 times ]
Top
 Profile  
Reply with quote  
 Post subject: Re: Failed disks. How fix errors and replace.
PostPosted: Fri Sep 20, 2019 9:16 am 

Joined: Tue Feb 06, 2018 11:55 am
Posts: 10
Hello,
how about you take the SSD from 0:8:0 and put it to 3:23:0
The other disks you have to replace i guess.
Run showpd -i <PD ID Number> to get the model of the disk.
It is a good practice to check :
showpd -space <PD ID Number>
before replacing. It should show all the size in failed.
For the disk in cage4 there should be some more investigation.
Eventually you can manually run servicemag and then replace it once it is finished.


Top
 Profile  
Reply with quote  
 Post subject: Re: Failed disks. How fix errors and replace.
PostPosted: Thu Oct 03, 2019 12:32 am 

Joined: Sun Aug 25, 2019 10:11 pm
Posts: 13
Hi Yavor,

I replaced the disk on 0:8 location with same 200GB SSD but it gave error resuming serviemag. error is "New disk 146 is smaller than replaced disk 8". Is there a way to force servicemag to use the current disk given its same type? May be the new one is off by few bytes :?:

sanhou01 cli% servicemag status -d 0 8
A servicemag resume command failed on this magazine.
The command completed at Wed Oct 2 16:19:47 2019.
failed to retrieve time that relocation started, no estimate available
The output of the servicemag resume was:
servicemag resume 0 8
... mag 0 8 already onlooped
... firmware is current on pd WWN [5000CCA01331143B]
... firmware is current on pd WWN [5000CCA02231FC5B] Id [ 8]
... firmware is current on pd WWN [5000CCA0131BB443] Id [126]
... checking for valid disks...
... checking for valid disks...
... disks in mag : 0 8
... normal disks: WWN [5000CCA01331143B] Id [146] diskpos [0]
... not normal disks: WWN [5000CCA02231FC5B] Id [ 8]
.................... WWN [5000CCA0131BB443] Id [126]
... verifying spare space for disks 8 and 146
Failed --
New disk 146 is smaller than replaced disk 8
servicemag resume 0 8 -- Failed



sanhou01 cli% showpd -c -failed -degraded
------- Normal Chunklets -------- ---- Spare Chunklets ----
- Used - -------- Unused -------- - Used - ---- Unused ----
Id CagePos Type State Total OK Fail Free Uninit Unavail Fail OK Fail Free Uninit Fail
8 0:8:0? FC failed 408 0 0 0 0 63 326 0 0 0 0 19
95 3:23:0 SSD failed 185 0 0 0 35 0 127 0 0 0 0 23
101 4:5:0 FC failed 408 0 0 0 54 0 336 0 0 0 0 18
109 4:13:0? FC failed 408 0 0 0 90 0 318 0 0 0 0 0
126 0:8:0? SSD degraded 185 0 0 185 0 0 0 0 0 0 0 0
146 0:8:0 SSD degraded 185 0 0 0 0 185 0 0 0 0 0 0
-------------------------------------------------------------------------------------------
6 total 1779 0 0 185 179 248 1107 0 0 0 0 60


Top
 Profile  
Reply with quote  
 Post subject: Re: Failed disks. How fix errors and replace.
PostPosted: Fri Oct 04, 2019 2:37 am 

Joined: Mon Sep 21, 2015 2:11 pm
Posts: 1570
Location: Europe
So....

you have 3 failed drives...
450GB 10k FC drives in 0:8:0 and 4:13:0
200GB 150k SSD in 3:23:0

Why are you still trying to push the SSD into 0:8:0? That slot is waiting for a 450GB 10k FC drive... The SSD needs to go into 3:23:0...

_________________
The views and opinions expressed are my own and do not necessarily reflect those of my current or previous employers.


Top
 Profile  
Reply with quote  
 Post subject: Re: Failed disks. How fix errors and replace.
PostPosted: Thu Oct 10, 2019 2:25 pm 

Joined: Sun Aug 25, 2019 10:11 pm
Posts: 13
Thanks Mammagut. I think i got confused with the disk and inserted incorrect one in the slot...I placed the SSD on 3:23:0 and looks like it went fine but the old PD ID is still showing up and new PD is normal. I will post another thread.


Top
 Profile  
Reply with quote  
Display posts from previous:  Sort by  
Post new topic Reply to topic  [ 5 posts ] 


Who is online

Users browsing this forum: No registered users and 61 guests


You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot post attachments in this forum

Search for:
Jump to:  
Powered by phpBB © 2000, 2002, 2005, 2007 phpBB Group | DVGFX2 by: Matt