ML
    • Recent
    • Categories
    • Tags
    • Popular
    • Users
    • Groups
    • Register
    • Login

    RAID10 - Two Drive Failure

    Scheduled Pinned Locked Moved IT Discussion
    62 Posts 11 Posters 3.4k Views
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as topic
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • JaredBuschJ
      JaredBusch @scottalanmiller
      last edited by

      @scottalanmiller said in RAID10 - Two Drive Failure:

      @JaredBusch said in RAID10 - Two Drive Failure:

      @DustinB3403 said in RAID10 - Two Drive Failure:

      @aaronstuder What raid controller do you have?

      Exactly this. A real SMB system should be a hot plug. But we have no idea what you bought.

      Even hot plug often requires you to offline a disk before you take it out for the system to be clear as to what is happening. ZFS is hot swap, but still requires that, for example.

      I said it already, but hardware controllers in most systems have no interface to the OS in order to do anything. There is no way to offline a drive while running.

      coliverC scottalanmillerS DashrenderD 3 Replies Last reply Reply Quote 0
      • scottalanmillerS
        scottalanmiller @JaredBusch
        last edited by

        @JaredBusch said in RAID10 - Two Drive Failure:

        @scottalanmiller said in RAID10 - Two Drive Failure:

        @aaronstuder said in RAID10 - Two Drive Failure:

        Some information I am reading says I need to take the drive "offline" first, is this true?

        PowerEdge R720.

        If killing one in predictive failure, yes you generally offline it first. But this depends on the controller in question, the server doesn't matter.

        Obviously, depends on the controller, but the point of blind swap being standard choice for the SMB is that you simply swap the drives.

        Blind swap yes, hot swap no. Even some blind swap struggle with healthy drives being pulled, though.

        1 Reply Last reply Reply Quote 0
        • coliverC
          coliver @JaredBusch
          last edited by

          @JaredBusch said in RAID10 - Two Drive Failure:

          @scottalanmiller said in RAID10 - Two Drive Failure:

          @JaredBusch said in RAID10 - Two Drive Failure:

          @DustinB3403 said in RAID10 - Two Drive Failure:

          @aaronstuder What raid controller do you have?

          Exactly this. A real SMB system should be a hot plug. But we have no idea what you bought.

          Even hot plug often requires you to offline a disk before you take it out for the system to be clear as to what is happening. ZFS is hot swap, but still requires that, for example.

          I said it already, but hardware controllers in most systems have no interface to the OS in order to do anything. There is no way to offline a drive while running.

          Are blind swap and hot swap different? I could have sworn blind swap was the one where you don't offline a drive prior to replacing it.

          scottalanmillerS 1 Reply Last reply Reply Quote 0
          • scottalanmillerS
            scottalanmiller @JaredBusch
            last edited by

            @JaredBusch said in RAID10 - Two Drive Failure:

            @scottalanmiller said in RAID10 - Two Drive Failure:

            @JaredBusch said in RAID10 - Two Drive Failure:

            @DustinB3403 said in RAID10 - Two Drive Failure:

            @aaronstuder What raid controller do you have?

            Exactly this. A real SMB system should be a hot plug. But we have no idea what you bought.

            Even hot plug often requires you to offline a disk before you take it out for the system to be clear as to what is happening. ZFS is hot swap, but still requires that, for example.

            I said it already, but hardware controllers in most systems have no interface to the OS in order to do anything. There is no way to offline a drive while running.

            I'm not aware of any that don't have an OS interface available. Lots of people don't install one, but most have one.

            JaredBuschJ 1 Reply Last reply Reply Quote 1
            • wirestyle22W
              wirestyle22
              last edited by

              Blind Swapping: Generally a unique feature to hardware RAID systems. This is an extension of hot swapping that includes not needing to interact with the operating system first. Hot swapping alone does not imply that a lack of interaction is needed. Blind swapping is popular in large datacenters so that datacenter staff who do not have access to the operating system can replace failed drives without any interaction from the systems administrators.

              as per @scottalanmiller

              coliverC 1 Reply Last reply Reply Quote 2
              • scottalanmillerS
                scottalanmiller @coliver
                last edited by

                @coliver said in RAID10 - Two Drive Failure:

                @JaredBusch said in RAID10 - Two Drive Failure:

                @scottalanmiller said in RAID10 - Two Drive Failure:

                @JaredBusch said in RAID10 - Two Drive Failure:

                @DustinB3403 said in RAID10 - Two Drive Failure:

                @aaronstuder What raid controller do you have?

                Exactly this. A real SMB system should be a hot plug. But we have no idea what you bought.

                Even hot plug often requires you to offline a disk before you take it out for the system to be clear as to what is happening. ZFS is hot swap, but still requires that, for example.

                I said it already, but hardware controllers in most systems have no interface to the OS in order to do anything. There is no way to offline a drive while running.

                Are blind swap and hot swap different? I could have sworn blind swap was the one where you don't offline a drive prior to replacing it.

                Very.

                https://mangolassi.it/topic/6816/hot-swap-vs-blind-swap

                1 Reply Last reply Reply Quote 0
                • JaredBuschJ
                  JaredBusch
                  last edited by

                  That said, if you have properly set up the tools like Dell OMSA, then you can do it there.
                  0_1499273197058_b9fdeb89-f17a-48ca-a65a-015f4dedea1d-image.png

                  1 Reply Last reply Reply Quote 0
                  • coliverC
                    coliver @wirestyle22
                    last edited by

                    @wirestyle22 said in RAID10 - Two Drive Failure:

                    Blind Swapping: Generally a unique feature to hardware RAID systems. This is an extension of hot swapping that includes not needing to interact with the operating system first. Hot swapping alone does not imply that a lack of interaction is needed. Blind swapping is popular in large datacenters so that datacenter staff who do not have access to the operating system can replace failed drives without any interaction from the systems administrators.

                    as per @scottalanmiller

                    That's right.

                    1 Reply Last reply Reply Quote 1
                    • dafyreD
                      dafyre
                      last edited by

                      Isn't there a SMART tool that will scan the drives on the RAID controller and flash the LEDs on ones with errors?

                      scottalanmillerS 1 Reply Last reply Reply Quote 0
                      • scottalanmillerS
                        scottalanmiller @dafyre
                        last edited by

                        @dafyre said in RAID10 - Two Drive Failure:

                        Isn't there a SMART tool that will scan the drives on the RAID controller and flash the LEDs on ones with errors?

                        No, nothing can see through a RAID controller. If you want to do a SMART scan, the RAID controller itself must do it.

                        1 Reply Last reply Reply Quote 0
                        • JaredBuschJ
                          JaredBusch @scottalanmiller
                          last edited by

                          @scottalanmiller said in RAID10 - Two Drive Failure:

                          @JaredBusch said in RAID10 - Two Drive Failure:

                          @scottalanmiller said in RAID10 - Two Drive Failure:

                          @JaredBusch said in RAID10 - Two Drive Failure:

                          @DustinB3403 said in RAID10 - Two Drive Failure:

                          @aaronstuder What raid controller do you have?

                          Exactly this. A real SMB system should be a hot plug. But we have no idea what you bought.

                          Even hot plug often requires you to offline a disk before you take it out for the system to be clear as to what is happening. ZFS is hot swap, but still requires that, for example.

                          I said it already, but hardware controllers in most systems have no interface to the OS in order to do anything. There is no way to offline a drive while running.

                          I'm not aware of any that don't have an OS interface available. Lots of people don't install one, but most have one.

                          That is installing 3rd party software, not an OS interface.

                          scottalanmillerS 1 Reply Last reply Reply Quote 1
                          • JaredBuschJ
                            JaredBusch @scottalanmiller
                            last edited by

                            @scottalanmiller said in RAID10 - Two Drive Failure:

                            @JaredBusch said in RAID10 - Two Drive Failure:

                            @scottalanmiller said in RAID10 - Two Drive Failure:

                            @aaronstuder said in RAID10 - Two Drive Failure:

                            Drive are 1 and 3 are in "predictive failure" , I am assuming the pairs are 0+1 and 2+3.

                            Why?

                            Why what? Assuming? Because he did not document and most hardware RAID controllers are not accessible except during the boot process.

                            Why does he feel that they are in the sets that they are? I hear people say that all the time, and it turns out that most people say that without basing it on anything at all.

                            Because by default, when using the PERC controllers from dell, it makes the array that way. Yes, it is horrible to assume that, but it is how the PERC controller does things by default.

                            @aaronstuder you see the spans in OMSA here.
                            0_1499273736022_b839c31c-6e43-4258-aa1b-77d81d4bcab6-image.png

                            1 Reply Last reply Reply Quote 0
                            • scottalanmillerS
                              scottalanmiller @JaredBusch
                              last edited by

                              @JaredBusch said in RAID10 - Two Drive Failure:

                              @scottalanmiller said in RAID10 - Two Drive Failure:

                              @JaredBusch said in RAID10 - Two Drive Failure:

                              @scottalanmiller said in RAID10 - Two Drive Failure:

                              @JaredBusch said in RAID10 - Two Drive Failure:

                              @DustinB3403 said in RAID10 - Two Drive Failure:

                              @aaronstuder What raid controller do you have?

                              Exactly this. A real SMB system should be a hot plug. But we have no idea what you bought.

                              Even hot plug often requires you to offline a disk before you take it out for the system to be clear as to what is happening. ZFS is hot swap, but still requires that, for example.

                              I said it already, but hardware controllers in most systems have no interface to the OS in order to do anything. There is no way to offline a drive while running.

                              I'm not aware of any that don't have an OS interface available. Lots of people don't install one, but most have one.

                              That is installing 3rd party software, not an OS interface.

                              IF by third party you mean the tools from the vendor, yes. It's basically a driver extension.

                              JaredBuschJ 1 Reply Last reply Reply Quote 0
                              • JaredBuschJ
                                JaredBusch @scottalanmiller
                                last edited by

                                @scottalanmiller said in RAID10 - Two Drive Failure:

                                @JaredBusch said in RAID10 - Two Drive Failure:

                                @scottalanmiller said in RAID10 - Two Drive Failure:

                                @JaredBusch said in RAID10 - Two Drive Failure:

                                @scottalanmiller said in RAID10 - Two Drive Failure:

                                @JaredBusch said in RAID10 - Two Drive Failure:

                                @DustinB3403 said in RAID10 - Two Drive Failure:

                                @aaronstuder What raid controller do you have?

                                Exactly this. A real SMB system should be a hot plug. But we have no idea what you bought.

                                Even hot plug often requires you to offline a disk before you take it out for the system to be clear as to what is happening. ZFS is hot swap, but still requires that, for example.

                                I said it already, but hardware controllers in most systems have no interface to the OS in order to do anything. There is no way to offline a drive while running.

                                I'm not aware of any that don't have an OS interface available. Lots of people don't install one, but most have one.

                                That is installing 3rd party software, not an OS interface.

                                IF by third party you mean the tools from the vendor, yes. It's basically a driver extension.

                                It is 3rd party to the OS. I clearly stated, many times, that there are no OS tools for this.

                                scottalanmillerS 1 Reply Last reply Reply Quote 0
                                • scottalanmillerS
                                  scottalanmiller @JaredBusch
                                  last edited by

                                  @JaredBusch said in RAID10 - Two Drive Failure:

                                  @scottalanmiller said in RAID10 - Two Drive Failure:

                                  @JaredBusch said in RAID10 - Two Drive Failure:

                                  @scottalanmiller said in RAID10 - Two Drive Failure:

                                  @JaredBusch said in RAID10 - Two Drive Failure:

                                  @scottalanmiller said in RAID10 - Two Drive Failure:

                                  @JaredBusch said in RAID10 - Two Drive Failure:

                                  @DustinB3403 said in RAID10 - Two Drive Failure:

                                  @aaronstuder What raid controller do you have?

                                  Exactly this. A real SMB system should be a hot plug. But we have no idea what you bought.

                                  Even hot plug often requires you to offline a disk before you take it out for the system to be clear as to what is happening. ZFS is hot swap, but still requires that, for example.

                                  I said it already, but hardware controllers in most systems have no interface to the OS in order to do anything. There is no way to offline a drive while running.

                                  I'm not aware of any that don't have an OS interface available. Lots of people don't install one, but most have one.

                                  That is installing 3rd party software, not an OS interface.

                                  IF by third party you mean the tools from the vendor, yes. It's basically a driver extension.

                                  It is 3rd party to the OS. I clearly stated, many times, that there are no OS tools for this.

                                  You generally get the RAID drivers in the same way, it's third party hardware to the OS. In many cases, though, I think that there are built in tools. Doesn't the Smart Array controllers get management via OS updates on Linux?

                                  1 Reply Last reply Reply Quote 0
                                  • T
                                    Texkonc
                                    last edited by

                                    How about answering his question.
                                    If you do not know what the pairs are, don't assume. Replace 1, rinse and repeat. General rule of thumb that is, error on side of caution.
                                    Predictive Failure is just a SMART report saying something doesn't add up on sectors. Order and replace ASAP.

                                    1 Reply Last reply Reply Quote 0
                                    • A
                                      Alex Sage
                                      last edited by

                                      RAID Controller is a H710

                                      scottalanmillerS 1 Reply Last reply Reply Quote 0
                                      • scottalanmillerS
                                        scottalanmiller @Alex Sage
                                        last edited by

                                        @aaronstuder said in RAID10 - Two Drive Failure:

                                        RAID Controller is a H710

                                        Definitely blind swap capable.

                                        1 Reply Last reply Reply Quote 0
                                        • DashrenderD
                                          Dashrender @JaredBusch
                                          last edited by Dashrender

                                          @JaredBusch said in RAID10 - Two Drive Failure:

                                          @scottalanmiller said in RAID10 - Two Drive Failure:

                                          @JaredBusch said in RAID10 - Two Drive Failure:

                                          @DustinB3403 said in RAID10 - Two Drive Failure:

                                          @aaronstuder What raid controller do you have?

                                          Exactly this. A real SMB system should be a hot plug. But we have no idea what you bought.

                                          Even hot plug often requires you to offline a disk before you take it out for the system to be clear as to what is happening. ZFS is hot swap, but still requires that, for example.

                                          I said it already, but hardware controllers in most systems have no interface to the OS in order to do anything. There is no way to offline a drive while running.

                                          I'm guessing this is an issue with hypervisors? my older bare metal servers all (HP and IBM - not Lenovo) had utilities that allowed RAID controller management from within the OS. I haven't seen a way to manage then when a hypervisor is in place though.

                                          Anyone know if there is a way, maybe through iLo or iDRAC, etc?

                                          My comments need to be read with the understanding that the OS itself doesn't talk to the RAID controller, but 3rd party drivers do.

                                          That said - the OS doesn't often talk to disks either, they talk via third party drivers.

                                          scottalanmillerS 1 Reply Last reply Reply Quote 0
                                          • scottalanmillerS
                                            scottalanmiller @Dashrender
                                            last edited by

                                            @Dashrender said in RAID10 - Two Drive Failure:

                                            I'm guessing this is an issue with hypervisors? my older bare metal servers all (HP and IBM - not Lenovo) had utilities that allowed RAID controller management from within the OS. I haven't seen a way to manage then when a hypervisor is in place though.

                                            Still do, he's saying that that is third party software, is all. Third party to the OS, first party to the controller.

                                            1 Reply Last reply Reply Quote 0
                                            • 1
                                            • 2
                                            • 3
                                            • 4
                                            • 3 / 4
                                            • First post
                                              Last post