Control Station problem or not?

Question

Hi,

We use a redundant NX4 system, I could not reach a the firs Control station over http, but I can reach it on ssh. (The second work well)

I try to use "/nas/http/nas_ezadm/etc/script restart" command, but nothing happend...

After reboot I got a message:

Message from syslogd@nx4cs at Fri May 13 11:48:19 2011 ...
nx4cs EMCServer: ReportEvent: CCMD 70338543628:CS_PLATFORM(6):MasterControl(129):EMERGENCY(0):12

------------------

The getreason command output is:

10 - slot_0 primary control station
4 - slot_2 configured
4 - slot_3 configured

--------------------------

The /nas/bin/nas_checkup command output is:

[root@nx4cs sbin]# /nas/bin/nas_checkup
Check Version: 5.6.47.11
Check Command: /nas/bin/nas_checkup
Check Log : /nas/log/checkup-run.110511-125415.log

-------------------------------------Checks-------------------------------------
Control Station: Checking if NBS clients are started....................... Fail
Control Station: Checking if NBS configuration exists...................... Pass
Control Station: Checking if NBS devices are accessible.................... Fail
Control Station: Checking if NBS service is started........................ Fail
Control Station: Checking if NAS partitions are mounted.................... Pass
Data Movers : Checking status........................................... Pass
--------------------------------------------------------------------------------

One or more error-level checks have failed. Follow the instructions
below to correct the problem and try again.

-------------------------------------Errors-------------------------------------
Control Station: Check if NBS clients are started
Symptom: NBS clients (nd-clnt 0 1 and nd-clnt 4 5 and nd-clnt 5 6) are
not started
Action : Contact EMC Customer Service and refer to EMC Knowledgebase
emc146016. Include this log with your support request.

Control Station: Check if NBS devices are accessible
Symptom: Failed NBS (nd-clnt processes) devices access check

         NOTE: Several checks depend on NBS device access to run. These checks
               were not run.
Action :
         1. This may occur if NBS is not configured correctly or if the NBS
            service is not started. Look in the "Checks" section to see if the
            following checks passed:

            * Control Station: Check if NBS configuration exists
            * Control Station: Check if NBS service is started
            * Control Station: Check if NBS clients are started

         If either of those checks did not pass, follow the instructions for
         that check to correct the problem, then rerun "nas_checkup" to verify
         that the NBS devices can now be accessed.
         2. This may also occur if Data Movers are powered down or pulled out.
            If you are on the primary Control Station, look in the "Checks"
            section to see if the following check passed:

* Data Movers: Check status

         If this check failed, follow its instructions to correct the
         problem, then rerun "nas_checkup" to verify that the NBS devices can
         now be accessed.
         3. If the problem persists, escalate this issue through your support
            organization. Provide this output and any errors or output that
            occurred running the commands in this procedure in the escalation.

Control Station: Check if NBS service is started
Symptom: NBS (nd-clnt processes) service is not (or not fully) started
Action : Use the command "/sbin/service nbs start" to restart the NBS
service or reboot the Control Station.

----------------------

/sbin/service nbs start not solved our problem....

So is there any idea?

B4FQqUoIrW12094 · Answer

Post nas_checkup and server_checkup

laszlo_ · Answer

nas_checkup reason is above.

vanderson1 · Answer

Hello all, any resolution to this issue? I have the same exact issue with a newly deployed NS-480. Everything was working fine and now can't access the web gui but can access via ssh - same output as above of nas_checkup.

server_checkup output :

NAS_DB environment not defined

getreason output:

10 - slot_0 primary control station
5 - slot_2 contacted
5 - slot_3 contacted

Thanks in advance for any help.

vanderson1 · Answer

Thanks for the reply. FYI it is an NS480 and here is the output when run as nasadmin:

NAS DB is needed to execute the current command but is unavailable at this time

I have rebooted the control station several times and same result.

I opened a live chat and they weren't able to help and escalated to phone support but haven't gotten a call back yet. I will try another method of getting support, just hoping someone else solved this problem already.

Thanks

Rainer_EMC · Answer

what system are you using ? The NX4 never supported more than on control station. Rainer

Rainer_EMC · Answer

that NAS_DB message is an indication that you did NOT login as nasadmin so you dont have the NASDB env and path defined that is necessary for all of the server_* and nas_* commands to work.

login as nasadmin and try nas_checkup again

next thing - if you have a problem with the GUI - just reboot the control station

If you still have a problem then - open a service request with support

Rainer

Rainer_EMC · Answer

Then most likely NAS services aren’t running – i.e. most server_* command wouldn’t work and /(nbs)nas isn’t mounted.

Your best course of action there is to work directly with support – this isn’t really feasible to solve via a forum.

Rainer

vanderson1 · Answer

Thanks Rainer_EMC for your help.

I did find the solution in a semi-related knowledge base article: emc222093

The data mover needed to be reset with the following command (this system only has one DM in slot 2):

/nas/sbin/setup_slot -init 2

Funny thing is the re-initialization completed and I was able to access it normally just as support called...

For those of you that find yourself in this situation in the future, please take notice that this was a recently deployed system with no active data and was *NOT* in production yet. I don't know what the consequences of that command would be on a live system, so check with support before calling it.

VNX

Control Station problem or not?

Was this post helpful?