CEOs Blog

April 17, 2010

Tandem Machine.

Filed under: Uncategorized — admin @ 12:35 pm

I have managed to salvage a Tandem machine running NonStop Kernel (NSK).

My understanding is that it is MIPS R4000 processor with another processor in lock-step redundancy. If the processors results differ then entire processor module is flagged as failed and fail-over to another processor module commences. I believe the model here is an “S-series”. A S72000.The architecture is very interesting a relies on concepts like “worm-hole routing”. It implements Non-Uniform Memory Access (NUMA) but is not cache coherent like my 24 CPU Onyx2’s CrayLink hypercube routing. Most code is written in COBOL, TAL, TALC or the like.

tandem3

tandem-back

db25_yost_dce

RJ45toRS232

Inorder to get the system console output I had to build a special cable. This is achieved by filing down one end of a RJ-45 (ethernet connector) until it fits into the socket and soldering a DB9 or DB25 serial (RS-232) connector on the other end. Then using a terminal program (19200 baud, 8,n,1) connect it to one or the other of the processor modules. The console follows the cable if you move it from one processor module to another.

I had inserted the boards into the wrong slots so there was a slew of errors as shown by the output below.

Once I got the machine working, I had to set up another subnet inorder telnet to the machine (via ethernet).

Connected

*** Starting POST LVL1 ***

Test Basic byte, word, lond word cycle to ram adr 0x2000000

Test passed

Testing batt_wr signal to BBRAM.

Test passed

Test passed

Verify 1K segment ENVIRON bbram area 0.

Verify 1K segment ERRLOG bbram area 1.

Verify 2K segment INITAL bbram area 2.

Verify 2K segment TESTIT bbram area 4.

Verify 2K segment SPARED bbram area 5.

Test passed

Testing protect register at 0x6fff819

Test passed

Testing protect register at 0x6fff825

Test passed

Testing protect register arming.

Test passed

Testing protect register at 0x6fff82a

Test passed

Testing QAGENT Config register at 0x6800004

Test passed

Starting Ram Bus Test at 0x2000000

Test passed

Walking address bit test using longs at 0x2000000 to 0x2400000

Test passed

Resetting IDMA port 2.

Test passed

IDMA port2 BYTE transfer test, 0x2300000 to 0x2310000, bytecnt 0x100

Test passed

IDMA port2 WORD transfer test, 0x2300000 to 0x2310000, bytecnt 0x100

Test passed

IDMA port2 LONG transfer test, 0x2300000 to 0x2310000, bytecnt 0x100

Test passed

Parity Ram Bus Testing using even parity generation checking.

Test passed

Parity Ram Bus Testing using odd parity generation checking.

Test passed

Reset BSM, put into byte mode, verify ID.

Test passed

Ethernet External Loopback Test ...

Test passed

*** POST LVL1 Successfully Completed ***

*** Restarting the system ...

Initializing SP filesystem...

Filesystem initialization complete.

SHEL:    , Executing Whitney SPstrt_norm.sh

spsh:
   , Info ... Kaweah POST level 1 enabled, run "postdisable" to disable.

SHEL:    , Executing SPstrt_norm.sh for FTSP

SHEL:    , ------ Whitney NSK start-up ------ 

SP_versions:

SP Wall Time = 000 00:00:03.00

SW component    Version Proc                                  Checksum 

--------------  --------------------------------------------  -----------------

LLinit          T1088G06^02DEC02^11DEC02^ABI                  337E8AA3:4C4542B5

LLBoot0         T1088G06^02DEC02^11DEC02^ABI                  3AEBCBF8:9B631F42

LLBoot1         T1088G06^02DEC02^11DEC02^ABI                  3667C74F:A309245E

SPsystem        T1089G06^02DEC02^11DEC02^ABI                  f74fdc62         

Millicode       T8461G05^12OCT00^AAE^12Oct00                  N/A              

ISP Firmware    T1067F40^22SEP00^TREV=01^AAO                  N/A              

ISP2 Firmware   T0480G06^11MAY00^TREV=01^AAA                  N/A              

SHEL:    , Starting Common Daemons 

SHEL:    , Starting   - SMBD 

SHEL:    , Starting   - P2P and Portmap 

SHEL:    , Starting   - IPRPC, RSH and TELNET

SHEL:    ,   RSH Disabled 

SHEL:    , Starting   - ALARMD and TNTCFGD 

logd:    , Info ... semaphore name = LOG_sysHostEvLog 

logd:    , Info ... semaphore id = 6 

SHEL:    , Starting   - NFTSPD 

nfts:    , Info ... shgd @ 0x 21245c4 

SHEL:    , Starting   - SPCON 

SHEL:    , Waiting For Sync from Peer 

SP1: mq_remote_send_message error 1

SHEL:    , Sync Completed 

SHEL:    , Starting   - HCD 

HCD: debug printing enabled

SHEL:    , Checking for any previous bus error dump - please wait ...

SHEL:    , Bus error dump check completed

SHEL:    , Running checkhashdata...

No valid hash data file.

Looking for file hash index orphans and mismatches...

Hash orphan checks complete (took 0.400 sec).

SHEL:    , Wait for takeover

SHEL:    , Taking over as primary SP

E-getstackid[040] 0230F582 00:0000019F 0307 : SMB Error                                00000000 00000080

gets: , Error 0040 [prescale:06 speed:892 kHz]

    TX [len= 12] : 07 ff 85 ff d6 ad 92 00 d9 b2 47 00 

    RX [len= 12] : 00 ff 00 00 00 00 00 00 00 00 00 00 

E-getstackid[040] 0231003C 00:0000019F 030A : SMB Error                                00000020 00000085

stackIdGetPMCU: Peer PMCU invalid mreg read.

Stack ID: 1

mq_remote_send_message error 1

SHEL:    , TNETCLUSTER  = 8

SHEL:    , Starting   - M2ED 

SHEL:    , Starting   - CPUMOND 

spsh:    , Info ... ssa_reset on slot = 55 

spsh:    , Info ... mod_su on slot = 1003 

TNT :    , Cluster #  = 8

TNT :    , System #   = 1

TNT :    , Stack ID # = 1

TNT :    , CPU 0 TNetid = 0xe003f

TNT :    , CPU 1 TNetid = 0xe007f

mq_remote_send_message error 1

TNT :    , SP TNetid = 0xe00c0

TNT :
   , Cluster #  = 8

TNT :
   , System #   = 1

TNT :
   , Stack ID # = 1

TNT :
   , CPU 1 TNetid = 0xe007f

TNT :
   , SP TNetid = 0xe00c0

CHK :
   , Checking state of the PMF Leconte CRU in slot 1.1.1.55 

Minimum Required SP Firmware level for CRU 1.01.1.55 = 0

Minimum Required CPU Bootcode level for CRU 1.01.1.55 = 0

not configured.

INI :
   , Initializing PMF Leconte CRU in slot 1.1.1.55...

Minimum Required SP Firmware level for CRU 1.01.1.55 = 0

Minimum Required CPU Bootcode level for CRU 1.01.1.55 = 0

INI :
   , issuing scan for extest to ABTs 

07 06 02 

spsh:    , Info ... RTR Tbl Init/Config for CRU 1.1.1.55  -- total time for 1 asic(s) = 0.0mS 

INI :
   , Resetting margins for main rail

INI :
   , Completed Kaweah initializing

INI :
   , Starting Leconte initializing

INI :
   , Completed initializing

ESC :
   , Starting ESC configuration for PMF leconte CRU in slot 1.1.1.55

ESC :
   , Init MRouters...

ESC :
   , Mrouter 3 

ESC :
   , Mrouter 2 

ESC :
   , Mrouter 1 

ESC :
 1.1.1.55 bulk OK

ESC :
   , PMF_L.v2 PPIN CRU_PRES = 0x0

ESC :
   , completed ESC configuration

 init scsi [0] = 0

    :    , Info ... Init Passed on board 1.1.1.55.

    :    , Info ... Starting TNet Config on board 1.1.1.55.

tntc:    , Info ... cfg for 1.1.1.55 0x8600 1   [    30 mS] 

tntc:    , Info ... cfg for 1.1.1.55 0x8500 1   [    10 mS] 

tntc:    , Info ... cfg for 1.1.1.55 0x8100 1   [     0 mS] 

tntc:    , Info ... cfg for 1.1.1.55 0x8100 2   [     0 mS] 

    :    , Info ... TNET Config Passed on board 1.1.1.55.

CFG :
   , Configuring PMF_L CRU in slot 1.1.1.55

CFG :
   , Beginning KAWEAH configuration

CFG :
   , NSK OS specific.. clean sdcsr3 and place SVT entries

CFG :
   , PASSED KAWEAH configuration

CFG :
   , PMF_L configuration complete

    :    , Info ... Config Passed on board 1.1.1.55.

------------------------------------------------------------------------------

spsh:    , Info ... working on CRU type = 00000000 in slot 50 

spsh:    ,          cru is present in slot 50 

E-rebuildrestable[046] 0230F582 00:00000268 0307 : SMB Error                                00000000 00000080

rebu: , Error 0040 [prescale:06 speed:892 kHz]

    TX [len= 21] : 07 ff 80 ff d6 ad 92 00 d9 b2 47 00 34 ff ff ff 

                   54 00 80 7f ff 

    RX [len= 21] : 04 ff 00 00 00 00 00 00 00 00 00 00 00 00 00 00 

                   00 00 00 00 00 

spsh:    ,          cru not powered on in slot 50 

spsh:    ,          dbgrm for slot 50 

spsh:    ,          dbgadd for slot 50 

enablePmfCru()

E-rebuildrestable[046] 0230F582 00:00000272 0307 : SMB Error                                00000000 00000080

rebu: , Error 0040 [prescale:06 speed:892 kHz]

    TX [len= 21] : 07 ff 80 ff d6 ad 92 00 d9 b2 47 00 34 ff ff ff 

                   54 00 80 7f ff 

    RX [len= 21] : 04 ff 00 00 00 00 00 00 00 00 00 00 00 00 00 00 

                   00 00 00 00 00 

    :    , Info ... Pon Failed on board 1.1.1.50.

------------------------------------------------------------------------------

spsh:    , Info ... working on CRU type = 00000000 in slot 51 

spsh:    ,          cru is present in slot 51 

E-rebuildrestable[046] 0230F582 00:00000273 0307 : SMB Error                                00000000 00000080

rebu: , Error 0040 [prescale:06 speed:892 kHz]

    TX [len= 21] : 04 ff 80 ff d6 ad 92 00 d9 b2 47 00 34 ff ff ff 

                   54 00 80 7f ff 

    RX [len= 21] : 04 ff 00 00 00 00 00 00 00 00 00 00 00 00 00 00 

                   00 00 00 00 00 

spsh:    ,          cru not powered on in slot 51 

spsh:    ,          dbgrm for slot 51 

spsh:    ,          dbgadd for slot 51 

    :    , Info ... Pon Passed on board 1.1.1.51.

INI :    , Initializing IOCE4 CRU in slot 1.1.1.51...

INI :    , Scanning via TLR to ERouter (reset problem)

spsh:    , Info ... RTR Tbl Init/Config for CRU 1.1.1.51  -- total time for 1 asic(s) = 20.0mS 

INI :    , Completed initializing

ESC :    , Starting ESC configuration for IOCE4 CRU in slot 1.1.1.51

ESC :    , completed ESC configuration

    :    , Info ... Init Passed on board 1.1.1.51.

    :    , Info ... Starting TNet Config on board 1.1.1.51.

tntc:    , Info ... cfg for 1.1.1.51 0x8600 1   [    70 mS] 

tntc:    , Info ... cfg for 1.1.1.51 0x8700 1   [     0 mS] 

tntc:    , Info ... cfg for 1.1.1.51 0x8700 2   [     0 mS] 

    :    , Info ... TNET Config Passed on board 1.1.1.51.

CFG :    , Configuring IOCE4 CRU in slot 1.1.1.51

CFG :    , Completed configuration

    :    , Info ... Config Passed on board 1.1.1.51.

------------------------------------------------------------------------------

spsh:    , Info ... working on CRU type = 00000000 in slot 52 

spsh:    ,          cru is present in slot 52 

E-rebuildrestable[046] 0230F582 00:000002BB 0307 : SMB Error                                00000000 00000080

rebu: , Error 0040 [prescale:06 speed:892 kHz]

    TX [len= 21] : 83 ff 80 ff d6 ad 92 00 d9 b2 47 00 34 ff ff ff 

                   54 00 80 7f ff 

    RX [len= 21] : 04 ff 00 00 00 00 00 00 00 00 00 00 00 00 00 00 

                   00 00 00 00 00 

spsh:    ,          cru not powered on in slot 52 

spsh:    ,          dbgrm for slot 52 

spsh:    ,          dbgadd for slot 52 

    :    , Info ... Pon Passed on board 1.1.1.52.

INI :    , Initializing SEB CRU in slot 1.1.1.52...

INI :    , SEB 1.1.1.52 default port: 2

spsh:    , Info ... RTR Tbl Init/Config for CRU 1.1.1.52  -- total time for 1 asic(s) = 0.0mS 

INI :    , Completed initializing

ESC :    , Starting ESC configuration for SEB CRU in slot 1.1.1.52

ESC :    , completed ESC configuration

    :    , Info ... Init Passed on board 1.1.1.52.

    :    , Info ... Starting TNet Config on board 1.1.1.52.

tntc:    , Info ... cfg for 1.1.1.52 0x8600 1   [    60 mS] 

    :    , Info ... TNET Config Passed on board 1.1.1.52.

CFG :    , Configuring SEB CRU in slot 1.1.1.52

CFG :    , Completed configuration

    :    , Info ... Config Passed on board 1.1.1.52.

------------------------------------------------------------------------------

spsh:    , Info ... working on CRU type = 00000000 in slot 53 

spsh:    ,          cru is present in slot 53 

E-rebuildrestable[046] 0230F582 00:000002E2 0307 : SMB Error                                00000000 00000080

rebu: , Error 0040 [prescale:06 speed:892 kHz]

    TX [len= 21] : 02 ff 80 ff d6 ad 92 00 d9 b2 47 00 34 ff ff ff 

                   54 00 80 7f ff 

    RX [len= 21] : 04 ff 00 00 00 00 00 00 00 00 00 00 00 00 00 00 

                   00 00 00 00 00 

spsh:    ,          cru not powered on in slot 53 

spsh:    ,          dbgrm for slot 53 

spsh:    ,          dbgadd for slot 53 

    :    , Info ... Pon Passed on board 1.1.1.53.

INI :    , Initializing SEB CRU in slot 1.1.1.53...

INI :    , SEB 1.1.1.53 default port: 0

spsh:    , Info ... RTR Tbl Init/Config for CRU 1.1.1.53  -- total time for 1 asic(s) = 0.0mS 

INI :    , Completed initializing

ESC :    , Starting ESC configuration for SEB CRU in slot 1.1.1.53

ESC :    , completed ESC configuration

    :    , Info ... Init Passed on board 1.1.1.53.

    :    , Info ... Starting TNet Config on board 1.1.1.53.

tntc:    , Info ... cfg for 1.1.1.53 0x8600 1   [    70 mS] 

    :    , Info ... TNET Config Passed on board 1.1.1.53.

CFG :    , Configuring SEB CRU in slot 1.1.1.53

CFG :    , Completed configuration

    :    , Info ... Config Passed on board 1.1.1.53.

------------------------------------------------------------------------------

spsh:    , Info ... working on CRU type = 00000000 in slot 54 

spsh:    ,          cru is present in slot 54 

E-rebuildrestable[046] 0230F582 00:00000309 0307 : SMB Error                                00000000 00000080

rebu: , Error 0040 [prescale:06 speed:892 kHz]

    TX [len= 21] : 01 ff 80 ff d6 ad 92 00 d9 b2 47 00 34 ff ff ff 

                   54 00 80 7f ff 

    RX [len= 21] : 04 ff 00 00 00 00 00 00 00 00 00 00 00 00 00 00 

                   00 00 00 00 00 

spsh:    ,          cru not powered on in slot 54 

spsh:    ,          dbgrm for slot 54 

spsh:    ,          dbgadd for slot 54 

    :    , Info ... Pon Passed on board 1.1.1.54.

INI :    , Initializing IOCE4 CRU in slot 1.1.1.54...

INI :    , Scanning via TLR to ERouter (reset problem)

spsh:    , Info ... RTR Tbl Init/Config for CRU 1.1.1.54  -- total time for 1 asic(s) = 10.0mS 

INI :    , Completed initializing

ESC :    , Starting ESC configuration for IOCE4 CRU in slot 1.1.1.54

ESC :    , completed ESC configuration

    :    , Info ... Init Passed on board 1.1.1.54.

    :    , Info ... Starting TNet Config on board 1.1.1.54.

tntc:    , Info ... cfg for 1.1.1.54 0x8600 1   [    70 mS] 

tntc:    , Info ... cfg for 1.1.1.54 0x8700 1   [     0 mS] 

tntc:    , Info ... cfg for 1.1.1.54 0x8700 2   [     0 mS] 

    :    , Info ... TNET Config Passed on board 1.1.1.54.

CFG :    , Configuring IOCE4 CRU in slot 1.1.1.54

CFG :    , Completed configuration

    :    , Info ... Config Passed on board 1.1.1.54.

------------------------------------------------------------------------------

spsh:    , Info ... working on CRU type = 0000000c in slot 1 

spsh:    ,          cru is present in slot 1 

    :    , Info ... Pon Passed on board 1.1.1.1.

    :    , Info ... Starting TNet Config on board 1.1.1.1.

    :    , Info ... TNET Config Passed on board 1.1.1.1.

    :    , Info ... Config Passed on board 1.1.1.1.

------------------------------------------------------------------------------

spsh:    , Info ... working on CRU type = 0000000c in slot 2 

spsh:    ,          cru is present in slot 2 

    :    , Info ... Pon Passed on board 1.1.1.2.

    :    , Info ... Starting TNet Config on board 1.1.1.2.

    :    , Info ... TNET Config Passed on board 1.1.1.2.

    :    , Info ... Config Passed on board 1.1.1.2.

------------------------------------------------------------------------------

spsh:    , Info ... working on CRU type = 0000000c in slot 3 

spsh:    ,          cru is present in slot 3 

    :    , Info ... Pon Passed on board 1.1.1.3.

    :    , Info ... Starting TNet Config on board 1.1.1.3.

    :    , Info ... TNET Config Passed on board 1.1.1.3.

    :    , Info ... Config Passed on board 1.1.1.3.

------------------------------------------------------------------------------

spsh:    , Info ... working on CRU type = 0000000c in slot 4 

spsh:    ,          cru is present in slot 4 

    :    , Info ... Pon Passed on board 1.1.1.4.

    :    , Info ... Starting TNet Config on board 1.1.1.4.

    :    , Info ... TNET Config Passed on board 1.1.1.4.

    :    , Info ... Config Passed on board 1.1.1.4.

------------------------------------------------------------------------------

spsh:    , Info ... working on CRU type = 0000000c in slot 5 

spsh:    ,          cru is present in slot 5 

    :    , Info ... Pon Passed on board 1.1.1.5.

    :    , Info ... Starting TNet Config on board 1.1.1.5.

    :    , Info ... TNET Config Passed on board 1.1.1.5.

    :    , Info ... Config Passed on board 1.1.1.5.

------------------------------------------------------------------------------

spsh:    , Info ... working on CRU type = 0000000c in slot 6 

spsh:    ,          cru not present in slot 6 

------------------------------------------------------------------------------

spsh:    , Info ... working on CRU type = 0000000c in slot 7 

spsh:    ,          cru is present in slot 7 

    :    , Info ... Pon Passed on board 1.1.1.7.

    :    , Info ... Starting TNet Config on board 1.1.1.7.

    :    , Info ... TNET Config Passed on board 1.1.1.7.

    :    , Info ... Config Passed on board 1.1.1.7.

------------------------------------------------------------------------------

spsh:    , Info ... working on CRU type = 0000000c in slot 8 

spsh:    ,          cru is present in slot 8 

    :    , Info ... Pon Passed on board 1.1.1.8.

    :    , Info ... Starting TNet Config on board 1.1.1.8.

    :    , Info ... TNET Config Passed on board 1.1.1.8.

    :    , Info ... Config Passed on board 1.1.1.8.

------------------------------------------------------------------------------

spsh:    , Info ... working on CRU type = 0000000c in slot 11 

spsh:    ,          cru is present in slot 11 

    :    , Info ... Pon Passed on board 1.1.1.11.

    :    , Info ... Starting TNet Config on board 1.1.1.11.

    :    , Info ... TNET Config Passed on board 1.1.1.11.

    :    , Info ... Config Passed on board 1.1.1.11.

------------------------------------------------------------------------------

spsh:    , Info ... working on CRU type = 0000000c in slot 12 

spsh:    ,          cru is present in slot 12 

    :    , Info ... Pon Passed on board 1.1.1.12.

    :    , Info ... Starting TNet Config on board 1.1.1.12.

    :    , Info ... TNET Config Passed on board 1.1.1.12.

    :    , Info ... Config Passed on board 1.1.1.12.

------------------------------------------------------------------------------

spsh:    , Info ... working on CRU type = 0000000c in slot 13 

spsh:    ,          cru is present in slot 13 

    :    , Info ... Pon Passed on board 1.1.1.13.

    :    , Info ... Starting TNet Config on board 1.1.1.13.

    :    , Info ... TNET Config Passed on board 1.1.1.13.

    :    , Info ... Config Passed on board 1.1.1.13.

------------------------------------------------------------------------------

spsh:    , Info ... working on CRU type = 0000000c in slot 14 

spsh:    ,          cru is present in slot 14 

    :    , Info ... Pon Passed on board 1.1.1.14.

    :    , Info ... Starting TNet Config on board 1.1.1.14.

    :    , Info ... TNET Config Passed on board 1.1.1.14.

    :    , Info ... Config Passed on board 1.1.1.14.

------------------------------------------------------------------------------

spsh:    , Info ... working on CRU type = 0000000c in slot 15 

spsh:    ,          cru is present in slot 15 

    :    , Info ... Pon Passed on board 1.1.1.15.

    :    , Info ... Starting TNet Config on board 1.1.1.15.

    :    , Info ... TNET Config Passed on board 1.1.1.15.

    :    , Info ... Config Passed on board 1.1.1.15.

------------------------------------------------------------------------------

spsh:    , Info ... working on CRU type = 0000000c in slot 16 

spsh:    ,          cru is present in slot 16 

    :    , Info ... Pon Passed on board 1.1.1.16.

    :    , Info ... Starting TNet Config on board 1.1.1.16.

    :    , Info ... TNET Config Passed on board 1.1.1.16.

    :    , Info ... Config Passed on board 1.1.1.16.

------------------------------------------------------------------------------

spsh:    , Info ... working on CRU type = 0000000c in slot 17 

spsh:    ,          cru is present in slot 17 

    :    , Info ... Pon Passed on board 1.1.1.17.

    :    , Info ... Starting TNet Config on board 1.1.1.17.

    :    , Info ... TNET Config Passed on board 1.1.1.17.

    :    , Info ... Config Passed on board 1.1.1.17.

------------------------------------------------------------------------------

spsh:    , Info ... working on CRU type = 0000000c in slot 18 

spsh:    ,          cru is present in slot 18 

    :    , Info ... Pon Passed on board 1.1.1.18.

    :    , Info ... Starting TNet Config on board 1.1.1.18.

    :    , Info ... TNET Config Passed on board 1.1.1.18.

    :    , Info ... Config Passed on board 1.1.1.18.

------------------------------------------------------------------------------

spsh:    , Info ... working on CRU type = 00000036 in slot 21 

spsh:    ,          cru is present in slot 21 

spsh:    ,          cru is powered on in slot 21 

spsh:    ,          cru pon successful for slot 21 

CHK :
   , Checking state of the PMCU CRU in slot 1.1.1.21 

CHK :
   , board state configured, could be inuse

ESC :
   , Starting ESC configuration for PMCU CRU in slot 1.1.1.21

ESC :
   , completed ESC configuration

------------------------------------------------------------------------------

spsh:    , Info ... working on CRU type = 00000036 in slot 22 

spsh:    ,          cru not present in slot 22 

------------------------------------------------------------------------------

spsh:    , Info ... working on CRU type = 0000000e in slot 25 

spsh:    ,          cru is present in slot 25 

spsh:    ,          cru is powered on in slot 25 

E-rebuildrestable[046] 0231003C 00:0000048D 030A : SMB Error                                00000020 00000085

DEBUG: invalid mreg read at SPIadd 7.5 reg 1

DEBUG: Error setting fan_1 power state -> 1 

spsh:    ,          cru PON failed in slot 25 [-191]

spsh:    ,          dbgrm for slot 25 

spsh:    ,          dbgadd for slot 25 

E-rebuildrestable[046] 0231003C 00:0000048D 030A : SMB Error                                00000020 00000085

DEBUG: invalid mreg read at SPIadd 7.5 reg 1

DEBUG: Error setting fan_1 power state -> 1 

    :    , Info ... Pon Failed on board 1.1.1.25.

DEBUG:cru_init_irqs(): failed sib_getboard():25

------------------------------------------------------------------------------

spsh:    , Info ... working on CRU type = 0000000e in slot 26 

spsh:    ,          cru is present in slot 26 

spsh:    ,          cru is powered on in slot 26 

E-rebuildrestable[046] 0231003C 00:00000492 030A : SMB Error                                00000020 00000085

DEBUG: invalid mreg read at SPIadd 7.5 reg 1

DEBUG: Error setting fan_1 power state -> 1 

spsh:    ,          cru PON failed in slot 26 [-191]

spsh:    ,          dbgrm for slot 26 

spsh:    ,          dbgadd for slot 26 

E-rebuildrestable[046] 0231003C 00:00000492 030A : SMB Error                                00000020 00000085

DEBUG: invalid mreg read at SPIadd 7.5 reg 1

DEBUG: Error setting fan_1 power state -> 1 

    :    , Info ... Pon Failed on board 1.1.1.26.

DEBUG:cru_init_irqs(): failed sib_getboard():26

------------------------------------------------------------------------------

spsh:    , Info ... working on CRU type = 00000003 in slot 23 

spsh:    ,          cru is present in slot 23 

spsh:    ,          cru is powered on in slot 23 

spsh:    ,          cru pon successful for slot 23 

------------------------------------------------------------------------------

spsh:    , Info ... working on CRU type = 00000003 in slot 28 

spsh:    ,          cru not present in slot 28 

------------------------------------------------------------------------------

spsh:    , Info ... working on CRU type = 00000003 in slot 29 

spsh:    ,          cru not present in slot 29 

------------------------------------------------------------------------------

spsh:    , Info ... working on CRU type = 00000003 in slot 30 

spsh:    ,          cru not present in slot 30 

------------------------------------------------------------------------------

spsh:    , Info ... working on CRU type = 0000003c in slot 31 

spsh:    ,          cru not present in slot 31 

------------------------------------------------------------------------------

spsh:    , Info ... working on CRU type = 0000003c in slot 32 

spsh:    ,          cru not present in slot 32 

------------------------------------------------------------------------------

spsh:    , Info ... Sending TNet sync 

------------------------------------------------------------------------------

clntpeer_create failed for proc 217 routed to 1.1.55

ERROR, Function @ 22f6c98, mq_remote_send_message error 1 

spsh:    , Info ... Executing POST for CRU in slot 55 

spsh:    , Info ... Executing POST for CRU in slot 50 

spsh:    , Info ... Executing POST for CRU in slot 51 

spsh:    , Info ... Executing POST for CRU in slot 52 

spsh:    , Info ... Executing POST for CRU in slot 53 

spsh:    , Info ... Executing POST for CRU in slot 54 

*** NOTE: cru_test(): No test script file for slot 50

SHEL:    , File Syncronization 

SHEL:    , Starting   - NSKPFAIL, SYSD, MRINTD and ESCMOND 

escFlags = 0x00000000

TST :
   , Testing PMF Leconte CRU in slot 1.1.1.55 [3]

SHEL:    , Starting   - CMUX 

cmux:    , Info ... cmuxd[0] PEER XFER DISABLED

SHEL:    , Starting   - HOST RPC [Host and ALT] 

TST :
   , Hard test, dont ask peer...

E-MRIMOND[229] 0230DAB2 00:000004AB 0301 : Time Out                                 00000001 00000032 00e50000

SHEL:    , Starting   - NSKEVMD 

TST :
   , Testing PMF CRU in slot 1.1.1.55 (CPU = 1)...

mrid:  ,         : bulk 01 OK (restored) 

ERROR, Function @ 22ebf4c, Error -201 processing msg 144, free 021031E4 

ADC :    , ADC Selection Event for CRU: 01.1.50.

SHEL:    , net info [204.160.19.230 255.255.255.0 204.160.19.1]

SHEL:    , Starting   - NETD 

mrid:  ,         : sending ON_LINE to pfaild 

E-MRIMOND[229] 0231003C 00:00000509 030A : SMB Error                                00000021 00000032

E-MRIMOND[229] 0231003C 00:00000509 030A : SMB Error                                00000021 00000032

E-MRIMOND[229] 0231003C 00:0000050A 030A : SMB Error                                00000021 00000032

E-MRIMOND[229] 0231003C 00:0000050A 030A : SMB Error                                00000021 00000032

mrid:  , error cleanup masked off port 128 at path = NULL                

    :    , attempting to get MAC from Kaweah FIR . - found MAC

SHEL:    , Dumping Memory Statistics

759904 bytes allocated in 764 blocks (max allocated 802400 bytes)

415008 bytes free in 57 blocks (largest free 378560 bytes)

23640 gets, 8392 hits, 235649 loops

22876 puts, 3374 mergeafters, 11054 mergebefores

0 failures

ioctl: M2E_CONN: Host is unreachable

clntm2e_create Failed:  Stack ID = 2, Program = 98, Version = 1

ioctl: M2E_CONN: Host is unreachable

clntm2e_create Failed:  Stack ID = 3, Program = 98, Version = 1

ioctl: M2E_CONN: Host is unreachable

clntm2e_create Failed:  Stack ID = 4, Program = 98, Version = 1

ioctl: M2E_CONN: Host is unreachable

clntm2e_create Failed:  Stack ID = 5, Program = 98, Version = 1

ioctl: M2E_CONN: Host is unreachable

clntm2e_create Failed:  Stack ID = 6, Program = 98, Version = 1

ioctl: M2E_CONN: Host is unreachable

clntm2e_create Failed:  Stack ID = 7, Program = 98, Version = 1

ioctl: M2E_CONN: Host is unreachable

clntm2e_create Failed:  Stack ID = 8, Program = 98, Version = 1

E-CPUM[045] 0230F582 00:00000552 0307 : SMB Error                                00000000 00000080

CPUM: , Error 0040 [prescale:803 speed:97 kHz]

    TX [len= 21] : 07 ff 80 ff d6 ad 92 00 d9 b2 47 00 34 ff ff ff 

                   54 00 80 7f ff 

    RX [len= 21] : 00 ff 00 00 00 00 00 00 00 00 00 00 00 00 00 00 

                   00 00 00 00 00 

E-CPUM[045] 0230F582 00:00000553 0307 : SMB Error                                00000000 00000080

CPUM: , Error 0040 [prescale:803 speed:97 kHz]

    TX [len= 21] : 07 ff 80 ff d6 ad 92 00 d9 b2 47 00 34 ff ff ff 

                   54 00 80 7f ff 

    RX [len= 21] : 00 ff 00 00 00 00 00 00 00 00 00 00 00 00 00 00 

                   00 00 00 00 00 

E-CPUM[045] 0230F582 00:00000557 0307 : SMB Error                                00000000 00000080

CPUM: , Error 0040 [prescale:803 speed:97 kHz]

    TX [len= 21] : 07 ff 80 ff d6 ad 92 00 d9 b2 47 00 34 ff ff ff 

                   54 00 80 7f ff 

    RX [len= 21] : 00 ff 00 00 00 00 00 00 00 00 00 00 00 00 00 00 

                   00 00 00 00 00 

ncmd:    , Error ... ll TNet write error [interrupt packet = fffffeb5, 0001f1c1] 

ncmd:
   , Info ... Putting CPU online [slot = 55]

ncmd:    , Info ... using default limit0Value (0x0f080700)

CFG :
   , Configuring PMF_L CRU in slot 1.1.1.55

clntpeer_create failed for proc 217 routed to 1.1.55

mq_remote_send_message error 1

CFG :
   , Beginning PMF_L configuration...

CFG :
   , Doing HARD reset of R4K

CFG :
   , Doing Reset of the MCs

CFG :
   , Passed Leconte configuration

CFG :
   , PMF_L configuration complete

ncmd:    , Info ... doing nano-loop test

ncmd:    , Info ... using /nskboot/nskmcode.elf for Millicode 

ncmd:    , Info ... using /nskboot/nskexv.elf for Exc Vectors 

ncmd:    , Info ... bdexec of Millicode [imcw = 0x81f00032] 

ncmd:    , Info ... starting millicode  [addr = 0xa0010000] 

CFG :
   , Final Configuration

dumping CPU 1, wait 1, timeout 300, dump only NO

Got CRU number (1.1.1.55) for CPU 1.

Waiting 300 seconds for CPU 1 to halt...

ncmd:    , Info ... rcvd cmd from mc cpu 1 [commandValue = 00000080] 

ncmd:    , Info ... new state from mc is [00800000] 

ncmd:    , Info ... rcvd cmd from mc cpu 1 [commandValue = 00000081] 

nmcd:    , Info ... got a Get CPU config request

ncmd:    , Info ... rcvd cmd from mc cpu 1 [commandValue = 00000080] 

ncmd:    , Info ... new state from mc is [00930000] 

(postdump) cpu 1 is running, diag bits = 00000013 (023), 290 sec left...

SHEL:    , Waiting For Failure of Critical Task

(postdump) cpu 1 is running, diag bits = 00000013 (023), 280 sec left...

(postdump) cpu 1 is running, diag bits = 00000013 (023), 270 sec left...

(postdump) cpu 1 is running, diag bits = 00000013 (023), 260 sec left...

(postdump) cpu 1 is running, diag bits = 00000013 (023), 250 sec left...

ncmd:    , Info ... rcvd cmd from mc cpu 1 [commandValue = 00000080] 

ncmd:    , Info ... new state from mc is [00c08156] 

CPU 1 is halted (state 00000000, halt 1, errfrz 0), done waiting.

Doing TNet read of 64 bytes from CPU 1 (AVT 17)...

CPU 1: Got Long Mailbox transaction record:

     post status  = 1 (Success)

     test num     = 16

     subtest num  = 0

     error (halt) = 0 (00)

Generating event, id = 75 (0x004B)...

  + Kaweah log size = 34, esd length = 168

(postdump) Event sent (75): CPU 1 CRU 1.1.1.55 POST Success (1), Kaweah POST events : 34.

TST :
   , Completed testing - Passed

clntpeer_create failed for proc 217 routed to 1.1.55

mq_remote_send_message error 1

clntpeer_create failed for proc 217 routed to 1.1.55

mq_remote_send_message error 1

clntpeer_create failed for proc 217 routed to 1.1.55

mq_remote_send_message error 1

clntpeer_create failed for proc 217 routed to 1.1.55

mq_remote_send_message error 1

clntpeer_create failed for proc 217 routed to 1.1.55

mq_remote_send_message error 1

I created an non RFC-1918 subnet to match what the machine address and network actually is.
Telneting to it I get:

Himalaya Service Processor

login:

It does not accept super.super as username.

tandem2

No Comments

No comments yet.

RSS feed for comments on this post.

Sorry, the comment form is closed at this time.

Powered by WordPress