| Title: | TurboLaser Notesfile - AlphaServer 8200 and 8400 systems |
| Notice: | Welcome to WONDER::TURBOLASER in it's new home shortly |
| Moderator: | LANDO::DROBNER |
| Created: | Tue Dec 20 1994 |
| Last Modified: | Fri Jun 06 1997 |
| Last Successful Update: | Fri Jun 06 1997 |
| Number of topics: | 1218 |
| Total number of notes: | 4645 |
Hello All,
I have read notes 295.0 and 359.0 about "Hang" problem at Turbolaser.
And I've done all information and sugestion from both notes to solved
the problem such as DECevent, O.S. pathes. And it still monitoring till
I shared this problem.
Our "TL" has the following configuration :
o D-UNIX 3.2G with patches for V3.2G
0 Dual CPU
0 1 GB Memory
o KFTHA consist of :
PCI-PIU#1 (hose#0) :
- 3 units KZPSA (A10) that connected to Internal SBB, TZ875 Autoloader,
and HSZ50 at SW800.
PCI-PIU#2 (hose#1) :
- 2 units KZPSA (A10) and 4 units KZPAA connected to Internal SBB, HSZ50 at
SW800, 3 units TZ87 and CD-ROM.
All disks at Internal SBB and SW800 has configured with LSM and has a mirror.
The problem occured intermittently, and it rather dificult for me to estimate the time.
Everytime the problem occured, we checked that hose-error led was on and KFTHA led off.
We suspect that the problem caused by the KFTHA module at that time.
But after installed the DECevent, the result of diagnostic tool of DECevent make me really
confused because the result mention that there was a problem with the system configuration
and I have to replace all module.
I really need your sugestion..!!
rgrds
doni
SSE DIGITAL-INDONESIA
There is some report from DECevent :
DECevent V2.3
******************************** ENTRY 1 ********************************
Logging OS 2. Digital UNIX
System Architecture 2. Alpha
Event sequence number 0.
Timestamp of occurrence 26-JAN-1997 19:32:20
Host name utpci1
System type register x0000000C AlphaServer 8x00
Number of CPUs (mpnum) x00000001
CPU logging event (mperr) x00000000
Event validity 1. O/S claims event is valid
Event severity 5. Low Priority
Entry type 110. Generalized Machine State Type
SWI Minor class 3. System configuration
--CONFIGURATION SUBPKT--
FRU CLASS x0001 ** TLSB FRU Subpkt **
Device Type x8014 Turbo-Laser Dual CPU, 4meg Bcache
TLSB Node # 0.
FRU Name KN7CE-AB
Serial Number
************************
FRU CLASS x0001 ** TLSB FRU Subpkt **
Device Type x5000 Turbo-Laser Memory Module
TLSB Node # 1.
FRU Name MS7CC
Serial Number ZG64401927
************************
FRU CLASS x0001 ** TLSB FRU Subpkt **
Device Type x5000 Turbo-Laser Memory Module
TLSB Node # 7.
FRU Name MS7CC
Serial Number ZG64401911
************************
FRU CLASS x0001 ** TLSB FRU Subpkt **
Device Type x2000 Turbo-Laser I/O Module
TLSB Node # 8.
FRU Name KFTHA
Serial Number AY62121481
************************
FRU CLASS x0002 * Hose to IO Bus Adptr *
Device Type xEF00 PCIA
Tiop 8.
Hose 0.
Slot 0.
FRU Name DWLPA
Serial Number AY63810750
************************
FRU CLASS x0005 * PCI FRU Subpkt *
Device Type x00091011 DEC_FASTNI
Tiop 8.
Hose 0.
Slot 0.
FRU Name TULIP
PCI Ident Field (LO) x000000C3
PCI Ident Field (HIGH) x00001000
Bar Length x0048
Base Address 0 x0000000004333000
Size 0 x00000100
Base Address 1 x0000000000183000
Size 1 x00000100
Base Address 2 x0000000000000000
Size 2 x00000000
Base Address 3 x00000000FFFFFFFF
Size 3 xFFFFFFFF
Base Address 4 x00000000FFFFFFFF
Size 4 xFFFFFFFF
Base Address 5 x00000000FFFFFFFF
Size 5 xFFFFFFFF
************************
FRU CLASS x0005 * PCI FRU Subpkt *
Device Type x00081011 DEC_KZPSA
Tiop 8.
Hose 0.
Slot 0.
FRU Name KZPSA
PCI Ident Field (LO) x000000C3
PCI Ident Field (HIGH) x00002800
Bar Length x0048
Base Address 0 x0000000004320000
Size 0 x00010000
Base Address 1 x0000000004200000
Size 1 x00100000
Base Address 2 x0000000000182000
Size 2 x00001000
Base Address 3 x0000000004332000
Size 3 x00001000
Base Address 4 x0000000000000000
Size 4 x00000000
Base Address 5 x00000000FFFFFFFF
Size 5 xFFFFFFFF
************************
FRU CLASS x0005 * PCI FRU Subpkt *
Device Type x00081011 DEC_KZPSA
Tiop 8.
Hose 0.
Slot 0.
FRU Name KZPSA
PCI Ident Field (LO) x000000C3
PCI Ident Field (HIGH) x00003800
Bar Length x0048
Base Address 0 x0000000004310000
Size 0 x00010000
Base Address 1 x0000000004100000
Size 1 x00100000
Base Address 2 x0000000000181000
Size 2 x00001000
Base Address 3 x0000000004331000
Size 3 x00001000
Base Address 4 x0000000000000000
Size 4 x00000000
Base Address 5 x00000000FFFFFFFF
Size 5 xFFFFFFFF
************************
FRU CLASS x0005 * PCI FRU Subpkt *
Device Type x00081011 DEC_KZPSA
Tiop 8.
Hose 0.
Slot 0.
FRU Name KZPSA
PCI Ident Field (LO) x000000C3
PCI Ident Field (HIGH) x00004800
Bar Length x0048
Base Address 0 x0000000004300000
Size 0 x00010000
Base Address 1 x0000000004000000
Size 1 x00100000
Base Address 2 x0000000000180000
Size 2 x00001000
Base Address 3 x0000000004330000
Size 3 x00001000
Base Address 4 x0000000000000000
Size 4 x00000000
Base Address 5 x00000000FFFFFFFF
Size 5 xFFFFFFFF
************************
FRU CLASS x0002 * Hose to IO Bus Adptr *
Device Type xEF00 PCIA
Tiop 8.
Hose 1.
Slot 0.
FRU Name DWLPA
Serial Number AY64616950
************************
FRU CLASS x0005 * PCI FRU Subpkt *
Device Type x00081011 DEC_KZPSA
Tiop 8.
Hose 1.
Slot 0.
FRU Name KZPSA
PCI Ident Field (LO) x000000C7
PCI Ident Field (HIGH) x00000800
Bar Length x0048
Base Address 0 x0000000004210000
Size 0 x00010000
Base Address 1 x0000000004100000
Size 1 x00100000
Base Address 2 x0000000000181000
Size 2 x00001000
Base Address 3 x0000000004221000
Size 3 x00001000
Base Address 4 x0000000000000000
Size 4 x00000000
Base Address 5 x00000000FFFFFFFF
Size 5 xFFFFFFFF
************************
FRU CLASS x0005 * PCI FRU Subpkt *
Device Type x00011000 NCR_810
Tiop 8.
Hose 1.
Slot 0.
FRU Name KZPAA
PCI Ident Field (LO) x000000C7
PCI Ident Field (HIGH) x00001800
Bar Length x0048
Base Address 0 x0000000004222300
Size 0 x00000100
Base Address 1 x0000000000182300
Size 1 x00000100
Base Address 2 x0000000000000000
Size 2 x00000000
Base Address 3 x00000000FFFFFFFF
Size 3 xFFFFFFFF
Base Address 4 x00000000FFFFFFFF
Size 4 xFFFFFFFF
Base Address 5 x00000000FFFFFFFF
Size 5 xFFFFFFFF
************************
FRU CLASS x0005 * PCI FRU Subpkt *
Device Type x00081011 DEC_KZPSA
Tiop 8.
Hose 1.
Slot 0.
FRU Name KZPSA
PCI Ident Field (LO) x000000C7
PCI Ident Field (HIGH) x00002800
Bar Length x0048
Base Address 0 x0000000004200000
Size 0 x00010000
Base Address 1 x0000000004000000
Size 1 x00100000
Base Address 2 x0000000000180000
Size 2 x00001000
Base Address 3 x0000000004220000
Size 3 x00001000
Base Address 4 x0000000000000000
Size 4 x00000000
Base Address 5 x00000000FFFFFFFF
Size 5 xFFFFFFFF
************************
FRU CLASS x0005 * PCI FRU Subpkt *
Device Type x00011000 NCR_810
Tiop 8.
Hose 1.
Slot 0.
FRU Name KZPAA
PCI Ident Field (LO) x000000C7
PCI Ident Field (HIGH) x00003800
Bar Length x0048
Base Address 0 x0000000004222200
Size 0 x00000100
Base Address 1 x0000000000182200
Size 1 x00000100
Base Address 2 x0000000000000000
Size 2 x00000000
Base Address 3 x00000000FFFFFFFF
Size 3 xFFFFFFFF
Base Address 4 x00000000FFFFFFFF
Size 4 xFFFFFFFF
Base Address 5 x00000000FFFFFFFF
Size 5 xFFFFFFFF
************************
FRU CLASS x0005 * PCI FRU Subpkt *
Device Type x00011000 NCR_810
Tiop 8.
Hose 1.
Slot 0.
FRU Name KZPAA
PCI Ident Field (LO) x000000C7
PCI Ident Field (HIGH) x00004800
Bar Length x0048
Base Address 0 x0000000004222100
Size 0 x00000100
Base Address 1 x0000000000182100
Size 1 x00000100
Base Address 2 x0000000000000000
Size 2 x00000000
Base Address 3 x00000000FFFFFFFF
Size 3 xFFFFFFFF
Base Address 4 x00000000FFFFFFFF
Size 4 xFFFFFFFF
Base Address 5 x00000000FFFFFFFF
Size 5 xFFFFFFFF
************************
FRU CLASS x0005 * PCI FRU Subpkt *
Device Type x00011000 NCR_810
Tiop 8.
Hose 1.
Slot 0.
FRU Name KZPAA
PCI Ident Field (LO) x000000C7
PCI Ident Field (HIGH) x00005800
Bar Length x0048
Base Address 0 x0000000004222000
Size 0 x00000100
Base Address 1 x0000000000182000
Size 1 x00000100
Base Address 2 x0000000000000000
Size 2 x00000000
Base Address 3 x00000000FFFFFFFF
Size 3 xFFFFFFFF
Base Address 4 x00000000FFFFFFFF
Size 4 xFFFFFFFF
Base Address 5 x00000000FFFFFFFF
Size 5 xFFFFFFFF
************************
******************************** ENTRY 2 ********************************
Logging OS 2. Digital UNIX
System Architecture 2. Alpha
Event sequence number 1.
Timestamp of occurrence 26-JAN-1997 19:32:20
Host name utpci1
System type register x0000000C AlphaServer 8x00
Number of CPUs (mpnum) x00000001
CPU logging event (mperr) x00000000
Event validity 1. O/S claims event is valid
Event severity 5. Low Priority
Entry type 300. Start-Up ASCII Message Type
SWI Minor class 9. ASCII Message
SWI Minor sub class 3. Startup
ASCII Message
Alpha boot: available memory from 0x1800000 to 0x3ffbe000
Digital UNIX V3.2G (Rev. 62); Sun Jan 26 19:29:17 GMT+0700 1997
physical memory = 1024.00 megabytes.
available memory = 999.75 megabytes.
using 3923 buffers containing 30.64 megabytes of memory
Firmware revision: 4.1
PALcode: OSF version 1.21
AlphaServer 8400 Model EV56/440
Master cpu at slot 0.
Created FRU table configuration errorlog packet
tiop0 at tlsb0 node 8
tiop0: cpu interrupt mask being set as 1.
pci0 at tiop0 slot 0
tu0: DECchip 21140-AA: Revision: 1.2
tu0 at pci0 slot 2
tu0: DEC Fast Ethernet Interface, hardware address: 00-00-F8-1E-25-6E
tu0: console mode: selecting 10BaseT (UTP) port: half duplex: no link
pza0 at pci0 slot 5
pza0 firmware version: DEC P01 A10
scsi0 at pza0 slot 0
rz1 at scsi0 bus 0 target 1 lun 0 (DEC RZ28M (C) DEC 0616)
rz2 at scsi0 bus 0 target 2 lun 0 (DEC RZ28M (C) DEC 0616)
rz3 at scsi0 bus 0 target 3 lun 0 (DEC RZ28M (C) DEC 0616)
pza1 at pci0 slot 7
pza1 firmware version: DEC P01 A10
scsi1 at pza1 slot 0
rz9 at scsi1 bus 1 target 1 lun 0 (DEC HSZ50-AX V50Z)
rz10 at scsi1 bus 1 target 2 lun 0 (DEC HSZ50-AX V50Z)
rz11 at scsi1 bus 1 target 3 lun 0 (DEC HSZ50-AX V50Z)
rz12 at scsi1 bus 1 target 4 lun 0 (DEC HSZ50-AX V50Z)
pza2 at pci0 slot 9
pza2 firmware version: DEC P01 A10
scsi2 at pza2 slot 0
tz21 at scsi2 bus 2 target 5 lun 0 (DEC TZ875 (C) DEC 9B3C)
pci1 at tiop0 slot 1
pza3 at pci1 slot 1
pza3 firmware version: DEC P01 A10
scsi3 at pza3 slot 0
rz25 at scsi3 bus 3 target 1 lun 0 (DEC HSZ50-AX V50Z)
rz26 at scsi3 bus 3 target 2 lun 0 (DEC HSZ50-AX V50Z)
rz27 at scsi3 bus 3 target 3 lun 0 (DEC HSZ50-AX V50Z)
rz28 at scsi3 bus 3 target 4 lun 0 (DEC HSZ50-AX V50Z)
psiop0 at pci1 slot 3
Loading SIOP: script c0001900, reg 4222300, data 406e38a0
scsi4 at psiop0 slot 0
rz37 at scsi4 bus 4 target 5 lun 0 (DEC RRD45 (C) DEC 0436)
pza4 at pci1 slot 5
pza4 firmware version: DEC P01 A10
scsi5 at pza4 slot 0
rz41 at scsi5 bus 5 target 1 lun 0 (DEC RZ28M (C) DEC 0616)
rz42 at scsi5 bus 5 target 2 lun 0 (DEC RZ28M (C) DEC 0568)
rz43 at scsi5 bus 5 target 3 lun 0 (DEC RZ28M (C) DEC 0568)
rz44 at scsi5 bus 5 target 4 lun 0 (DEC RZ28D (C) DEC 0010)
psiop1 at pci1 slot 7
Loading SIOP: script c000d900, reg 4222200, data c0019ca0
scsi6 at psiop1 slot 0
tz53 at scsi6 bus 6 target 5 lun 0 (DEC TZ87 (C) DEC 9B3C)
psiop2 at pci1 slot 9
Loading SIOP: script c001f900, reg 4222100, data 406e40a0
scsi7 at psiop2 slot 0
tz58 at scsi7 bus 7 target 2 lun 0 (DEC TZ87 (C) DEC 9B3C)
psiop3 at pci1 slot 11
Loading SIOP: script c002b900, reg 4222000, data 406e44a0
scsi8 at psiop3 slot 0
tz66 at scsi8 bus 8 target 2 lun 0 (DEC TZ87 (C) DEC 9B3C)
TLMEM at node 7
TLMEM at node 1
Dual TLEP at node 0
lvm0: configured.
lvm1: configured.
dli: configured
SuperLAT. Copyright 1993 Meridian Technology Corp. All rights reserved.
| T.R | Title | User | Personal Name | Date | Lines |
|---|---|---|---|---|---|
| 1072.1 | Why 4 KZPAAs...????? | WONDER::MUZZI | Mon Jan 27 1997 10:33 | 6 | |
Why do you have 4 KZPAAs on this system...? Only one is supported...an
that's only support as a connection to the CD-ROM.
| |||||
| 1072.2 | 3 units of KZPAA for TZ87 Tape drive | DAIVC::ENGKOS | Wed Jan 29 1997 01:25 | 12 | |
Thanks for your quick reply.
Do you mean TL support one KZPAA only..?
Actually, we used that KZPAAs for TZ87 Tape drive, because our
customer need it for parallel backup of their application.
How about if we'd like to add on another device that needed KZPAA
for the interface..??
Rgrds
engkos
daivc::engkos
| |||||
| 1072.3 | Only one KZPAA supported.. | WONDER::MUZZI | Wed Jan 29 1997 09:25 | 14 | |
Only ONE KZPAA is supported...and only as a connection to a CDrom only.
It's in to SOC. The supproted connection is thru KZPSA/KFTIA-differential
to DWZZA-VAs. Additional single ended devices need to be connected thru
KZPSA/DWZZA. It's a more costly connection...but that's the way it is.
The problem with the KZPAA is with the SCSI chip it uses (53c710..?).
It run on scripts that live in main memory. So everytime it wants/has
to do something it has to go to main memory to get the scripts.
-Mark-
| |||||
| 1072.4 | What is the root cause ? | DAIVC::AGUSSUSANTO | Thu Jan 30 1997 04:12 | 12 | |
I am just curious, in my understand that if more than one KZPAA
installed it will consumed memory source rather than make the system
hang or crash . Anyway, I heard that the 3 KZPAA was removed and the
problem still exist.
Do you have any idea ? It is very difficult to find out the root cause
since nothing can do but power recycle every time the system hang,
means it is no way to get the latest information which resident in the
memory due to its refreshed every time the system do the initialization.
rgds,
as
| |||||
| 1072.5 | Please check Power Regulator EPU value | LANDO::DROBNER | TurboLaser Engineering - 8200/8400 | Thu Jan 30 1997 09:44 | 16 |
I am going to put my similiar reply here as an early note stream.
Please give us the complete system configuration; what we would like
to see is; 1) 8200 or 8400 style cabinet. 2) Part number and quantity
of power regulators in the cabinet. 3) The modules and where they
are; system bus, PCI bus (DWLPA/B, quantity).
Reading these notes, I would guess you have a 8400 style cabinet and
one power regulator (H7263-AA/AB or H7263-AC/AD) in this cabinet. If
this is the case - please look in the 8400 SOC article and calculate
the "EPU" value that the system is using (JAN-97 update, page 2.191).
If you are close to the EPU value of 80, but not above and you have
only one power regulator in the system - I would recommond adding a
second power regulator or replacing the orginal.
/Howard
| |||||
| 1072.6 | Total EPU value = 68 | DAIVC::AGUSSUSANTO | Thu Jan 30 1997 21:39 | 22 | |
Three power regulater (H7263-AB) were in the system (DA-292FD-BB),
means it was configured as an N+1 redundant power. Table below is the
complete system configuration.
OPTION EPU QTY TOTAL EPU
Base Server 30 1 30
KFTHA-AA 3 1 3
MS7CC-DA 5 2 10
DWLPB-BA 1 2 2
KZPSA-BB 1 7 7
DE500-XA 1 1 1
DWZZB-VW 0 2 0
DWZZA-VA 0 2 0
RZ28M-VW 1 6 6
TZ87-VA 3 3 9
TOTAL 68
Any ideas are welcome
/AS
| |||||
| 1072.7 | check for unix patches...? | WONDER::MUZZI | Fri Jan 31 1997 09:34 | 10 | |
You might want to check to see if there are any patches for unix/tape
problems. It wouldn't be the first time that I've seen unix hang the
system and it be a software issue.
-Mark-
| |||||
| 1072.8 | Already applied | DAIVC::AGUSSUSANTO | Mon Feb 03 1997 01:58 | 4 | |
I have a complete one patches for V3.2G and it was already applied to
the system at installation period. FYI, below is the location of patch
ftp://oskits.zk3.dec.com/patches/osf/v3.2g/v3.2g_bpatch.tar
| |||||
| 1072.9 | I HAVE THE SAME PROBLEMS | NETRIX::"[email protected]" | Cesarato | Wed Feb 19 1997 12:16 | 37 |
Hi, I have the same problems :random system hang. When it happen you can do restart only. I have had two crashes where DIA reported two different memory simm's with ECC error but it was a different problem. The system is a 8400/440 with 4 GB memory (2 board 2GB at node 2 and 6) 2 twin CPU, 2 PCI bus with 8 KZPSA A10, 1 memory channel, 1 de500, 1 de435, 1 defpa. AT THE kzpsa are connected : 2 kzpsa for 1 TL826 4 kzpsa for 4 hsz40 Software configuration: OSF/1 3.2G ADVFS LSM ORACLE 7.2 POLYCENTER NSR 4.2B EBU Some parameters have been changed for oracle as shared memory at 2GB Shared memory seg 32 MAXVAS = MACHINE_PHYSYCAL_MEMORY maxprc =1024 There is LSM configured with mirrorset on internal disks connected at TIA and 60GB mirrorset on HSZ40's. The volumes are used from oracle 7.2 like row devices. I have checked firmwares, installed patches for OSF/1 3.2G, but the problem is still present. Any ideas [Posted by WWW Notes gateway] | |||||