[Search for users]
[Overall Top Noters]
[List of all Conferences]
[Download this site]
| Title: | ALL-IN-1 (tm) Support Conference | 
| Notice: | Please spell ALL-IN-1 correctly - all CAPITALS! | 
| Moderator: | IOSG::PYE  CE | 
|  | 
| Created: | Fri Jul 01 1994 | 
| Last Modified: | Fri Jun 06 1997 | 
| Last Successful Update: | Fri Jun 06 1997 | 
| Number of topics: | 2716 | 
| Total number of notes: | 12169 | 
2653.0. "NODE1::"73=" Stopped" by BARNA::DSMAIL () Wed Apr 30 1997 14:45
Hi,
A customer has a VMS cluster with NODE1 and NODE2,
he ran ALL-IN-1V3.0 from NODE1 without problems
Now, he runs ALL-IN-1 from NODE2, but he has File Cabinet problems.
    
SM/MFC/MS shows		             Server Name:  NODE1::"73="
                                          Status:  Stopped
        	            DECnet object number:  73
        		      TCP/IP port number:  7373
        
also:
                               Index of Servers
 (Selections: 0  )                                          (New messages: 0  )
   No.   Node   Num TCP/IP Type   Process Name       Startup Queue
 > 1     NODE1   73  7373  LOCAL  NODE1$SRV73        A1V30$BATCH
OAFC$SERVER_STARTUP.LOG;79
                     30-APR-1997 08:07:27.05
    No reference to NODE2!!!
    
search OAFC$SERVER_STARTUP.LOG NODE1
$       @sys$startup:oafc$startup SYS$SYSDEVICE:[ALLIN1.DATA_SHARE]
NODE1$SERVER73.DAT;1 NODE1$SRV73
            /PROCESS_NAME        = "NODE1$SRV73" -
$       WRITE/ERROR=write_mbx_error write_mbx "SYS$SYSDEVICE:[ALLIN1.
DATA_SHARE]NODE1$SERVER73.DAT;1"
                configuration_file: SYS$SYSDEVICE:[ALLIN1.DATA_SHARE]
NODE1$SERVER73.DAT;1
                process_name:       NODE1$SRV73
SH SYS 
VAX/VMS V5.5-2  on node NODE2  30-APR-1997 12:49:28.77   Uptime  11 04:21:23
  Pid    Process Name    State  Pri      I/O       CPU       Page flts Ph.Mem
20800219 SYMBIONT_0001   HIB      6  1486754   0 00:55:21.42     38565    293
2080221C NODE1$SRV73     HIB      6      332   0 00:00:05.91      1758   1988
20800257 OA$SYMBIONT_008 HIB      6       21   0 00:00:00.77       200    108
20800258 SYMBIONT_0009   HIB      5    14801   0 00:17:32.01      2142   1354
20802CA2 ALLIN1          CUR  1   4      713   0 00:00:07.33      3778    509
20802AF6 OA$FCV          SUSP     3        5   0 00:00:00.07        66    135
20801EF7 TM_SERVER       LEF      3       26   0 00:00:00.65       381    801
Starting server with the STA option
	NODE1::"73=" submitted to A1V30$BATCH for startup
Nothing happens
	Batch queue A1V30$BATCH, idle, on NODE2::
  /BASE_PRIORITY=4 /JOB_LIMIT=1 /OWNER=[SISTEMAS,SYSTEM] /PROTECTION=(S:E,O:D,
  G:R,W:W)
OAFC$SERVER_STARTUP.LOG;80
                     30-APR-1997 12:08:40.18
Contains as last lines:
$ start_server:
$       @sys$startup:oafc$startup SYS$SYSDEVICE:[ALLIN1.DATA_SHARE]
NODE1$SERVER73.DAT;1 NODE1$SRV73
$!
$!      Copyright information and description at end.
$!
$! Default any parameters not passed to us.
$!
$       IF P1 .EQS. "" THEN P1 = "OA$DATA_SHARE:OAFC$SERVER_CONFIG.DAT"
$       IF P2 .EQS. "" THEN P2 = "A1FS"
$!
$! Check that the configuration file exists before trying to use it for the
$! server.
$!
$       if f$search( P1 ) .eqs. "" then goto No_config_file
$!
$! We'll be running under a different UIC, we'll need CMKRNL for that, but the
$! subprocess will also need a number of other privileges, and hence we need to
$! make sure that we've already got them, since we can't pass them to the
$! subprocess otherwise. Set a symbol to the list so it's easy to change here,
$! in the error message, and in the RUN command.
$!
$       Required_privs = -
          "CMKRNL,SYSLCK,SYSNAM,SYSPRV,PRMMBX,TMPMBX,EXQUOTA,NETMBX,DETACH"
$       IF .not. F$PRIVILEGE( Required_privs ) THEN GOTO priv_server
$!
$! Check a process is not already running with the requested process name
$!
$       if f$type( Ctx ) .eqs. "PROCESS_CONTEXT" then -
            Temp = f$context( "PROCESS",Ctx,"CANCEL")
$       Ctx = ""
$!
$! Select processes that have the name the caller chose (or we defaulted to)
$! for the subprocess, are detached, are owned by OAFC$SERVER and are on this
$! node (the latter is the default).
$!
$       Temp = f$context( "PROCESS",Ctx,"PRCNAM",P2,"EQL" )
$       Temp = f$context( "PROCESS",Ctx,"JOBTYPE","DETACHED","EQL" )
$       Temp = f$context( "PROCESS",Ctx,"UIC","[OAFC$SERVER]","EQL" )
$!
$! Pick up the process ID, which clears the context if the process doesn't
$! exist, otherwise we know a process of the specified name already exists.
$!
$       Temp = f$pid( Ctx )
$       if f$type( Ctx ) .eqs. "PROCESS_CONTEXT"
$       then
$           Temp = f$context( "PROCESS",Ctx,"CANCEL")
$           goto proc_name_error
$proc_name_error:
$       WRITE SYS$OUTPUT -
          "A server process already exists with the name NODE1$SRV73"
A server process already exists with the name NODE1$SRV73
$       EXIT 0
%NONAME-W-NOMSG, Message number 00000000
$
$       write sys$output "Server is being started:"
Server is being started:
$       write sys$output "              configuration_file: ",config_file
                configuration_file: SYS$SYSDEVICE:[ALLIN1.DATA_SHARE]
NODE1$SERVER73.DAT;1
$       write sys$output "              process_name:       ",process_name
                process_name:       NODE1$SRV73
$ exit:
$       write sys$output ""
$       write sys$output ""
$       write sys$output "******************************************", -
                                                "******************************"
************************************************************************
$       write sys$output ""
$       write sys$output ""
$
$       exit
  ALLIN1       job terminated at 30-APR-1997 12:08:42.40
  Accounting information:
  Buffered I/O count:              45         Peak working set size:     433
  Direct I/O count:                57         Peak page file size:      3012
  Page faults:                    493         Mounted volumes:             0
  Charged CPU time:           0 00:00:01.20   Elapsed time:     0 00:00:03.42
Anybody could help us?
Regards,
		Josep
                            
| T.R | Title | User | Personal Name
 | Date | Lines | 
|---|
| 2653.1 | Stop/Start and then upgrade ! | IOSG::STANDAGE | My hovercraft is full of eels | Wed Apr 30 1997 16:00 | 30 | 
|  |     
    JoseP,
    
    It would appear as though the FCS is running, but the Management UI is
    showing it to be Stopped. When you go to SM MFC MFS does it take a long
    time before saying "Stopped" (~30 seconds?). If this is the case then
    your server is in trouble and is not responding to connection requests.
    The quickest solution is to STOP/ID the NODE1::"73=" process and 
    restart it from the SM MFC MFS subsystem.
    
    I don't think having the extra node running ALL-IN-1 effected this.
    
    In the longterm, there are a few things you should consider. Firstly,
    ALL-IN-1 V3.0 is *very* old so you should consider upgrading to V3.1 or
    (better still) V3.2. There are also a number of ICF patches for V3.1
    which address FCS problems which you should install if you cannot
    upgrade to V3.2.
    
    If you really have to stay on ALL-IN-1 V3.0, then at least install the
    TLC - TeamLinks Connection Package (to V2.1 and then V2.5), as this
    will also update your servers to newer, better versions.
    
    Finally, be aware of VMS patches which exist to resolve problems that
    appear to be with the servers. There are already many notes in this
    conference about them...
    
    -Kevin.
    
    
    
 | 
| 2653.2 | cluster alias? | VELI::KORKKO | Veli K�rkk� @FNO, 879-5512 | Wed Apr 30 1997 17:30 | 6 | 
|  |         Do you have DECnet/OSI there by any chance? Do you have cluster
        alias enabled on NODE2 by any chance? There used to be case that
        having alias outgoing enabled on the FCS object (or session
        control application) caused this kind of problem.
        
        _veli
 | 
| 2653.3 | I'm losing it, time to retire gracefully | IOSG::STANDAGE | My hovercraft is full of eels | Wed Apr 30 1997 17:57 | 10 | 
|  |     
    Ah..yes!  Veli is indeed correct. I didn't realise that it's from NODE2
    that you can't see the server running on NODE1 - my apologies, I was
    reading your note too quickly.
    
    The alias enabled could potentially be the problem...
    
    Kevin.
    
    
 | 
| 2653.4 | Only DECnet | BARNA::DSMAIL |  | Mon May 05 1997 16:16 | 7 | 
|  |     The customer has only DECnet Phase IV.
    
    Any verification could be done in that?
    
    Best Regards,
    
    		Josep
 | 
| 2653.5 |  | IOSG::STANDAGE | My hovercraft is full of eels | Mon May 05 1997 16:27 | 12 | 
|  |     
    Josep,
    
    What does :
    
    NCP> SHOW EXEC CHAR
    
    show ?
    
    Kevin.
    
    
 | 
| 2653.6 |  | AIMTEC::ZANIEWSKI_D | Add Jean to the list of deserters! | Mon May 05 1997 16:43 | 5 | 
|  |         $SHOW SYSTEM is all that you can do.  The server indicating the
        wrong status on the management menu is a known problem for version
        3.0 and 3.1, and fixed in 3.2 according to my IPMT.
        
        Dave Zaniewski
 | 
| 2653.7 | I always did like history lessons... | IOSG::STANDAGE | My hovercraft is full of eels | Mon May 05 1997 17:10 | 11 | 
|  |     
    Dave,
    
    Which problem is that ? I know there's a problem when you have lots of
    servers running (Max DASL sessions), but in this case they only have
    one or two servers running.
    
    Just curious,
    Kevin.
    
    
 | 
| 2653.8 |  | AIMTEC::ZANIEWSKI_D | Add Jean to the list of deserters! | Mon May 05 1997 18:46 | 7 | 
|  |         Hi Kevin,
        
        HPXQB2DDE
        C961101-5797
        And Martin Cook references FC_SERVER_ICOS #189.0
        
        Dave Zaniewski
 | 
| 2653.9 | Not sure this is the same problem | IOSG::STANDAGE | My hovercraft is full of eels | Tue May 06 1997 08:14 | 13 | 
|  |     
    Yes, this is the client exceeding the maximum number of DASL
    connections in one session (or to that effect), which is usually
    reproducable only when you have multiple FCS and AIDAs and you elect to
    view the status on each one (I think 12 is the critical number).
    
    So, this might not be the same problem, but I would certainly suggest
    they upgrade first and take it from there...
    
    -Kevin.
    
    
    
 | 
| 2653.10 | A relink could help? | BARNA::DSMAIL |  | Tue May 06 1997 15:08 | 5 | 
|  |     There is not cluster alias.
	Before upgrade, could a relink be done to check if helps?
    
    Best Regards,
    			Josep
 |