| [Date Prev] [Date Next] | [Thread Prev] [Thread Next] | [Date Index] [Thread Index] |
Re: [snips-users] Dozens of DataAge problems
|
On Wed, Jan 16, 2002 at 08:34:09AM -0500, Tuc wrote:
> > > Hi,
> > >
> > > We just brought our system up, and are seeing alot of :
> > >
> > > Warning - Tue Jan 15 12:40:10 2002 [hostmon]: DEVICE l.ttsg.com
> > > l.ttsg.com VAR DataAge 366 900 Secs LEVEL Warning LOGLEVEL Warning STATE
> > > down
> >
> > The DataAge issue, if memory serves, says that the variable hasn't been
> > updated for the last three (?) interations/deltas (think that's like 30
> > minutes with hostmon).
> >
> I look at the file, and it looks like its been updated, atleast
> the timestamp was. And isn't 366 less than 900, so it shouldn't alarm ?
That might be an error or a mis-nomer (perhaps Vikas can clarify - I've
not looked that closely at this code yet); think the "dataage" failure
may start closer to 0 when it's first triggered? (In any case, I think
this isn't terribly well documented, etc, at this point)
> > Think this will go away if the hostmon-client is
> > started on the machine in-quesiton and the box running hostmon-collector
> > is able to get to the control port or copy down the correct data (either
> > via rcp or scp or whatever).
>
> Its running, its available, its answering requests for other
> programs that query it. The timestamp on the file in the /tmp/hostmon_data
> directory keeps changing...
Hmmm... I saw something very similiar here, too, at some point (and I
forget what I did to make it go away or if it "just did") -- that is,
basically I could telnet in to the port from the hostmon-collector side
and see everything, but hostmon wasn't picking it up yet was still
complaining about the "Data Age."
> > Also... although keepalive-monitors, by default, likes to try to start
> > the hostmon process, I do not believe that it starts the client (we're
> > talking out-of-the-box config, etc).
> >
> Understood.
> >
> > Hope that helps...
> >
> Nope. I put extra debug in, and it didn't help me understand.
>
> Between this, and a few other things, we backed out SNIPS and
> went back to Nocol. Is this a reflection of the production version of
> SNIPS?
>
> Tuc/TTSG Internet Services, Inc.
Well, there's always the premise of never running an even-numbered or
zeroed release in production, but... yeah, I'd certainly agree with you
(amny changes this time (which is overall a good thing), but also a lot
more "weird" things compared to previous releases (perhaps it's just
that we're well-familiar with the "tweaks" for those now).
Russell
--
Russell M. Van Tassell
russell at loosenut com
"Windows 95: n. 32 bit extensions and a graphical shell for a 16 bit patch
to an 8 bit operating system originally coded for a 4 bit microprocessor,
written by a 2 bit company that can't stand 1 bit of competition."
- Gnu-Win32/CygWin32 FAQ (http://www.cygnus.com/misc/gnu-win32)
|