Dear Sys Mgrs,
In brief, I requested information on systems to monitor for significant failures
which could alert me via a pager (original message at the end). In the end, we
opted for the NetWORKS system from Caravelle 613-596 2802. This allows us to
query SNMP agents and to dial our paging service (or send email or pop up a
window, etc) when it detects a condition which we define. (We use SNMP rather
than just "ping" because we need to know if a machine is fully functional
whereas it can be crippled but still respond to pings).
Other systems which were recommended or that we considered were:
1) Network Managment Systems. Tivoli, HP OpenView, Sun's SunNet Manager,
Cabletron's Spectrum. These allow you to query SNMP agents and do up/down
tests. With add-on packages, they can do paging as well. However, they do a
lot more than simply detect problems; thus they require a lot of configuration
and are a lot more expensive. However, ultimately, this is the direction that
we will go. It is simply overkill right now.
2) Public Domain Software. Many people suggested that I wrote my own program.
Others suggested that I check out "watcher" ftp from
ftp.ucar.edu:/pub/watcher.tar.Z or your favourite archive site.
3) Non-standard systems. These are systems which don't support SNMP such as
Control Tower from MSI NetSys Inc (810) 333 8090 or Patrol from BMC 800-841-2031
but do give you lots of other goodies instead.
I'd like to thank the following for responding:
mattb@BCAA.BC.CA (Matt Binnie)
x092306@hyperion.LANL.GOV (Jerry Weber C-8/IS-5)
kmac@baosc.com (Keith McCloskey x8110)
Gene Rackow <rackow@mcs.anl.gov>
"Jan C. Boyer - Boston" <jcb@denver.ssds.com>
feldt@phyast.nhn.uoknor.edu (Andy Feldt)
jay@intermec.com (Jay Schlegel x6878)
brant@dcs-systems.com (Brant Henderson)
Dan Stromberg - OAC-DCS <strombrg@hydra.acs.uci.edu>
john@oncology.uthscsa.edu (John Justin Hough)
Carlos Ojeda <cojeda@hq.nasa.gov>
Stuart Myles (stuart@mop.com)
----- Begin Included Message -----
Dear Systems Managers,
I'd like to find out about event-monitoring systems. Ideally, I'd like to have
a system which would detect significant systems events (e.g. a leased line or
server going down) and notify me (perhaps via pager). And I'm interested in the
cost, flexibility, reliability of these types of things.
The names of systems which spring to mind include Sun's SunNet Manager, Tivoli's
Tivoli Management System, HP's OpenView. I'd be interested in opinions on these
and any other possible systems. (My network is a mixture of Suns, HP's and
PC's).
If any of you have any experience or even just hints of where to look, please
let me know, and I will summarise to the list.
TIA
Stuart Myles
Systems Administrator Phone: 215 995 1457
Cooper Neff Technologies, LP Fax: 215 995 1451
3 Radnor Corporate Centre,
Radnor, PA 19087 Internet: stuart@mop.com
----- End Included Message -----
This archive was generated by hypermail 2.1.2 : Fri Sep 28 2001 - 23:09:00 CDT