Hi,
I'm using a Tyan Tomcat i875P (S5102) Mainboard, running SuSE Linux 10.0 with kernel 2.6.13-15.10-smp (geeko@buildhost) (gcc version 4.0.2 20050901). Attached is my sensors.conf file.
On two different occasions, the server simply shut down. Here are the server log files:
--- 2nd time ---
Jun 1 22:20:01 secserv /usr/sbin/cron[27086]: (root) CMD (/etc/health/healthd.sh)
Jun 1 22:20:01 secserv kernel: ACPI-0463: *** Warning: Critical trip point
Jun 1 22:20:01 secserv kernel: Critical temperature reached (95 C), shutting down.
Jun 1 22:20:01 secserv kernel: klogd 1.4.1, ---------- state change ----------
Jun 1 22:20:01 secserv kernel: ACPI-0212: *** Warning: Device is not power manageable
Jun 1 22:20:01 secserv kernel: ACPI-0629: *** Warning: Unable to turn cooling device [dffd8a00] 'on'
Jun 1 22:20:02 secserv init: Switching to runlevel: 0
Jun 1 22:20:03 secserv snort: Final Flow Statistics
Jun 1 22:20:03 secserv snort: Snort exiting
Jun 1 22:20:03 secserv ntpd[5243]: ntpd exiting on signal 15
Jun 1 22:20:03 secserv sshd[4810]: Received signal 15; terminating.
Jun 1 22:20:05 secserv kernel: Kernel logging (proc) stopped.
Jun 1 22:20:05 secserv kernel: Kernel log daemon terminating.
Jun 1 22:20:06 secserv exiting on signal 15
---
--- 1st time ---
Apr 16 01:56:01 secserv /usr/sbin/cron[4541]: (root) CMD (/etc/health/healthd.sh)
Apr 16 01:56:01 secserv kernel: ACPI-0463: *** Warning: Critical trip point
Apr 16 01:56:01 secserv kernel: Critical temperature reached (80 C), shutting down.
Apr 16 01:56:01 secserv kernel: klogd 1.4.1, ---------- state change ----------
Apr 16 01:56:01 secserv kernel: ACPI-0212: *** Warning: Device is not power manageable
Apr 16 01:56:01 secserv kernel: ACPI-0629: *** Warning: Unable to turn cooling device [dfdbea00] 'on'
Apr 16 01:56:02 secserv init: Switching to runlevel: 0
Apr 16 01:56:02 secserv snort: Final Flow Statistics
Apr 16 01:56:02 secserv snort: Snort exiting
Apr 16 01:56:03 secserv sshd[10074]: Received signal 15; terminating.
Apr 16 01:56:03 secserv ntpd[30004]: ntpd exiting on signal 15
Apr 16 01:56:04 secserv kernel: Kernel logging (proc) stopped.
Apr 16 01:56:04 secserv kernel: Kernel log daemon terminating.
Apr 16 01:56:05 secserv exiting on signal 15
---
The script "healthd.sh" issues the call "sensors" to the command line and if an alarm is raised, I get notified by eMail. The CPU usually has a temperature at around 38°C, so it's rather unlikely that it really reached 80°C resp. 90°C (especially if the system is not under heavy load).
Could this possibly be a bug with lm_sensors and the kernel ACPI functions, or is something wrong with my sensors.conf file?
Any help is greatly appreciated.
Yours,
Paul