0

Through ISA running on SBS 2003 SP2 as a LAN gateway, no matter how many clients in the LAN are running, I'm seeing massive amounts of HTTP timeouts to any external service. The server has two network cards: one is used for LAN/DMZ (different IP ranges; using virtual LANs on one port) and one card for WAN. ISA is not used as a web proxy and I verified the timeouts with various tools, from the usual visual browsers over wget and telnet and in e.g. PHP applications.

I used some scripts and rrdtool to make this graph, which measures loading time of an external resource (I've had already tested seven different external websites, of course with appropriate permissions, to test load time; the all look the same); the unit (seen on the total left) is in seconds; I've set a 30 second max timeout while gathering the data.

(Note: this image is around 270kb in size and 16000 pixel wide!) ISA Timeout RRD graph http://markus.fischer.name/tmp/isa_timeout.png

This graph spans 24 hours; the outage from around 11 to 13:20 was due a re-configuration (which obviously didn't change anything).

I've already verified the following things:

  • traffic from LAN/DMZ over Server to WAN causes timeouts
  • traffic from Server to WAN causes no timeouts
  • traffic from LAN/DMZ to Server causes no timeouts

Hardware, like switches and cables, have already been verified to not cause this.

Update:

I decided to take this up higher and open an M$ support ticket for that issue. I'll append updates as I receive them.

Update 2:

Two weeks have passed, not much progress. I'm actually not going after the ticket myself but we've a company which does that for us. I think that was a smart move, saves me time for other things.

Anyway, the ticket was forgotten by M$ in the first week thus only last week there was progress, leading to a patch for ISA to be deployed, which unfortunately didn't change anything.

Next move was that they requested extensive reporting information which they've received yesterday.

Update 3:

Now it's 10th of August. The problem suddenly disappeared on 6th of August. Right in the middle of the day at around 11:17 the last of the permanently measured timeouts occurred. Since then, no such problems from no network from no external hosts in this type of scenario could be detected.

There could be no single action identified to stay in connection with this sudden disappearance. The night before was a partial outage within the company and at 12:30 we reseted some hardware which didn't full recover after the outage (we only got aware that the problem has gone until the late afternoon this day).

From my support company as well as from M$ itself, besides gathering logs and reports, nothing came up before and after this so far. Since time is money I've to suspend further research into this for now ...

mark
  • 1,516
  • 4
  • 21
  • 33

4 Answers4

1

Are you using ISA as a web proxy or just as a router/NATer? I find that if ISA is the gateway and I'm not using ISA as the web proxy then I do have problems opening sites. I've never pinned the problem down because it was easier to just use it as the web proxy and the problem disappeared.

JR

John Rennie
  • 7,756
  • 1
  • 22
  • 34
0

Have you tried running the SBS 2003 Best Practices Analyzer ? Among other things it will check several NIC related issues and provide solutions. Make sure you check for updates within the Analyzer before running the tests.

JS.
  • 3,901
  • 21
  • 18
0

have you verified your results with a tool like wget? Are you using ISA as a proxy, or a transparent proxy? Sometimes clients have issues if proxy auto-detection is set up in IE. If you turn this off, or hard configure it to use your ISA proxy, then things get a lot better.

I had this same problem at startup at one of my clients, but not quite as reliable or prevalent as you seem to have it.

Chris K
  • 659
  • 4
  • 11
  • Yes, it has been verified with wget before conducting automated loading tests every second. The ISA web proxy functionality is not used. – mark Jul 09 '09 at 08:21
0

The problem went away from one day to another, without anyone doing anything.

I'm not paranoid, but it might have been a PC being part of a bot net or so.

mark
  • 1,516
  • 4
  • 21
  • 33