We have a high traffic website, at peak it has 1000
concurrent users, and in minimum it has 100
users at the same time. In average it has 40,000
to 100,000
visit a day. The problem is sometimes it load very slow(we named this time as disaster time :) ), In in that time when we try to load website with Firefox
, it shows waiting...
(I tried it with many providers around the world)
We monitor the server at disaster times , CPU load
, Memory Usage
are normal. Also slow query log of MySQL
doesn't any query up to 1 sec
. Apache
hasn't any errors. iotop
doesn't show anything that causes this disaster.
It is very interesting that disaster time and peak times don't have any relations. Sometimes disaster happen at 300
concurrent user and another time different. I can't find any relation between them.
How can I trace the packets at disaster time? I want to know this disaster is our Data Center's fault (such as upstream or firewall) or our server fault(such as Apache
configuration, web application or anything else that I don't know).
For additional data just add a comment, then I edit my question to provide the data that you need to answer.