As requested, I am asking the question here.
Is there a way to find a load on my web application server without a load / stress testing?
I have deployed my application in the Apache Tomcat server. As per as I know, each web server have some limitations on serving a request, beyond which, performance degrades. e.g. If a server can handle 800 requests happily, concurrently, the more requests landing on the server will get delayed response, or may be dropped.
I am looking for some alerting mechanism by which I will get to know that the load on my web server has crossed the threshold limit, and accordingly I will able to take the required actions like e.g. adding one more node behind load balancer to distribute the load.