Apache Http Server Tomcat

admin

The Apache Tomcat Project is proud to announce the release of version 7.0.108 of Apache Tomcat. This release contains a number of bug fixes and improvements compared to version 7.0.107. Fix a potential file descriptor leak when WebSocket connections are attempted and fail. Apache Tomcat Server. Apache tomcat server is a open source web server that is developed by Apache software foundation. It allows to run servlet and Java Server Pages(JSP) based web applications. It provides HTTP server environment for the Java code to run.It has the build in servlet container called Catalina in the tomcat bin directory.

For the Tomcat job perspective, you may be asked a number of questions like up gradation, server maintenance, managing users and about the new features. Apache Tomcat and Apache http server were created for the different porpoises but most of the times people confused about their functionality just because they solve the same issues. Along with the tomcat interview questions preparing for the other necessary segments like default ports, web containers and Apache Tomcat is also beneficial.

Here is a list of most commonly asked tomcat interview questions so that you can prepare for your next job interview. Keep one thing in mind that just by reading you won't get the fundamentals so make sure you also know the functioning associated with Tomcat. An eligible candidate also knows the installation, virtual hosting functions and security of web server. If you have practically worked over it then this will prove to be an add-on.

Download Apache Tomcat Interview Questions PDF

Below are the list of Best Apache Tomcat Interview Questions and Answers

1) What is Apache Tomcat?

Apache Tomcat is also referred to Tomcat Server which is developed by the Apache Software Foundation (ASF). It is an open-source Java Servlet Container. Tomcat executes various Java EE specs comprises of JavaServer Pages (JSP), Java Servlet, WebSocket and Java EL and gives you a 'pure Java'. It provides you with an efficient HTTP web server environment in which Java code can operate.

Tomcat is managed and maintained by open community developers under the aegis of Apache Software Foundation.

2) What is Tomcat?

Apache Http Server Tomcat

Tomcat is a basically a web server and a java servlet container from Apache Foundation. It can be used solitary as well as with the web servers. Most of its recent versions are static content with the speed same as http. It uses the request message exchange pattern to solve the web servers and pages.

3) What is a servlet container in Apache Tomcat?

It is a component of a web server that is interacting with Java servlets. It is also held responsible for the management life cycles of servlets, mapping a URL for a particular servlet and assuring that the request has always the genuine access. Servlet containers also handle the request from servlets, Java server pages files and many other including the server side code.

4) What is Catalina in Apache Tomcat?

Once Jasper has done the compilation it converts JSP into the servlet and allows Catalina to handle further. Catalina is a Tomcat's servlet container. It also implements all of the specifications for Java server pages and servlets. Catalina is a Java engine which is built into Tomcat providing an environment for the servlets to run efficiently.

5) What is Tomcat cluster?

In order to manage the large applications Tomcat cluster is used. It is used more efficiently for load balancing and for managing most of the traffic, Apache Tomcat cluster is utilized. For the Tomcat server, it can provide multiple instances based on its content.

Download Free : Apache Tomcat Interview Questions PDF

6) What is Tomcat high availability?

It is a feature which is added in Tomcat in order to facilitator schedules of system upgrades without actually affecting the live environment. By dispatching the live traffic request from the main server to a temporary server on an entirely different port this is done. It is terminated until the main server is upgraded on the main port. It is a very beneficial feature to deal with the request on a high traffic web application.

7) Do you have any idea about the history of Tomcat?

Tomcat initially started as a servlet reference which was implemented by James Duncan Davidson. He was known as a software architect at Sun Microsystems Group. He was the person who started helping to make the open source project and work as a fusion for Sun Microsystems and Apache Software Foundation. The most tragic part of Tomcat with the development of this and another Apache was that a software automation tool also got developed as a side effect.

8) Who according to you is responsible for the Tomcat Foundation?

The Apache Software Foundation is an organization that looked after most of the open source projects. Jakarta is the name of the Java-based project on this foundation and Tomcat is a web server which is handling server data off Java. Tomcat is basically a part of the Apache Jakarta project and a reference for the implementation of servlet and JSP standards.

9) What do you understand by Tomcat default port and can it used SSL?

The default port allocated for Tomcat is 8080. Well, it can be changed by editing the file under the conf folder name in the Tomcat install directory server.xml. By changing the property to the desired port connector port =”8080” and then restart the Tomcat so that it can save changes can be implemented.

Tomcat can use SSL but for that, you need to make some configuration. You have to do the following tasks-

  • Generate a keystore
  • Then add a connector in server.xml
  • Restart Tomcat

10) What is the server server.xml configuration file mean?

The server.xml file is the main Tomcat configuration file which is responsible for specifying the entire setup and Tomcat configuration while startup.

11) What does web.xml configuration file means?

This file is derived from the servlet specification in Tomcat. It contains information which is used to deploy the components on web applications or servers.

12) What do you understand by Tomcat-users.xml configuration file?

Under the Tomcat-users.XML configuration files, the entire Tomcat users are specified and defined. It is located in the conf folder in the server root of Tomcat.

13) What is a connector and how is it used in Tomcat?

The connectors in the Apache project are a part of the project itself providing web server plug-in for the connection of web service with Tomcat and all the other back-ends. The consequent supported Web Servers are-

  • Apache http server with a plug-in named mod_jk.
  • Microsoft IIS with a plug-in named ISAPI redirector.
  • iplanet web server with a plug-in named NSAPI redirector.

14) What is Jasper?

The name of the Tomcat JSP engine is Jasper. It passes the JSP files for the compilation of them into Java code as servlets. With the running time, it detects all of the changes in JSP files and recompiles them often. Jasper is also the Java server pages Handler and internally it deals with the necessary compiling.

15) Tell us something about Coyote?

A connected component for the Tomcat's project is Coyote, which supports the HTTP 1.1 protocol as a web server. It allows Catalina to act as a basic web server by serving the local files as HTTP documents. It also deals with the incoming connections for a server with the specific TCP port and forwards the request messages to the Tomcat engine. Once a response is received it forwards it back to the client.

16) What is a host and context in Tomcat?

The host is an element in Tomcat. It is an association in a network name for the server. Context, on the other hand, is an element which represents a web application, running with a particular virtual host. Web applications are based on the Web Application Archive (WAR) file or correlating directory which is having all the unpacked contents specified with the servlet description.

Ads Free Download our Android app for Apache Tomcat Interview Questions (Interview Mocks )

Got a Questions? Share with us

Other fundamentals

Apache Tomcat is also referred to as Tomcat server. It is an Open source java container servlet which was developed by Apache software foundation (ASF). Its features are as-

  • It is an absolute lightweight application.
  • It provides freedom as it is an open source project.
  • More stability to the server.
  • Enhanced security levels.

Some Pros

  • Its installation is easy and runs over a single application deployment.
  • Easy deployment of applications in prod.
  • Its integration with eclipse is easy.

Some Cons

  • It is entirely Java based.
  • Making changes in the server configuration is a tricky job.

Also Read Related Apache Tomcat Interview Questions

A+ Interview QuestionsGit Interview QuestionsGWT interview questionsIELTS Interview QuestionsMatlab Interview QuestionsOpenGL Interview QuestionsOpenstack Interview QuestionsAerospace Interview QuestionsApache http server tomcat 8 integrationPLC Interview QuestionsSoap interview questions
Never Miss an Articles from us.

Summary


Running cluster of Tomcat servers behind the Web server can be demandingtask if you wish to archive maximum performance and stability.This article describes best practices how to accomplish that.
By Mladen Turk

Fronting Tomcat

One might ask a question Why to put the Web server in front of Tomcatat all? Thanks to the latest advances in Java Virtual Machines (JVM)technology and the Tomcat core itself, the Tomcat standalone is quitecomparable with performance to the native web servers.Even when delivering static content it is only 10%slower than recent Apache 2 web servers.
The answer is: scalability.

Tomcat can serve many concurrent users by assigning a separate thread ofexecution to each concurrent client connection. It can do that nicely butthere is a problem when the number of those concurrent connections rise.The time the Operating System will spend on managing those threads will degradethe overall performance. JVM will spend more time managing and switching thosethreads then doing a real job, serving the requests.

Besides the connectivity there is one more significant problem, and it causedby the applications running on the Tomcat. A typical application will processclient data, access the database, do some calculations and present the databack to the client. All that can be a time consuming job that in most casesmust be finished inside half a second, to achieve user perception of a workingapplication. Simple math will show that for a 10ms application response time youwill be able to serve at most 50 concurrent users, before your users startcomplaining. So what to do if you need to support more users?The simplest thing is to buy a faster hardware, add more CPU or add more boxes.A two 2-way boxes are usually cheaper then a 4-way one, so adding more boxesis generally a cheaper solution then buying a mainframe.

First thing to ease the load from the Tomcat is to use the Web serverfor serving static content like images, etc.

Figure 1. Generic configuration

Figure 1. shows the simplest possible configuration scenario. Here theWeb server is used to deliver static context while Tomcat only does thereal job - serving application. In most cases this is all that you will need.With 4-way box and 10ms application time you'll be capable of serving 200concurrent users, thus giving 3.5 million hits per day, that is by allmeans a respectable number.

For that kind of load you generally do not need the Web server in front ofTomcat. But here comes the second reason why to put the Web server in front, andthat is creating an DMZ (demilitarized zone). Putting Web server on acomputer host inserted as a 'neutral zone' between a company's private networkand the internet or some other outside public network gives the applicationshosted on Tomcat capability to access company private data, while securingthe access to other private resources.

Figure 2. Secure generic configuration

Beside having DMZ and secure access to a private network there canbe many other factors like the need for the custom authentication for example.

If you need to handle more load you will eventually have to add more Tomcatapplication servers. The reason for that can be either caused by the factthat your client load just can not be handled by a single box or that youneed some sort of failover in case one of the nodes breaks.
Figure 3. Load balancing configuration

Configuration containing multiple Tomcat application servers needs a load balancerbetween web server and Tomcat. For Apache 1.3, Apache 2.0 and IIS Web serversyou can use Jakarta Tomcat Connector (also known as JK), because it offersboth software load balancing and sticky sessions. For the upcoming Apache 2.1/2.2use the advanced mod_proxy_balancer that is a new module designed and integratedwithin the Apache httpd core.

Calculating Load

When determining the number of Tomcat servers that you will need to satisfythe client load, the first and major task is determining the Average ApplicationResponse Time (hereafter AART). As said before, to satisfy the user experiencethe application has to respond within half of second. The content received by the clientbrowser usually triggers couple of physical requests to the Web server (e.g. images). Theweb page usually consists of html and image data, so client issues a seriesof requests, and the time that all this gets processed and delivered iscalled AART. To get most out of Tomcat you should limit the number of concurrentrequests to 200 per CPU.

So we can come with the simple formula to calculate the maximumnumber of concurrent connections a physical box can handle:

The other thing that you must care is the Network throughput between theWeb server and Tomcat instances. This introduces a new variable calledAverage Application Response Size (hereafter AARS), that is the number ofbytes of all context on a web page presented to the user. On a standard100Mbps network card with 8 Bits per Byte, the maximum theoreticalthroughput is 12.5 MBytes.

For a 20KB AARS this will give a theoretical maximum of 625 concurrentrequests. You can add more cards or use faster 1Gbps hardware if needto handle more load.


The formulas above will give you rudimentary estimation of the number ofTomcat boxes and CPU's that you will need to handle the desirednumber of concurrent client requests.If you have to deploy the configuration withouthaving actual hardware, the closest you can get is to measure the AART ona test platform and then compare the hardware vendor Specmarks.

Fronting Tomcat with Apache

If you need to put the Apache in front of Tomcat use the Apache2 withworker MPM. You can use Apache1.3 or Apache2 with prefork MPM for handlingsimple configurations like shown on the Figure 1. If you need to frontseveral Tomcat boxes and implement load balancing use Apache2 and workerMPM compiled in.

MPM or Multi-Processing Module is Apache2 core feature and it is responsiblefor binding to network ports on the machine, accepting requests,and dispatching children to handle the requests.MPMs must be chosen during configuration, and compiled into the server.Compilers are capable of optimizing a lot of functions if threads are used,but only if they know that threads are being used. Because some MPMs use threadson Unix and others don't, Apache will always perform better if the MPM ischosen at configuration time and built into Apache.

Worker MPM offers a higher scalability compared to a standard preforkmechanism where each client connection creates a separate Apache process.It combines the best from two worlds, having a set of child processes eachhaving a set of separate threads. There are sites that are running10K+ concurrent connections using this technology.


Connecting to Tomcat

In a simplest scenario when you need to connect to single Tomcat instanceyou can use mod_proxy that comes as a part of every Apache distribution.However, using the mod_jk connector will provide approximately double the performance.There are several reasons for that and the major is that mod_jk manages apersistent connection pool to the Tomcat, thus avoiding opening and closingconnections to Tomcat for each request. The other reason is that mod_jk uses a customprotocol named AJP an by that avoids assembling and disassembling headerparameters for each request that are already processed on the Web server.You can find more details about AJPprotocol on the Jakarta Tomcat connectors site.

For those reasons you can use mod_proxy only for the low load sitesor for the testing purposes. From now on I'll focus on mod_jk for frontingTomcat with Apache, because it offers better performance and scalability.

One of the major design parameters when fronting Tomcat with Apacheor any other Web server is to synchronize the maximum number of concurrentconnections. Developers often leave default configuration values from both Apache andTomcat, and are faced with spurious error messages in theirlog files. The reason for that is very simple. Tomcat and Apache can each accept onlya predefined number of connections. If thosetwo configuration parameters differs, usually with Tomcat havinglower configured number of connections, you will be faced with thesporadic connection errors. If the load gets even higher, your users willstart receiving HTTP 500 server errors even if your hardware is capableof dealing with the load.

The escape artist helen fremont. Determining the number of maximum of connections to the Tomcatin case of Apache web server depends on the MPM used.

MPMconfiguration parameter
PreforkMaxClients
WorkerMaxClients
WinNTThreadsPerChild
NetwareMaxThreads

On the Tomcat side the configuration parameter that limits the numberof allowed concurrent requests is maxProcessors with default value of20. This number needs to be equal to the MPM configuration parameter.


Load balancing

Load balancing is one of the ways to increase the number of concurrentclient connections to the application server. There are two types ofload balancers that you can use. The first one is hardware load balancerand the second one is software load balancer. If you are using load balancinghardware, instead of a mod_jk or proxy, it must support a compatible passiveor active cookie persistence mechanism, and SSL persistence.

Mod_jk has an integrated virtual load balancer worker that can containany number of physical workers or particular physical nodes.Each of the nodes can have its own balance factor or the worker'squota or lbfactor. Lbfactor is how much we expect this workerto work, or the workers's work quota.This parameter is usually dependent on the hardware topology itself, andit offers to create a cluster with different hardware node configurations.Each lbfactor is compared to all other lbfactors in the cluster and itsrelationship gives the actual load. If the lbfactors are equal the workersload will be equal as well (e.g. 1-1, 2-2, 50-50, etc..). If firstnode has lbfactor 2 while second has lbfactor 1, than the first nodewill receive two times more requests than second one.This asymmetric load configuration enables to have nodes with differenthardware architecture.

In the simplest load balancer topology with only two nodes in thecluster, the number of concurrent connections on a web server sidecan be as twice as high then on a particular node. But ..

The upper statement means that the sum of allowed connections on aparticular nodes does not give the total number of connections allowed.This means that each node has to allow a slightly higher number ofconnections than the desired total sum. This number is usually a20% higher and it means that

So if you wish to have a 100 concurrent connections with two nodes,each of the node will have to handle the maximum of 60 connections.The 20% margin factor is experimental, and depends on the Apacheserver used. For prefork MPMs it can rise up to 50%, while forthe NT or Netware its value is 0%. The reason for that is thateach particular child process menages its own balance statisticsthus giving this 20% error for multiple child process web servers.

The minimum configuration for a three node cluster shown in theupper example will give the 25%-50%-25% distribution of the load,meaning that the node2 will get as much load as the rest of the two members.It will also impose the following number of maxProcessors for each particularnode in case of the MaxClients=200.

Using simple math the load should be 50-100-50 but we needed to add the20% load distribution error. In case this 20% additional load is not sufficient,you will need to set the higher value up to the 50%. Of course the averagenumber of connections for each particular node will still follow theload balancer distribution quota.

Apache Http Server And Tomcat Integration


Sticky sessions and failower

One of the major problems with having multiple backendapplication servers is determining the client-server relationship.Once the client makes a request to a server application thatneeds to track user actions over a designated time period,some sort of state has to be enforced inside a stateless httpprotocol. Tomcat issues a session identifier thatuniquely distinguishes each user. The problem with that sessionidentifier is that he does not carry any information about theparticular Tomcat instance that issued that identifier.

Tomcat in that case adds an extra jvmRoute configurablemark to that session. The jvmRoute can be any name that willuniquely identify the particular Tomcat instance in the cluster.On the other side of the wire the mod_jk will use that jvmRouteas the name of the worker in it's load balancer list. This meansthat the name of the worker and the jvmRoute must be equal.

When having multiple nodes in a cluster you can improve your applicationavailability by implementing failover. The failover means that if theparticular elected node can not fulfill the request the another nodewill be selected automatically. In case of three nodes you are actually doubling yourapplication availability. The application responsetime will be slower during failover, but noneof your users will be rejected. Inside the mod_jk configuration thereis a special configuration parameter called worker.retries that has default value of 3, butthat needs to be adjusted to the actual number of nodes in the cluster.

If you add more then three workers to the load balanceradjust the retries parameter to reflect that number.It will ensure that even in the worse case scenario the requestgets served if there is a single operable node. Of course, therequest will be rejected if there are no free connections available on theTomcat side , so you should increase the allowed number of connectionson each Tomcat instance. In the three node scenario (1-2-1)if one of the nodes goes down, the othertwo will have to take its load. So if the load is divided equally you will needto set the following Tomcat configuration:

This configuration will ensure that 200 concurrent connections willalways be allowable no matter which of the nodes goes down. The reason fordoubling the number of processors on node1 and node3 is because theyneed to handle the additional load in case node2 goes down (load 1-1).Node2 also needs the adjustment becauseif one of the other two nodes goes down, the load will be 1-2. As youcan see the 20% load error is always calculated in.

Apache Tomcat Download

Figure 4. Three node example load balancer
Figure 5. Failover for node2

As shown in the two figures above setting maxProcessors depends bothon 20% load balancer error and expected single node failure. Thecalculation must include the node with the highest lbfactor asthe worst case scenario.


Domain Clustering model

Since JK version 1.2.8 there is a new domain clustering model andit offers horizontal scalability and performance of tomcat cluster.

Tomcat cluster does only allow session replication to all nodes in the cluster.Once you work with more than 3-4 nodes there is too much overhead and risk inreplicating sessions to all nodes. We split all nodes into clustered groups.The newly introduced worker attribute domain letmod_jk know, to which other nodes a session gets replicated (all workers withthe same value in the domain attribute). So a load balancing worker knows, onwhich nodes the session is alive. If a node fails or is being taken downadministratively, mod_jk chooses another node that has a replica of the session.

For example if you have a cluster with four nodes you can maketwo virtual domains and replicate the sessions only inside the domains.This will lower the replication network traffic by half

Figure 6. Domain model clustering

For the above example the configuration would look like:

Now assume you have multiple Apaches and Tomcats. The Tomcats are clustered andmod_jk uses sticky sessions. Now you are going to shut down (maintenance) onetomcat. All Apache will start connections to all tomcats. You end up with alltomcats getting connections from all apache processes, so the number of threadsneeded inside the tomcats will explode.If you group the tomcats to domain as explained above, the connections normallywill stay inside the domain and you will need much less threads.

Fronting Tomcat with IIS

Just like Apache Web server for Windows, Microsoft IIS maintainsa separate child process and thread pool for serving concurrent clientconnections. For non server products like Windows 2000 Professional orWindows XP the number of concurrent connections is limited to 10.This mean that you can not use workstation products for productionservers unless the 10 connections limit will fulfil your needs.The server range of products does not impose that 10 connectionlimit, but just like Apache, the 2000 connections is a limit whenthe thread context switching will take its share and slow down theeffective number of concurrent connections.If you need higher load you will need to deploy additional web serversand use Windows Network Load Balancer (WNLB) in front of Tomcat servers.

Figure 7. WNLB High load configuration

For topologies using Windows Network Load Balancer the same rules are in placeas for the Apache with worker MPM. This means that each Tomcat instancewill have to handle 20% higher connection load per node than its real lbfactor.The workers.properties configuration must beidentical on each node that constitutes WNLB, meaning that you will have toconfigure all four Tomcat nodes.

Apache 2.2 and new mod_proxy

For the new Apache 2.1/2.2 mod_proxy has been rewriten and hasa new AJP capable protocol module (mod_proxy_ajp) and integratedsoftware load balancer (mod_proxy_balancer).

Because it can maintain a constant connection pool to backedservers it can replace the mod_jk functionality.

The above example shows how easy is to configure a Tomcat cluster withproxy loadbalancer. One of the major advantages of using proxy is theintegrated caching, and no need to compile external module.

Mod_proxy_balancer has integrated manager for dynamic parameter changes.It offers changing session routes or disabling a node for maintenance.

Figure 8. Changing BalancerMember parameters

The future development of mod_proxy will include the option todynamically discover the particular node topology. It will also allowto dynamically update loadfactors and session routes.

About the Author

Mladen Turk is a Developer and Consultant for JBoss Inc in Europe, where he isresponsible for native integration. He is a long time commiter for Jakarta Tomcat Connectors,Apache Httpd and Apache Portable Runtime projects.

Links and Resources

Jakarta Tomcat connectors documentation
Apache 2.0 documentation
Apache 2.1 documentation