An on premise PaaS offering would provide better cloud control

Longjump, a PaaS offering to build applications (see our earlier coverage on the PaaS players here) announced an on premise version of its software last week. The company cites lack of cloud control reasons for enterprises to try out their software on premise. By cloud control, I mean gaining visibility and governance supporting compliance (typically legal), and audit requirements. (See overview of Longjump below - image courtesy: Longjump)

longjump-model

Companies who need to know the location of their customer’s data may not really know where their data is when using a cloud provider, unless this was found and put in writing during the initial talks. Data that is used less frequently may also be backed up in data centers that are globally distributed where regulations change per region. Such issues can be compounded when using a cloud storage provider. While the on premise offering negates the benefits derived from a multi tenant environment, however, is a right move from a governance view point in the absense of a legal framework on adequate controls in cloud environments.

Amazon introduces Elastic MapReduce (Hadoop Framework) Service

Today Amazon introduced the Hadoop framework as a service called Elastic MapReduce. This adds a new layer or dimension to the definition of cloud computing: Framework as a Service (FaaS). Using Amazon Elastic MapReduce (EMR) one can process large data sets on clusters of servers. The value proposition of EMR is to allow business, researchers, analysts and developers to easily process large amounts of data without worrying about the infrastructure setup (for example security settings when interacting between EC2 nodes among others), tuning and management of the map reduce framework. EMR is based on the popular open source Apache Hadoop map reduce framework.

If you already own an AWS account then it takes a few steps in AWS Console to get started and running with EMR. See the exhibit below for a high level component overview of how EMR works.

Amazon Elastic Map Reduce

In your implementation of the Hadoop map reduce framework, the core components are the implementations of a mapper by realizing Hadoop’s “Mapper” interface and a reducer by realizing Hadoop’s “Reducer” interface. These two components should reside in Amazon’s Simple Storage Service (S3) as shown in exhibit above. The data set that is meant to be analyzed should be made available (for now) on S3 which is specified as input in AWS Console and the output from the aggregator (or reducer) is specified as output (can be on S3) in the console. After this setup, all that is left is to select the number of instances (EC2 nodes) that you want your map reduce job to be executed and stop the instances when the job is done. EMR also exposes other interfaces such as a Ruby based command line interface and a Web Services API to manage the map reduce jobs.

The software inventory used by EMR is as follows:

1. Debian GNU/Linux v 5.0
2. Hadoop v0.18.3 (with patches)
3. Java v1.6
4. Perl v5.10
5. Ruby v1.8
6. Python v2.5
7. PHP v5.2
8. R v2.7

The introduction of Amazon’s Elastic Map Reduce (EMR) would have an impact on the sales of Cloudera, who offer services such as installation, training and configuration of Hadoop on Amazon EC2 infrastructure.

amazon elastic map reduce pricing

The pricing of EMR is shown in the table above (image courtesy: Amazon). However, note the pricing is based on per instance hour, meaning if you have a job using 100 instances for 20 minutes, you will be charged for 100 instance hours. More details on pricing is available here.

Correlsense attempts to simplify your application management

Per a survey conducted by EMA on monitoring and managing an application, it was found that 54% of application related problems are notified by end users to the support personnel. Obviously, managing a customer facing application in this fashion is not efficient. However, this is a well known and perennial problem because the tools available in the market don’t really help isolate the problem in an easy fashion thereby increasing the incident (problem) isolation/resolution time.

Consider the following enterprise application environment depicted in the figure below. How do you isolate which tier or component in the application is causing the problem when an issue is notified by a customer? Finding a resolution can happen only when the problem causing component/code has been identified precisely. Application Performance Management (APM) tools (i.e. deep dive tools) such as Introscope are pretty well known to pinpoint the root cause of an issue in an application. While such tools are great assets to service management teams, however, implementing deep dive solution for every tier in an enterprise is not feasible from cost/benefit ratio perspective and besides there are limitations related to process and (some political, which is another posting) that limit departments from using one tool as a standard for application management across tiers.

appmgmt-resized.jpg

Business Transaction Monitoring (BTM) tools can help in reducing the incident isolation time for a faster service restoration and faster resolution in combination with deep dive tools. BTM tools have the ability to monitor business transactions across tiers or application silos and provide comprehensive visibility into an application on a single pane.

Correlsense Sharepath is a BTM solution that attempts to make the process of incident and problem management simple, efficient and reducing the time in problem isolation. Oren Elias, the CEO of Correlsense mentions BTM is about monitoring across tiers with probably less deep dive than the APM tools that are good in managing single tiers.

From my conversations with Correlsense team in learning about their product, Sharepath arguably supports the largest application technology stacks (i.e. PHP, JEE, .Net among others) in the BTM market. Support for ERP such as SAP is planned for this year. The value add by providing support for a large variety of application technology stack or packaged software is to reduce the overall problem identification and resolution time thereby reducing the cost and business impact associated with an incident.

Correlsense Sharepath uses a distributed architecture for data collection and reporting. Low overhead agents are deployed on the servers to be monitored. The agents collect the transaction related data and this data is stored in a repository for a later reporting and analysis. Some of the salient features of Correlsense Sharepath that would interest an enterprise are as follows:

1. Minimal time for configuration and deployment of agents with the least overhead on the application.

2. Support for transaction monitoring (includes virtualized environments) across all major application protocols such as HTTP, LDAP etc. (RMI support is currently available).

3. Provides a capability to define rules for displaying the monitored transactions in a bubble chart. The bubble chart is a straight out of box feature and is comprehensive in displaying the transactions performance against the Service Level Agreement (SLA).The SLA is base lined from a historical measurement of transactions such as averages or standard deviations. API support is available for integration with repositories having the SLA information.

4. Supports integration with alerting solutions such as Tivoli, HP OVO and CA Unicenter.

5. Works in conjunction with deep dive tools such as Introscope in finding a problem’s root cause.

HP Transaction viewer and Optier are competitors to Sharepath. However the Correlsense team believes that the easy setup without the involvement of professional services with a simple innovative interface providing a complete transaction view of an enterprise system is an advantage over its competitors.

Open cloud manifesto launched amid much dissension

Open cloud manifesto is a set of guiding principles to increase the cloud adoption through open collaboration and appropriate use of standards among cloud providers. Through open collaboration the cloud providers seek to address the challenges of cloud adoption such as interoperability, security, portability, governance and monitoring/metering. The complete manifesto is available here. The manifesto seems to have been driven by IBM garnering support from cloud vendors such as Cisco, SAP, EMC, HP, Sun, Rackspace’s Mosso, and AT&T among others. However, Microsoft, Google, Salesforce, CCIF and Amazon pulled out of the open cloud manifesto citing reasons on limited openness when drafting the manifesto. Among all the coverage in blogosphere on the topic, I liked the views of Phil Wainewright which is available over here.

Conversation: Ten questions on Quest Software’s Foglight

Foglight is an application management solution that provides a correlated, 360 degree view of your applications from end user to database and from service levels to infrastructure.  Foglight combines transactional data with infrastructure metrics to help identify performance bottlenecks impacting your business and fix them quickly. Mentioned below is a conversation on Foglight via email with Brad Micklea, Director of Product Management for Quest’s Application Management business and Hugh McEvoy, Manager of Product Management for Foglight’s applications offerings and thanks to Krystal Jones AR/PR Specialist, Application Management of Quest Software, for facilitating the email conversation.

1. Question: Where do you classify Foglight solution to fall in, Application Performance Management (APM) or Business Transaction Management (BTM)?

Answer: Currently Foglight’s features fall primarily within the APM space, however, we believe that in the next three to five years any viable APM solution will have to include a strong BTM solution.  We feel we are standing on an unusually strong foundation here as we already have deep transactional analysis abilities within our end user, Java, .NET and database solutions. We are developing transaction analysis and tracing capabilities that will connect these together and join them with our application and services focus to create what we believe will be among the pre-eminent BTM solutions in the market.

2. Question: From a high level architectural perspective, what components make Foglight?

Answer: It depends, but in general there are agents. A few places where there aren’t agents: monitoring of real end-user transactions, some database monitoring (which we can do in both an agented and agent-less way). Additionally, we can integrate with many other, non-Quest agents to pull in performance data without requiring new agents to be installed. In addition to the agents, there is the management server, where data is stored, analyzed, correlated and visualized (via a rich-experience, web front-end).

3. Question: Can you elaborate on why one would need integrations between Foglight and Spotlight, QuestCentral and Performasure among other Quest products?

Answer: Foglight is used for 24×7 performance monitoring & diagnostics. Spotlight & Performance Analysis are more specific diagnostic tools for databases, so there are cases where a Foglight alert, for example, would lead a DBA to open up Performance Analysis to see the cause of the issue. Quest Central is a database administration toolset which is used extensively by production DBAs. Integration between Quest Central and Foglight is centered on the workflow for detecting problems (Foglight) and resolving them (Quest Central). In these cases, integration between the products makes the workflow easier. JProbe is a Java profiler for pre-production use.  PerformaSure overlaps technically with Foglight, but PerformaSure is optimized for pre-production use cases whereas Foglight is optimized for production use cases.

4. Question: What are the top five diagnostic features provided by Foglight? that can help an ops team to diagnose and resolve performance problems quickly.

Answer: This is a really big question and probably deserves a conversation. However the short answer is that it depends who the audience is. For example, Foglight is used by apps administrators, IT Ops, NOC users (if those are different) and business application owners. All of them use it in different ways. It also depends whether the customer puts Foglight in across their entire infrastructure or whether they go for only a part of the solution (e.g. particular application servers, particular databases etc). But for performance diagnostics & resolution, and without being application specific, the most commonly used features are probably:

The ability to see all issues (whether end-user, apps, db, infrastructure) within the context of the application. So rather than being an asset-based solution, Foglight is a service-based solution.

The ability to see detailed information about the health of the system, presented in ways which are specific to the Correlation of performance information across multiple tiers

Flexible, role-based visualization by role (e.g. a business owner might see only KPIs while an apps admin might see everything).

Long-term data retention & analysis i.e. how much slower is the system today than last week/month/year.

The ability to roll up performance information in many different ways to show attainment against a whole variety of SLAs.

5. Question: How can Foglight help pinpoint the root cause of a problem? such as a memory leak caused by a data structure in the application or an expensive SQL due to missing index on a DB table.

Answer: The precise mechanism is different in the two cases you mention. However, the principle is the same: we collect data on both the infrastructure resource consumption (memory, disk, thread pools…) and on the transactions flowing through the system. We then analyze that data. To give a couple of specific examples: in the case of Java we can show you how often a particular Java call is made, who has called it and how long it’s taking. That information, when tracked across time, can show you where problems are. In the case of the database, we specifically look at how long SQL takes to execute and whether adding an index would help. We could give similar, specific, examples across any of the technology stacks that Foglight can manage. Note that our approach here covers areas unaddressed by most BTM vendors (many of whom focus on transactional performance at the expense of resource-based issues) and also unaddressed by solutions such as pure end-user based monitors (whether passive or active).

6. Question: Does Foglight support performance management of an application developed on PHP or Ruby/RoR, Scala/Lift technologies using MySQL DB?

Answer: We have no out-of-the-box collectors for PHP or RoR, however, most of our customers have used our consulting services to build custom agents for technologies which are important to them but are uncommon.  We developed Foglight with an eye to this by creating agent and data management APIs which make building a custom agent easy provided you can get access to the data you want Foglight to measure.

7.  Question: How is Foglight different from Customer Experience Monitoring (CEM)? Example tools in this CEM category include Coradiant, Tealeaf among others. Would you advise companies using Foglight are better off than using CEM tools for performance monitoring? Can you provide some example scenarios supporting this claim?

Answer: Foglight includes, but is not limited to CEM. So we compete with all the products you mention above. There are two main differences (at a high-level) between us and them.

Firstly, those products look at transactional traffic between the end-users and the web servers. So they can say “this is slow at the front end” or “this is slow at the back-end”. For the front-end, our functionality is essentially equivalent: 24×7 performance monitoring plus transaction playback. For the back-end, none of those systems look into the technical stacks in the actual application, so they simply can’t tell you *why* it’s slow. We can be specific about why it’s slow, by looking at the back end. For example: the reason your end-users are experiencing slowdown is because their transactions are exercising an EJB which is experiencing resource contention across its database connection pool.

Secondly, we not only have CEM but also synthetics (the latter for both thin and thick client applications).  In our experience we find that our combination of real and synthetic performance and session analysis capabilities are necessary to understand what performance is, generate performance baselines and also understand whether the users themselves are using the application in the optimal way.

Finally, note that, compared with e.g. HP, Foglight is a completely integrated solution. HP (and others) have amassed a set of acquisitions which aren’t really integrated. So the effort and time required to find and fix problems will be greater with their solutions. We’ve also seen that, for some of those vendors, the need for them to integrate using services what is OOTB with Foglight makes their implementations disproportionately expensive compared to ours.

8. Question: Does Foglight support transaction monitoring across protocols? i.e. an ability to identify Web traffic vs. IM Sessions, VOIP calls among others.

Answer: We currently focus mostly on HTTP(S) but we’re looking at extending that.

9.  Question: What does it take or how do you create the grouping of all transactions and displaying them per business function? Is this done programmatically? If yes, what languages are used for such implementations?

Answer: We provide a rich web 2.0 client for all administration and operation. Additionally, you can choose to exercise APIs to achieve things like service definition. The main language for that is Groovy: we find that it’s a powerful and intuitive scripting language for people who are OO-literate.

10. Question: Where is the SLA information retrieved from? Is it from a CMDB? If yes, does Foglight have an embedded CMDB or do you integrate with other CMDBs? Can you provide some example CMDB’s that you integrate with or support?

Answer: Foglight is not a CMDB, our goal is to integrate with a wide range of CMDBs to give customers the option of choosing the CMDB that works for their organization without the need to compromise on service management as a side effect.  To this end we are building out an integration mechanism targeted at the new CMDBf standard.  We believe this will provide our customers with the greatest degree of freedom in choosing their CMDB.  SLA information can therefore be built out simply within Foglight alone (for organizations looking to create primarily performance and availability-based SLAs); can be built in Foglight to leverage configuration data from a 3rd party CMDB or external store.  In the event that a customer has already invested in an SLA management system that they feel is right for them, Foglight’s data and alerts can also be pushed to that system to add further strength to existing or new SLAs.

Other articles of possible interest:

Top five Java application performance management tools

How to size a JVM

Top six performance tuning tips for a Java enterprise application

Keynote and SOASTA, cloud offerings to test your web site performance

To scale out or not using Gigaspaces

Three tools to monitor your Websphere MQ environment

IBM announces products for the Amazon cloud

IBM announced its partnership with Amazon Web Services this week. Six IBM products (see the list below) will be available in the Amazon cloud in the near future.

  1. IBM DB2
  2. Informix Dynamic Server
  3. WebSphere Portal
  4. Lotus Web Content Management
  5. WebSphere sMash
  6. Novell’s SUSE Linux operating system

The pricing for usage is billed hourly and paid to Amazon; however, the billing is based on the hourly usage of IBM software, Novell operating system and EC2 Service. Though there are plans to offer premium support, the pricing for support is not yet disclosed.

This is an interesting announcement and makes it easy for the enterprise to adopt the AWS cloud if their platform is based on IBM products. The cost of using IBM products might be higher than an open source powered service in Amazon cloud. However, it will be interesting to compare the cost models of these services in AWS and IBM Blue cloud .

Optimized Bandwidth and Encrypted data transfer for the cloud

Hifn’s Express DS 4100, a Network Interface Card (NIC) with optimized bandwidth and encrypted data transfer can possibly be the answer to security issues related to data in the cloud. Some key points of Express DS 4100 are:

1. Offers dual Gigabit Ethernet connections either copper or optical fibre

2. Data is compressed for encryption using IPsec and IPComp.

3. Improves performance by offloading all security and optimization operations, including IPsec and IPComp processing; SRTP; Internet Key Exchange (IKE); IP, TCP and UDP checksum operations.

From the press release: “The DS 4100 enables OEMs to provide products that can send twice the amount of data across the LAN or WAN without their customers having to add extra appliances or pay for more bandwidth,” said Mike Goldgof, Vice President of Marketing at Hifn.  “For customers that need more network capacity, the DS 4100 provides a new cost-effective way to double the throughput while also seamlessly adding security for compliance and peace of mind.”

Other articles of possible interest:

What does Oracle cloud offering mean to the enterprise

VMware and Citrix want to be enterprise cloud enablers

What cloud services can a small IT firm provide?

Will cloud computing be a commodity business ?