This is the first in potentially a series of posts regarding vulnerability scanning in the cloud and some of the related challenges and helpful tips.
I started looking around for any good sources or posts on the topic of vulnerability scanning in the cloud. In this case, an infrastructure as as service (iaas) scenario for private or public cloud. I didn’t find anything.
How it starts..
When you get the email or call that goes something like.. “Hey, what are we scanning in our <vendor name here> cloud”??? .. In a perfect world you just say, “When we worked with everybody to setup and plan our cloud usage we had vulnerability scanning designs built in from the beginning.. We are good.”
Or, you start sweating and realize nobody ever brought you in to any planning or discussions and there are already large cloud deployments that you aren’t scanning.
Or maybe you are a consultant or service provider going in to an environment and setting up vulnerability scanning in a customer cloud. These posts should be helpful for people that are in the planning stages or trying to “catch up” with their cloud groups or customers.
Dynamic nature of the cloud
Most clouds give you the ability to dynamically provision systems and services, and as a result, you dynamically provision IP addresses. Sometimes these IP addresses are from a certain range, and often, especially for Internet facing systems, these IP addresses are from a large pool of addresses shared with other customers.
In these large dynamic ranges, it is common for the IP address you used today, to be used by another customer tomorrow.
This dynamic nature is great for operations, but can cause some challenges on tracking assets.
Asset management is different
Traditional vulnerability management has been very tied to IP addresses and/or DNS names. In cloud scenarios, assets are often temporary, or may not have DNS names. Sometimes your dns names for PaaS type services are provisioned by the cloud provider, with little or no control from your IT group.
Most cloud providers have their own type of unique identifiers for assets. These unique identifiers are what need to be used for asset tracking.. IP addresses, and sometimes DNS names are just stateful metadata for your asset.
Also, cloud has different types of “objects” that can be given IP addresses beyond traditional compute system interfaces. Certain services can be provisioned in cloud from a PaaS solution that are dedicated to your tenancy/account, and they get their own IP address. Are these your asset? Many times you may have some control over the content and data on these services even though you don’t manage most of the underlying solution.
In general, the whole approach for asset management in cloud is that your assets are tracked by the cloud provider, and you use their API’s to query and gather information on your assets.
Your vulnerability analysis and asset analysis needs to become dynamic and based on the data returned from your asset queries. This is definitely not a bad thing. Most big companies struggle with solid asset management because there are always ways to circumvent traditional asset management. (This is why network traffic based asset management is becoming so popular)
Now, with cloud, as long as you are using the API, and know what tenancies you have, you can get a good list of assets… However, this list is short lived… You need to consistently query the API’s to get a good list. Some cloud providers are able to provide a “push” notification or provide “diffs” of what has come online or gone away in X amount of time. I think that is the future best practice of cloud asset management. Real time visibility into what is coming and going.
Capacity is costly..
One major concept and value of cloud is only using and paying for capacity you need.
When it comes to information technology, this “costly capacity” in IaaS essentially comes down to
- Network usage (sending data over the network)
- Storage usage,(disk space, object space, etc)
- Compute usage (CPU)..
Classic Vulnerability scanning can typically be performed 2 different ways,
- Either scanning over the network from a scanning system, or
- By installing a local agent/daemon/service on the host that reports up the vulnerability data.
Both of these approaches use all 3 types of capacity mentioned above in your cloud, but mostly network and CPU usage.
Scanning over the network — Network Usage
Your cloud vendor’s software defined networking can have huge capacity, or it could remind you of early 90’s era home networking.
One of the major considerations for network based scanning is determining where your bottlenecks are going to be.
- Do you have virtual gateways or bandwidth caps?
- Do you have packet rate caps?
- Are you trying to scan across regions or networks that may be geographically disperse with high latency and/or low bandwidth?
Cloud networking doesn’t just “work”… in many cases it is far more sensitive than physical networks. You need to carefully look at the network topology for your cloud implementations and base scanner placement based on your topology and bottleneck locations. Depending on your network security stack, you may even need or want to avoid scanning across those stacks.
Agents
Agent based scanning is starting to be one of the preferred options in some cloud iaas implementations, because you can just hope that every host reports up it’s vulnerability data when it comes online. This is a nice approach if you have good cooperation from your infrastructure groups to allow your agent to be deployed to all systems.
However, agents likely will not be able to go on every type of resource or service with an IP, such as 3rd party virtual appliances. You will still need network scanning to be able to inspect some virtual systems or resource types such as PaaS deployed services.
Even though agents won’t solve all your problems, you probably won’t be hitting packet rate caps or throughput issues, since they mostly just push up their data in one stream on a regular schedule. So agents can allow you to avoid some of the network issues you might hit otherwise.
- How much cpu impact will there be from network scanning or agent scanning? The act of scanning will use some capacity.
- Should you size your cloud capacity to allow for vulnerability management? (yes)
- Dynamic assets.
- API driven asset management
-
Cloud has more “things” as a service than what one solution can handle.
-
Container Services
-
PaaS
-
Functions/Serverless
-
SaaS/Services
How to handle vulnerability management in the cloud?
- Take a look at all the services your cloud provider offers that you are planning to use.
- Create an approach for each type of scenario and thing that will be used.
- Some cloud providers are starting to build in some amount of vulnerability management natively into their platforms. Leverage these native integrations as much as possible.