AWS FSx File Cache Monitoring


AWS FSx File Cache - Overview

Amazon FSx File Cache is a high-performance caching service used to process file data, regardless of where the data is stored. It provides seamless integration with existing storage environments, providing low-latency access to frequently accessed data without the need for data migration. AWS File Cache monitoring tool offered by Applications Manager helps you to attain effective performance by detecting latency issues and resolving potential bottlenecks.

Note: AWS FSx File Cache monitoring is not supported for AWS China account type.

Creating a new AWS FSx File Cache monitor

To learn how to create a new AWS FSx File Cache monitor, refer here.

Monitored Parameters

Go to the Monitors Category View by clicking the Monitors tab. Click on the FSx File Cache instance available under Amazon in the Cloud Apps section. Displayed is the Amazon FSx File Cache bulk configuration view distributed into three tabs:

  • Availability tab gives the availability history for the past 24 hours or 30 days.
  • Performance tab gives the health status and events for the past 24 hours or 30 days.
  • List view tab enables you to perform bulk admin configurations.

By clicking a monitor from the list, you'll be taken to the AWS FSx File Cache dashboard which includes the following tabs:

Overview

Parameter Description
FILE CACHE INFORMATION
DNS Name The Domain Name System (DNS) name for the cache.
Mount Name The name used when mounting the cache.
Lifecycle State The lifecycle status of the cache. Possible Values: AVAILABLE/ CREATING/ DELETING/ UPDATING/ FAILED
Per Unit Storage Throughput Represents the megabytes per second of read or write throughput per 1 tebibyte of storage provisioned. The only supported value is 1000.
DATA STORAGE UTILIZATION
Data Storage Utilization Total amount of storage utilized in the data storage (in %)
METADATA STORAGE UTILIZATION
Metadata Storage Utilization Total amount of storage utilized in the metadata storage (in %)
STORAGE SIZE
Free Data Storage The latest free storage available in the cache between the poll interval (in GiB).
Used Data Storage The latest storage used in the cache between the poll interval (in GiB).
Data Storage Capacity The storage capacity of the cache in gibibytes (GiB).
METADATA STORAGE SIZE
Free Metadata Storage The latest free metadata storage available in the cache between the poll interval (in GiB).
Used Metadata Storage The latest used metadata storage in the cache between the poll interval (in GiB).
Metadata Storage Capacity The storage capacity of the Lustre MDT (Metadata Target) storage volume in gibibytes (GiB).

Frontend IO Performance

Parameter Description
DATA READ THROUGHPUT
Data Read Throughput The total data throughput per minute associated with read operations between the poll interval (in MB/min).
DATA WRITE THROUGHPUT
Data Write Throughput The total data throughput per minute associated with write operations between the poll interval (in MB/min).
DATA READ OPERATIONS THROUGHPUT
Data Read Operations Throughput The total number of data read operations performed per minute in the File System Cache between the poll interval (in operations/min) .
DATA WRITE OPERATIONS THROUGHPUT
Data Write Operations Throughput The total number of data write operations performed per minute in the File System Cache between the poll interval (in operations/min).
METADATA OPERATIONS THROUGHPUT
Metadata Operations Throughput The total number of metadata operations performed per minute in the File System Cache between the poll interval (in operations/min).

Backend IO Performance

Parameter Description
REPOSITORY OPERATIONS IN PROGRESS
Repository Operations In Progress The average number of read and write operations (including metadata operations) on data repositories that are in progress between the poll interval (in operations(s)).
REPOSITORY READ STATISTICS
Repository Read Success The average rate of files being read into the cache from its linked data repository between the poll interval (in file(s)).
Repository Read Failure The average rate of files failing to be read into the cache from its linked data repositories between the poll interval (in file(s)).
REPOSITORY WRITE SUCCESS
Repository Write Success The average rate of files being written from the cache to its linked data repositories between the poll interval (in file(s)).
REPOSITORY METADATA READ STATISTICS
Repository Metadata Read Success The average rate at which the workload is triggering the loading of file and directory metadata into the cache between the poll interval.
Repository Metadata Read Failure The average rate of file and directory metadata failing to be read into the cache from its linked data repositories between the poll interval.
Repository Metadata Read Not Found The average rate at which cache is looking up the data repository for paths that does not exist between the poll interval.

Configuration

Parameter Description
File Cache ID The system-generated unique ID of the cache.
File Cache Type The type of cache.
File Cache Type Version The Lustre version of the cache.
VPC ID The ID of the virtual private cloud (VPC).
Deployment Type The deployment type of the Amazon File Cache resource.
Logging Level The data repository events that are logged by Amazon FSx. Possible Values: DISABLED/ WARN_ONLY/ ERROR_ONLY/ WARN_ERROR
Weekly Maintenance Start Time A recurring weekly time, in the format D:HH:MM.D is the day of the week, for which 1 represents Monday and 7 represents Sunday. HH is the zero-padded hour of the day (0-23), and MM is the zero-padded minute of the hour.
Creation Time The time that the resource was created.