Docs > AWS WhitePapers & Deep Dives > Deep Dive on Amazon Relational Database Service (reInvent 2017)

Deep Dive on Amazon Relational Database Service (reInvent 2017)

Why use Amazon RDS?

Lower TCO
- Get more leverage from your teams
- Focus on the things that differentiate you
Built-in high availability and cross region replication across multiple data centers
Even a small startup can leverage multiple data centers to design highly available apps with over 99.95% availability

T2 Family
- Burstable instances
- Moderate networking performance
- Good for smaller or variable workloads
- Monitor CPU credit metrics in Amazon CloudWatch
- T2.micro is eligible for free tier
M3/M4 Family
- General-purpose instances
- High-performance networking
- Good for running CPU intensive workloads
R3/R4 Family
- Memory-optimized instances
- High-performance networking
- Good for query intensive workloads or high connection counts

GP2 is a great choice, but be aware of burst credits on volumes < 1TB
- Hitting credit-depletion results in IOPS drop - latency and queue depth metrics will spike until credits are replenished
- Monitor BurstBalance to see percent of burst-bucket I/O credits available
- Monitor read/write IOPS to see if average IOPS is greater than the baseline
Think of GP2 burst rate and PIOPS stated as maximum I/O rates

Scale compute/memory vertically up or down
- Handle higher load to grow over time
- Lower usage to control costs
- New host is attached to existing storage with minimal downtime
Scale up Amazon ECS storage (up to 16TB!)
- Amazon ECS engines now support Elastic Volumes for fast scaling (now including SQL Server)
- No downtime for storage scaling
- Initial scaling operation may take longer, because storage is reconfigured on older instances
- Can re-provision IOPS on the fly

Each host manages set of Amazon EBS volumes with a full copy of the data
Instances are monitored by an external observer to maintain consensus over quorum
Failover initiated by automation or through the Amazon RDS API
Redirection to the new primary instance is provided through DNS (watch for TTLs)

Two options - automated backups and manual snapshots
Backups leverage Amazon EBS snapshots stored in S3
Transaction logs are stored every 5 minutes in Amazon S3 to support point-in-time recovery (PITR)
No performance penalty for backups
Snapshots can be copied across regions or shared with other accounts

When to use Automated vs Manual backups?

Automated
- Specify backup retention window per instance (7-day default)
- Kept until outside of window (35-day maximum) or instance is deleted
- Support PITR
- Good for disaster recovery
Manual
- Manually created through AWS console, AWS CLI, or Amazon RDS API
- Kept until you delete them
- Restores to saved snapshot
- Use for checkpoint before making large changes, non-production/test environments, final copy before deleting a database

Restoring Backups

Restoring creates an entirely new database instance
New volumes are hydrated from Amazon S3
- While the volume is usable immediately, full performance requires the volume to warm up until fully instantiated
- Migrate to a DB instance class with high I/O capacity
- Maximize I/O during restore process

Designed to be secure by default: patches, updates, etc…
NEtwork isolation with VPC
AWS IAM based resource-level permission controls
Encryption at rest using AWS KMS (all engines) or Oracle/Microsoft TDE
- No performance penalty for encryption data
- Encryption cannot be removed from DB instances
- If source is encrypted, Read Replicas must be encrypted
- Add encryption to an unencrypted DB instance by encryption a snapshot copy
Use SSL protection for data in transit
Do not use AWS root credentials to manage RDS resources - create IAM user for everyone, including yourself
Can use AWS Multi-Factor Authentication (MFA) to provide extra level of protection

Amazon CloudWatch Metrics
- CPU/Storage/Memory
- Swap Usage
- I/O (read and write)
- Latency
- Throughput
- Replica lag
Amazon CloudWatch Alarms
Enhanced monitoring for RDS
- Access to over 50 CPU, memory, file system and disk I/O metrics
- Low as 1-second intervals
Integration with third-party monitoring tools
Amazon RDS Performance Insights
- Measures DB Load - Average Active Sessions (AAS)
- Identifies database bottlenecks (TOP SQL)
- Identifies source of bottlenecks
- Enables problem discovery
- Adjustable time frame (hour, day, week and longer)
Subscribe to SNS notifications on events

Any maintenance that cases downtime will be scheduled in your maintenance window
Operating system or Amazon RDS software patches are usually performed without restarting databases
Database engine upgrades require downtime
- Minor version upgrades - automate or manually applied
- Major version upgrades - manually applied
- Version deprecations - three to six-month notification before scheduled upgrades
- View upcoming maintenance events in your AWS Personal Health Dashboard

Database instance (instance hours)
Database storage (GB-mo)
Backup storage
- No charge for backup storage up to 100% of total database storage
Data transfer (GB-mo)
- Uses AWS regional data-transfer pricing

Amazon RDS charges are grouped by region
Instances are grouped by engine
Storage and backup charges are cross-engine
Use AWS Cost Explorer for graphical comparison
Use the AWS Cost & Usage Report for billing details
- Must be enabled for account
- Stored in your Amazon S3 bucket

Saving Money