Skip to content

Latest commit

 

History

History
134 lines (106 loc) · 6.12 KB

File metadata and controls

134 lines (106 loc) · 6.12 KB

AWS Loft – Database Week

Managed Database Basics Hands-on Lab

In this lab you will learn the following:

  • How to create an EC2 instance for use as a SQL client
  • How to create and RDS/MySQL instance
  • How to connect your SQL client instance to you RDS/MySQL instance
  • How to create a database and load a public Landsat dataset into it
  • How to query your database using SQL
  • How to use the RDS console including stopping, starting, monitoring, snapshots and other RDS features.
  1. Create an Amazon EC2 t2.micro instance running the most recent version of Amazon Linux 2. You will use this EC2 instance as a SQL client to access your RDS/MySQL instance. Follow the documentation at: http://docs.aws.amazon.com/AWSEC2/latest/UserGuide/EC2_GetStarted.html

  2. Create and use an EC2 Security Group that allows inbound access as follows:

  • SSH using TCP on port 22 from Anywhere* (0.0.0.0/0)

  • MYSQL/Aurora using TCP on port 3306 from MyIP Pic25

  • *You might get an automated warning that your EC2 instance is “open to the world”, because we’re not limiting the source IP range for SSH. This is expected. In a production system, as a security best practice, you must only provide a limited IP range for allowed SSH access. For this lab however, you can disregard the warning.

  • Make a note of the MyIP address. We’ll need to update the RDS Security Group later to allow inbound access from this IP address.

  1. Access the Linux console on the t2.micro instance See documentation at: http://docs.aws.amazon.com/AWSEC2/latest/UserGuide/AccessingInstances.html

At this point Linux/OSX and Windows users should have successfully connected to their EC2 instance and be at a $ command prompt.

  1. Install MySQL tools
$ sudo yum install mysql
  1. Create an Amazon RDS MySQL db.t2.small instance See documentation at: http://docs.aws.amazon.com/AmazonRDS/latest/UserGuide/CHAP_GettingStarted.CreatingConnecting.MySQL.html
  • Use the MySQL engine and a Dev/Test use case
  • Use the general public license, the most current version of MySQL 5.7, and a db.t2.small instance. Choose No for Multi-AZ deployment and configure 20GB of General Purpose SSD storage
  • Name the instance sql-lab. Be sure to choose MySQL (not Aurora and not MariaDB) and the db.t2.micro instance size
  • Choose a username and a password (and don’t forget them!)
  • Do not create a database (we will do that later). Keep all other default settings, and click on Create database button
  • Wait a few minutes until the RDS instance status is ‘Available’.
  • Once the RDS instance is available, find your instance’s DNS endpoint. We’ll use this endpoint to connect the MySQL client to your RDS instance. i. On the AWS Console, in the AWS Services box, type ‘RDS’ and press enter. Pic26 ii. On the RDS Dashboard, choose DB Instances Pic27 iii. Select your DB Instance Pic28 iv. Scroll down and find your endpoint name. You will need it later!

Note: When using the endpoint name, you usually should not use the port extension (:3306), just the name.

v. Modify your RDS Security Group (using the Modify button) to allow inbound access from your EC2 instance. Select the Security Group and update the source IP with the IP address of your EC2 instance. Be sure to choose “Apply Immediately”

  1. Verify you can access your RDS/MySQL instance from your EC2 instance:
$ mysql –h <mysql endpoint> -u <user name> -p

Example: 
$ mysql -h sql-lab.cxpjiluqq0c9.us-west-2.rds.amazonaws.com -u awsuser -p

Enter password: 

Welcome to the MySQL monitor.  Commands end with ; or \g.
Your MySQL connection id is 9999
Server version: 5.6.27-log MySQL Community Server (GPL)

Type 'help;' or '\h' for help. Type '\c' to clear the current input statement.

mysql> 

To exit from the mysql> prompt, use CTRL-D

Note: if you cannot connect to your RDS/MySQL instance, verify that you have updated the RDS Security Group to allow traffic from your EC2 instance’s IP address.  7. Download and Prepare Landsat scenes See documentation at https://aws.amazon.com/public-data-sets/landsat/ From your EC2 instance, download the Landsat scenes $ wget https://landsat-pds.s3.amazonaws.com/c1/L8/scene_list.gz

  • Unzip the scene list $ gunzip scene_list.gz
  • Trim the list to the last 250,000 scenes $ cp scene_list scene_list.orig $ tail -n 250000 scene_list.orig > scene_list
  1. Create a database and load the Landsat data
  • Log into the mysql> console as you did in step 6.
  • Create a Landsat database mysql> CREATE DATABASE landsat; mysql> USE landsat;
  • Create the scene_list table mysql> CREATE TABLE scene_list (productId VARCHAR(64),entityId VARCHAR(64), acquisitionDate DATETIME,cloudCover DECIMAL(5,2),processingLevel VARCHAR(8),path INT,row INT,min_lat DECIMAL(8,5),min_lon DECIMAL(8,5),max_lat DECIMAL(8,5),max_lon DECIMAL(8,5),download_url VARCHAR(128));
  • Add an index mysql> CREATE INDEX idx_entityId ON scene_list (entityId);
  • Load the landsat data mysql> LOAD DATA LOCAL INFILE 'scene_list' INTO TABLE scene_list FIELDS TERMINATED BY ',';
  1. Run a SQL query
mysql> 
SELECT DISTINCT(a.entityId) AS Id, a.productId, a.acquisitionDate, a.cloudCover
FROM scene_list a
INNER JOIN (
SELECT productid, entityId, acquisitionDate 
FROM scene_list 
WHERE acquisitionDate > ( 
SELECT MAX(acquisitionDate)
FROM scene_list 
WHERE acquisitionDate < CURDATE() - INTERVAL 1 YEAR)	
) b ON a.productId = b.productId AND a.entityId = b.entityId AND a.acquisitionDate = b.acquisitionDate
WHERE cloudCover < 50
ORDER BY acquisitionDate;
  • This generates a list of all the satellite images during the last year which have less than 50% cloud cover
  • Note how long it takes to get an answer
  1. Experiment with RDS features
  • Stop the database and restart it
  • Enable Multi-AZ for failover
  • Take a snapshot
  • Review the various monitoring metrics available