Prerequisites : This lab requires Prerequisites to be completed before you can continue. Please continue if you have already completed the prerequisites.
PART A:Create Glue Crawler for Auto and Property source data.
1. Navigate to the AWS Glue Console
2. On the AWS Glue menu, select Crawlers.
3. Click Add Crawler.
4. Enter C360-workshop-glue-crawler
as the crawler name for initial data load.
5. Optionally, enter the description. This should also be descriptive and easily recognized and Click Next.
6. Choose Data stores, Crawl all folders and Click Next
7. On the Add a data store page, make the following selections:
8. Click Next.
9. On the Add another data store page, select No. and Click Next.
10. On the Choose an IAM role page, make the following selections:
Select Choose an existing IAM role. For IAM role, Glue Role pre-created for you. For example GlueServiceRoleLab
11. Click Next.
12. On the Create a schedule for this crawler page, for Frequency, select Run on demand and Click Next.
13. On the Configure the crawler’s output page, click Add database to create a new database for our Glue Catalogue.
14. Enter c360_workshop_db
as your database name and click create
Add for field “Prefix added to tables (optional)" –> source_
15. Click Next
16. Review the summary page noting the Include path and Database output and Click Finish. The crawler is now ready to run.
17. Tick the crawler name, click Run Crawler button.
Crawler will change status from starting to stopping, wait until crawler comes back to ready state (the process will take a few minutes), you can see that it has created 5 tables.
18. In the AWS Glue navigation pane, click Databases → Tables. You can also click the C360_workshop_db database to browse the tables.