Database clients can connect to Redshift using ODBC and JDBC drivers for Postgres.
- True (Ans)
- False
Cloudera Impala is included on EMR clusters by default.
- True
- False (Ans)
EC2 stands for:
- External Cloud Connectivity
- Elastic Compute Cloud (Ans)
- Energy Conserving CPU
- Enhanced Capacity, Squared
How does DynamoDB display data rows of differing schema together?
- It displays common keys as distinct columns in a table, and the rest in an “Other” column
- It combines all keys from all rows as if they were common to each row (Ans)
- It omits any key absent from any row
- It displays data in multiple frids, each with differing column headers
Which of the following services is a component of the AWS Big Data Stack?
- Redshift (Ans)
- VPC
- Cloud Watch
- Glacier
Amazon offers a code library that allows developers to build their own Kinesis data connectors.
- False
- True (Ans)
Designing Jaspersoft reports is done in a standalone desktop application.
- True
- False (Ans)
Which of the following is not directly supported as a Pipeline data source or destination?
- Microsoft SQL Server on Relational Database Service (RDS) (Ans)
- DynamoDB
- Redshift
- MySQL on Relational Database Service (RDS)
Which of the following Hadoop vendor’s distribution is supported on Elastic MapReduce?
- Cloudera
- Hortonworks
- MapR (Ans)
- Pivotal
Which of the following Hadoop distribution components does Elastic MapReduce omit?
- MapReduce
- Sqoop (Ans)
- Hive
- Pig
What is the name of Apache Pig’s programming language?
- Pig Latin (Ans)
- PQL
- PigML
- Pig.js
Amazon CloudFormation can be used to provision Japsersoft instances.
- False
- True (Ans)
You can create a key pair in the AWS Management Console.
- True (Ans)
- False
Saving a Pipeline causes a validation check to be run on it.
- False
- True (Ans)
Files stored in S3 can be referenced by URL.
- True (Ans)
- False
Kinesis has data connectors for all Amazon components except:
- MySQL Relational Database Service (RDS) (Ans)
- DynamoDB
- Redshift
- EMR
The purpose of a DynamoDB secondary indexes is to:
- Allow for fast searches on attributes beyond hash and range keys (Ans)
- Allow for querying data using SQL
- Allow indexes to be added after table creation
- Make data available on more nodes for distributed access
Redshift uses relational database technology.
- True (Ans)
- False
Which of the following is an effective method for monitoring running Data Pipeline jobs?
- Run the pipeline in Eclipse
- Set breakpoints in Pipeline script file
- Set email alerts for activity nodes
- Ensure no steps in status page stay in WAITING_FOR_RUNNER or WAITING_ON_DEPENDENCIES state too long (Ans)
Which of the following is the largest enabler of the Big Data phenomenon?
- Commercial software
- Relational databases
- Significant reduction in storage costs (Ans)
- Specialized hardware appliances
Which of the following does S3 Browser need to connect to your S3 account?
- Public and private key pair
- Access key ID and secret access key (Ans)
- AWS username and password
- Bucket name and folder name
Impala provides interactive (non-batch) SQL query over data in:
- The Hadoop Distributed File System (HDFS) (Ans)
- Relational Database Service (RDS)
- DynamoDB
- Redshift
- Best AI tools for Software Engineers - November 4, 2024
- Installing Jupyter: Get up and running on your computer - November 2, 2024
- An Introduction of SymOps by SymOps.com - October 30, 2024