GCP - 概述
對應到其他的雲端服務是 :
- Amazon Web Services (AWS) : ****
- Microsoft Azure : ****
參考資料
https://cloud.google.com/dataproc?hl=zh-TW
Q 20. Your company was bidding on a big data project form last few months and they have finally received the project. The project wants you to deploy Apache Spark clusters on Google Cloud. Which service will you use?
A. DataFlow B. DataProc C. BigTable D. Cloud Composer
Correct Answer: B Option B is correct: Cloud Dataproc is a fast, easy-to-use, fully managed cloud service for running Apache Spark and Apache Hadoop clusters in a simpler, more cost-efficient way. Option A is incorrect: Cloud Dataflow is a fully-managed service for transforming and enriching data in stream (real time) and batch (historical) modes with equal reliability and expressiveness. Option C is incorrect: A petabyte-scale, fully managed NoSQL database service for large analytical and operational workloads. It supports the open source industry standard HBase API. Option D is incorrect: Cloud Composer is a fully managed workflow orchestration service that empowers you to author, schedule, and monitor pipelines that span across clouds and on-premises data centers. It is built on the popular Apache Airflow open source project.
Q 21. Your client wants to migrate their 30 TB of Hadoop or Spark cluster from a RHEL 6.5 on-premise servers to Google Cloud Platform. Which of the following service can be used at GCP end?
A. Compute Engine B. App Engine C. Dataproc D. Big Query
Correct Answer: C C is correct: A faster, easier, more cost-effective way to run Apache Spark and Apache Hadoop A is incorrect: Can be used but would require high compute and cost. B is incorrect: App Engine is not an effective way to this purpose D is incorrect: Big query is a data warehouse and not suitable to run spark commands.