Professional Data Engineer Exam QuestionsBrowse all questions from this exam

Professional Data Engineer Exam - Question 93


You're using Bigtable for a real-time application, and you have a heavy load that is a mix of read and writes. You've recently identified an additional use case and need to perform hourly an analytical job to calculate certain statistics across the whole database. You need to ensure both the reliability of your production application as well as the analytical workload.

What should you do?

Show Answer
Correct Answer: B

To ensure both the reliability of your production application and the analytical workload, it's best to add a second cluster to your existing Bigtable instance with multi-cluster routing. This allows for automatic failover to the nearest available cluster if one cluster becomes unavailable, ensuring high availability and reliability for both your production and analytical tasks. Using the live-traffic app profile for regular workloads and the batch-analytics profile for analytics helps isolate workloads and maintain performance.

Discussion

17 comments
Sign in to comment
[Removed]Option: C
Sep 5, 2022

Answer is C When you use a single cluster to run a batch analytics job that performs numerous large reads alongside an application that performs a mix of reads and writes, the large batch job can slow things down for the application's users. With replication, you can use app profiles with single-cluster routing to route batch analytics jobs and application traffic to different clusters, so that batch jobs don't affect your applications' users. https://cloud.google.com/bigtable/docs/replication-overview#use-cases

HarshKothari21
Sep 8, 2022

Agreed :)

[Removed]
Nov 11, 2022

"When you use a single cluster", here we are creating a 2nd cluster, so we'll be using 2 different clusters. We want to redirect analysis jobs to the 2nd cluster, and the other job to the 1st cluster. Thus, I think that D is more adequate

somilaseeja
Dec 7, 2022

Option D didnt say to create a new cluster, rather it said to increase the size of the cluster. There is a difference. Hence c is the correct answer to run the batch processing in a single cluster mode

aewisOption: C
Jul 14, 2023

It was actually illustrated here https://cloud.google.com/bigtable/docs/replication-settings#batch-vs-serve

dish11dishOption: B
Nov 22, 2022

Option B is correct An app profile specifies the routing policy that Bigtable should use for each request. Single-cluster routing routes all requests to 1 cluster in your instance. If that cluster becomes unavailable, you must manually fail over to another cluster. Multi-cluster routing automatically routes requests to the nearest cluster in an instance. If the cluster becomes unavailable, traffic automatically fails over to the nearest cluster that is available. Bigtable considers clusters in a single region to be equidistant, even though they are in different zones. You can configure an app profile to route to any cluster in an instance, or you can specify a cluster group that tells the app profile to route to only some of the clusters in the instance. Cluster group routing sends requests to the nearest available cluster within a cluster group that you specify in the app profile settings. Reference:-https://cloud.google.com/bigtable/docs/app-profiles#routing

juliobsOption: C
Mar 24, 2023

C. This is exactly the example in the documentation. https://cloud.google.com/bigtable/docs/replication-settings#batch-vs-serve

DevShah
Apr 14, 2023

Correct 2 jobs >> 2 cluster 3 jobs >> 3 cluster app profiles with single-cluster routing used to route to specific cluster Job1 >> Cluster 1 Job2 >> Cluster 2 .....

cloudmonOption: B
Nov 6, 2022

It's B https://cloud.google.com/bigtable/docs/replication-overview#app-profiles https://cloud.google.com/bigtable/docs/replication-overview#routing-policies

piotrpiskorskiOption: C
Nov 21, 2022

https://cloud.google.com/bigtable/docs/replication-settings#batch-vs-serve "When you use a single cluster to run a batch analytics job that performs numerous large reads alongside an application that performs a mix of reads and writes, the large batch job can slow things down for the application's users. With replication, you can use app profiles with single-cluster routing to route batch analytics jobs and application traffic to different clusters, so that batch jobs don't affect your applications' users." It is C.

Siant_137Option: C
Dec 3, 2022

Answer is C "When you use a single cluster to run a batch analytics job that performs numerous large reads alongside an application that performs a mix of reads and writes, the large batch job can slow things down for the application's users. With replication, you can use app profiles with single-cluster routing to route batch analytics jobs and application traffic to different clusters, so that batch jobs don't affect your applications' users." https://cloud.google.com/bigtable/docs/replication-overview#batch-vs-serve

zellckOption: C
Dec 6, 2022

C is the answer. https://cloud.google.com/bigtable/docs/replication-settings#batch-vs-serve When you use a single cluster to run a batch analytics job that performs numerous large reads alongside an application that performs a mix of reads and writes, the large batch job can slow things down for the application's users. With replication, you can use app profiles with single-cluster routing to route batch analytics jobs and application traffic to different clusters, so that batch jobs don't affect your applications' users.

slade_wilsonOption: C
Dec 21, 2022

When you use a single cluster to run a batch analytics job that performs numerous large reads alongside an application that performs a mix of reads and writes, the large batch job can slow things down for the application's users. With replication, you can use app profiles with single-cluster routing to route batch analytics jobs and application traffic to different clusters, so that batch jobs don't affect your applications' users. Single cluster routing - You can use single-cluster routing for this use case if you don't want your Bigtable cluster to automatically fail over if a zone or region becomes unavailable. Multi-cluster routing - If you want Bigtable to automatically fail over to one region if your application cannot reach the other region, use multi-cluster routing.

DevShahOption: C
Apr 14, 2023

https://cloud.google.com/bigtable/docs/replication-settings#batch-vs-serve

A4M
May 4, 2023

I see what you say on C but the question states high availability how do you handle that with option C when you have a single region cluster hence answer needs to be with multi-region cluster - To configure your instance for a high availability (HA) use case, create a new app profile that uses multi-cluster routing, or update the default app profile to use multi-cluster routing.

A4M
May 4, 2023

i meant single-cluster routing

zevexWM
Apr 24, 2024

It actually addresses the issue of High availability in that same link if you scroll down a bit more. https://cloud.google.com/bigtable/docs/replication-settings#high-availability

carbinoOption: C
Dec 9, 2023

IIt is C: "Workload isolation: Using separate app profiles lets you use different routing policies for different purposes. For example, consider a situation when you want to prevent a batch read job (workload A) from increasing CPU usage on clusters that handle an application's steady reads and writes (workload B). You can create an app profile for workload B that routes to a cluster group that excludes one cluster. Then you create an app profile for workload A that specifies single-cluster routing to the cluster that workload B doesn't send requests to. You can change the settings for one application or function without affecting other applications that connect to the same data." https://cloud.google.com/bigtable/docs/app-profiles

gudikingOption: C
Nov 18, 2022

C - "With replication, you can use app profiles with single-cluster routing to route batch analytics jobs and application traffic to different clusters, so that batch jobs don't affect your applications' users." - https://cloud.google.com/bigtable/docs/replication-overview#batch-vs-serve

sfsdenisoOption: C
Nov 24, 2022

Answer is C

samdhimalOption: C
Feb 5, 2023

I am going for C?

musumusuOption: B
Feb 15, 2023

Answer B: reason 1: If you don' t have any cost constraint use multi-cluster routing, reason 2: Single cluster is less scalable as we need high scalability i would go with B

opt_subOption: B
Mar 27, 2024

B is correct. Two different job profiles to redirect trafiic to two different cluster. C is incorrect because there is no tpoint in creating app profile for two different workloads in the same cluster. One cluster handles writes and another handle reads.

47767f9Option: B
Jun 30, 2024

B better than C. Multi-cluster routing to handle failovers automatically. Reference: https://cloud.google.com/bigtable/docs/replication-settings#regional-failover