Professional Data Engineer Exam - Question 93

Question

You're using Bigtable for a real-time application, and you have a heavy load that is a mix of read and writes. You've recently identified an additional use case and need to perform hourly an analytical job to calculate certain statistics across the whole database. You need to ensure both the reliability of your production application as well as the analytical workload.

What should you do?

Examice · Accepted Answer

To ensure both the reliability of your production application and the analytical workload, it's best to add a second cluster to your existing Bigtable instance with multi-cluster routing. This allows for automatic failover to the nearest available cluster if one cluster becomes unavailable, ensuring high availability and reliability for both your production and analytical tasks. Using the live-traffic app profile for regular workloads and the batch-analytics profile for analytics helps isolate workloads and maintain performance.

[Removed] · Answer

Answer is C

When you use a single cluster to run a batch analytics job that performs numerous large reads alongside an application that performs a mix of reads and writes, the large batch job can slow things down for the application's users. With replication, you can use app profiles with single-cluster routing to route batch analytics jobs and application traffic to different clusters, so that batch jobs don't affect your applications' users.

https://cloud.google.com/bigtable/docs/replication-overview#use-cases

aewis · Answer

It was actually illustrated here
https://cloud.google.com/bigtable/docs/replication-settings#batch-vs-serve

dish11dish · Answer

Option B is correct

An app profile specifies the routing policy that Bigtable should use for each request.

Single-cluster routing routes all requests to 1 cluster in your instance. If that cluster becomes unavailable, you must manually fail over to another cluster.

Multi-cluster routing automatically routes requests to the nearest cluster in an instance. If the cluster becomes unavailable, traffic automatically fails over to the nearest cluster that is available. Bigtable considers clusters in a single region to be equidistant, even though they are in different zones. You can configure an app profile to route to any cluster in an instance, or you can specify a cluster group that tells the app profile to route to only some of the clusters in the instance.

Cluster group routing sends requests to the nearest available cluster within a cluster group that you specify in the app profile settings.

Reference:-https://cloud.google.com/bigtable/docs/app-profiles#routing

juliobs · Answer

C. This is exactly the example in the documentation.
https://cloud.google.com/bigtable/docs/replication-settings#batch-vs-serve

cloudmon · Answer

It's B
https://cloud.google.com/bigtable/docs/replication-overview#app-profiles
https://cloud.google.com/bigtable/docs/replication-overview#routing-policies

piotrpiskorski · Answer

https://cloud.google.com/bigtable/docs/replication-settings#batch-vs-serve

"When you use a single cluster to run a batch analytics job that performs numerous large reads alongside an application that performs a mix of reads and writes, the large batch job can slow things down for the application's users. With replication, you can use app profiles with single-cluster routing to route batch analytics jobs and application traffic to different clusters, so that batch jobs don't affect your applications' users."

It is C.

Siant_137 · Answer

Answer is C

"When you use a single cluster to run a batch analytics job that performs numerous large reads alongside an application that performs a mix of reads and writes, the large batch job can slow things down for the application's users. With replication, you can use app profiles with single-cluster routing to route batch analytics jobs and application traffic to different clusters, so that batch jobs don't affect your applications' users."

https://cloud.google.com/bigtable/docs/replication-overview#batch-vs-serve

zellck · Answer

C is the answer.

https://cloud.google.com/bigtable/docs/replication-settings#batch-vs-serve
When you use a single cluster to run a batch analytics job that performs numerous large reads alongside an application that performs a mix of reads and writes, the large batch job can slow things down for the application's users. With replication, you can use app profiles with single-cluster routing to route batch analytics jobs and application traffic to different clusters, so that batch jobs don't affect your applications' users.

slade_wilson · Answer

When you use a single cluster to run a batch analytics job that performs numerous large reads alongside an application that performs a mix of reads and writes, the large batch job can slow things down for the application's users. With replication, you can use app profiles with single-cluster routing to route batch analytics jobs and application traffic to different clusters, so that batch jobs don't affect your applications' users.

Single cluster routing - You can use single-cluster routing for this use case if you don't want your Bigtable cluster to automatically fail over if a zone or region becomes unavailable.

Multi-cluster routing - If you want Bigtable to automatically fail over to one region if your application cannot reach the other region, use multi-cluster routing.

DevShah · Answer

https://cloud.google.com/bigtable/docs/replication-settings#batch-vs-serve

carbino · Answer

IIt is C:
"Workload isolation:
Using separate app profiles lets you use different routing policies for different purposes. For example, consider a situation when you want to prevent a batch read job (workload A) from increasing CPU usage on clusters that handle an application's steady reads and writes (workload B). You can create an app profile for workload B that routes to a cluster group that excludes one cluster. Then you create an app profile for workload A that specifies single-cluster routing to the cluster that workload B doesn't send requests to.

You can change the settings for one application or function without affecting other applications that connect to the same data."
https://cloud.google.com/bigtable/docs/app-profiles

gudiking · Answer

C - "With replication, you can use app profiles with single-cluster routing to route batch analytics jobs and application traffic to different clusters, so that batch jobs don't affect your applications' users." - https://cloud.google.com/bigtable/docs/replication-overview#batch-vs-serve

sfsdeniso · Answer

Answer is C

samdhimal · Answer

I am going for C?

musumusu · Answer

Answer B: 
reason 1: If you don' t have any cost constraint use multi-cluster routing, 
reason 2: Single cluster is less scalable as we need high scalability i would go with B

opt_sub · Answer

B is correct. 
Two different job profiles to redirect trafiic to two different cluster. C is incorrect because there is no tpoint in creating app profile for two different workloads in the same cluster. One cluster handles writes and another handle reads.

47767f9 · Answer

B better than C. Multi-cluster routing to  handle failovers automatically. Reference: https://cloud.google.com/bigtable/docs/replication-settings#regional-failover

Professional Data Engineer Exam - Question 93

Discussion