Category: gcp
Harnessing the power of Google dataflow: Processing data from diverse sources
Google Dataflow, a robust data processing service that can seamlessly process data from different sources. In this article, we delve…
Cross-Region Data Replication in Google Dataflow: Practical Scenarios
Ensuring data availability and durability in the cloud era is paramount. Google Dataflow, part of Google Cloud’s suite of data…
Understanding Data Encryption in Google Dataflow
Google Dataflow is designed to ensure data is encrypted both at rest and in transit. Here’s a brief overview of…
GCP : How to Generate a Boto Configuration File using gsutil
The boto file is a configuration file that is used by gsutil and boto library (a Python SDK for AWS…
GCP : Monitoring Google BigQuery Costs for Each SQL Query
Google BigQuery is a powerful tool for analyzing large datasets, but it’s also important to keep track of costs to…
GCP : Connecting Python to Google BigQuery
Google BigQuery is a web service from Google that is used for handling and analyzing big data. It’s part of…
GCP : The user account authentication flow no longer works as of Febuary 1, 2023.
Since the user account authentication flow has changed, you should go ahead with the Cloud SDK installation to authenticate and…
Google Dataflow : Handling Late Data in Google Dataflow
Handling late-arriving data is a common challenge when working with streaming data processing systems like Google Dataflow. Late data refers…
Spark : Advantages of Google’s Serverless Spark
Google’s Serverless Spark has several advantages compared to traditional Spark clusters: Cost-effective: Serverless Spark eliminates the need for dedicated servers…
How to create and delete a BigQuery dataset and table?
Creating a BigQuery dataset and table: Go to the BigQuery web UI in the Cloud Console. Click the project drop-down…