Data Science on the Google Cloud Platform

Data Science on the Google Cloud Platform

English | 2017 | ISBN: 978-1491974568 | 250 Pages | EPUB | 13 MB

Implementing End-to-End Real-time Data Pipelines: from ingest to machine learning
Learn how easy it is to apply sophisticated statistical and machine learning methods to real-world problems when you build on top of the Google Cloud Platform (GCP). With this practical guide, author and GCP Program Manager Valliappa Lakshmanan shows you how to gain insight into a sample business decision by applying different statistical and machine learning methods and tools.
Along the way, you’ll get an extensive tour of the big data and machine learning parts of GCP. You’ll start with statistical methods, move into straightforward classification, and then explore windowing and real-time prediction.

  • Move from basic to increasingly sophisticated methods
  • Understand interactive querying of very large datasets with BigQuery
  • Learn about probabilistic decision making with SparkSQL and Spark
  • Train a TensorFlow model in Python and call it from Java
  • Create a data processing pipeline with Dataflow
  • Compute time-windowed aggregates in real-time
Homepage