The emergence of generative artificial intelligence (GenAI) has significant implications at all levels of the technology stack, not least analytics and data products, which serve to support the development, training and deployment of GenAI models, and also stand to benefit from the advances in automation enabled by GenAI. The intersection of analytics and data and GenAI was a significant focus of the recent Google Cloud Next ’24 event. My colleague David Menninger has already outlined the key announcements from the event as they relate to Google Cloud’s Vertex AI development platform and the Gemini multimodal large language models (LLMs), as well as Gemini-based GenAI assistants. In this perspective, I’ll take a closer look at Google Cloud’s progress as it relates to the company’s various data platforms, as well as the Looker analytics platform.
Google Cloud has a broad portfolio of data and analytics offerings that spans data storage and processing, data management and data governance, as well as analytics and machine learning (ML). The company has sought to
Google BigQuery is a distributed serverless analytic data platform environment for processing and analyzing large volumes of data. It is now being positioned by Google as a unified data platform that provides multiple data engines — including SQL, Spark and Python — to process data in multiple formats, across multiple locations, for multiple use cases (including business intelligence (BI), as well as AI). In addition to being available on Google Cloud, BigQuery is also available on Amazon Web Services as well as Microsoft Azure via BigQuery Omni, while the BigLake storage engine enables enterprises to work with structured and unstructured data in open table formats, including Apache Iceberg, Delta and Hudi. Among the announcements at this year’s Google Cloud Next event was the preview release of Apache Kafka for BigQuery and the incorporation of enhanced search capabilities, courtesy of Google Dataplex’s unified metadata catalog.
Also new is direct access from BigQuery to Google’s Vertex AI development platform for AI, enabling vector search and retrieval augmented generation to ground the output of GenAI models with enterprise data, as well as the ability to fine-tune models in Vertex AI from BigQuery. In addition to using BigQuery to deliver and improve GenAI, enterprises can now also use GenAI to improve the use of BigQuery thanks to the introduction of Gemini in BigQuery and BigQuery data canvas. Gemini in BigQuery provides GenAI-based assistance for BigQuery users, including augmented data preparation, semantic search-based data exploration, the conversion of natural language queries to SQL or Python code, and recommendations to improve query performance. BigQuery data canvas is a natural language interface for data exploration, curation, preparation, analysis and visualization. Google also announced the general availability of the BigQuery Studio collaborative analytics workspace, which provides a single environment for analyzing data in BigQuery using SQL, Python, Spark or natural language queries.
Google Cloud also introduced Gemini in Looker and Gemini in Databases. Gemini in Looker provides an interface for natural language conversational analytics using the Looker analytics environment that is grounded by
Google Cloud is by no means unique in providing digital assistant capabilities based on GenAI for data and analytics. Indeed, I assert that through 2026, analytics and data software providers will prioritize the delivery of automated code and query generation and conversion capabilities based on GenAI. However, the company is ahead of several of its rivals in terms of the breadth of functionality on offer and the range of products for which AI assistants are available or in preview. Enterprises evaluating databases to support new application development projects, both on premises and in the cloud, should consider Google Cloud BigQuery and AlloyDB for PostgreSQL, as well as Google Cloud’s associated analytics and GenAI-based services.
Regards,
Matt Aslett