Home BigQuery Optimize project management with AI-supported document chat and BigQuery

Optimize project management with AI-powered document chat and BigQuery

In numerous industries, efficiency and precision are of paramount importance. Managing large volumes of project data, such as daily reports, logs, descriptions, material plans, and product data sheets, often presents a complex challenge. However, with AI and BigQuery, you can effortlessly interact with your documents and retrieve the information needed in real time to make informed decisions.

This wiki explains how these technologies can lead to significant improvements using an example from the construction industry. In addition, we present a prototype developed by us that uses an innovative document and chat interface with BigQuery as a vector database.

Table of contents

What is BigQuery? 

BigQuery is Google's fully managed, serverless data warehouse that enables extremely fast SQL queries. Thanks to the powerful computing infrastructure of Google, BigQuery is ideal for processing large datasets and offers real-time analyses. This is particularly advantageous for the construction industry, which requires fast access to extensive project data.

The role of vector databases in the construction industry

Vector databases store data as high-dimensional vectors and are ideal for similarity searches. In the software, vector embeddings are used to represent the text of construction documents. This enables an efficient search for the most relevant information.

What is a vector database?

A vector database is a special type of database that stores data as vectors in a multidimensional space. These vectors are mathematical objects that have a direction and a magnitude. In this context, these are typically data points that have been converted into a numerical form by algorithms. 

How do vector databases work?

1.
Converting the data into vectors:

First, the data is converted into vectors, often using embedding models. For example, text is converted into numerical vectors. Each vector consists of a list of numbers (coordinates) that represent the positions of the data points in a multidimensional space.

2.
Vector storage:

These vectors are stored in the vector database. Each vector corresponds to a data point, and all vectors together form a kind of point cloud in multi-dimensional space.

3.
Similarity search:

When a request is made, it is also transformed into a vector. The database then searches for the vector that is most similar to the query vector. Similarity is often determined by calculating the cosine similarity or the Euclidean distance between the vectors.

Multidimensional spaces:

A multidimensional space contains not only the three classical dimensions (length, width, height) but many dimensions, sometimes hundreds or thousands. Each of these dimensions represents a specific characteristic of the data.

  • Coordinates in this space: Each vector in this multidimensional space has a specific number of dimensions. The coordinates of a vector indicate exactly where a data point is located in this space. For example, a vector in a three-dimensional space has the coordinates (2, 3, 5). In a space with 100 dimensions, a vector can accordingly contain 100 such numbers.

Introduction to the context cache:

The context cache is a function that stores frequently accessed data so that it does not have to be reprocessed each time. This saves costs and improves response times.

Advantages of Context Caching:

  • Cost reduction: Caching frequently used data lowers operating costs because fewer tokens need to be processed with each request.
  • Reduced latency: Faster access to cached data improves user experience.
  • Sustainability: Less computing power means less energy consumption and leads to more sustainable operations.

Application examples for document search:

1.
Summary of meeting minutes and daily reports:

Query: "Show all meeting minutes and daily reports for construction phase A regarding the implementation of window details."

Process: The system searches all relevant documents and creates a history of the discussions and decisions regarding the window details, including all proposed solutions and their status.

Answer: "Several solutions for the window details were discussed in the meeting minutes and construction daily reports from March 1st to April 15th. The final decision was made on April 10th and is currently in the implementation phase."

2.
Tracking Approvals and Audits

Question: “What approvals and inspections were granted, and when, for the building foundation of construction phase 1?”

Process: The system tracks all relevant approvals and inspections and creates a chronological summary.

Answer: "The foundation permits for construction section 1 were issued on February 5, and the inspections were carried out on February 12 and 19. Both tests were completed successfully."

3.
Material tracking and inventory report:

Query: "Are there material consumption and inventory reports for Phase B?"

Process: The system collects information from material tracking reports and inventory reports.

Answer: "The material tracking report shows that 20 precast concrete elements and 3 steel staircases were installed in construction section B. The current stock level is 102 precast elements and 17 steel staircases."

4.
Change request documentation:

Query: "Show all change requests and their implementation status for project XY."

Process: The system searches the project documentation and displays all change requests with their current status and the respective construction phase.

Answer: "For project XY, there are three change requests concerning the modification of the roof structure, which have already been completed. Order No. 3 concerns additional safety measures that have been approved but not yet started."

5.
Creation of security reports and compliance documents:

Question: “What security measures were documented in construction phase C in the last quarter?”

Process: The system searches the security and compliance documents to create an overview of the measures taken and their results.

Answer: "In the last quarter, a regular safety inspection was carried out for construction phase C. The measures included the installation of safety nets, weekly safety drills, and daily safety inspections by the safety and health coordinator (SiGeKo)."

6.
Warranty and maintenance document tracking:

Query: "What warranty and maintenance work has been carried out on the heating system?"

Process: The system tracks information about the work performed and its data.

Answer: "The heating system was repaired on January 15th. The warranty work included replacing the thermostat on February 20th and repairing the leak on March 5th. All work was documented and completed."

Conclusion:

AI and BigQuery offer fast, precise, and scalable solutions for data analysis and information retrieval, which can significantly improve project management across various industries. Our prototype demonstrates how these technologies can be used to develop a powerful chat interface that efficiently accesses and processes relevant documents.

Interested? Contact us today to learn more about how your processes can be optimized through the use of AI and BigQuery.

Know more?

Would you like to delve deeper into this topic? Then we would be happy to talk to you personally about the possibilities of the Google Cloud Platform (GCP).

Profile picture Christopher Maier
Christopher Maier
Analytics Consultant

Published by:

Christopher Maier

Google Cloud Platform (Cloud Infrastructure | Cloud Solutions) Consultant

author

How did you like the article?

How helpful was this post?

Click on a star to rate!

Average rating 5 / 5.
Number of ratings: 1

No votes so far! Be the first person to rate this post!

INFORMATION

More information

Cover_Photo_SAC_AI_ML_Features_at_a_glance

SAC AI features explained: Joule, Just Ask, and Smart Predict

This wiki explains how to use Smart Predict to create automated forecasting models...
Visual Woman and robot look at each other at a desk, the robot is sitting

SAP Business AI & AI Units

When it comes to AI in the SAP universe, there are basically two approaches...
Visual Databricks and BDC Wiki

What is Databricks? What is the BDC? The ultimate guide to the perfect combination!

In today's data-driven business world, the ability to efficiently analyze and use large amounts of data is crucial for...
Your guide to successful SAC migration

SAC Migration by Q2 2026: The Guide to Transitioning to the Optimized Story Experience

The time for the conversion of SAP Analytics Cloud (SAC)...
Hands with three stars representing the different technologies: SAP Analytics Cloud, SAP Business Data Cloud, SAP Datasphere.

Feature update for SAP Business Data Cloud, Analytics Cloud, and Datasphere

This wiki article summarizes the most important content of the webinar on the topic:...
Lord of the Rings association with connection to SQL and dbt as fighters.

SQL and dbt: The future of modern data transformation

The article describes data processing in companies. Both...
9.1 Differences between SAP Databricks and native Databricks

SAP Databricks vs. Native Databricks: The detailed comparison for your company

In today's data-driven business world, the ability to efficiently analyze and use large amounts of data is crucial for...