- BigQuery, Google Cloud Platform, AI
- Big Query, Google Cloud
- 4 min reading time
Christopher Maier
In numerous industries, efficiency and precision are of paramount importance. Managing large volumes of project data, such as daily reports, logs, descriptions, material plans, and product data sheets, often presents a complex challenge. However, with AI and BigQuery, you can effortlessly interact with your documents and retrieve the information needed in real time to make informed decisions.
This wiki explains how these technologies can lead to significant improvements using an example from the construction industry. In addition, we present a prototype developed by us that uses an innovative document and chat interface with BigQuery as a vector database.
Table of contents
What is BigQuery?
BigQuery is Google's fully managed, serverless data warehouse that enables extremely fast SQL queries. Thanks to the powerful computing infrastructure of Google, BigQuery is ideal for processing large datasets and offers real-time analyses. This is particularly advantageous for the construction industry, which requires fast access to extensive project data.
The role of vector databases in the construction industry
Vector databases store data as high-dimensional vectors and are ideal for similarity searches. In the software, vector embeddings are used to represent the text of construction documents. This enables an efficient search for the most relevant information.
What is a vector database?
A vector database is a special type of database that stores data as vectors in a multidimensional space. These vectors are mathematical objects that have a direction and a magnitude. In this context, these are typically data points that have been converted into a numerical form by algorithms.
How do vector databases work?
First, the data is converted into vectors, often using embedding models. For example, text is converted into numerical vectors. Each vector consists of a list of numbers (coordinates) that represent the positions of the data points in a multidimensional space.
These vectors are stored in the vector database. Each vector corresponds to a data point, and all vectors together form a kind of point cloud in multi-dimensional space.
When a request is made, it is also transformed into a vector. The database then searches for the vector that is most similar to the query vector. Similarity is often determined by calculating the cosine similarity or the Euclidean distance between the vectors.
Multidimensional spaces:
A multidimensional space contains not only the three classical dimensions (length, width, height) but many dimensions, sometimes hundreds or thousands. Each of these dimensions represents a specific characteristic of the data.
- Coordinates in this space: Each vector in this multidimensional space has a specific number of dimensions. The coordinates of a vector indicate exactly where a data point is located in this space. For example, a vector in a three-dimensional space has the coordinates (2, 3, 5). In a space with 100 dimensions, a vector can accordingly contain 100 such numbers.
Introduction to the context cache:
The context cache is a function that stores frequently accessed data so that it does not have to be reprocessed each time. This saves costs and improves response times.
Advantages of Context Caching:
- Cost reduction: Caching frequently used data lowers operating costs because fewer tokens need to be processed with each request.
- Reduced latency: Faster access to cached data improves user experience.
- Sustainability: Less computing power means less energy consumption and leads to more sustainable operations.
Application examples for document search:
Query: "Show all meeting minutes and daily reports for construction phase A regarding the implementation of window details."
Process: The system searches all relevant documents and creates a history of the discussions and decisions regarding the window details, including all proposed solutions and their status.
Answer: "Several solutions for the window details were discussed in the meeting minutes and construction daily reports from March 1st to April 15th. The final decision was made on April 10th and is currently in the implementation phase."
Question: “What approvals and inspections were granted, and when, for the building foundation of construction phase 1?”
Process: The system tracks all relevant approvals and inspections and creates a chronological summary.
Answer: "The foundation permits for construction section 1 were issued on February 5, and the inspections were carried out on February 12 and 19. Both tests were completed successfully."
Query: "Are there material consumption and inventory reports for Phase B?"
Process: The system collects information from material tracking reports and inventory reports.
Answer: "The material tracking report shows that 20 precast concrete elements and 3 steel staircases were installed in construction section B. The current stock level is 102 precast elements and 17 steel staircases."
Query: "Show all change requests and their implementation status for project XY."
Process: The system searches the project documentation and displays all change requests with their current status and the respective construction phase.
Answer: "For project XY, there are three change requests concerning the modification of the roof structure, which have already been completed. Order No. 3 concerns additional safety measures that have been approved but not yet started."
Question: “What security measures were documented in construction phase C in the last quarter?”
Process: The system searches the security and compliance documents to create an overview of the measures taken and their results.
Answer: "In the last quarter, a regular safety inspection was carried out for construction phase C. The measures included the installation of safety nets, weekly safety drills, and daily safety inspections by the safety and health coordinator (SiGeKo)."
Query: "What warranty and maintenance work has been carried out on the heating system?"
Process: The system tracks information about the work performed and its data.
Answer: "The heating system was repaired on January 15th. The warranty work included replacing the thermostat on February 20th and repairing the leak on March 5th. All work was documented and completed."
Conclusion:
AI and BigQuery offer fast, precise, and scalable solutions for data analysis and information retrieval, which can significantly improve project management across various industries. Our prototype demonstrates how these technologies can be used to develop a powerful chat interface that efficiently accesses and processes relevant documents.
Interested? Contact us today to learn more about how your processes can be optimized through the use of AI and BigQuery.
Know more?
Would you like to delve deeper into this topic? Then we would be happy to talk to you personally about the possibilities of the Google Cloud Platform (GCP).
Published by:
Christopher Maier
Google Cloud Platform (Cloud Infrastructure | Cloud Solutions) Consultant
Christopher Maier
How did you like the article?
How helpful was this post?
Click on a star to rate!
Average rating 5 / 5.
Number of ratings: 1
No votes so far! Be the first person to rate this post!






