Toronto, North America
The Non-degree in Build an End-to-End Data Capture Pipeline using Document AI at Google Cloud is a program for international students taught in English.
Google Cloud is a public, private cloud computing services provider that offers a suite of cloud computing services that runs on the same infrastructure that Google uses internally for its end-user products, such as Google Search, Gmail, file storage, and YouTube. Google Cloud, a subsidiary of Alphabet Inc., was officially launched in 2008. It is renowned for its innovative technologies, including data analytics, machine learning, and artificial intelligence, as well as its global network infrastructure, which ensures low latency and high availability. Google Cloud distinguishes itself with its commitment to sustainability, operating the cleanest cloud in the industry with the largest corporate purchase of renewable energy.
This is a self-paced lab that takes place in the Google Cloud console. In this lab you use Cloud Functions and Pub/Sub to create an end-to-end document processing pipeline using Document AI. The Document AI API is a document understanding solution that takes unstructured data, such as documents and emails, and makes the data easier to understand, analyze, and consume.In this lab, you will create a document processing pipeline that will automatically process documents that are uploaded to Cloud Storage. The pipeline consists of a primary Cloud Function that processes new files that are uploaded to Cloud Storage using a Document AI form processor and then saves form data detected in those files to BigQuery. If the form data includes any address fields the address data is then written to a Pub/Sub topic that in turn triggers a second Cloud Function that uses to Geocoding API to provide geographic coordinate data for the address that is also written to BigQuery.This is a simple pipeline that uses a general form processor that will detect basic form data, such as a labelled field containing address information. Document AI processors that use one of the specialized parsers that are beyond the scope of this lab provide enhanced entity information for specific document types even when those documents do not include labelled fields. For example, a Document AI Invoice parser can provide detailed address and supplier information, from an unlabelled invoice document because it understands the layout of invoices.
Google Cloud provides a range of services including computing, data storage, data analytics, and machine learning. It offers tools like Google Cloud Platform (GCP), which provides a robust environment for developers to build, test, and deploy applications. Additionally, Google Cloud includes G Suite (now known as Google Workspace), a collection of productivity and collaboration tools such as Gmail, Docs, Drive, and Calendar. Google Cloud is designed to provide organizations with the tools they need to drive innovation, improve operational efficiency, and scale their services securely and reliably. Its client base spans various industries, including financial services, healthcare, retail, and education.
Application Fee
$0 USD
$0 USD
Tuition Fee
$49 USD
$49 USD
per year
All students from all countries are eligible to apply to this program.
1
Step 1
Choose programs
2
Step 2
Apply online
3
Step 3
Enroll
Application Fee
$0 USD
Service Fee
$0 USD
Tuition
49
Boost Your Acceptance Rate
Easy Online Application
Thousands of international students use Global Admissions with 4.9 star reviews
Free Service to Partner Universities or upgrade to our Guaranteed Service