-
Notifications
You must be signed in to change notification settings - Fork 46
Description
Is your feature request related to a problem? Please describe.
In our existing bigquery extractor, we collect both bigquery table metadata & its profile (common join & filter) based on audit log. But since the data from audit log are not always having a 1-1 mapping to the table metadata (Sometimes audit log data has out of project context data), we always need to build a map of table stats from the audit log data and later we match the metadata to the map based on table urn.
The approach consumed high memory and can caused OOM error.
Describe the solution you'd like
Since our current compass support upsert-patch asset update, we might could decouple bigquery metadata extraction and bigquery table profile extraction to reduce the memory-footprint of a job.