Skip to content

Commit abe1346

Browse files
committed
feat(datahub-actions): bundle datahub-gc and datahub-documents venvs (#16781)
1 parent 7f3833b commit abe1346

4 files changed

Lines changed: 7 additions & 4 deletions

File tree

docker/datahub-actions/Dockerfile

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -148,7 +148,7 @@ FROM ingestion-base-slim AS bundled-venvs-full
148148
USER 0
149149

150150
# Set up bundled venv configuration for FULL variant (with PySpark)
151-
ARG BUNDLED_VENV_PLUGINS="s3,demo-data,file"
151+
ARG BUNDLED_VENV_PLUGINS="s3,demo-data,file,datahub-gc,datahub-documents"
152152
ARG BUNDLED_VENV_SLIM_MODE="false"
153153
ARG BUNDLED_CLI_VERSION
154154
ENV DATAHUB_BUNDLED_VENV_PATH=/opt/datahub/venvs
@@ -190,7 +190,7 @@ USER 0
190190

191191
# Set up bundled venv configuration for SLIM variant (without PySpark)
192192
# Venv named s3-bundled but uses s3-slim package internally
193-
ARG BUNDLED_VENV_PLUGINS="s3,demo-data,file"
193+
ARG BUNDLED_VENV_PLUGINS="s3,demo-data,file,datahub-gc,datahub-documents"
194194
ARG BUNDLED_VENV_SLIM_MODE="true"
195195
ARG BUNDLED_CLI_VERSION
196196
ENV DATAHUB_BUNDLED_VENV_PATH=/opt/datahub/venvs
@@ -232,7 +232,7 @@ USER 0
232232

233233
# Set up bundled venv configuration for LOCKED variant (without PySpark, network blocked)
234234
# Same as slim but will have network access disabled in final stage
235-
ARG BUNDLED_VENV_PLUGINS="s3,demo-data,file"
235+
ARG BUNDLED_VENV_PLUGINS="s3,demo-data,file,datahub-gc,datahub-documents"
236236
ARG BUNDLED_VENV_SLIM_MODE="true"
237237
ARG BUNDLED_CLI_VERSION
238238
ENV DATAHUB_BUNDLED_VENV_PATH=/opt/datahub/venvs

metadata-ingestion/pyproject.toml

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -363,6 +363,8 @@ datahub-documents = [
363363
"urllib3>=1.26,<3.0",
364364
]
365365

366+
datahub-gc = []
367+
366368
datahub-kafka = [
367369
"confluent_kafka[schemaregistry,avro]>=1.9.0,!= 2.8.1,<3.0.0",
368370
"fastavro>=1.2.0,<2.0.0",

metadata-ingestion/setup.py

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -680,6 +680,7 @@
680680
"setuptools<82",
681681
},
682682
"datahub-debug": {"dnspython==2.7.0", "requests<3.0.0"},
683+
"datahub-gc": set(),
683684
"datahub-documents": unstructured_lib,
684685
"mode": {"requests<3.0.0", "python-liquid<2", "tenacity>=8.0.1,<9.0.0"}
685686
| sqlglot_lib

metadata-ingestion/uv.lock

Lines changed: 1 addition & 1 deletion
Some generated files are not rendered by default. Learn more about customizing how changed files appear on GitHub.

0 commit comments

Comments
 (0)