Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Di 2095 datahub upgrade to v13 #70

Open
wants to merge 793 commits into
base: master
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
793 commits
Select commit Hold shift + click to select a range
ac1ee6c
fix(lineage): logging reduction and fixes (#9878)
RyanHolstien Feb 21, 2024
f13ae77
feat(ingest): add escape hatch methods to SqlParsingAggregator (#9860)
hsheth2 Feb 21, 2024
b15b352
fix(ingest): better logging line attribution (#9876)
hsheth2 Feb 21, 2024
ef3a814
fix(ingest): bump sqlglot + add some debug info to tests (#9867)
hsheth2 Feb 21, 2024
3b88ebb
docs(ingest): document remote for ingest deploy (#9897)
gabe-lyons Feb 21, 2024
fbc291c
build(workflow): test enable workflow for docs change (#9901)
david-leifker Feb 21, 2024
943edc3
feat(ingest/mongodb): improve sorting when downsampling collection sc…
TonyOuyangGit Feb 21, 2024
60e4b2d
feat(ingest/snowflake): speed up snowflake usage query (#9899)
hsheth2 Feb 21, 2024
62f2789
feat(structured-properties): soft delete (#9812)
david-leifker Feb 22, 2024
b01416c
feat(ingest): add timeout to sqlglot parser (#9868)
hsheth2 Feb 23, 2024
639dd14
feat(models): Add subtypes aspect to dataFlow (#9866)
asikowitz Feb 23, 2024
def4b24
fix(cli): add-sibling preserve existing siblings (#9907)
stevenayers Feb 23, 2024
3921588
feat(ingest/redshift): redshift lineage v2 (#9904)
hsheth2 Feb 24, 2024
c1332c6
build(frontend): Add tsc to lint (#9911)
asikowitz Feb 24, 2024
1c5e66d
fix(ui/posts): add padding to post link card (#9883)
gaurav2733 Feb 26, 2024
5921a33
feat(ingestion/trino): Add sibling support in ingestion (#9853)
shubhamjagtap639 Feb 26, 2024
14648f4
fix(tests): Change secret name to avoid conflicts (#9917)
Kunal-kankriya Feb 26, 2024
21ecb71
feat(policies): Make policies searchable by privilege, type, status o…
pedro93 Feb 26, 2024
93acb82
feat(ci): exempt more names from being marked as community (#9918)
anshbansal Feb 26, 2024
a1f2216
feat(ingest/qlik): Qlik cloud connector integration (#9682)
shubhamjagtap639 Feb 26, 2024
02f41b7
feat(ingest): use `uv` for python package installs (#9885)
hsheth2 Feb 26, 2024
ca613ba
feat(ingest/tableau): add non-embedded custom sql to project containe…
hsheth2 Feb 26, 2024
d2d9661
fix(ingest/patch): Adding proper json patch path quoting (#9872)
treff7es Feb 27, 2024
5889d84
fix(ui/token): use correct API to filter for expired tokens on UI (#9…
gaurav2733 Feb 27, 2024
7ee86bb
feat(ingest/slack): add option to ingest slack channels (#9884)
anshbansal Feb 27, 2024
336d054
feat(ingest/csv-enricher): add support for ownership type urn (#9935)
anshbansal Feb 27, 2024
ddf0b7d
feat(patch): refactor cll patch (#9922)
RyanHolstien Feb 27, 2024
aa5ba5b
fix(pr): make prs only pull changes from pr branch instead of head me…
RyanHolstien Feb 27, 2024
1bcc9a6
feat(ingest): loosen sentry requirements (#9936)
hsheth2 Feb 27, 2024
6888bfb
fix(react): include flow diagram in docs build (#9785)
hsheth2 Feb 28, 2024
f561efe
fix(ingest/mysql): Adding support for bit type (#9950)
treff7es Feb 28, 2024
f399a87
feat(ingest/patch): add helper for auto-quoting (#9938)
hsheth2 Feb 28, 2024
92b1cfa
feat(ingest): Support for JSONL in s3 source with max_rows support (#…
Adityamalik123 Feb 28, 2024
1736edf
feat(ingest): fix bugs in SqlParsingAggregator (#9926)
hsheth2 Feb 28, 2024
3b73555
fix(ui/schema): error handling add (#9952)
anshbansal Feb 28, 2024
b45c04f
fix(tests): test_group_upsert smoke test updated (#9888)
Kunal-kankriya Feb 28, 2024
55bc955
feat(ci): skip smoke-test lint if there are no changes (#9945)
hsheth2 Feb 28, 2024
ed10a8d
feat(search): search access controls (#9892)
david-leifker Feb 28, 2024
29274c1
feat(ingest/sql-parser): add alias for mariadb (#9956)
hsheth2 Feb 29, 2024
0c7ca4c
docs(ingest/lookml): update known discrepancy list (#9941)
hsheth2 Feb 29, 2024
9a7c097
chore(vulnerability): Bumped up versions for vulnerability fix (#9929)
pankajmahato-visa Feb 29, 2024
8b6790e
Revert "chore(vulnerability): Bumped up versions for vulnerability fi…
RyanHolstien Feb 29, 2024
3284235
bump(kafka-setup): client version bump (#9962)
david-leifker Feb 29, 2024
4c83f10
feat(ingest): throw codegen error on duplicate class names (#9960)
hsheth2 Mar 1, 2024
05593f4
feat(docker): respect pip mirrors with uv (#9963)
hsheth2 Mar 1, 2024
6eb5f80
Openlineage endpoint and Spark Lineage Beta Plugin (#9870)
treff7es Mar 1, 2024
1b4f31b
fix(ingest/json-schema): adding support descriptions for array (#9757)
AvaniSiddhapuraAPT Mar 1, 2024
70adf73
fix(ingest/redshift): fix bug in lineage v2 table renames (#9967)
hsheth2 Mar 4, 2024
d987707
feat(ingest): speed up to_obj() and validate() (#9969)
hsheth2 Mar 4, 2024
9dd0c37
feat(ingest): fix fspath lint error (#9976)
hsheth2 Mar 5, 2024
67406aa
docs: archive old version before 0.12.0 & fix broken links (#9957)
yoonhyejin Mar 5, 2024
70656b3
fix(ui/editor): arrows change field when editing description (#9949)
gaurav2733 Mar 5, 2024
dde9687
feat(ui/policies): add filter for Active/Inactive/All on policy page …
gaurav2733 Mar 5, 2024
a85db60
feat(ui): add option to add picture link for groups (#9882)
akarsh991 Mar 5, 2024
5bee25f
feat(ingest): add Looks subtype + stop reemitting browsePathV2 (#9978)
hsheth2 Mar 5, 2024
782d33d
fix(ingest/bigquery): escape special characters for table description…
AvaniSiddhapuraAPT Mar 5, 2024
a0efb9d
feat(ui): add loading spin to access management table (#9974)
filipe-caetano-ovo Mar 5, 2024
4d053f1
fix(ingestion/fivetran): Fix fivetran get connector jobs bug (#9975)
shubhamjagtap639 Mar 5, 2024
4189ca9
feat(ingest/dbt): generate CLL for all node types (#9964)
hsheth2 Mar 5, 2024
c0aedd4
chore(search): bump OpenSearch image version to 2.11.0 (#9984)
darnaut Mar 5, 2024
fda5eb8
feat(ingest): enable stateful_ingestion by default for DataHub rest s…
shubhamjagtap639 Mar 5, 2024
f597800
feat(ingestion/cli): Adding check option to validate allow/deny and p…
treff7es Mar 5, 2024
ab12385
fix(ingest): only import PathSpec when necessary (#9989)
hsheth2 Mar 5, 2024
d476f5f
feat(config): add configuration to reprocess UI sourced events (#9988)
RyanHolstien Mar 6, 2024
eed0abe
feat(pluginRegistry): add configuration to reduce runnable frequency …
RyanHolstien Mar 6, 2024
3e31ae4
build(react): Fix typescript errors in test files (#9982)
sumitappt Mar 6, 2024
6d3f087
feat(docs): disable last update timestamps (#9987)
hsheth2 Mar 6, 2024
289a484
feat: add versioned content for 0.12.1 (#9944)
yoonhyejin Mar 6, 2024
a59f3cb
doc: add version 0.13.0 (#9991)
yoonhyejin Mar 6, 2024
9ad0a0c
fix: fix mobile view and subtitles on slack/calendar page (#9822)
yoonhyejin Mar 6, 2024
4fbe814
fix(ingest/redshift): fix stl scan lineage for lineage v2 (#9986)
hsheth2 Mar 6, 2024
68a26b4
fix(ingest/delta-lake): support parsing nested types correctly (#9862)
dushayntAW Mar 6, 2024
1b3bbf8
fix(test): nested domains (#9993)
david-leifker Mar 6, 2024
7de517c
fix(ci): refactor build-and-test command (#9999)
hsheth2 Mar 6, 2024
8047d0b
feat(ingest/snowflake): generate query nodes for snowflake (#9966)
mayurinehate Mar 6, 2024
e963047
fix(ingest/unity): creating group urn in case of group (#9951)
dushayntAW Mar 7, 2024
7d10547
fix(ui/browse): hide data products option in left side bar (#10001)
gaurav2733 Mar 7, 2024
6227754
feat(ingest/redshift): make query generation configurable (#10000)
hsheth2 Mar 7, 2024
c4a4532
fix(opensearch): Rollover usage events at a file size rather than tim…
darnaut Mar 7, 2024
03a677a
chore(java): bump java dependency versions (#10009)
david-leifker Mar 7, 2024
6e8a2eb
ci(react): Update package.json to enable lint check (#10011)
sumitappt Mar 8, 2024
937f58b
fix(ui/ingest): trim leading and trailing whitespaces from the text (…
gaurav2733 Mar 8, 2024
e1c4f61
fix(policy-backfull): fix policy backfill job (#10016)
david-leifker Mar 8, 2024
927775d
feat(opensearch): support for updating ISM policy used for usage even…
darnaut Mar 8, 2024
28b11d4
refactor(react): Provide option to skip importing theme in CustomThem…
asikowitz Mar 8, 2024
4dddd7f
fix(openapi): fix openapi openlineage endpoint (#10019)
david-leifker Mar 8, 2024
b6956f9
feat(ingest): update sqlglot fork (#10022)
hsheth2 Mar 11, 2024
ae3443b
feat(ingest/superset): map awsathena platform name to athena (#10005)
LePuppy Mar 11, 2024
7540e64
fix(ingest/redshift): patch instead of replace redshift custom proper…
ethan-cartwright Mar 11, 2024
bcae7ac
fix(ingest/slack): tweak docs for slack source (#10007)
hsheth2 Mar 11, 2024
92a3ac6
fix(ingest): use contextvar for cooperative timeout (#10021)
hsheth2 Mar 11, 2024
20e15ff
feat(ingest): improve custom package metadata (#9985)
hsheth2 Mar 11, 2024
2fe3583
feat(docs): build website using swc-loader instead of babel (#9977)
hsheth2 Mar 11, 2024
7e2076e
feat(ingest): add query formatting to sql aggregator (#10025)
hsheth2 Mar 11, 2024
5937472
feat(ingest): add DataHubGraph.emit_all method (#10002)
hsheth2 Mar 11, 2024
2265ae9
feat(ingestion): Support for Server-less Redshift (#9998)
skrydal Mar 12, 2024
28f16aa
fix(ingest/teradata): small teradata improvements (#9953)
treff7es Mar 12, 2024
2de0e62
feat(ingest): add classification for sql sources (#10013)
mayurinehate Mar 12, 2024
3fcc372
docs(monitoring): add health check endpoint (#10033)
kopax-polyconseil Mar 12, 2024
4155a62
feat(ingest/dbt): capture both raw and compiled code (#10026)
hsheth2 Mar 12, 2024
4535f2a
fix(ingest/redshift): Temp table lineage fix (#10008)
treff7es Mar 12, 2024
b0163c4
feat(ingest): utilities for query logs (#10036)
hsheth2 Mar 13, 2024
729090b
docs: add missing api sample docs (#9869)
yoonhyejin Mar 13, 2024
239ae31
feat(gms): add aspect name to siblings hook log (#10044)
hsheth2 Mar 14, 2024
77c72da
feat(ingest): add classification to bigquery, redshift (#10031)
mayurinehate Mar 14, 2024
0f2b15c
fix(ui/lineage): show data is too large error when limitation exceeds…
gaurav2733 Mar 14, 2024
d3b0450
feat(ci): exempt more names from community (#10039)
mayurinehate Mar 15, 2024
3f85896
docs: improve versiondropdown design & set docs main to /features (#9…
yoonhyejin Mar 15, 2024
d4d175d
fix(ingest/redshift): tweak lineage v2 queries (#10045)
hsheth2 Mar 15, 2024
ecfab6b
chore(aws-msk-iam-auth): bump dependency version (#10063)
darnaut Mar 15, 2024
84bdf81
feat(lineage): add priority to via node (#10034)
RyanHolstien Mar 16, 2024
ad4da57
docs(acryl-cloud): notes for 0.2.16 (#10069)
anshbansal Mar 18, 2024
3a4bdef
fix(ingest/unity): generate sibling and lineage (#9894)
dushayntAW Mar 18, 2024
104e787
fix(ingest): only auto-enable stateful ingestion if pipeline name is …
hsheth2 Mar 18, 2024
0593056
feat(ingest/s3): set default spark version (#10057)
hsheth2 Mar 18, 2024
656f94b
feat(ingest): better rest emitter error message (#10073)
hsheth2 Mar 18, 2024
1248635
docs(sdk): Update API guide with example for Acryl (#10072)
gabe-lyons Mar 18, 2024
630f2a1
feat(ingest): check for private import path usages (#10059)
hsheth2 Mar 18, 2024
f4d4e79
feat(ingest): add sql formatter utility (#10064)
hsheth2 Mar 19, 2024
11eda4b
feat(ingest): refactor LineageConfig class (#10074)
hsheth2 Mar 19, 2024
1f9ce9d
feat(ingest/dbt): point dbt assertions at dbt nodes (#10055)
hsheth2 Mar 19, 2024
64cb5d1
feat(dbt): show source and compiled code in the UI (#10028)
hsheth2 Mar 19, 2024
11f6ab6
feat(ui/ingest): ingestion form for Okta and AzureAD (#9829)
gaurav2733 Mar 19, 2024
59a26a6
Update domains docs to include nested domains (#9890)
eboneil Mar 19, 2024
9aa099f
fix(ingestion): Handle Redshift string length limit in Serverless mod…
skrydal Mar 19, 2024
49de01b
build(deps): bump follow-redirects from 1.15.4 to 1.15.6 in /docs-web…
dependabot[bot] Mar 19, 2024
a676aa6
build(deps): bump es5-ext from 0.10.62 to 0.10.63 in /docs-website (#…
dependabot[bot] Mar 19, 2024
60a1291
fix(lineage): fix array out of bounds error (#10081)
david-leifker Mar 19, 2024
cc0552a
Add owners, tags, glossary terms to dataset yaml loader (#9859)
eboneil Mar 20, 2024
446fbe5
fix(ingest/slack): add rate limiting to more places in slack source (…
eboneil Mar 20, 2024
43ac405
fix(metadata-ingestion)glue connector failure when Optional field Typ…
siladitya2 Mar 20, 2024
70ab759
feat(redshift): adds flag to skip all external tables (#10040)
sgomezvillamor Mar 20, 2024
c1f6efa
feat(models) : Joins (Datasets) schema, resolvers and UI (#8325)
poorvi767 Mar 20, 2024
5a3df32
feat(properties) Add upsertStructuredProperties graphql endpoint for …
chriscollins3456 Mar 20, 2024
87169ba
Clean up logic for dataset.py yaml loader (#10089)
eboneil Mar 20, 2024
6c3834b
feat(ingest/dbt): add option to skip sources (#10077)
hsheth2 Mar 21, 2024
8c21b17
feat(ingest): support incremental column-level lineage (#10090)
hsheth2 Mar 21, 2024
c480b59
feat(ingest/powerbi): add chart subtypes (#10076)
hsheth2 Mar 21, 2024
7a2d61d
fix(ingest/metabase): Use connect_uri instead of display_uri to query…
diegmonti Mar 21, 2024
e6e5c09
feat(tableau): ability to force extraction of table/column level lina…
alexs-101 Mar 21, 2024
9659d60
feat(ingest/datahub-gc): gc source to cleanup things (#10085)
anshbansal Mar 21, 2024
a6b1701
docs(acryl-cloud): fix year in notes from 2023 to 2024 (#10095)
anshbansal Mar 21, 2024
d552106
feeat(openapi): add batch endpoint to v2 using requestbody (#10100)
RyanHolstien Mar 21, 2024
af06f95
fix(ingest/dbt): fix config validator for skip_sources_in_lineage (#1…
hsheth2 Mar 21, 2024
bd7a6b6
docs: add gtm tag (#10083)
yoonhyejin Mar 22, 2024
1cff5ef
docs: add doc for assertions & data contracts (#10029)
yoonhyejin Mar 22, 2024
a4a556a
test(ingest/mssql): use non-ephemeral mapping port (#10104)
hsheth2 Mar 22, 2024
2248737
fix(ingestion/unity-catalog): patch owners and properties (#10086)
dushayntAW Mar 22, 2024
dd502ae
fix(ingest): added new transformer to cleanup suffix/prefix in owner …
dushayntAW Mar 22, 2024
27612bd
fix(ui/user-group): add non existent entity page for user (#10004)
gaurav2733 Mar 22, 2024
d573acd
fix(resolver): Allow users to add/remove related terms for children g…
pinakipb2 Mar 22, 2024
332e417
Increase role member count in listRoles query to 20 from 10 (#10020)
jayasimhankv Mar 22, 2024
e0cc390
fix(frontend): exclude plugins/frontend/auth/user.props config does n…
Masterchen09 Mar 22, 2024
13f4993
fix(ui): show dataset display name in browse paths v2 (#10054)
Masterchen09 Mar 22, 2024
e484094
fix(metrics): get fieldName for GraphQL Mutation queries (#9972)
trialiya Mar 22, 2024
a4418f1
feat(UI): disable access management ui when no roles are linked to en…
githendrik Mar 22, 2024
8cb65bd
ci(filters): add graphql code to backend trigger (#10113)
david-leifker Mar 22, 2024
7315e6c
test(urn): add test case (#10112)
david-leifker Mar 22, 2024
36e12f7
fix(ui) Add min width to the usage stats component (#10056)
chriscollins3456 Mar 22, 2024
35cf4f8
log(system-update): Update DataHubStartupStep.java (#9971)
david-leifker Mar 23, 2024
93b5907
fix(usage-stats): usage-stats error handling and filter (#10105)
david-leifker Mar 23, 2024
a0d952d
fix(elasticsearch logging): log how long bulk execution took (#10116)
darnaut Mar 23, 2024
f9e64d0
feat(auth): view authorization (#10066)
david-leifker Mar 23, 2024
088e7a8
fix(searchContext): fix search flag immutability (#10117)
david-leifker Mar 23, 2024
9de15a2
fix(ingest/looker): use `external_base_url` for explore url generatio…
k7ragav Mar 24, 2024
7e5610f
feat(ingest/dagster): Dagster source (#10071)
treff7es Mar 25, 2024
c8a3818
fix(forms) Fix a couple of small inconsistencies with forms (#9928)
chriscollins3456 Mar 25, 2024
892a37d
fix: exclude Elasticsearch ignore_throttled warnings from log (#10042)
Masterchen09 Mar 25, 2024
eb2278c
Update build-and-test.yml (#10127)
david-leifker Mar 25, 2024
f726135
fix(mae-consumer): fix aspect retriever injections mae-consumer (#10125)
david-leifker Mar 25, 2024
5195d3a
fix(docs): fix docs build (#10129)
RyanHolstien Mar 25, 2024
fc03a1c
fix(search): respect the search flags term bucket size (#10130)
david-leifker Mar 25, 2024
45f6c2a
fix(ingestProposal): fix/handle no-op ingestion (#10126)
david-leifker Mar 25, 2024
1598070
fix(ci): simplify python release process (#10133)
hsheth2 Mar 26, 2024
2657c8f
feat(lineage): add a parameter to allow limiting the per hop explorat…
RyanHolstien Mar 26, 2024
e4ebf34
feat(ingest/bigquery): Respect dataset and table patterns when ingest…
ANich Mar 26, 2024
a70e775
feat(ingest): emit platform for query entities (#10103)
hsheth2 Mar 26, 2024
e97e682
feat(ingest): loosen pyarrow dep (#10141)
hsheth2 Mar 26, 2024
1febe68
fix(ingest/dbt): respect `convert_column_urns_to_lowercase` in mappin…
hsheth2 Mar 26, 2024
ef0048e
chore(ingestion-base): update base requirements (#10142)
david-leifker Mar 26, 2024
f0bdc24
feat(ingest/dbt): dbt model performance (#9992)
hsheth2 Mar 27, 2024
9928d70
fix(ingest/databricks): support hive metastore schemas with special c…
mayurinehate Mar 27, 2024
95a444a
feat(ui): sort partition keys to the top of the table for better visi…
ngamanda Mar 27, 2024
85c54f5
fix: OBS-729 | Filters: Fix alignment on nested dropdown (#10140)
sumitappt Mar 27, 2024
0361f24
feat(ingest/dynamodb): add support for classification (#10138)
mayurinehate Mar 27, 2024
bcd7240
feat(incidents) incident resolution note more clearly displayed (#10151)
jayacryl Mar 27, 2024
2e8936d
fix(entity-client): fix entity client cache and test (#10149)
david-leifker Mar 27, 2024
654d991
chore(ingest): update doc & log detail (#10139)
HuanjieGuo Mar 27, 2024
07ef677
feat(ingest): loosen airflow plugin dependencies requirements (#10106)
hsheth2 Mar 27, 2024
25d9d66
feat(ingest): fix validators (#10115)
hsheth2 Mar 27, 2024
e043587
feat(ingest/bigquery): improve debug logs (#10101)
hsheth2 Mar 27, 2024
35ec7df
fix(graphQL): Ignore soft-deleted assertions in UI calls (#10148)
pedro93 Mar 27, 2024
93febbb
fix(openapi): fix system-metadata response (#10155)
david-leifker Mar 27, 2024
db668e8
docs: update markprompt project key (#10134)
yoonhyejin Mar 28, 2024
4d69cea
add row type for athena types (#10131)
rae89 Mar 28, 2024
140c0f1
fix(setup): fix postgres setup to create temp table with no data (#10…
trialiya Mar 28, 2024
4e328c3
feat(ingest/looker): update browse paths to align with looker UI (#10…
mayurinehate Mar 28, 2024
32a2de4
feat(ingest/airflow): allow plugin to load on listener exception (#10…
hsheth2 Mar 28, 2024
9f2c5d3
feat(ingestion/bigquery): BigQuery Owner Label to Datahub Ownership (…
shubhamjagtap639 Mar 28, 2024
61c21e1
feat(ingest): bump sqlglot dep (#10144)
hsheth2 Mar 29, 2024
0de20b6
docs(website): tweak eyebrow copy (#10143)
hsheth2 Mar 29, 2024
cb9ee89
docs: upgrade markprompt version (#10159)
yoonhyejin Mar 29, 2024
c1eff59
fix(openapi): fix index out of bounds for sort order (#10168)
RyanHolstien Mar 29, 2024
a72e9a8
fix(search): fix field name in api (#10170)
RyanHolstien Mar 29, 2024
4bda3d8
build(docker): prefix pr on pr sha tags (#10171)
david-leifker Mar 29, 2024
7f7d713
Revert docker helper changes (#10172)
david-leifker Mar 31, 2024
c3257d8
feat(metadata-jobs): improve consumer logging (#10173)
darnaut Mar 31, 2024
6aa6a8f
test(graph): refactor graph test (#10175)
david-leifker Apr 1, 2024
3e39129
fix(ingest/tableau) Fix Tableau lineage ingestion from Clickhouse (#1…
valeral Apr 1, 2024
14bbc0b
<fix>[oracle ingestion]: get database name when using service (#10158)
Nelvin73 Apr 1, 2024
ef637cc
fix(docker): fix versioning for compose file post release (#10176)
RyanHolstien Apr 1, 2024
9a0a53b
fix(restoreIndices): batchSize vs limit (#10178)
david-leifker Apr 1, 2024
bf52807
feat(ui): show classification in test connection (#10156)
hsheth2 Apr 2, 2024
db33c86
fix(ingest): add classification dep for dynamodb (#10162)
hsheth2 Apr 2, 2024
c9b9afc
feat(ingest/dbt): enable model performance and compiled code by defau…
hsheth2 Apr 2, 2024
77c4629
refactor(docker): move to acryldata repo for all images (#9459)
david-leifker Apr 2, 2024
3671860
fix(github): fix docker publish (#10186)
david-leifker Apr 2, 2024
a89e189
feat(lineage): mark nodes as explored (#10180)
RyanHolstien Apr 2, 2024
e0b20e1
feat(ingest/gc): add index truncation logic (#10099)
anshbansal Apr 2, 2024
4bba834
fix(entity-service): fix findFirst when already present (#10187)
david-leifker Apr 2, 2024
2873736
fix(ingestion/salesforce): fixed the issue by escaping the markdown s…
dushayntAW Apr 2, 2024
0a8ec37
hotfix(datahub-upgrade): fix schema for datahub upgrade mcls
david-leifker Apr 16, 2024
efe2520
Merge tag 'v0.13.2' of https://github.com/datahub-project/datahub int…
arosanda May 6, 2024
a1969d2
downgrade spring boot to v2
arosanda May 6, 2024
b1f6809
Revert "downgrade spring boot to v2"
arosanda May 6, 2024
c904d4c
try change logo
arosanda May 9, 2024
c162750
.
arosanda May 9, 2024
3c89757
.
arosanda May 10, 2024
4531a5b
.
arosanda May 10, 2024
50b7b98
.
arosanda May 10, 2024
8809068
another try
arosanda May 20, 2024
c8564a0
.
arosanda Jul 2, 2024
da00fe2
change download kafka url
arosanda Jul 15, 2024
d9a996a
.
arosanda Jul 15, 2024
11f3c4f
.
arosanda Jul 15, 2024
b1ac93a
last try change logo
arosanda Jul 16, 2024
44fd3fb
.
arosanda Jul 19, 2024
cf50c76
.
arosanda Jul 19, 2024
a090a35
remove health status auth
arosanda Jul 26, 2024
2de0774
delete file
arosanda Jul 26, 2024
b447105
delete commented lines
arosanda Jul 26, 2024
b820e83
delete commented lines 2
arosanda Jul 26, 2024
1e105c7
delete commented lines 3
arosanda Jul 26, 2024
3b170d1
Revert "delete commented lines 3"
arosanda Jul 31, 2024
2e85ae4
Revert "delete commented lines 2"
arosanda Jul 31, 2024
882d8ca
Revert "delete commented lines"
arosanda Jul 31, 2024
952a7e0
Revert "delete file"
arosanda Jul 31, 2024
44a5bb0
Revert "remove health status auth"
arosanda Jul 31, 2024
958720b
types-pkg_resources > types-setuptools
arosanda Aug 23, 2024
450eae8
Revert "types-pkg_resources > types-setuptools"
arosanda Aug 23, 2024
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
The diff you're trying to view is too large. We only load the first 3000 changed files.
9 changes: 7 additions & 2 deletions .dockerignore
Original file line number Diff line number Diff line change
@@ -1,6 +1,11 @@
**/node_modules/
datahub-frontend/build/
metadata-ingestion/venv/
*/build/
*/*/build/
**/venv/
**/.tox/
**/.mypy_cache/
**/.pytest_cache/
**/__pycache__/
out
**/*.class
# Have to copy gradle/wrapper/gradle-wrapper.jar, can't exclude ALL jars
Expand Down
87 changes: 87 additions & 0 deletions .github/actions/ci-optimization/action.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,87 @@
name: 'Identify CI Optimizations'
description: 'Determine if code changes are specific to certain modules.'

outputs:
frontend-only:
description: "Frontend only change"
value: ${{ steps.filter.outputs.frontend == 'true' && steps.filter.outputs.ingestion == 'false' && steps.filter.outputs.backend == 'false' }}
ingestion-only:
description: "Ingestion only change"
value: ${{ steps.filter.outputs.frontend == 'false' && steps.filter.outputs.ingestion == 'true' && steps.filter.outputs.backend == 'false' }}
backend-only:
description: "Backend only change"
value: ${{ steps.filter.outputs.frontend == 'false' && steps.filter.outputs.ingestion == 'false' && steps.filter.outputs.backend == 'true' }}
backend-change:
description: "Backend code has changed"
value: ${{ steps.filter.outputs.backend == 'true' }}
ingestion-change:
description: "Ingestion code has changed"
value: ${{ steps.filter.outputs.ingestion == 'true' }}
frontend-change:
description: "Frontend code has changed"
value: ${{ steps.filter.outputs.frontend == 'true' }}
docker-change:
description: "Docker code has changed"
value: ${{ steps.filter.outputs.docker == 'true' }}
kafka-setup-change:
description: "Kafka setup docker change"
value: ${{ steps.filter.outputs.kafka-setup == 'true' }}
mysql-setup-change:
description: "Mysql setup docker change"
value: ${{ steps.filter.outputs.mysql-setup == 'true' }}
postgres-setup-change:
description: "Postgres setup docker change"
value: ${{ steps.filter.outputs.postgres-setup == 'true' }}
elasticsearch-setup-change:
description: "Elasticsearch setup docker change"
value: ${{ steps.filter.outputs.elasticsearch-setup == 'true' }}
smoke-test-change:
description: "Smoke test change"
value: ${{ steps.filter.outputs.smoke-test == 'true' }}
runs:
using: "composite"
steps:
- uses: dorny/paths-filter@v2
id: filter
with:
filters: |
frontend:
- "datahub-frontend/**"
- "datahub-web-react/**"
- "smoke-test/tests/cypress/**"
- "docker/datahub-frontend/**"
ingestion:
- "metadata-ingestion-modules/**"
- "metadata-ingestion/**"
- "metadata-models/**"
- "smoke-test/**"
- "docker/datahub-ingestion**"
docker:
- "docker/**"
backend:
- ".github/**"
- "metadata-models/**"
- "datahub-upgrade/**"
- "entity-registry/**"
- "li-utils/**"
- "metadata-auth/**"
- "metadata-dao-impl/**"
- "metadata-events/**"
- "metadata-io/**"
- "metadata-jobs/**"
- "metadata-service/**"
- "metadata-utils/**"
- "metadata-operation-context/**"
- "datahub-graphql-core/**"
- "smoke-test/**"
- "docker/**"
kafka-setup:
- "docker/kafka-setup/**"
mysql-setup:
- "docker/mysql-setup/**"
postgres-setup:
- "docker/postgres-setup/**"
elasticsearch-setup:
- "docker/elasticsearch-setup/**"
smoke-test:
- "smoke-test/**"
13 changes: 11 additions & 2 deletions .github/actions/docker-custom-build-and-push/action.yml
Original file line number Diff line number Diff line change
Expand Up @@ -20,7 +20,7 @@ inputs:
required: false

images:
# e.g. linkedin/datahub-gms
# e.g. acryldata/datahub-gms
description: "List of Docker images to use as base name for tags"
required: true
build-args:
Expand Down Expand Up @@ -70,11 +70,20 @@ runs:
push: false
cache-from: type=registry,ref=${{ steps.docker_meta.outputs.tags }}
cache-to: type=inline
- name: Single Tag
if: ${{ inputs.publish != 'true' }}
shell: bash
run: |
TAGS="""
${{ steps.docker_meta.outputs.tags }}
"""
echo "SINGLE_TAG=$(echo $TAGS | tr '\n' ' ' | awk -F' ' '{ print $1 }')" >> $GITHUB_OUTPUT
id: single_tag
- name: Upload image locally for testing (if not publishing)
uses: ishworkh/docker-image-artifact-upload@v1
if: ${{ inputs.publish != 'true' }}
with:
image: ${{ steps.docker_meta.outputs.tags }}
image: ${{ steps.single_tag.outputs.SINGLE_TAG }}

# Code for building multi-platform images and pushing to Docker Hub.
- name: Set up QEMU
Expand Down
28 changes: 20 additions & 8 deletions .github/scripts/check_policies.py
Original file line number Diff line number Diff line change
Expand Up @@ -20,7 +20,7 @@
elif urn == "urn:li:dataHubPolicy:editor-platform-policy":
editor_platform_policy_privileges = policy["info"]["privileges"]
elif urn == "urn:li:dataHubPolicy:7":
all_user_platform_policy_privilges = policy["info"]["privileges"]
all_user_platform_policy_privileges = policy["info"]["privileges"]
try:
doc_type = policy["info"]["type"]
privileges = policy["info"]["privileges"]
Expand Down Expand Up @@ -54,10 +54,22 @@
)
assert len(diff_policies) == 0, f"Missing privileges for root user are {diff_policies}"

diff_policies = set(editor_platform_policy_privileges).difference(
set(all_user_platform_policy_privilges)
)
assert "MANAGE_POLICIES" not in all_user_platform_policy_privilges
assert (
len(diff_policies) == 0
), f"Missing privileges for all user policies are {diff_policies}"
# All users privileges checks
assert "MANAGE_POLICIES" not in all_user_platform_policy_privileges
assert "MANAGE_USERS_AND_GROUPS" not in all_user_platform_policy_privileges
assert "MANAGE_SECRETS" not in all_user_platform_policy_privileges
assert "MANAGE_USER_CREDENTIALS" not in all_user_platform_policy_privileges
assert "MANAGE_ACCESS_TOKENS" not in all_user_platform_policy_privileges
assert "EDIT_ENTITY" not in all_user_platform_policy_privileges
assert "DELETE_ENTITY" not in all_user_platform_policy_privileges

# Editor checks
assert "MANAGE_POLICIES" not in editor_platform_policy_privileges
assert "MANAGE_USERS_AND_GROUPS" not in editor_platform_policy_privileges
assert "MANAGE_SECRETS" not in editor_platform_policy_privileges
assert "MANAGE_USER_CREDENTIALS" not in editor_platform_policy_privileges
assert "MANAGE_ACCESS_TOKENS" not in editor_platform_policy_privileges
# These don't prevent a user from modifying entities they are an asset owner of, i.e. their own profile info
assert "EDIT_CONTACT_INFO" not in editor_platform_policy_privileges
assert "EDIT_USER_PROFILE" not in editor_platform_policy_privileges
assert "EDIT_ENTITY_OWNERS" not in editor_platform_policy_privileges
18 changes: 18 additions & 0 deletions .github/scripts/check_python_package.py
Original file line number Diff line number Diff line change
@@ -0,0 +1,18 @@
import setuptools

folders = ["./smoke-test/tests"]

for folder in folders:
print(f"Checking folder {folder}")
a = [i for i in setuptools.find_packages(folder) if "cypress" not in i]
b = [i for i in setuptools.find_namespace_packages(folder) if "cypress" not in i]

in_a_not_b = set(a) - set(b)
in_b_not_a = set(b) - set(a)

assert (
len(in_a_not_b) == 0
), f"Found packages in {folder} that are not in namespace packages: {in_a_not_b}"
assert (
len(in_b_not_a) == 0
), f"Found namespace packages in {folder} that are not in packages: {in_b_not_a}"
10 changes: 5 additions & 5 deletions .github/scripts/docker_helpers.sh
Original file line number Diff line number Diff line change
Expand Up @@ -12,19 +12,19 @@ export SHORT_SHA=$(get_short_sha)
echo "SHORT_SHA: $SHORT_SHA"

function get_tag {
echo $(echo ${GITHUB_REF} | sed -e "s,refs/heads/${MAIN_BRANCH},${MAIN_BRANCH_TAG}\,${SHORT_SHA},g" -e 's,refs/tags/,,g' -e 's,refs/pull/\([0-9]*\).*,pr\1,g')
echo $(echo ${GITHUB_REF} | sed -e "s,refs/heads/${MAIN_BRANCH},${MAIN_BRANCH_TAG},g" -e 's,refs/tags/,,g' -e 's,refs/pull/\([0-9]*\).*,pr\1,g'),${SHORT_SHA}
}

function get_tag_slim {
echo $(echo ${GITHUB_REF} | sed -e "s,refs/heads/${MAIN_BRANCH},${MAIN_BRANCH_TAG}-slim\,${SHORT_SHA}-slim,g" -e 's,refs/tags/,,g' -e 's,refs/pull/\([0-9]*\).*,pr\1-slim,g')
echo $(echo ${GITHUB_REF} | sed -e "s,refs/heads/${MAIN_BRANCH},${MAIN_BRANCH_TAG}-slim,g" -e 's,refs/tags/,,g' -e 's,refs/pull/\([0-9]*\).*,pr\1-slim,g'),${SHORT_SHA}-slim
}

function get_tag_full {
echo $(echo ${GITHUB_REF} | sed -e "s,refs/heads/${MAIN_BRANCH},${MAIN_BRANCH_TAG}-full\,${SHORT_SHA}-full,g" -e 's,refs/tags/,,g' -e 's,refs/pull/\([0-9]*\).*,pr\1-full,g')
echo $(echo ${GITHUB_REF} | sed -e "s,refs/heads/${MAIN_BRANCH},${MAIN_BRANCH_TAG}-full,g" -e 's,refs/tags/,,g' -e 's,refs/pull/\([0-9]*\).*,pr\1-full,g'),${SHORT_SHA}-full
}

function get_python_docker_release_v {
echo $(echo ${GITHUB_REF} | sed -e "s,refs/heads/${MAIN_BRANCH},0.0.0+docker.${SHORT_SHA},g" -e 's,refs/tags/v\(.*\),\1+docker,g' -e 's,refs/pull/\([0-9]*\).*,0.0.0+docker.pr\1,g')
echo $(echo ${GITHUB_REF} | sed -e "s,refs/heads/${MAIN_BRANCH},1!0.0.0+docker.${SHORT_SHA},g" -e 's,refs/tags/v\(.*\),1!\1+docker,g' -e 's,refs/pull/\([0-9]*\).*,1!0.0.0+docker.pr\1,g')
}

function get_unique_tag {
Expand All @@ -37,4 +37,4 @@ function get_unique_tag_slim {

function get_unique_tag_full {
echo $(echo ${GITHUB_REF} | sed -e "s,refs/heads/${MAIN_BRANCH},${SHORT_SHA}-full,g" -e 's,refs/tags/,,g' -e 's,refs/pull/\([0-9]*\).*,pr\1-full,g')
}
}
27 changes: 19 additions & 8 deletions .github/workflows/airflow-plugin.yml
Original file line number Diff line number Diff line change
Expand Up @@ -12,7 +12,7 @@ on:
branches:
- "**"
paths:
- ".github/**"
- ".github/workflows/airflow-plugin.yml"
- "metadata-ingestion-modules/airflow-plugin/**"
- "metadata-ingestion/**"
- "metadata-models/**"
Expand All @@ -32,32 +32,42 @@ jobs:
strategy:
matrix:
include:
# Note: this should be kept in sync with tox.ini.
- python-version: "3.8"
extra_pip_requirements: "apache-airflow~=2.1.4"
extra_pip_extras: plugin-v1
- python-version: "3.8"
extra_pip_requirements: "apache-airflow~=2.2.4"
extra_pip_extras: plugin-v1
- python-version: "3.10"
extra_pip_requirements: "apache-airflow~=2.4.0"
extra_pip_requirements: "apache-airflow==2.4.3"
extra_pip_extras: plugin-v2,test-airflow24
- python-version: "3.10"
extra_pip_requirements: 'apache-airflow==2.6.3 -c https://raw.githubusercontent.com/apache/airflow/constraints-2.6.3/constraints-3.10.txt'
extra_pip_extras: plugin-v2
- python-version: "3.10"
extra_pip_requirements: "apache-airflow~=2.6.0"
extra_pip_requirements: 'apache-airflow==2.7.3 -c https://raw.githubusercontent.com/apache/airflow/constraints-2.7.3/constraints-3.10.txt'
extra_pip_extras: plugin-v2
- python-version: "3.10"
extra_pip_requirements: "apache-airflow>=2.7.0"
extra_pip_requirements: 'apache-airflow==2.8.1 -c https://raw.githubusercontent.com/apache/airflow/constraints-2.8.1/constraints-3.10.txt'
extra_pip_extras: plugin-v2
fail-fast: false
steps:
- uses: actions/checkout@v3
- name: Set up JDK 17
uses: actions/setup-java@v3
with:
distribution: "zulu"
java-version: 17
- uses: gradle/gradle-build-action@v2
- uses: acryldata/sane-checkout-action@v3
- uses: actions/setup-python@v4
with:
python-version: ${{ matrix.python-version }}
cache: "pip"
- name: Install dependencies
run: ./metadata-ingestion/scripts/install_deps.sh
- name: Install airflow package and test (extras ${{ matrix.extra_pip_requirements }})
run: ./gradlew -Pextra_pip_requirements='${{ matrix.extra_pip_requirements }}' -Pextra_pip_extras='${{ matrix.extra_pip_extras }}' :metadata-ingestion-modules:airflow-plugin:lint :metadata-ingestion-modules:airflow-plugin:testQuick
run: ./gradlew -Pextra_pip_requirements='${{ matrix.extra_pip_requirements }}' -Pextra_pip_extras='${{ matrix.extra_pip_extras }}' :metadata-ingestion-modules:airflow-plugin:build
- name: pip freeze show list installed
if: always()
run: source metadata-ingestion-modules/airflow-plugin/venv/bin/activate && pip freeze
Expand All @@ -69,15 +79,16 @@ jobs:
**/build/reports/tests/test/**
**/build/test-results/test/**
**/junit.*.xml
!**/binary/**
- name: Upload coverage to Codecov
if: always()
uses: codecov/codecov-action@v3
with:
token: ${{ secrets.CODECOV_TOKEN }}
directory: .
fail_ci_if_error: false
flags: airflow-${{ matrix.python-version }}-${{ matrix.extraPythonRequirement }}
name: pytest-airflow
flags: airflow,airflow-${{ matrix.extra_pip_extras }}
name: pytest-airflow-${{ matrix.python-version }}-${{ matrix.extra_pip_requirements }}
verbose: true

event-file:
Expand Down
Loading