New view for Data Dict JSON to support chunk load #34787

ajeety4 · 2024-06-18T16:41:33Z

Product Description

Technical Summary

This is one of the sub tasks that creates a new django view with support to load case properties in a chunked fashion using skip and limit (defaults to 500).
If no case type is provided, the view returns all case types with their respective properties count.
With case type parameter, the view returns properties for that case type based on skip and limit query params.

See Technical Spec for more details.

Once the related changes are completed, the current urls for fetching case types and case properties will be migrated to this new view and old view would be removed.

JIRA Ticket and see linked tickets for related tasks as part of chunk loading.
The related tasks are basically updating the javascript model and template to support chunk loading.

Feature Flag

N/A. Available on all plans with Data Dictionary.

Safety Assurance

Safety story

Creates a new view that is not yet used, so changes are extremely safe.
Local testing done for new view

Automated test coverage

New test cases added

QA Plan

None

Rollback instructions

This PR can be reverted after deploy with no further considerations

Labels & Review

Risk label is set correctly
The set of people pinged as reviewers is appropriate for the level of risk of the change

sentry-io · 2024-06-18T16:41:52Z

🔍 Existing Issues For Review

Your pull request is modifying functions with the following pre-existing issues:

📄 File: corehq/apps/data_dictionary/util.py

Function	Unhandled Issue
`get_used_props_by_case_type`	ESError: ConnectionTimeout caused by - ReadTimeoutError(HTTPConnectionPool(host='10.202.40.159', port=9200... ... `Event Count:` 24

📄 File: corehq/apps/data_dictionary/views.py (Click to Expand)

Function	Unhandled Issue
`data_dictionary_json`	ESError: ConnectionTimeout caused by - ReadTimeoutError(HTTPConnectionPool(host='10.202.41.10', port=9200)... ... `Event Count:` 9

---

_{Did you find this useful? React with a 👍 or 👎}

zandre-eng

Overall looks good, just a few comments regarding the view.

corehq/apps/data_dictionary/views.py

zandre-eng · 2024-06-19T10:03:56Z

corehq/apps/data_dictionary/views.py

+            "properties_count": case_type.properties_count,
+        }
+
+    if case_type_name:


Nit: This function is quite long. Could we move the code related to fetching data for a single case type into its own separate function? This could even be split up into two separate views.

I was in favour of this (moving into a different view) however went against it seeing some amount of common code. I was keen to see what others say during review.
I am going to split them.

+1 for splitting function

Yeah, I would also advocate splitting. I would make a few changes:

Split load_fhir_resource_mappings() into two functions: One for case types and one for case properties.

Pull _get_case_data() out. Move into it some of the code that would become duplicated. Maybe something like:
def get_case_type_data(domain, case_type): case_type_app_module_count = get_case_type_app_module_count(domain) used_props_by_case_type = get_used_props_by_case_type(domain) module_count = case_type_app_module_count.get(case_type.name, 0) used_props = used_props_by_case_type.get(case_type.name, []) data = { "name": case_type.name, "fhir_resource_type": None, "is_deprecated": case_type.is_deprecated, "module_count": module_count, "is_safe_to_delete": len(used_props) == 0, "properties_count": case_type.properties_count, } if toggles.FHIR_INTEGRATION.enabled(domain): resource_type_name_by_case_type = get_fhir_resource_type_map(domain) data["fhir_resource_type"] = resource_type_name_by_case_type.get(case_type) return data

Split the view at the else: into two views: One if the case type is given, and one to list the case types.

Addressed in 384263b

corehq/apps/data_dictionary/views.py

zandre-eng · 2024-06-19T10:14:49Z

corehq/apps/data_dictionary/views.py

+        queryset = CaseType.objects.filter(domain=domain).annotate(properties_count=Count('property'))
+        if not request.GET.get("load_deprecated_case_types", False) == "true":
+            queryset = queryset.filter(is_deprecated=False)
+        case_types_data = [_get_case_data(case_type) for case_type in queryset]


Do we need the full case type JSON when just retrieving the list of case types and their property counts? I imagine properties like is_safe_to_delete and module_count we only need when loading the full set of data for a case type (since this is when a user will be able to delete the case type).

Great point. I went with the expected response in the tech spec but I agree on this. I can check on this and get rid of things that we do not need.

After thinking/looking more on this, I think logically all the case type related data including is_safe_to_delete and module_count should be part of the Case Type API.
Instead, we should consider not returning these case type details when we use the case properties endpoint - just return case type name, _links, properties_count and their grouped properties groups.
Does that makes sense ? @zandre-eng @kaapstorm

I'm okay with what you've done in f91eaf5 -- I'm feeling quite pragmatic about this. The viewmodel is the only thing that will be consuming this API, and as long as it has what it needs before it needs it, I think that works.

zandre-eng · 2024-06-19T10:18:24Z

corehq/apps/data_dictionary/tests/test_views.py

-from corehq import privileges
-from corehq.util.test_utils import flag_enabled
+# TODO Remove this once we migrate to the new view
+urlpatterns.insert(0, url(r"^json_v2/$", data_dictionary_json_v2, name='data_dictionary_json_v2'))


Instead of dynamically inserting these URL patterns for the tests, would it make sense to rather just add these to the URLs file? They already have a different URL path to the current DD JSON endpoint, and we can always switch them out when we remove the old views.

Sure, that can be done. I only was not in favour of adding new urls , however considering we are going to remove them in short time, I am good to add these in the urls.

Addressed in 384263b

zandre-eng · 2024-06-19T10:20:44Z

corehq/apps/data_dictionary/views.py

+                })
+            case_type_data["groups"].append(group_data)
+
+        return JsonResponse(case_type_data)


Should the geo_case_property not also be returned here? After the case property data has been loaded we use the property to identify which ones are used in the Geospatial feature.

Good question. Although not very sure on that, I think this was probably not required as Norman has completed the changes on the JS side.

Charl1996 · 2024-06-20T07:35:32Z

corehq/apps/data_dictionary/views.py

+        current_url = request.build_absolute_uri()
+        links = {"self": update_url_query_params(current_url, {"skip": skip, "limit": limit})}
+        if skip:
+            links["previous"] = update_url_query_params(
+                current_url,
+                {"skip": max(skip - limit, 0), "limit": limit}
+            )
+        if case_type_data["properties_count"] > (skip + limit):
+            links["next"] = update_url_query_params(current_url, {"skip": skip + limit, "limit": limit})
+        case_type_data["_links"] = links


This also feels like it could be split out into a separate method, maybe something like update_url_params or so.

Addressed in 1339a9d

mkangia

Just wanted to suggest a slightly different approach to the PR

refactor the current data_dictionary_json to extract out pieces, specifically things that will stay the same
copy exactly the same view into a v2
modify the bits that need to change

This would make it easier to identify

what has stayed the same or what code is not new
what has changed and is also relevant for review

As in the current state, I believe the PR is really hard to read and review.

P.S: You can ignore this if a lot has changed from the previous view.

corehq/apps/data_dictionary/urls.py

corehq/apps/data_dictionary/util.py

ajeety4 · 2024-06-24T16:31:50Z

Just wanted to suggest a slightly different approach to the PR

refactor the current data_dictionary_json to extract out pieces, specifically things that will stay the same

copy exactly the same view into a v2

modify the bits that need to change

This would make it easier to identify

what has stayed the same or what code is not new

what has changed and is also relevant for review

As in the current state, I believe the PR is really hard to read and review.

P.S: You can ignore this if a lot has changed from the previous view.

Good suggestion in terms of making the review better.
However as you mentioned, there are some changes to the functionality/refactoring and looking at additional review comments, I am going to keep as it is.

mkangia · 2024-06-24T17:04:56Z

However as you mentioned, there are some changes to the functionality/refactoring and looking at additional review comments, I am going to keep as it is.

Hey @ajeety4
I would still request if you could reconsider how this is currently done. Even if not the approach I suggested, I'd recommend going through the commits yourself and see what changes would make this easier to review. That would not only make review easier but also make it possible to clearly understand the approach and reveal changes from how data dictionary worked previously, and if any, highlight concerns. As of now, this all looks like a new feature altogether.

Though if others are happy to review the way it is currently, I am not going to block on this.

ajeety4 · 2024-06-25T07:07:01Z

However as you mentioned, there are some changes to the functionality/refactoring and looking at additional review comments, I am going to keep as it is.

Hey @ajeety4 I would still request if you could reconsider how this is currently done. Even if not the approach I suggested, I'd recommend going through the commits yourself and see what changes would make this easier to review. That would not only make review easier but also make it possible to clearly understand the approach and reveal changes from how data dictionary worked previously, and if any, highlight concerns. As of now, this all looks like a new feature altogether.

Though if others are happy to review the way it is currently, I am not going to block on this.

Hi @mkangia ,
I agree with the notion that this does looks like a new feature. Looking at the points highlighted, I feel like it is worth cleaning up.
I intend to create a old view and make changes on top of that so they are clear.
@Charl1996 @kaapstorm @zandre-eng , just wanted to check that it is okay to go ahead with the new commits ?

corehq/apps/data_dictionary/views.py

kaapstorm

I'm not sure whether all the comments have been resolved, but I'm happy with this.

ajeety4 · 2024-06-27T16:02:51Z

However as you mentioned, there are some changes to the functionality/refactoring and looking at additional review comments, I am going to keep as it is.

Hey @ajeety4 I would still request if you could reconsider how this is currently done. Even if not the approach I suggested, I'd recommend going through the commits yourself and see what changes would make this easier to review. That would not only make review easier but also make it possible to clearly understand the approach and reveal changes from how data dictionary worked previously, and if any, highlight concerns. As of now, this all looks like a new feature altogether.
Though if others are happy to review the way it is currently, I am not going to block on this.

Hi @mkangia , I agree with the notion that this does looks like a new feature. Looking at the points highlighted, I feel like it is worth cleaning up. I intend to create a old view and make changes on top of that so they are clear. @Charl1996 @kaapstorm @zandre-eng , just wanted to check that it is okay to go ahead with the new commits ?

Hello Team, as conveyed , I have made the commit changes and force pushed it. So the first commit creates a copy of existing view and have made changes on top of that which makes it better to understand.
The end result aka code is same though.
@kaapstorm you might need to do a rebase.

Apologies for any inconvenience.

mkangia · 2024-06-28T15:53:51Z

Hey @ajeety4

Just checking that this should be reviewed from the first commit again?

ajeety4 · 2024-07-01T08:06:49Z

Hey @ajeety4

Just checking that this should be reviewed from the first commit again?

Hey @mkangia , that is correct !

kaapstorm · 2024-07-01T08:41:14Z

btw, this branch is currently deployed to Staging @ajeety4 @mkangia and is being tested by QA as part of testing data dictionary changes.

Rebased branch: ay+ze+nh/dd_chunked
QA ticket: QA-6727

kaapstorm · 2024-07-01T16:12:40Z

corehq/apps/data_dictionary/views.py

+    properties_queryset = CaseProperty.objects.select_related('group').filter(case_type=case_type)
+    properties_queryset = properties_queryset.order_by('group_id', 'index', 'pk')[skip:skip + limit]
+    properties_queryset = properties_queryset.prefetch_related(
+        Prefetch('allowed_values', queryset=CasePropertyAllowedValue.objects.order_by('allowed_value'))
+    )


Perhaps very subjective, but I find blocks like this to be more readable when they are inside parentheses instead of reassigned over several lines. e.g.

Suggested change

properties_queryset = CaseProperty.objects.select_related('group').filter(case_type=case_type)

properties_queryset = properties_queryset.order_by('group_id', 'index', 'pk')[skip:skip + limit]

properties_queryset = properties_queryset.prefetch_related(

Prefetch('allowed_values', queryset=CasePropertyAllowedValue.objects.order_by('allowed_value'))

)

properties_queryset = (

CaseProperty.objects

.select_related('group')

.filter(case_type=case_type)

.order_by('group_id', 'index', 'pk')[skip:skip + limit]

.prefetch_related(Prefetch(

'allowed_values',

queryset=CasePropertyAllowedValue.objects.order_by('allowed_value')

))

)

I'm okay with leaving this as it is though.

I too feel this is more readable. Thanks Norman. Addressed in 606be7c

zandre-eng

Changes look good from my side, great work!

fixes current url for case type data dict json

68b2362

ajeety4 force-pushed the ay/data-dict-chunk-load branch from da777c5 to bac90de Compare June 18, 2024 16:52

ajeety4 marked this pull request as ready for review June 18, 2024 16:59

ajeety4 requested review from esoergel, orangejenny and zandre-eng as code owners June 18, 2024 16:59

ajeety4 added the product/feature-flag Change will only affect users who have a specific feature flag enabled label Jun 18, 2024

ajeety4 requested review from kaapstorm, mkangia and Charl1996 June 18, 2024 16:59

zandre-eng reviewed Jun 19, 2024

View reviewed changes

Charl1996 reviewed Jun 20, 2024

View reviewed changes

mkangia reviewed Jun 20, 2024

View reviewed changes

corehq/apps/data_dictionary/urls.py Show resolved Hide resolved

kaapstorm reviewed Jun 23, 2024

View reviewed changes

corehq/apps/data_dictionary/util.py Show resolved Hide resolved

kaapstorm reviewed Jun 25, 2024

View reviewed changes

corehq/apps/data_dictionary/views.py Show resolved Hide resolved

kaapstorm reviewed Jun 25, 2024

View reviewed changes

corehq/apps/data_dictionary/views.py Show resolved Hide resolved

kaapstorm reviewed Jun 25, 2024

View reviewed changes

corehq/apps/data_dictionary/views.py Outdated Show resolved Hide resolved

kaapstorm approved these changes Jun 25, 2024

View reviewed changes

ajeety4 added 8 commits June 27, 2024 18:41

creates a copy of existing view for better seeing changes

0215ce7

simple split for new v2 view

384263b

update response data for case types view

fded18a

test cases for new case types API

d0df537

nit: rename test_view to test_views

59eb921

updates response for new case properties view

c9bb8c8

refactor grouping logic for readibility and use groups from groupby

b67fec7

adds skip limit for case properties API

6ebc163

ajeety4 added 8 commits June 27, 2024 20:15

util for adding query params to url with test cases

e7d7f3c

adds pagination links

1339a9d

adds saved index to properties

9f3423e

test cases for case properties view

7373e6c

splits load_fhir_resource_mappings into two fuctions

c0cb6bf

nit: corrects TODO plus move test class

eb66360

nit: code placement plus use get for dict

9037e5e

nit: isort on related files

a97f285

ajeety4 force-pushed the ay/data-dict-chunk-load branch from f91eaf5 to a97f285 Compare June 27, 2024 15:58

kaapstorm mentioned this pull request Jun 28, 2024

Load data dictionary case properties in chunks #34827

Merged

3 tasks

kaapstorm approved these changes Jul 1, 2024

View reviewed changes

kaapstorm reviewed Jul 1, 2024

View reviewed changes

ajeety4 force-pushed the ay/data-dict-chunk-load branch 2 times, most recently from 2cd5ea9 to a97f285 Compare July 3, 2024 07:00

ajeety4 added 4 commits July 3, 2024 12:33

sends empty default group if no properties exist

69c5ea5

nit:readibility

606be7c

refactor get_used_props_by_case_type to accept case type

876d78e

Merge branch 'master' into ay/data-dict-chunk-load

cd02d6b

kaapstorm approved these changes Jul 3, 2024

View reviewed changes

zandre-eng approved these changes Jul 3, 2024

View reviewed changes

zandre-eng added the product/invisible Change has no end-user visible impact label Jul 3, 2024

ajeety4 added product/all-users-all-environments Change impacts all users on all environments and removed product/feature-flag Change will only affect users who have a specific feature flag enabled labels Jul 3, 2024

ajeety4 merged commit 8232c9d into master Jul 3, 2024
13 checks passed

ajeety4 deleted the ay/data-dict-chunk-load branch July 3, 2024 12:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New view for Data Dict JSON to support chunk load #34787

New view for Data Dict JSON to support chunk load #34787

ajeety4 commented Jun 18, 2024 •

edited

Loading

sentry-io bot commented Jun 18, 2024

zandre-eng left a comment

zandre-eng Jun 19, 2024

ajeety4 Jun 19, 2024

Charl1996 Jun 20, 2024

kaapstorm Jun 24, 2024 •

edited

Loading

ajeety4 Jul 3, 2024

zandre-eng Jun 19, 2024

ajeety4 Jun 19, 2024

ajeety4 Jun 25, 2024 •

edited

Loading

kaapstorm Jun 25, 2024

zandre-eng Jun 19, 2024

ajeety4 Jun 24, 2024

ajeety4 Jul 3, 2024

zandre-eng Jun 19, 2024

ajeety4 Jul 3, 2024

Charl1996 Jun 20, 2024

ajeety4 Jul 3, 2024

mkangia left a comment •

edited

Loading

ajeety4 commented Jun 24, 2024

mkangia commented Jun 24, 2024

ajeety4 commented Jun 25, 2024

kaapstorm left a comment

ajeety4 commented Jun 27, 2024

mkangia commented Jun 28, 2024

ajeety4 commented Jul 1, 2024

kaapstorm commented Jul 1, 2024 •

edited by jira bot

Loading

kaapstorm Jul 1, 2024

ajeety4 Jul 3, 2024

zandre-eng left a comment

New view for Data Dict JSON to support chunk load #34787

New view for Data Dict JSON to support chunk load #34787

Conversation

ajeety4 commented Jun 18, 2024 • edited Loading

Product Description

Technical Summary

Feature Flag

Safety Assurance

Safety story

Automated test coverage

QA Plan

Rollback instructions

Labels & Review

sentry-io bot commented Jun 18, 2024

🔍 Existing Issues For Review

zandre-eng left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kaapstorm Jun 24, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ajeety4 Jun 25, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mkangia left a comment • edited Loading

Choose a reason for hiding this comment

ajeety4 commented Jun 24, 2024

mkangia commented Jun 24, 2024

ajeety4 commented Jun 25, 2024

kaapstorm left a comment

Choose a reason for hiding this comment

ajeety4 commented Jun 27, 2024

mkangia commented Jun 28, 2024

ajeety4 commented Jul 1, 2024

kaapstorm commented Jul 1, 2024 • edited by jira bot Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

zandre-eng left a comment

Choose a reason for hiding this comment

ajeety4 commented Jun 18, 2024 •

edited

Loading

kaapstorm Jun 24, 2024 •

edited

Loading

ajeety4 Jun 25, 2024 •

edited

Loading

mkangia left a comment •

edited

Loading

kaapstorm commented Jul 1, 2024 •

edited by jira bot

Loading