feat: reformat trip and shape dist validator #1676

cka-y · 2024-02-18T23:45:28Z

Summary:
Resolves #1613 and #1611 by implementing a distance threshold for the trip_distance_exceeds_shape_distance notice.

Expected Behavior:

Introduces a 11.1m threshold for trip_distance_exceeds_shape_distance, triggering an ERROR for distances $\geq 11.1m$.
Creates a new notice, trip_distance_exceeds_shape_distance_below_threshold, with WARNING severity for distances $\lt11.1m$.
This update streamlines the cross-validation of trips against shapes by adhering to the GTFS specification. This approach has resulted in minimal changes to validation outcomes as evidenced here. Any minor discrepancies arise from instances where the feed does not comply with the specification, often leading to ERROR level notices such as decreasing_or_equal_stop_time_distance and decreasing_shape_distance.
- By capitalizing on the expectation that both stop-time and shape distances should incrementally increase, the validation process is optimized. Instead of evaluating all points, we now only assess the last one, which changes our processing time complexity from linear to constant for these elements.

Empirical Performance Comparison:
Considering $n$ as the number of trips, $m$ as the number of stop-times, and $k$ as the number of shapes, the complexity in the worst-case scenario:

For the master branch is $\Omega(k^2nm)$
For the feature branch (feat/1613) is $\Omega(nk)$

Statistical Performance Comparison:
The performance improvements are depicted in the graph below. The datasets analyzed are from our catalog, with zipped file sizes of at least 1MB. Sizes have been normalized for a more meaningful comparison of slopes. The performance slope of the feature branch is significantly lower, decreasing from approximately 37 to approximately 25, indicating enhanced efficiency.

Please make sure these boxes are checked before submitting your pull request - thanks!

Run the unit tests with gradle test to make sure you didn't break anything
Add or update any needed documentation to the repo
Format the title like "feat: [new feature short description]". Title must follow the Conventional Commit Specification(https://www.conventionalcommits.org/en/v1.0.0/).
Linked all relevant issues
Include screenshot(s) showing how this pull request works and fixes the issue(s)

github-actions · 2024-02-19T02:25:56Z

✅ Rule acceptance tests passed.
New Errors: 0 out of 1485 datasets (~0%) are invalid due to code change, which is less than the provided threshold of 1%.
Dropped Errors: 1 out of 1485 datasets (~0%) are invalid due to code change, which is less than the provided threshold of 1%.
New Warnings: 0 out of 1485 datasets (~0%) are invalid due to code change, which is less than the provided threshold of 1%.
Dropped Warnings: 0 out of 1485 datasets (~0%) are invalid due to code change, which is less than the provided threshold of 1%.
1 out of 1486 sources (~0 %) are corrupted.
Corrupted sources:
us-district-of-columbia-dc-circulator-gtfs-486
Commit: 89d1cc1
Download the full acceptance test report here (report will disappear after 90 days).
✅ Rule acceptance tests passed.

github-actions · 2024-02-19T21:57:00Z

❌ Invalid acceptance test.
New Errors: 0 out of 1485 datasets (~0%) are invalid due to code change, which is less than the provided threshold of 1%.
Dropped Errors: 17 out of 1485 datasets (~1%) are invalid due to code change, which is above the provided threshold of 1%.
New Warnings: 25 out of 1485 datasets (~2%) are invalid due to code change, which is above the provided threshold of 1%.
Dropped Warnings: 0 out of 1485 datasets (~0%) are invalid due to code change, which is less than the provided threshold of 1%.
1 out of 1486 sources (~0 %) are corrupted.
Corrupted sources:
fi-etela-pohjanmaa-komia-liikenne-gtfs-1255
Commit: bb18676
Download the full acceptance test report here (report will disappear after 90 days).
❌ Invalid acceptance test.

github-actions · 2024-03-06T02:15:07Z

❌ Invalid acceptance test.
New Errors: 0 out of 1520 datasets (~0%) are invalid due to code change, which is less than the provided threshold of 1%.
Dropped Errors: 36 out of 1520 datasets (~2%) are invalid due to code change, which is above the provided threshold of 1%.
New Warnings: 82 out of 1520 datasets (~5%) are invalid due to code change, which is above the provided threshold of 1%.
Dropped Warnings: 0 out of 1520 datasets (~0%) are invalid due to code change, which is less than the provided threshold of 1%.
0 out of 1520 sources (~0 %) are corrupted.
Commit: 97dc096
Download the full acceptance test report here (report will disappear after 90 days).
❌ Invalid acceptance test.

main/src/main/java/org/mobilitydata/gtfsvalidator/validator/TripAndShapeDistanceValidator.java

emmambd

Notice description + name minor revisions.

main/src/main/java/org/mobilitydata/gtfsvalidator/validator/TripAndShapeDistanceValidator.java

github-actions · 2024-03-08T21:51:48Z

❌ Invalid acceptance test.
New Errors: 0 out of 1520 datasets (~0%) are invalid due to code change, which is less than the provided threshold of 1%.
Dropped Errors: 37 out of 1520 datasets (~2%) are invalid due to code change, which is above the provided threshold of 1%.
New Warnings: 82 out of 1520 datasets (~5%) are invalid due to code change, which is above the provided threshold of 1%.
Dropped Warnings: 0 out of 1520 datasets (~0%) are invalid due to code change, which is less than the provided threshold of 1%.
0 out of 1520 sources (~0 %) are corrupted.
Commit: b9deab7
Download the full acceptance test report here (report will disappear after 90 days).
❌ Invalid acceptance test.

github-actions · 2024-03-11T15:24:45Z

❌ Invalid acceptance test.
New Errors: 0 out of 1520 datasets (~0%) are invalid due to code change, which is less than the provided threshold of 1%.
Dropped Errors: 37 out of 1520 datasets (~2%) are invalid due to code change, which is above the provided threshold of 1%.
New Warnings: 91 out of 1520 datasets (~6%) are invalid due to code change, which is above the provided threshold of 1%.
Dropped Warnings: 0 out of 1520 datasets (~0%) are invalid due to code change, which is less than the provided threshold of 1%.
0 out of 1520 sources (~0 %) are corrupted.
Commit: cb6887b
Download the full acceptance test report here (report will disappear after 90 days).
❌ Invalid acceptance test.

jcpitre

Impressive, again!

github-actions · 2024-03-11T18:16:25Z

❌ Invalid acceptance test.
New Errors: 0 out of 1520 datasets (~0%) are invalid due to code change, which is less than the provided threshold of 1%.
Dropped Errors: 37 out of 1520 datasets (~2%) are invalid due to code change, which is above the provided threshold of 1%.
New Warnings: 91 out of 1520 datasets (~6%) are invalid due to code change, which is above the provided threshold of 1%.
Dropped Warnings: 0 out of 1520 datasets (~0%) are invalid due to code change, which is less than the provided threshold of 1%.
0 out of 1520 sources (~0 %) are corrupted.
Commit: ff5278a
Download the full acceptance test report here (report will disappear after 90 days).
❌ Invalid acceptance test.

MobilityData deleted a comment from github-actions bot Feb 19, 2024

emmambd mentioned this pull request Feb 21, 2024

Implement threshold for triggering trip_distance_exceeds_shape_distance #1613

Closed

emmambd linked an issue Feb 21, 2024 that may be closed by this pull request

Performance issues with trip_distance_exceeds_shape_distance #1611

Closed

cka-y force-pushed the feat/1613 branch from 1b99140 to 3d8d72b Compare March 5, 2024 23:08

cka-y added 7 commits March 5, 2024 18:10

feat: reformat trip and shape dist validator

e5f100e

fix: max error

acf506a

fix: max error

ac64c10

feat: added threshold of 1.11m

8f9ac7c

fix: validator error

faf3fef

fix: updated threshold

ede62be

fix: JF

8b5e21e

cka-y force-pushed the feat/1613 branch from 585a787 to 8b5e21e Compare March 6, 2024 01:31

cka-y marked this pull request as ready for review March 6, 2024 13:27

emmambd self-requested a review March 6, 2024 13:58

jcpitre reviewed Mar 6, 2024

View reviewed changes

main/src/main/java/org/mobilitydata/gtfsvalidator/validator/TripAndShapeDistanceValidator.java Outdated Show resolved Hide resolved

jcpitre reviewed Mar 6, 2024

View reviewed changes

main/src/main/java/org/mobilitydata/gtfsvalidator/validator/TripAndShapeDistanceValidator.java Show resolved Hide resolved

jcpitre reviewed Mar 6, 2024

View reviewed changes

main/src/main/java/org/mobilitydata/gtfsvalidator/validator/TripAndShapeDistanceValidator.java Show resolved Hide resolved

emmambd mentioned this pull request Mar 6, 2024

Analysis improvement: new Github action to trigger comparison between PR acceptance test and past release stats #1692

Open

emmambd reviewed Mar 8, 2024

View reviewed changes

main/src/main/java/org/mobilitydata/gtfsvalidator/validator/TripAndShapeDistanceValidator.java Outdated Show resolved Hide resolved

fix: PR comments

7d00e10

fix: pr comments

4bba9de

jcpitre approved these changes Mar 11, 2024

View reviewed changes

Merge branch 'master' into feat/1613

f252a94

cka-y merged commit 2f9fccd into master Mar 11, 2024
332 of 333 checks passed

cka-y deleted the feat/1613 branch March 11, 2024 18:29

emmambd mentioned this pull request Mar 13, 2024

Limit the number of shapes trip_distance_exceeds_shape_distance supports #1589

Closed

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: reformat trip and shape dist validator #1676

feat: reformat trip and shape dist validator #1676

cka-y commented Feb 18, 2024 •

edited

Loading

github-actions bot commented Feb 19, 2024

github-actions bot commented Feb 19, 2024

github-actions bot commented Mar 6, 2024

emmambd left a comment

github-actions bot commented Mar 8, 2024

github-actions bot commented Mar 11, 2024

jcpitre left a comment

github-actions bot commented Mar 11, 2024

feat: reformat trip and shape dist validator #1676

feat: reformat trip and shape dist validator #1676

Conversation

cka-y commented Feb 18, 2024 • edited Loading

github-actions bot commented Feb 19, 2024

github-actions bot commented Feb 19, 2024

github-actions bot commented Mar 6, 2024

emmambd left a comment

Choose a reason for hiding this comment

github-actions bot commented Mar 8, 2024

github-actions bot commented Mar 11, 2024

jcpitre left a comment

Choose a reason for hiding this comment

github-actions bot commented Mar 11, 2024

cka-y commented Feb 18, 2024 •

edited

Loading