refactor: generate 1wg-charters files via celery #7428

jennifer-richards · 2024-05-16T04:41:24Z

The main goal of this PR is to update the 1wg-charters.txt and 1wg-charters-by-acronym.txt files via celery tasks instead of cron jobs. The cron job worked by using wget to retrieve a datatracker URL that generated the files. This was done once per hour and cached for an hour each time using the slowpages cache.

As a result, the slowpages cache is effectively always holding on to the data generated by the last run of bin/hourly and the view itself is always served from that cache except when the cron job regenerates it.

This seems needlessly baroque, so I've refactored the work at generating the charters files entirely out of the views and into a task. The views then simply grab the file the task updates and hand that back in their response.

The new method has a bunch of benefits. It avoids collecting the group data twice (the views gathered the same data twice, the task does it only once). It avoids the small chance that the charter data is occasionally an extra hour out of date depending on system load (the cron job might make its request at very slightly less than an hour since the previously cached value). It entirely moves the purportedly expensive data gathering out of the view.

Important: I did a careful comparison of the output of the old and new code and there are a couple of differences. Most notably, the groups in 1wg-charters.txt are sorted alphabetically by acronym within each area. Before, they were (to my eye) random within the area. This seems like an improvement to me, but it is a change.

Second, there were some \r\n line endings in the old output - I assume they were bleeding in from the charters. These are converted to \n by the new rendering path. This seems like an absolute win to me, but again it's a change.

Also important: We reviewed the \r (aka CR) changes and found it was caused by CRLFs in some old draft titles, not in the charters. Added a clean_whitespace filter to the title in the template to fix this. That replaces any run of ASCII whitespace / control characters (i.e., 0x00-0x1F) with a single space, then strips leading and trailing space. This ends up improving spacing / inappropriate line breaks in a couple dozen draft titles.

jennifer-richards · 2024-05-16T04:41:36Z

~~Draft right now because I need to update tests~~

codecov · 2024-05-16T15:25:46Z

Codecov Report

Attention: Patch coverage is 92.85714% with 6 lines in your changes are missing coverage. Please review.

Project coverage is 88.92%. Comparing base (187c2c5) to head (a08a02f).
Report is 199 commits behind head on main.

❗ Current head a08a02f differs from pull request most recent head 11b4be5

Please upload reports for the commit 11b4be5 to get more accurate results.

Files	Patch %	Lines
ietf/group/utils.py	87.75%	6 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #7428      +/-   ##
==========================================
- Coverage   88.98%   88.92%   -0.07%     
==========================================
  Files         291      294       +3     
  Lines       40717    41112     +395     
==========================================
+ Hits        36233    36557     +324     
- Misses       4484     4555      +71

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

rjsparks · 2024-05-16T19:24:51Z

Acknowledging that there will be ordering and whitespace changes. That might disrupt some people doing diffs for a day, but the tradeoff seems the right one.

…arters # Conflicts: # bin/hourly # ietf/utils/management/commands/periodic_tasks.py

jennifer-richards added 5 commits May 16, 2024 00:40

refactor: move helpers to utils.py

129c514

feat: task to generate 1wg-charters files

85f5350

refactor: use 1wg-charter files in views

a2f3509

chore: create periodic task + slight renaming

b5ccfb3

chore: remove wgets from bin/hourly

e326948

jennifer-richards added 3 commits May 16, 2024 11:13

test: refactor tests for new task/views

25bb5da

fix: fix bug uncovered by tests

d9bac3b

chore: remove unused imports

a08a02f

jennifer-richards marked this pull request as ready for review May 16, 2024 14:51

jennifer-richards requested a review from rjsparks May 16, 2024 14:54

jennifer-richards added 2 commits May 16, 2024 16:09

fix: clean whitespace in draft titles

b353ce6

fix: return verbatim bytes for charter views

2c1caec

jennifer-richards added 2 commits May 16, 2024 16:37

Merge remote-tracking branch 'refs/remotes/upstream/main' into 1wg-ch…

0571244

…arters # Conflicts: # bin/hourly # ietf/utils/management/commands/periodic_tasks.py

chore: remove now-empty /bin/hourly 🎉

11b4be5

rjsparks approved these changes May 16, 2024

View reviewed changes

rjsparks merged commit a5f44df into ietf-tools:main May 16, 2024
7 checks passed

jennifer-richards deleted the 1wg-charters branch May 16, 2024 20:08

github-actions bot locked as resolved and limited conversation to collaborators May 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor: generate 1wg-charters files via celery #7428

refactor: generate 1wg-charters files via celery #7428

jennifer-richards commented May 16, 2024 •

edited

Loading

jennifer-richards commented May 16, 2024 •

edited

Loading

codecov bot commented May 16, 2024 •

edited

Loading

rjsparks commented May 16, 2024 •

edited

Loading

refactor: generate 1wg-charters files via celery #7428

refactor: generate 1wg-charters files via celery #7428

Conversation

jennifer-richards commented May 16, 2024 • edited Loading

jennifer-richards commented May 16, 2024 • edited Loading

codecov bot commented May 16, 2024 • edited Loading

Codecov Report

rjsparks commented May 16, 2024 • edited Loading

jennifer-richards commented May 16, 2024 •

edited

Loading

jennifer-richards commented May 16, 2024 •

edited

Loading

codecov bot commented May 16, 2024 •

edited

Loading

rjsparks commented May 16, 2024 •

edited

Loading