Ensure channel version database exists when adding to community library by Jakoma02 · Pull Request #5233 · learningequality/studio

Jakoma02 · 2025-07-31T09:31:46Z

Summary

This PR ensures that if an older channel that does not have a versioned database file yet is added to the community library, the versioned database file is created.

References

Solves #5191.

~~This PR depends on changes from #5228, and must be merged after it.~~ (Done)

Reviewer guidance

~~After merging #5228, this PR should first be rebased onto the merged changes and only then reviewed and merged.~~ (Done)

AlexVelezLl · 2025-07-31T23:30:00Z

contentcuration/kolibri_public/utils/export_channel_to_kolibri_public.py

+        mapper.run()
+
+
+def _possibly_migrate_unversioned_database(


I haven't read it in depth yet 😅. But, a general comment is that we should make this copy when we create the submission, not after it's approved and mapped to the public models.

Mainly for two reasons:

If the user has published more recent versions between submission creation and submission approval, then it will no longer be true that the current channel database is the database for that channel version.

In the future, we'll need to create a way to preview the channel version related to the submission, and for that, we'll need to ensure that the channel-versioned database exists, and this preview would happen before approving the submission.

If there are arguments to have this copy at export time instead, Im happy to hear it too 😄

My idea for doing this at export time was to deal with content databases at a place that was already dealing with content databases, without complicating the viewset logic with something I thought it did not need to care about. I was trying to solve 1 by checking whether the database contains the channel metadata with the given version, but 2 alone is a good reason for actually doing this at submission creation time.

I am thinking that I could create a create_versioned_database_if_needed method inside contentcuration/utils/publish.sh and use it from the submission viewset -- or is there a better place for it?

Also, I think that the using_temp_migrated_database helper is fairly useful and makes the export_channel_to_kolibri_public implementation (arguably) more readable, but my motivation for creating it was to avoid reimplementing its logic in _possibly_migrate_unversioned_database, and it is no longer valid. Should I scratch this, or should I keep this change anyway since it is already done?

For sure! I agree on not complicating the viewset logic, I think this function can perfectly live in the publish.py module.

Should I scratch this, or should I keep this change anyway since it is already done?

I also think it is more readable now, and we can re-use this if we ever need it, so Im fine with keeping this change!

Implemented in 3f83c80.

Jakoma02 · 2025-08-06T17:44:03Z

I have rebased this PR onto current community-channels right now.

AlexVelezLl

Thanks @Jakoma02! Code changes looks mostly correct, I just found a little bug on how we are copying the versioned database, and noticed that we should probably have this process as an async task. Apart from that, code changes looks good, and tests provide a lot of confidence.

AlexVelezLl · 2025-08-13T14:57:10Z

contentcuration/contentcuration/utils/publish.py

+            )
+
+        with storage.open(unversioned_db_storage_path, "rb") as unversioned_db_file:
+            with storage.open(versioned_db_storage_path, "wb") as versioned_db_file:


I am getting this error when try to create a submission that doesn't have a versioned channel database:

File "/home/alexvelezll/.pyenv/versions/studio-py3.10/lib/python3.10/site-packages/django_s3_storage/storage.py", line 318, in _open raise ValueError("S3 files can only be opened in read-only mode") ValueError: S3 files can only be opened in read-only mode

So, it seems like a better way to go here is just to save the same database in the new path just like we do in the publish_channel method:

with storage.open(unversioned_db_storage_path, "rb") as unversioned_db_file: storage.save(versioned_db_storage_path, unversioned_db_file)

You are right, this slipped through. It should be fixed in 054c89d, and I did more thorough manual testing this time.

AlexVelezLl · 2025-08-13T15:10:24Z

contentcuration/contentcuration/models.py

+            # When creating a new submission, ensure the channel has a versioned database
+            # (it might not have if the channel was published before versioned databases
+            # were introduced).
+            ensure_versioned_database_exists(self.channel)


Just realized that this should probably happen in an async task since downloading the databases may take some time, and we should not keep the connection open for that long. So could you please create a new task in contentcuration/tasks.py that just calls the ensure_versioned_database_exists method (so we dont have all this logic in the tasks module) and then enqueue it here? Apologies I did not catch this earlier.

Done in 611b641.

AlexVelezLl

Thanks @Jakoma02, code changes looks good! Just a nitpick comment. We will also need to rebase this PR to solve the conflicts. After that, this is good to go!

AlexVelezLl · 2025-09-09T13:53:33Z

contentcuration/kolibri_public/utils/export_channel_to_kolibri_public.py

-                mapper.run()
+        db_storage_path = versioned_db_storage_path
+
+    with using_temp_migrated_database(db_storage_path):


I'd rename it to something like using_temp_migrated_content_database to explicity declare that this is using the content database context.

Done in b115edb

Jakoma02 · 2025-09-19T18:03:43Z

I rebased the PR on top of current unstable. The rebasing was not completely trivial, it would be great if you could at least briefly double-check the new commits (I have some respect for rewriting git history like this, because if I make a mistake here and it is discovered later, it will be hard to tell that this was a rebasing error and it might make the history really hard to understand).

AlexVelezLl

The new task is working correctly, code changes look good, and tests gives a lot of reassurance. Went through all the PR commits and didnt spot anything weird. Thanks a lot @Jakoma02!

AlexVelezLl linked an issue Jul 31, 2025 that may be closed by this pull request

ESoCC: Ensure that channel version's database exists when a community library submission is created #5191

Closed

1 task

AlexVelezLl reviewed Jul 31, 2025

View reviewed changes

Jakoma02 force-pushed the ensure-channel-version-database-exists branch from 30b24d0 to 3f83c80 Compare August 6, 2025 17:40

Jakoma02 requested a review from AlexVelezLl August 6, 2025 17:46

marcellamaki added this to the Studio: Easy Sharing of Community Channels milestone Aug 12, 2025

AlexVelezLl self-assigned this Aug 13, 2025

AlexVelezLl requested changes Aug 13, 2025

View reviewed changes

rtibbles changed the base branch from community-channels to unstable August 28, 2025 00:00

Jakoma02 requested a review from AlexVelezLl September 2, 2025 21:22

AlexVelezLl reviewed Sep 9, 2025

View reviewed changes

Jakoma02 added 8 commits September 19, 2025 19:11

Ensure channel version database exists

69e536b

Avoid using storage path method

8961832

Ensure versioned database exists on submission create

5086384

Fix failing tests and rebasing mistakes

794cafe

Fix ensure_versioned_database_exists to not open a file for writing

2f6e2af

Use async task for ensure_versioned_database_exists

98ffe0c

Fix publishing tests

70e86da

Clarify helper naming

15d40ff

Jakoma02 force-pushed the ensure-channel-version-database-exists branch from b115edb to 15d40ff Compare September 19, 2025 17:55

Jakoma02 requested a review from AlexVelezLl September 19, 2025 18:03

AlexVelezLl approved these changes Sep 22, 2025

View reviewed changes

AlexVelezLl merged commit fc27318 into learningequality:unstable Sep 22, 2025
13 checks passed

Uh oh!

Conversation

Jakoma02 commented Jul 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

References

Reviewer guidance

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Jakoma02 commented Aug 6, 2025

Uh oh!

AlexVelezLl left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

AlexVelezLl left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Jakoma02 commented Sep 19, 2025

Uh oh!

AlexVelezLl left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Jakoma02 commented Jul 31, 2025 •

edited

Loading