[...] The other remaining point I noticed when reviewing this is that it only talks about dataset repos on HF. But the guidance (or at least most of it?) applies just as much to model repos.
Presumably we don't want to have near-duplicates of this for dataset and model repo guidance. Would the idea be to create a separate model repo guidance that by and large refers to this page asking the reader to replace the concept of "dataset" with "model"? Or would it be better to have a single page that in the (hopefully very few) places where it matters distinguishes between dataset repo and model repo type? (One pending project candidate that needs this guidance in fact needs it for a model repo.)
Originally posted by @hlapp in #65 (comment)
It could probably be refactored to a general HF upload guide, especially since I already referenced a model vs dataset vs space distinction. I think models are generally more standardized, so it should be simple enough to mostly point to the docs. The key points to note there are about different checkpoints getting their own repositories and then generating a collection.
This last point will take a bit more consideration for implementation, so we have deferred this change to a new PR based on this issue.
[...] The other remaining point I noticed when reviewing this is that it only talks about dataset repos on HF. But the guidance (or at least most of it?) applies just as much to model repos.
Presumably we don't want to have near-duplicates of this for dataset and model repo guidance. Would the idea be to create a separate model repo guidance that by and large refers to this page asking the reader to replace the concept of "dataset" with "model"? Or would it be better to have a single page that in the (hopefully very few) places where it matters distinguishes between dataset repo and model repo type? (One pending project candidate that needs this guidance in fact needs it for a model repo.)
Originally posted by @hlapp in #65 (comment)
It could probably be refactored to a general HF upload guide, especially since I already referenced a model vs dataset vs space distinction. I think models are generally more standardized, so it should be simple enough to mostly point to the docs. The key points to note there are about different checkpoints getting their own repositories and then generating a collection.
This last point will take a bit more consideration for implementation, so we have deferred this change to a new PR based on this issue.