move gsutil copy to condtional to avoid breakages#909
Conversation
|
Isn't it better to error out when a user requests profiling but the profiling fails? User can set profiler='' (default, no profiling) to avoid profiling instead of this check? This error points to a real bug in maxtext - we should either ensure gsutil is installed or perhaps can use a different gcs API (upload_blob) |
|
Hi @gobbleturk the profiling works fine as is. This line just copies the generated |
Don't we need to move the profile to GCS? How are we able to use the profile otherwise? |
|
@gobbleturk yes, on clusters using gcs, it is needed indeed. Hence, the nightly maxtext container made through this script installs Another way could be to enforce the |
Ya this makes sense. For TPUs the profile will be sent to |
gobbleturk
left a comment
There was a problem hiding this comment.
Will approve for now to unblock nsys profiling use cases, as we work on longer term fix
Created #911 to track longer term fix |
gsutil is not in the requirements list, hence not installed by default. The copy command is breaking for the containers which don't have it by default. Moved it under a conditonal