-
-
Notifications
You must be signed in to change notification settings - Fork 19.5k
DOC: Add Google Colab data loading section #63354
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
|
#63354 pre-commit.ci autofix |
afeld
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Hey, thanks for this contribution. See my comments in #63343, as most of them apply here as well. I'll approve whichever is ready first. Appreciate it!
| import pandas as pd | ||
| df = pd.read_csv("/content/drive/MyDrive/data.csv") | ||
| Loading data from a URL |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think we can cut this section, as it isn't really specific to Colab.
| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ||
|
|
||
| Google Colab is a hosted Jupyter notebook environment. Since it runs remotely, | ||
| files must be explicitly uploaded or mounted before they can be read by pandas. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is useful to clarify for beginners, thanks.
| uploaded = files.upload() | ||
| import pandas as pd | ||
| df = pd.read_csv("data.csv") |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
| .. _fsimpl1: https://filesystem-spec.readthedocs.io/en/latest/api.html#built-in-implementations | ||
| .. _fsimpl2: https://filesystem-spec.readthedocs.io/en/latest/api.html#other-known-implementations | ||
|
|
||
| Loading data in Google Colab notebooks |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Being more consistent with the other headings:
| Loading data in Google Colab notebooks | |
| Google Colab |
Can we also move this section to be right above Google BigQuery?
| Loading data in Google Colab notebooks | ||
| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ||
|
|
||
| Google Colab is a hosted Jupyter notebook environment. Since it runs remotely, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Let's link to it.
| url = ( | ||
| "https://raw.githubusercontent.com/pandas-dev/pandas/main/" | ||
| "doc/data/air_quality_no2.csv" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Is this split enforced by pre-commit? I know you're trying to keep the lines short, but especially given this section is oriented to beginners, I think this split may confuse them.
Adds documentation describing how to load data in Google Colab, including
file uploads, Google Drive mounting, and URL-based workflows.
Fixes #62708