[C++] Parse time32 from string and infer in CSV reader

When reading a CSV with read_csv_arrow() with date types and time types, the dates are read as datetimes rather than dates and times are read as characters rather than time.

The first problem can be fixed by supplying date32() to schema(), though better inference would be nice. However, supplying time32() to schema() causes an error.

Here is a sample dataset, also attached.

date,time,reading
 2021-01-01,00:00:00,67.8
 2021-01-01,00:00:00,72.4
 2021-01-01,00:00:00,63.1
 2021-01-01,00:05:00,67.8

Reading with readr::read_csv() results in a tibble with three columns: date, time, dbl, as expected.
 
```r

samp_readr <- readr::read_csv('sampledata.csv')
samp_readr
```

```r

# A tibble: 4 x 3
  date       time   reading
  <date>     <time>   <dbl>
1 2021-01-01 00'00"    67.8
2 2021-01-01 00'00"    72.4
3 2021-01-01 00'00"    63.1
4 2021-01-01 05'00"    67.8
```

Reading with arrow::read_csv_arrow() without providing schema() results in a tibble with three columns: dttm, chr, dbl.

```r

samp_arrow_plain <- arrow::read_csv_arrow('sampledata.csv')
samp_arrow_plain
```
```r

# A tibble: 4 x 3
  date                time     reading
  <dttm>              <chr>      <dbl>
1 2020-12-31 19:00:00 00:00:00    67.8
2 2020-12-31 19:00:00 00:00:00    72.4
3 2020-12-31 19:00:00 00:00:00    63.1
4 2020-12-31 19:00:00 00:05:00    67.8
```

Reading with arrow::read_csv_arrow() and providing date=date32() via schema() to col_types results in a tibble with three columns: date, chr, dbl.

```r

samp_arrow_date <- arrow::read_csv_arrow('sampledata.csv', col_types=schema(date=date32()))
samp_arrow_date
```
```r

# A tibble: 4 x 3
  date       time     reading
  <date>     <chr>      <dbl>
1 2021-01-01 00:00:00    67.8
2 2021-01-01 00:00:00    72.4
3 2021-01-01 00:00:00    63.1
4 2021-01-01 00:05:00    67.8
```

Reading with arrow::read_csv_arrow() and providing time=time32() via schema() to col_types generates an error.

```r

samp_arrow_time <- arrow::read_csv_arrow('sampledata.csv', col_types=schema(time=time32()))
```
```r

Error in csv___TableReader__Read(self) : 
  NotImplemented: CSV conversion to time32[ms] is not supported
```

The same error occurs when using compact string notation.

```r

samp_arrow_string <- arrow::read_csv_arrow('sampledata.csv', col_types='DTc', col_names=c('date', 'time', 'reading'), skip=1)
```
```r

Error in csv___TableReader__Read(self) : 
  NotImplemented: CSV conversion to time32[ms] is not supported
```

This is something in the internals, so far beyond me to figure out a fix, but I saw it in action and wanted to report it.

**Environment**: Ubuntu 18.04, R 4.0.3
**Reporter**: [Jared Lander](https://issues.apache.org/jira/browse/ARROW-11243)
**Assignee**: [Antoine Pitrou](https://issues.apache.org/jira/browse/ARROW-11243) / @pitrou
#### Related issues:
- [[C++] Unable to read date64 or date32 in specific format from CSV](https://github.com/apache/arrow/issues/28303) (relates to)
#### Original Issue Attachments:
- [sampletimedata.csv](https://issues.apache.org/jira/secure/attachment/13018721/sampletimedata.csv)
#### PRs and other links:
- [GitHub Pull Request #10782](https://github.com/apache/arrow/pull/10782)

<sub>**Note**: *This issue was originally created as [ARROW-11243](https://issues.apache.org/jira/browse/ARROW-11243). Please see the [migration documentation](https://github.com/apache/arrow/issues/14542) for further details.*</sub>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[C++] Parse time32 from string and infer in CSV reader #27146

Related issues:

Original Issue Attachments:

PRs and other links:

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Uh oh!

[C++] Parse time32 from string and infer in CSV reader #27146

Description

Related issues:

Original Issue Attachments:

PRs and other links:

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions