Skip to content

[R] strptime should return NA (not error) with format mismatch  #31114

Description

@asfimport

base::strptime() returns NA when the value passed to the format argument does not match the string to be parsed. The arrow binding currently errors in the same scenario.

strptime("2022-02-11", format = "%Y-%m-%d")
#> [1] "2022-02-11 GMT"
strptime("2022-02-11", format = "%Y %m-%d")
#> [1] NA
suppressMessages(library(lubridate))
suppressMessages(library(arrow))
suppressMessages(library(dplyr))

df <- tibble(x = "2022-02-11")

df %>% 
  mutate(z = strptime(x, format = "%Y-%m %d"))
#> # A tibble: 1 × 2
#>   x          z     
#>   <chr>      <dttm>
#> 1 2022-02-11 NA

df %>% 
  record_batch() %>% 
  mutate(z = strptime(x, format = "%Y-%m %d")) %>% 
  collect()
#> Error: Invalid: Failed to parse string: '2022-02-11' as a scalar of type timestamp[ms]

Reporter: Dragoș Moldovan-Grünfeld / @dragosmg
Assignee: Dragoș Moldovan-Grünfeld / @dragosmg
Watchers: Rok Mihevc / @rok

Related issues:

PRs and other links:

Note: This issue was originally created as ARROW-15659. Please see the migration documentation for further details.

Metadata

Metadata

Assignees

No one assigned

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions