[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[jira] [Created] (ARROW-3693) [R] Invalid buffer for null characters with null data

Javier Luraschi created ARROW-3693:

             Summary: [R] Invalid buffer for null characters with null data
                 Key: ARROW-3693
                 URL: https://issues.apache.org/jira/browse/ARROW-3693
             Project: Apache Arrow
          Issue Type: Improvement
          Components: R
            Reporter: Javier Luraschi
         Attachments: Screen Shot 2018-11-02 at 10.27.11 PM.png

I'm hitting this from data coming from Spark while retrieving this data frame:


<database>   <name>    <temp>
default      a         TRUE
             b         TRUE

error: Failed to fetch data: invalid data in buffer 2
The problem is that is possible to have a NULL character array with offsets set to 0, notice that in the example above Spark returns two batches, so effectively, we are trying to parse:

<database>   <name>    <temp>
             b         TRUE

Where the data array is NULL and the offsets 0s for the <database> columns.

Here is a snapshot while debugging StringArray_to_Vector:

!Screen Shot 2018-11-02 at 10.27.11 PM.png!

This message was sent by Atlassian JIRA