[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[jira] [Created] (HBASE-20844) Duplicate rows returned while hbase snapshot reads

ShivaKumar SS created HBASE-20844:

             Summary: Duplicate rows returned while hbase snapshot reads
                 Key: HBASE-20844
                 URL: https://issues.apache.org/jira/browse/HBASE-20844
             Project: HBase
          Issue Type: Bug
          Components: mapreduce, spark
    Affects Versions: 1.3.1
         Environment: Cluster Details 

Java 	1.7
Hbase     1.3.1
Spark      1.6.1
            Reporter: ShivaKumar SS

We are trying to take snapshot from code and read data using MR and spark, both approaches are returning duplicate records.

On the API side, {{org.apache.hadoop.hbase.mapreduce.TableSnapshotInputFormat }} is used. 

Snapshot was taken during the table is being in the region split state. 

We suspect it is due to data is being returned for both parent and daughter regions.

This message was sent by Atlassian JIRA