osdir.com


[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Issue with segments not loading/taking a long time


Hi Samarth,

have you had a change to check the segment balancing status of your cluster?
Do you see any significant imbalance between historicals?

Jihoon

On Thu, Jul 19, 2018 at 12:28 PM Samarth Jain <samarth.jain@xxxxxxxxx>
wrote:

> I am working on upgrading our internal cluster to 0.12.1 release and seeing
> that a few data sources fail to load. Looking at coordinator logs, I am
> seeing messages like this for the datasource:
>
> @400000005b50dbc637061cec 2018-07-19T18:43:08,923 INFO
> [Coordinator-Exec--0] io.druid.server.coordinator.CuratorLoadQueuePeon -
> Asking server peon[/druid-test--001/loadQueue/127.0.0.1:7103] to drop
> segment[*datasource*
>
> _2015-09-03T00:00:00.000Z_2015-09-04T00:00:00.000Z_2018-04-23T21:24:04.910Z]
>
>
>
> @400000005b50dbc637391f84 2018-07-19T18:43:08,926 WARN
> [Coordinator-Exec--0] io.druid.server.coordinator.rules.LoadRule - No
> available [_default_tier] servers or node capacity to assign primary
>
> segment[*datasource*-08-10T00:00:00.000Z_2015-08-11T00:00:00.000Z_2018-04-23T21:24:04.910Z]!
> Expected Replicants[1]
>
>
> The datasource failed to load for a long time and then eventually was
> loaded successfully. Has anyone else seen this? I see a few fixes around
> segment loading and coordination in 0.12.2 (which I am hoping will be out
> soon) but I am not sure if they address this issue.
>