[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: State in the Scala DataStream API

Hi Juan,

The state will be purged if you return None instead of a Some.
However, this only happens when the function is called for a specific key, i.e., state won't be automatically removed after some time.
If this is your use case, you have to implement a ProcessFunction and use timers to manually clean up the state.

Best, Fabian

2018-08-08 19:02 GMT+02:00 Juan Gentile <j.gentile@xxxxxxxxxx>:


I'm looking at the following page of the documentation


particularly at this piece of code:

val stream: DataStream[(String, Int)] = ...

val counts: DataStream[(String, Int)] = stream
  .mapWithState((in: (String, Int), count: Option[Int]) =>
    count match {
      case Some(c) => ( (in._1, c), Some(c + in._2) )
      case None => ( (in._1, 0), Some(in._2) )

How is the state clear/purge in this case for keys that no longer appear?

Thank you,