window function for Spark 1.6 -

i have dataframe :

+----+------+--------------------+--------+-------------+  | id | name |    end time        |  value |    comment  | --------------------------------------------------------- |1   |node1 |2017-03-24 08:30:00 |    5   | blabla      | --------------------------------------------------------- |2   |node1 |2017-03-24 09:00:00 |    3   | blabla      | --------------------------------------------------------- |3   |node1 |2017-03-24 09:30:00 |    8   | blabla      | --------------------------------------------------------- |4   |node2 |2017-03-24 10:00:00 |    5   | blabla      | --------------------------------------------------------- |5   |node2 |2017-03-24 10:30:00 |    3  | blabla      | --------------------------------------------------------- |6   |node2 |2017-03-24 11:00:00 |    1   | blabla      | --------------------------------------------------------- |7   |node2 |2017-03-24 11:30:00 |    3   | blabla      | --------------------------------------------------------- |8   |node2 |2017-03-24 12:00:00 |    5  | blabla      | ---------------------------------------------------------

and need find nodes value less 6 during 2 hours. how can in spark 1.6? in advance!

edit : in spark 2.x

you can use window aggregate functions :

df.groupby(     col("name"),     window(col("end time"), "2 hour", "30 minute") ) .agg(max("value").as("2_hour_max_value")) .filter(col("2_hour_max_value") < 6) .select("name") .distinct()

Search This Blog

Breniser

window function for Spark 1.6 -

Comments

Post a Comment

Popular posts from this blog

javascript - Clear button on addentry page doesn't work -

python - Error: Unresolved reference 'selenium' What is the reason? -

php - Need to store a large amount of data in session with CI 3 but on storing large data in session it is itself destorying automatically -