I have a PySpark dataframe that keeps track of changes that occur in a product\'s price and status over months. This means that a new row is created only when a change occur