Best Time to Buy and Sell Stock with Cooldown
Say you have an array for which the ith element is the price of a given stock on day i.
Design an algorithm to find the maximum profit. You may complete as many transactions as you like (ie, buy one and sell one share of the stock multiple times) with the following restrictions:
You may not engage in multiple transactions at the same time (ie, you must sell the stock before you buy again). After you sell your stock, you cannot buy stock on next day. (ie, cooldown 1 day) Example:
prices = [1, 2, 3, 0, 2]
maxProfit = 3
transactions = [buy, sell, cooldown, buy, sell]
solution:
public class Solution {
public int maxProfit(int[] prices) {
int sell = 0, buy = Integer.MIN_VALUE, pre_sell = 0, pre_buy = 0;
for(int i = 0 ; i< prices.length; i++){
pre_buy = buy;
buy = Math.max(pre_sell - prices[i],pre_buy);
pre_sell = sell;
sell = Math.max(pre_buy + prices[i], pre_sell);
}
return sell;
}
}
@dietpepsi said in Share my thinking process:
The series of problems are typical dp. The key for dp is to find the variables to represent the states and deduce the transition function.
Of course one may come up with a O(1) space solution directly, but I think it is better to be generous when you think and be greedy when you implement.
The natural states for this problem is the 3 possible transactions :
buy
,sell
,rest
. Hererest
means no transaction on that day (aka cooldown).Then the transaction sequences can end with any of these three states.
For each of them we make an array,
buy[n]
,sell[n]
andrest[n]
.
buy[i]
means before dayi
what is the maxProfit for any sequence end withbuy
.
sell[i]
means before dayi
what is the maxProfit for any sequence end withsell
.
rest[i]
means before dayi
what is the maxProfit for any sequence end withrest
.Then we want to deduce the transition functions for
buy
sell
andrest
. By definition we have:buy[i] = max(rest[i-1]-price, buy[i-1]) sell[i] = max(buy[i-1]+price, sell[i-1]) rest[i] = max(sell[i-1], buy[i-1], rest[i-1])
Where
price
is the price of dayi
. All of these are very straightforward. They simply represents :(1) We have to `rest` before we `buy` and (2) we have to `buy` before we `sell`
One tricky point is how do you make sure you
sell
before youbuy
, since from the equations it seems that[buy, rest, buy]
is entirely possible.Well, the answer lies within the fact that
buy[i] <= rest[i]
which meansrest[i] = max(sell[i-1], rest[i-1])
. That made sure[buy, rest, buy]
is never occurred.A further observation is that and
rest[i] <= sell[i]
is also true thereforerest[i] = sell[i-1]
Substitute this in to
buy[i]
we now have 2 functions instead of 3:buy[i] = max(sell[i-2]-price, buy[i-1]) sell[i] = max(buy[i-1]+price, sell[i-1])
This is better than 3, but
we can do even better
Since states of day
i
relies only oni-1
andi-2
we can reduce the O(n) space to O(1). And here we are at our final solution:Java
public int maxProfit(int[] prices) { int sell = 0, prev_sell = 0, buy = Integer.MIN_VALUE, prev_buy; for (int price : prices) { prev_buy = buy; buy = Math.max(prev_sell - price, prev_buy); prev_sell = sell; sell = Math.max(prev_buy + price, prev_sell); } return sell; }
C++
int maxProfit(vector<int> &prices) { int buy(INT_MIN), sell(0), prev_sell(0), prev_buy; for (int price : prices) { prev_buy = buy; buy = max(prev_sell - price, buy); prev_sell = sell; sell = max(prev_buy + price, sell); } return sell; }
For this problem it is ok to use
INT_MIN
as initial value, but in general we would like to avoid this. We can do the same as the following python:Python
def maxProfit(self, prices): if len(prices) < 2: return 0 sell, buy, prev_sell, prev_buy = 0, -prices[0], 0, 0 for price in prices: prev_buy = buy buy = max(prev_sell - price, prev_buy) prev_sell = sell sell = max(prev_buy + price, prev_sell) return sell