I am attempting to schedule a dataflow to append the week's data to a historical dataset which contains the previous week's history. The auto-updating dataset, call it: "projects_prod_daily" runs everyday as a replace data update. The historical dataset, let's call it: "projects_append_weekly" contains a history we manually appended in the past. I also created a connector to the USGS Earthquake data which runs on Saturday mornings, called: "recursive_trigger_weekly".
Using a MySQL dataflow, the following queries are what I created to achieve the desired result:
Transform 1:
SELECT
*,
DATE_FORMAT(CURRENT_DATE, "%m/%d/%Y") AS `Date`
FROM projects_prod_daily
Transform 2:
SELECT
*,
STR_TO_DATE(`Date`, "%m/%d/%Y") AS `Date`
FROM projects_append_weekly
UNION ALL
SELECT
*
FROM pull_updated_status_data
My question is: If I initially created the "projects_append_weekly" in a separate dataflow but use the queries listed above to create an output dataset with the same name, will the data continue to append the last week's updates automatically over time?