I have a complex data flow (A) that was taking about 7 minutes to run. I have another simple data flow (B) that is taking 2 minutes to run. Both have outputs that are properly indexed.
I created a new data flow that creates an inner join between two of the output datasets from each data flow (A & B). This inner join is on one field. The new data flow is now taking over 25 minutes (and running). What should I investigate to understand the potential problem? Are there alternative ways of efficiently creating a new dataset from two different dataflows?