At the risk of my issue being very data specific, I want to ask for clues as to why I am seeing a large performance hit from what looks like a simple change in the SQL in a data flow.
I started with the following and got acceptable performance of 15 minutes or so:
select A.* from A inner join B on A.field1 = B.field2 and A.field3 = B.field4
However, when I added a column from the table on the right side of the join...
select A.*, B.field5 from A inner join B on A.field1 = B.field2 and A.field3 = B.field4
Suddenly, my performance soared to 14 hours.
What can explain this kind of performance hit? Is there a work-around? I tried a left join instead. I tried playing around with the indexing. Nothing worked to bring the performance back in line with the original SQL.