Data Profiling Task running out of memory
I am running the Data Profiling Task on a table with 3 million rows and receiving an out of memory exception. I am pulling the data into a developer machine to do the profiling so the tables being profiled are on the local machine running SSIS and
the machine has 8gb ram. Is there a way to regulate memory usage in the Data Profiling Task to mitigate this issue?
May 4th, 2012 10:56am
Hi,
Can you share with me
1) how many columns do you have ?
2) Are you profiling the Candidate Key ? and how many keyColumns are you profiling ?
Cheers
Free Windows Admin Tool Kit Click here and download it now
May 4th, 2012 12:51pm
It is about 100 columns and we are profiling for candidate keys.
the defualt profiling settings ran in 6.30 seconds but when adding the column pattern profiling checkbox, the timeout occurred.
May 8th, 2012 11:33am
There is nothing in the Data Profiling Task that would help you to consume less memory.
It is just your situation where you have 8 GB of RAM (too low for what you need) only available, and you have 100 columns.
There are two options for you to choose to proceed with:
Add more memory, or do that on a more capable machine and/orDo the profiling in small increments, e.g. the 1st 20 columns first, then next 20 until all 100 are profiled.
Arthur My Blog
Free Windows Admin Tool Kit Click here and download it now
May 8th, 2012 12:17pm
how would I connect the columns back together to get a comprehensive view of the table? arent some of the profiling activities involved dependent on the column's relationships to each other?
regarding option #1, can you point me to any benchmarking references for dataset size?
May 8th, 2012 4:17pm
All the 100 columns cannot be candidates, narrow down to only those most possible thus making the data profiling leaner, then run.
Adding memory is just like that: buying a bigger RAM chipArthur My Blog
Free Windows Admin Tool Kit Click here and download it now
May 9th, 2012 10:30am
thanks arthurz, for option #1 I meant any benchmarking in terms of data set size, not how to purchase more ram. but this looks good enough for the start, thanks.
May 14th, 2012 1:47pm


