Handling traditional Chinese script in SSIS
Dear Experts,
I have my source data as flat file in traditional Chinese script and the delimiter is
| symbol, apart from this some other | symbols are falling in side the data which is in traditional Chinese script. I have created flat file connection, in preview mode the data is falling correctly. Where else run time the
| symbol in side the Chinese script data is breaking the data and pushing it in to the next column which is irrelevant.
Note : Some data are in double byte
Expecting best solution
March 17th, 2011 2:16pm
Does the flat file contain qualifiers around the columnvalues? The text qualifier string is
most often a quotation mark (") which
takes care of chars in the text that are also used as column delimiters.
Please mark the post as answered if it answers your question | My SSIS Blog:
http://microsoft-ssis.blogspot.com
Free Windows Admin Tool Kit Click here and download it now
March 17th, 2011 2:49pm
Dear Experts,
I have my source data as flat file in traditional Chinese script and the delimiter is
| symbol, apart from this some other | symbols are falling in side the data which is in traditional Chinese script. I have created flat file connection, in preview mode the data is falling correctly. Where else run time the
| symbol in side the Chinese script data is breaking the data and pushing it in to the next column which is irrelevant.
Expecting best solution
best solution is to choose another delimiter rather than |
something never used in your column contents. maybe ^^ or ¬¬ or other charsIf you think my suggestion is useful, please rate it as helpful.
If it has helped you to resolve the problem, please Mark it as Answer.
http://twitter.com/7Kn1ghts
March 17th, 2011 2:53pm
for your kind in formation the data is in double byte.
Free Windows Admin Tool Kit Click here and download it now
March 17th, 2011 3:03pm
The delimiter is | simple which is predefined by my client
March 17th, 2011 3:04pm
The delimiter is | simple which is predefined by my client
How big is your source flat file from client?
Quick way is write up a small programme or use text tools to replace the first | into different delimiter.If you think my suggestion is useful, please rate it as helpful.
If it has helped you to resolve the problem, please Mark it as Answer.
http://twitter.com/7Kn1ghts
Free Windows Admin Tool Kit Click here and download it now
March 17th, 2011 3:14pm
I think this delimiter is repeating in data only in one or 2 columns. In SSIS there is no direct way to handle this condition.
Work Around: Check the "Delimited File Reader Source Sample" @ codeplex. check this custom component .. may this help you.
else
1> check the no of "|" pipes in your single row in DFT ...
2> use a conditional split for filtering the data with expected no of "|" pipes in your data --- valid data
3> and default (means in-valid) data to some flat file.
Then analyze your invalid data and either write a script task for handling your invalid data or use another DFT and make use of derived column for concatenation.
Let us TRY this |
My Blog :: http://quest4gen.blogspot.com/
March 17th, 2011 3:21pm
its pretty huge and I should not change the delimiter. pls give me some other idea with out replacing the delimiter.
As discussed above in flat file connection manager and preview of my source i an view the data properly but it is spllecting only in run time.
Free Windows Admin Tool Kit Click here and download it now
March 17th, 2011 3:25pm
The delimiter is | simple which is predefined by my client
Either you change the delimiter or you add a qualifier... otherwise SSIS doesn't know where the column endsPlease mark the post as answered if it answers your question | My SSIS Blog:
http://microsoft-ssis.blogspot.com
March 17th, 2011 3:27pm
its pretty huge and I should not change the delimiter. pls give me some other idea with out replacing the delimiter.
As discussed above in flat file connection manager and preview of my source i an view the data properly but it is spllecting only in run time.
You wont have any other solutions without change the delimiter, unfortunately.If you think my suggestion is useful, please rate it as helpful.
If it has helped you to resolve the problem, please Mark it as Answer.
http://twitter.com/7Kn1ghts
Free Windows Admin Tool Kit Click here and download it now
March 17th, 2011 3:31pm
Since the data are in Chinese which is unknown script for me. I am not sure how many | symbol will get mixed in the data and the data will fall in daily basis routine
and the size is also expected very huge.
March 17th, 2011 3:34pm
Since the data are in Chinese which is unknown script for me. I am not sure how many | symbol will get mixed in the data and the data will fall in daily basis routine
and the size is also expected very huge.
Then ask client to use a different delimiter say ||| or |¬| ,or something else, will be the best solution.
If you think my suggestion is useful, please rate it as helpful.
If it has helped you to resolve the problem, please Mark it as Answer.
http://twitter.com/7Kn1ghts
Free Windows Admin Tool Kit Click here and download it now
March 17th, 2011 3:39pm
dude, I need a solution with out changing the delimiterkarthikeyan
March 17th, 2011 3:53pm
its pretty huge and I should not change the delimiter. pls give me some other idea with out replacing the delimiter.
As discussed above in flat file connection manager and preview of my source i an view the data properly but it is spllecting only in run time.
The preview mode is only showing the first 100 or 200 rows... that why it's a preview. So there will be no problems in the preview mode if there aren't any extra | chars in the first 100 rows. Try to open your file in a Excel worksheet. Than you
will see the problems.
Please mark the post as answered if it answers your question | My SSIS Blog:
http://microsoft-ssis.blogspot.com
Free Windows Admin Tool Kit Click here and download it now
March 17th, 2011 3:53pm
:-) for testing sake I have only 3 rows in my flat file.karthikeyan
March 17th, 2011 3:55pm
dude, I need a solution with out changing the delimiter
karthikeyan
Sorry dude, there isn't! Your file is incorrect and will never ever work without changing something!
See the basic rules of csv files:
http://en.wikipedia.org/wiki/Comma-separated_values#Basic_rules
Solution 1: change the delimiter
Solution 2: add qualifiers to ignore the false/extra delimitors
Solution 3: change the file type to fixed width
Please mark the post as answered if it answers your question | My SSIS Blog:
http://microsoft-ssis.blogspot.com
Free Windows Admin Tool Kit Click here and download it now
March 17th, 2011 4:00pm
dude, I need a solution with out changing the delimiter
karthikeyan
have you checked my previous post??
Let us TRY this |
My Blog :: http://quest4gen.blogspot.com/
March 17th, 2011 4:15pm
best solution how i have implemented is as fell..........
1) converted the connection manager to fixed with
2) created dummy col between each col
3)mapped the | values to the Dummy col
4)leave the Dummy col with out mapping in the Destination
this is the working fine nowkarthikeyan
Free Windows Admin Tool Kit Click here and download it now
May 10th, 2011 6:37am