<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Informatica Issue</title>
	<atom:link href="http://kirtandesai.com/write/index.php/2007/12/11/informatica-issue-2/feed/" rel="self" type="application/rss+xml" />
	<link>http://kirtandesai.com/write/2007/12/11/informatica-issue-2/</link>
	<description></description>
	<lastBuildDate>Mon, 29 Mar 2010 07:14:52 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.4</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: Shivam</title>
		<link>http://kirtandesai.com/write/2007/12/11/informatica-issue-2/comment-page-1/#comment-31257</link>
		<dc:creator>Shivam</dc:creator>
		<pubDate>Mon, 29 Mar 2010 07:14:52 +0000</pubDate>
		<guid isPermaLink="false">http://kirtandesai.com/write/2007/12/11/informatica-issue-2/#comment-31257</guid>
		<description>Hi Thanks,
I agree your words but what the are the other possibilities ,so that change control process
is also a factor keeping in mind

Thanks for your paitence in guoding me ,I appreciate it

Shivam</description>
		<content:encoded><![CDATA[<p>Hi Thanks,<br />
I agree your words but what the are the other possibilities ,so that change control process<br />
is also a factor keeping in mind</p>
<p>Thanks for your paitence in guoding me ,I appreciate it</p>
<p>Shivam</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Administrator</title>
		<link>http://kirtandesai.com/write/2007/12/11/informatica-issue-2/comment-page-1/#comment-31227</link>
		<dc:creator>Administrator</dc:creator>
		<pubDate>Mon, 29 Mar 2010 03:06:59 +0000</pubDate>
		<guid isPermaLink="false">http://kirtandesai.com/write/2007/12/11/informatica-issue-2/#comment-31227</guid>
		<description>seems fine. try it out. it should work. I personally don&#039;t like the idea of such automation. I am all for automation but change control processes are also important as a product/project matures.</description>
		<content:encoded><![CDATA[<p>seems fine. try it out. it should work. I personally don&#8217;t like the idea of such automation. I am all for automation but change control processes are also important as a product/project matures.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Shivam</title>
		<link>http://kirtandesai.com/write/2007/12/11/informatica-issue-2/comment-page-1/#comment-31224</link>
		<dc:creator>Shivam</dc:creator>
		<pubDate>Mon, 29 Mar 2010 02:53:29 +0000</pubDate>
		<guid isPermaLink="false">http://kirtandesai.com/write/2007/12/11/informatica-issue-2/#comment-31224</guid>
		<description>Hi
I may be wrong on dynamic part ,just elimating columns or entities
May i know the steps to map or proceess for the desired output</description>
		<content:encoded><![CDATA[<p>Hi<br />
I may be wrong on dynamic part ,just elimating columns or entities<br />
May i know the steps to map or proceess for the desired output</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Administrator</title>
		<link>http://kirtandesai.com/write/2007/12/11/informatica-issue-2/comment-page-1/#comment-31215</link>
		<dc:creator>Administrator</dc:creator>
		<pubDate>Mon, 29 Mar 2010 01:54:54 +0000</pubDate>
		<guid isPermaLink="false">http://kirtandesai.com/write/2007/12/11/informatica-issue-2/#comment-31215</guid>
		<description>@Shivam
How often are you expecting these changes? And why do you have to make this dynamic? What&#039;s going to happen to all the stored procs, reports, feeds that are written based on the structure? are you going to make all of that dynamic too?</description>
		<content:encoded><![CDATA[<p>@Shivam<br />
How often are you expecting these changes? And why do you have to make this dynamic? What&#8217;s going to happen to all the stored procs, reports, feeds that are written based on the structure? are you going to make all of that dynamic too?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Administrator</title>
		<link>http://kirtandesai.com/write/2007/12/11/informatica-issue-2/comment-page-1/#comment-31176</link>
		<dc:creator>Administrator</dc:creator>
		<pubDate>Sun, 28 Mar 2010 15:23:29 +0000</pubDate>
		<guid isPermaLink="false">http://kirtandesai.com/write/2007/12/11/informatica-issue-2/#comment-31176</guid>
		<description>@Nitin
I would need more details about your mapping. but a few things you can check for are: table indexes, table locks, connections etc.
you can search for &quot;performance tuning&quot;  on this site and see if it gives you any ideas. 
let me know if you have more questions.</description>
		<content:encoded><![CDATA[<p>@Nitin<br />
I would need more details about your mapping. but a few things you can check for are: table indexes, table locks, connections etc.<br />
you can search for &#8220;performance tuning&#8221;  on this site and see if it gives you any ideas.<br />
let me know if you have more questions.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Shivam</title>
		<link>http://kirtandesai.com/write/2007/12/11/informatica-issue-2/comment-page-1/#comment-31172</link>
		<dc:creator>Shivam</dc:creator>
		<pubDate>Sun, 28 Mar 2010 05:12:59 +0000</pubDate>
		<guid isPermaLink="false">http://kirtandesai.com/write/2007/12/11/informatica-issue-2/#comment-31172</guid>
		<description>Hi kirtan this what they are expecting;--



One point to be noted that we are not looking for “runtime” manipulation of metadata for target schema or the process flow.

 

Based on customer specific requirements for target schema variation (elimination of columns or entities), we should be able to generate from scratch or tweak master metadata for target schema map or process flow to generate desired output.   This is still considered as “design” time, just that on-site person is not using GUI tools to design the process flow rather our infrastructure will generate it.

 

Based on your evaluation, does Talend and Pentaho support this?

 

I will need to answer this from Imformatica perspective.

 _______________________________________________________________________________________________________________________________________________________

Our Analysis:------------------



steps:---

1. Create table control_load
2. Add 3 columns like tab_name,column_name,flag
3.Develop map to generate the parameter file where flag = &#039;Y&#039; with parameter
 as INSERT statement( for those columns or table you dont want make the flag as &#039;N&#039;) 
4. Develop the actual map, after source add SQL transformation and call the parameter 
for Inserting to target tableflow make dymamic for SQL transformation
5.In work

two mappings one workflow,  call sessions in sequential mode
it covers both scenarios
based on flag you can do that
it all depends on how you&#039;ll set the flag


Kirtan, make changes to the steps if i m wrong
Shivam</description>
		<content:encoded><![CDATA[<p>Hi kirtan this what they are expecting;&#8211;</p>
<p>One point to be noted that we are not looking for “runtime” manipulation of metadata for target schema or the process flow.</p>
<p>Based on customer specific requirements for target schema variation (elimination of columns or entities), we should be able to generate from scratch or tweak master metadata for target schema map or process flow to generate desired output.   This is still considered as “design” time, just that on-site person is not using GUI tools to design the process flow rather our infrastructure will generate it.</p>
<p>Based on your evaluation, does Talend and Pentaho support this?</p>
<p>I will need to answer this from Imformatica perspective.</p>
<p> _______________________________________________________________________________________________________________________________________________________</p>
<p>Our Analysis:&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;</p>
<p>steps:&#8212;</p>
<p>1. Create table control_load<br />
2. Add 3 columns like tab_name,column_name,flag<br />
3.Develop map to generate the parameter file where flag = &#8216;Y&#8217; with parameter<br />
 as INSERT statement( for those columns or table you dont want make the flag as &#8216;N&#8217;)<br />
4. Develop the actual map, after source add SQL transformation and call the parameter<br />
for Inserting to target tableflow make dymamic for SQL transformation<br />
5.In work</p>
<p>two mappings one workflow,  call sessions in sequential mode<br />
it covers both scenarios<br />
based on flag you can do that<br />
it all depends on how you&#8217;ll set the flag</p>
<p>Kirtan, make changes to the steps if i m wrong<br />
Shivam</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Nitin</title>
		<link>http://kirtandesai.com/write/2007/12/11/informatica-issue-2/comment-page-1/#comment-31156</link>
		<dc:creator>Nitin</dc:creator>
		<pubDate>Fri, 26 Mar 2010 16:49:10 +0000</pubDate>
		<guid isPermaLink="false">http://kirtandesai.com/write/2007/12/11/informatica-issue-2/#comment-31156</guid>
		<description>Hi,

I am running a workflow, in this workgflow there are 2 session. one session is for loading data from file into staging table and 2nd session for loading data from staging into fact table. 
first session is running fine , but when i run the 2nd session which has 65k records to load, it keep running for 1-1.5 hrs withour any error message, only message i found in logs is &quot;CONNECTING to DATABASE&quot; i checked every level in db i.e. checked the temp tablespace it has enough space.. and there&#039;s no lock on the target table. 
Need your support on this issue.

thanks in advance,
Nitin</description>
		<content:encoded><![CDATA[<p>Hi,</p>
<p>I am running a workflow, in this workgflow there are 2 session. one session is for loading data from file into staging table and 2nd session for loading data from staging into fact table.<br />
first session is running fine , but when i run the 2nd session which has 65k records to load, it keep running for 1-1.5 hrs withour any error message, only message i found in logs is &#8220;CONNECTING to DATABASE&#8221; i checked every level in db i.e. checked the temp tablespace it has enough space.. and there&#8217;s no lock on the target table.<br />
Need your support on this issue.</p>
<p>thanks in advance,<br />
Nitin</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Sangram Takmoge</title>
		<link>http://kirtandesai.com/write/2007/12/11/informatica-issue-2/comment-page-1/#comment-18584</link>
		<dc:creator>Sangram Takmoge</dc:creator>
		<pubDate>Sun, 06 Dec 2009 09:33:39 +0000</pubDate>
		<guid isPermaLink="false">http://kirtandesai.com/write/2007/12/11/informatica-issue-2/#comment-18584</guid>
		<description>Hi Mahendar, Can you please let me know how would the &#039;rowval&#039; variable get updated?</description>
		<content:encoded><![CDATA[<p>Hi Mahendar, Can you please let me know how would the &#8216;rowval&#8217; variable get updated?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Mahendar</title>
		<link>http://kirtandesai.com/write/2007/12/11/informatica-issue-2/comment-page-1/#comment-14510</link>
		<dc:creator>Mahendar</dc:creator>
		<pubDate>Mon, 05 Oct 2009 06:45:08 +0000</pubDate>
		<guid isPermaLink="false">http://kirtandesai.com/write/2007/12/11/informatica-issue-2/#comment-14510</guid>
		<description>Hello,
There are couple of steps to load the data.
First, create a variable name &quot;rowval&quot; and assign 0 to that initially. 
Step 1: configure the source qualifier(default)
step 2: sort the the record by some unique identifier if it have.
step 3: create sequence generator transformation take the output to expression and similarly output of sorting transformation to the expression transformation. 
step 4: configure filter transformation with condition sequence_col &gt; rowval and sequence_col &lt;= (rowval+10)
step 5: connect the output from the filter to the target</description>
		<content:encoded><![CDATA[<p>Hello,<br />
There are couple of steps to load the data.<br />
First, create a variable name &#8220;rowval&#8221; and assign 0 to that initially.<br />
Step 1: configure the source qualifier(default)<br />
step 2: sort the the record by some unique identifier if it have.<br />
step 3: create sequence generator transformation take the output to expression and similarly output of sorting transformation to the expression transformation.<br />
step 4: configure filter transformation with condition sequence_col &gt; rowval and sequence_col &lt;= (rowval+10)<br />
step 5: connect the output from the filter to the target</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Administrator</title>
		<link>http://kirtandesai.com/write/2007/12/11/informatica-issue-2/comment-page-1/#comment-8464</link>
		<dc:creator>Administrator</dc:creator>
		<pubDate>Wed, 05 Aug 2009 22:54:37 +0000</pubDate>
		<guid isPermaLink="false">http://kirtandesai.com/write/2007/12/11/informatica-issue-2/#comment-8464</guid>
		<description>i am assuming that you have an id on each row that uniquely identifies each row. [call it ID ]
sort the rows by ID.

in your mapping create a lookup to the table that you are going to load with the contents of this file. 

in a filter transformation, discard the rows that are already in the DB/table. 

AFTER that rank rest of the rows. let all the rows with rank &lt;=10 pass through. and discard rest. OR discard where rank &gt; 10.

-------------
one of the biggest drawback of informatica in my opinion is that it does not and can not allow looping construct...

now the solution above would work fine if you are dealing with 100 rows (as you mentioned). however , if you are dealing with 200 million rows it can be a nightmare.

i would rather do something like the following if my source file was 200m and if i had a choice.

say your source file looks like:

ID    NAME
1     A
2     B
3     C
4     D
5     E
...
...
26    Z

I would load it into a database table [ in staging area/schema/space sort of] called Table T.

So table T will look just like the file.

now, from table T to your main table where you really want to load this data, you can either loop through the data or run any type of SQL on it by providing range etc... your options are limitless from this point onwards...and the time/money/resource saved by not doing it using pure informatica code can be used for something else.

in ETL, remember this... 
if you can do something using SQL and nothing else, use only SQL and nothing else
if you cannot do it in SQL , use PL/SQL or T-SQL or similar.
If you cannot do it in PL/SQL or T-SQL or similar, use Informatica
If you cannot do it in Informatica, use .NET/Java/Perl etc
If you cannot do it in .NET/Java/Perl etc, stop coding . go back to your analysis and find the biggest mistake/misunderstanding of your life.</description>
		<content:encoded><![CDATA[<p>i am assuming that you have an id on each row that uniquely identifies each row. [call it ID ]<br />
sort the rows by ID.</p>
<p>in your mapping create a lookup to the table that you are going to load with the contents of this file. </p>
<p>in a filter transformation, discard the rows that are already in the DB/table. </p>
<p>AFTER that rank rest of the rows. let all the rows with rank < =10 pass through. and discard rest. OR discard where rank > 10.</p>
<p>&#8212;&#8212;&#8212;&#8212;-<br />
one of the biggest drawback of informatica in my opinion is that it does not and can not allow looping construct&#8230;</p>
<p>now the solution above would work fine if you are dealing with 100 rows (as you mentioned). however , if you are dealing with 200 million rows it can be a nightmare.</p>
<p>i would rather do something like the following if my source file was 200m and if i had a choice.</p>
<p>say your source file looks like:</p>
<p>ID    NAME<br />
1     A<br />
2     B<br />
3     C<br />
4     D<br />
5     E<br />
&#8230;<br />
&#8230;<br />
26    Z</p>
<p>I would load it into a database table [ in staging area/schema/space sort of] called Table T.</p>
<p>So table T will look just like the file.</p>
<p>now, from table T to your main table where you really want to load this data, you can either loop through the data or run any type of SQL on it by providing range etc&#8230; your options are limitless from this point onwards&#8230;and the time/money/resource saved by not doing it using pure informatica code can be used for something else.</p>
<p>in ETL, remember this&#8230;<br />
if you can do something using SQL and nothing else, use only SQL and nothing else<br />
if you cannot do it in SQL , use PL/SQL or T-SQL or similar.<br />
If you cannot do it in PL/SQL or T-SQL or similar, use Informatica<br />
If you cannot do it in Informatica, use .NET/Java/Perl etc<br />
If you cannot do it in .NET/Java/Perl etc, stop coding . go back to your analysis and find the biggest mistake/misunderstanding of your life.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
