300+ REAL TIME INFORMATICA Questions & Answers 2023

300+ TOP INFORMATICA Interview Questions and Answers

INFORMATICA Interview Questions for freshers experienced :-

1. What is Informatica? Why do we need it?

Informatica is a software development firm which offers some data integration solution for ETL, data virtualization, master data management, data quality, data replica, ultra messaging etc.

Some of the popular Informatica products are:

Informatica PowerCenter
Informatica PowerConnect
Informatica Power Mart
Informatica Power Exchange
Informatica Power Analysis
Informatica Power Quality

We need Informatica while working with data systems that contain data to perform certain operations along with a set of rules. Informatica facilitates operations line cleaning and modifying data from structured and unstructured data systems.

2. What is the format of Informatica objects in a repository? What are the databases that Informatica can connect to Windows?

Informatica objects can be written in XML format.

Following is the list of databases that Informatica can connect to:

SQL Server
Oracle
MS Access
MS Excel
DB2
Sybase
Teradata

3. What is Informatica PowerCenter?

Informatica PowerCenter is an ETL/Data Integration Tool which is used to connect and retrieve data from different sources and data processing. PowerCenter processes a high volume of data and supports data retrieval from ERP sources such as SAP, PeopleSoft, etc.

You can connect PowerCenter to database management systems like SQL and Oracle to integrate data into the third system.

4. Which are the different editions of Informatica PowerCenter that are available?

Different editions of Informatica PowerCenter are:

Standard Edition
Advance Edition
Premium Edition

The current version of PowerCenter available is v10 with a high-performance increase.

5. How many repositories can be created in informatica?

There can be any number of repositories in informatica but eventually it depends on number of ports.

6. What is the benefit of partitioning a session?

Partitioning a session means solo implementation sequences within the session. It’s main purpose is to improve server’s operation and efficiency. Other transformations including extractions and other outputs of single partitions are carried out in parallel.

7. How are indexes created after completing the load process?

For the purpose of creating indexes after the load process, command tasks at session level can be used. Index creating scripts can be brought in line with the session’s workflow or the post session implementation sequence. Moreover this type of index creation cannot be controlled after the load process at transformation level.

8. Explain sessions. Explain how batches are used to combine executions?

A teaching set that needs to be implemented to convert data from a source to a target is called a session. Session can be carried out using the session’s manager or pmcmd command. Batch execution can be used to combine sessions executions either in serial manner or in a parallel. Batches can have different sessions carrying forward in a parallel or serial manner.

9. How many number of sessions can one group in batches?

One can group any number of sessions but it would be easier for migration if the number of sessions are lesser in a batch.

10. Explain the difference between mapping parameter and mapping variable?

When values change during the session’s execution it’s called a mapping variable. Upon completion the Informatica server stores the end value of a variable and is reused when session restarts. Moreover those values that do not change during the sessions execution are called mapping parameters. Mapping procedure explains mapping parameters and their usage. Values are allocated to these parameters before starting the session.

11.What is complex mapping?

Following are the features of complex mapping.

Difficult requirements
Many numbers of transformations
Complex business logic

12. How can one identify whether mapping is correct or not without connecting session?

One can find whether the session is correct or not without connecting the session is with the help of debugging option.

13. Can one use mapping parameter or variables created in one mapping into any other reusable transformation?

Yes, One can do because reusable transformation does not contain any mapplet or mapping.

14. Explain the use of aggregator cache file?

Aggregator transformations are handled in chunks of instructions during each run. It stores transitional values which are found in local buffer memory. Aggregators provides extra cache files for storing the transformation values if extra memory is required.

15. Briefly describe lookup transformation?

Lookup transformations are those transformations which have admission right to RDBMS based data set. The server makes the access faster by using the lookup tables to look at explicit table data or the database. Concluding data is achieved by matching the look up condition for all look up ports delivered during transformations.

16. What does role playing dimension mean?

The dimensions that are utilized for playing diversified roles while remaining in the same database domain are called role playing dimensions.

17. How can repository reports be accessed without SQL or other transformations?

Repositoryreports are established by metadata reporter. There is no need of SQL or other transformation since it is a web app.

18. What are the types of metadata that stores in repository?

The types of metadata includes Source definition, Target definition, Mappings, Mapplet, Transformations.

19. Explain the code page compatibility?

When data moves from one code page to another provided that both code pages have the same character sets then data loss cannot occur. All the characteristics of source page must be available in the target page. Moreover if all the characters of source page are not present in the target page then it would be a subset and data loss will definitely occur during transformation due the fact the two code pages are not compatible.

20. How can you validate all mappings in the repository simultaneously?

All the mappings cannot be validated simultaneously because each time only one mapping can be validated.

21. Briefly explain the Aggregator transformation?

It allows one to do aggregate calculations such as sums, averages etc. It is unlike expression transformation in which one can do calculations in groups.

22. Describe Expression transformation?

Values can be calculated in single row before writing on the target in this form of transformation. It can be used to perform non aggregate calculations. Conditional statements can also be tested before output results go to target tables.

23. What do you mean by filter transformation?

It is a medium of filtering rows in a mapping. Data needs to be transformed through filter transformation and then filter condition is applied. Filter transformation contains all ports of input/output, and the rows which meet the condition can only pass through that filter.

24. What is Joiner transformation?

Joiner transformation combines two affiliated heterogeneous sources living in different locations while a source qualifier transformation can combine data emerging from a common source.

25. What is Lookup transformation?

It is used for looking up data in a relational table through mapping. Lookup definition from any relational database is imported from a source which has tendency of connecting client and server. One can use multiple lookup transformation in a mapping.

26. How Union Transformation is used?

Ans: It is a diverse input group transformation which can be used to combine data from different sources. It works like UNION All statement in SQL that is used to combine result set of two SELECT statements.

27. What do you mean Incremental Aggregation?

Option for incremental aggregation is enabled whenever a session is created for a mapping aggregate. Power center performs incremental aggregation through the mapping and historical cache data to perform new aggregation calculations incrementally.

28. What is the difference between a connected look up and unconnected look up?

When the inputs are taken directly from other transformations in the pipeline it is called connected lookup. While unconnected lookup doesn’t take inputs directly from other transformations, but it can be used in any transformations and can be raised as a function using LKP expression. So it can be said that an unconnected lookup can be called multiple times in mapping.

29. What is a mapplet?

A recyclable object that is using mapplet designer is called a mapplet. It permits one to reuse the transformation logic in multitude mappings moreover it also contains set of transformations.

30.Briefly define reusable transformation?

Reusable transformation is used numerous times in mapping. It is different from other mappings which use the transformation since it is stored as a metadata. The transformations will be nullified in the mappings whenever any change in the reusable transformation is made.

31. What does update strategy mean, and what are the different option of it?

Row by row processing is done by informatica. Every row is inserted in the target table because it is marked as default. Update strategy is used whenever the row has to be updated or inserted based on some sequence. Moreover the condition must be specified in update strategy for the processed row to be marked as updated or inserted.

32. What is the scenario which compels informatica server to reject files?

This happens when it faces DD_Reject in update strategy transformation. Moreover it disrupts the database constraint filed in the rows was condensed.

33. What is surrogate key?

Surrogate key is a replacement for the natural prime key. It is a unique identification for each row in the table. It is very beneficial because the natural primary key can change which eventually makes update more difficult. They are always used in form of a digit or integer.

34.What are the prerequisite tasks to achieve the session partition?

In order to perform session partition one need to configure the session to partition source data and then installing the Informatica server machine in multifold CPU’s.

35. Which files are created during the session rums by informatics server?

During session runs, the files created are namely Errors log, Bad file, Workflow low and session log.

36. Briefly define a session task?

It is a chunk of instruction the guides Power center server about how and when to transfer data from sources to targets.

37,What does command task mean?

This specific task permits one or more than one shell commands in UNIX or DOS in windows to run during the workflow.

38. What is standalone command task?

This task can be used anywhere in the workflow to run the shell commands.

39. What is meant by pre and post session shell command?

Command task can be called as the pre or post session shell command for a session task. One can run it as pre session command r post session success command or post session failure command.

40.What is predefined event?

It is a file-watch event. It waits for a specific file to arrive at a specific location.

41. How can you define user defied event?

User defined event can be described as a flow of tasks in the workflow. Events can be created and then raised as need arises.

42. What is a work flow?

Ans: Work flow is a bunch of instructions that communicates server about how to implement tasks.

43. What are the different tools in workflow manager?

Following are the different tools in workflow manager namely

Task Designer
Task Developer
Workflow Designer

44. Tell me any other tools for scheduling purpose other than workflow manager pmcmd?

The tool for scheduling purpose other than workflow manager can be a third party tool like ‘CONTROL M’.

45. What is OLAP (On-Line Analytical Processing?

A method by which multi-dimensional analysis occurs.

46. What are the different types of OLAP? Give an example?

ROLAP eg.BO, MOLAP eg.Cognos, HOLAP, DOLAP

47. What do you mean by worklet?

When the workflow tasks are grouped in a set, it is called as worklet. Workflow tasks includes timer, decision, command, event wait, mail, session, link, assignment, control etc.

48. What is the use of target designer?

Target Definition is created with the help of target designer.

49. Where can we find the throughput option in informatica?

Throughput option can be found in informatica in workflow monitor. In workflow monitor, right click on session, then click on get run properties and under source/target statistics we can find throughput option.

50. What is target load order?

Ans: Target load order is specified on the basis of source qualifiers in a mapping. If there are multifold source qualifiers linked to different targets then one can entitle order in which informatica server loads data into targets.

51. What do you mean by Enterprise Data Warehousing?

When the organization data is created at a single point of access it is called as enterprise data warehousing. Data can be provided with a global view to the server via a single source store. One can do periodic analysis on that same source. It gives better results but however the time required is high.

52. What the difference is between a database, a data warehouse and a data mart?

Database includes a set of sensibly affiliated data which is normally small in size as compared to data warehouse. While in data warehouse there are assortments of all sorts of data and data is taken out only according to the customer’s needs. On the other hand datamart is also a set of data which is designed to cater the needs of different domains. For instance an organization having different chunk of data for its different departments i.e. sales, finance, marketing etc.

53. What is meant by a domain?

When all related relationships and nodes are covered by a sole organizational point, its called domain. Through this data management can be improved.

54. What is the difference between a repository server and a powerhouse?

Repository server controls the complete repository which includes tables, charts, and various procedures etc. Its main function is to assure the repository integrity and consistency. While a powerhouse server governs the implementation of various processes among the factors of server’s database repository.

55. Explain the features of the Connected and Unconnected lookup.

The features of Connected Lookup can be explained as follows:

There is a direct source of input from the pipeline for connected lookup.
It has active participation in data flow and both dynamic as well as static cache is used as the case is.
It caches all lookup columns and returns the default values as the output when the lookup condition does not match.
More than one column values can be returned to the output port.
Multiple output values are passed as well as output ports are linked to another transformation.
Connected lookup supports user-defined default values.

The features of unconnected lookup can be explained as follows:

Unconnected lookup uses static cache and its source of input is the result received from the output of LKP expression.
It caches only the lookup output ports and returns the value as NULL when the lookup condition does not match.
Only one column is returned from each port.
Only one output value is passed to another transformation.
User-defined default values are not supported by unconnected lookup.

56. During the running session, output files are created by the INFORMATICA server. Enlist a few of them.

Mentioned below are the few output files:

Cache files: These files are created at the time of memory cache creation. For circumstances like Lookup transformation, Aggregator transformation, etc index and data cache files are created by the INFORMATICA server.
Session detail file: As the name defines, this file contains load statistics like table name, rows rejected or written for each target in mapping and can be viewed in the monitor window.
Performance detail file: This file is a part of the session property sheet and contains session performance information in order to determine improvement areas.
INFORMATICA server log: The server creates a log for all status and error messages and can be seen in the INFORMATICA home directory.
Session log file: For each session, the server creates a session log file depending on the set tracing level. The information that can be seen in log files about sessions can be:
1. Session initialization process,
2. SQL commands creation for reader and writer threads,
3. List of errors encountered and
4. Load summary
Post-session email: This helps in communicating the information about the session (session completed/session failed) to the desired recipients automatically.
Reject file: This file contains information about the data that has not been used/written to targets.
Control file: In case, when the session uses the external loader, control file consists of loading instructions and data format about the target file.
Indicator file: This file basically contains a number that highlights the rows marked for INSERT/UPDATE/DELETE or REJECT.
Output file: The output file is created based on the file properties.

57. How to differentiate between the Active and Passive transformations?

To understand the difference between Active and Passive transformations, let us see its features which will explain the differences in a better way.

The action performed by Active transformations includes:

Changing the number of rows as per the requirement, that passes through the transformations. For Example, Filter transformation that deletes the row that does not meet the condition.
Changing the transaction boundary by setting the rollback and commit points. For Example, Transaction control transformation.
Changing the row type for INSERT/ UPDATE/DELETE or REJECT.

The action performed by Passive transformations includes:

The number of rows passing through the transformation is never changed.
The transaction boundary is set.
Row type is set.

58. Enlist the various Types of Transformations.

The various types of transformations are as follows:

Aggregator transformation
Expression transformation
Normalizer transformation
Rank transformation
Filter transformation
Joiner transformation
Lookup transformation
Stored procedure transformation
Sorter transformation
Update strategy transformation
XML source qualifier transformation
Router transformation
Sequence Generator transformation

59. What is Dynamic Cache?

INFORMATICA lookups can be categorized either as cached or uncached. In the case of Dynamic cache, rows can be inserted or deleted at the time of passing the rows and is synchronized with the target. The cache memory is refreshed every time after delete/update operations within the session.

60. What is decode in Static cache?

Static cache is the one that is neither updated nor refreshed in the session run. It is the default cache and returns the value only when the return condition is true. In other cases, it returns Null value. Insert or Update cache cannot be performed in this case.

61. Mention a few advantages of Router transformation over Filter transformation.

Router transformation and Filter transformation are the same because both of them use a condition to test and filter the data.

However, the advantages of Router over filter transformation can be understood by the below-mentioned points.

Router Transformation:

It allows more than one test condition.
Provide the ability to test the same input data on multiple numbers of conditions.
In the case of mapping, input data is processed only once by the server and hence performance is improved.
Less complex and more efficient.
The records that fail the test condition are never blocked instead are passed on to the default group.

62. Enlist some properties of sessions.

A session is available in the workflow manager and is configured by creating a session task. Within a mapping program, there can be multiple sessions and it can be either reusable or non-reusable.

Some of the properties of the session are as follows:

As per the requirement, session tasks can be run either concurrently or sequentially.
A session can be configured to analyze the performance.
To create or run a session task, it is required to have general information about Session name, schedule and integration service.
Other important property of session includes Session log file, the test load, error handling, commit interval, target properties, etc.

63. Enlist the tasks for which Source qualifier transformation is used.

Source qualifier is considered as an active transformation that reads the rows that are involved in integration service within the running session. It determines the way in which the data is fetched from the source and is automatically added while adding a source to mapping.

The list of different tasks where source qualifier is used is as follows:

Rows filtering
Data sorting
Custom query creation
Joining tables from the same source
Selecting distinct values

64. Mention a few Power Centre client applications with their basic purpose?

Tasks like session and workflow creation, monitoring workflow progress, designing mapplets, etc are performed by Powercentre client applications.

Enlisted below is the list of Power center client applications with their purpose:

Repository Manager: It is an administrative tool and its basic purpose is to manage repository folders, objects, groups, etc.
Administration Console: Here the service tasks like start/stop, backup/restore, upgrade/delete, etc are performed.
Power center designer: The designer consists of various designing tools that serve various purposes. These designing tools are:

Source Analyzer
Target designer
Transformation Developer
Mapplet Designer
Mapping Manager

Workflow Manager: Its basic purpose is to define a set of instructions/workflow that is required to execute mappings designed in the designer. To help develop a workflow, there are 3 tools available, namely Task developer, Workflow designer, Worklet Designer.

Workflow Monitor: As the name suggests, the Workflow monitor, monitors the workflow or tasks. The list of windows available are:

Navigator Window
Output window
Time window
Properties window
Task view
Gantt chart view

65. If You Want To Create Indexes After The Load Process Which Transformation You Choose?

Its usually not done in the mapping (transformation) level. Its done in session level. Create a command task which will execute a shell script (if Unix) or any other scripts which contains the create index command. Use this command task in the workflow after the session or else, You can create it with a post session command.

INFORMATICA Questions and Answers Pdf Download

300+ [LATEST] Informatica Interview Questions and Answers

Q1. What Are The Differences Between Informatica Power Center Versions 6.2 And 7.1, Also Between Versions 6.2 And 5.1?

The main difference between informatica 5.1 and 6.1 is that in 6.1 they introduce a new thing called repository server and in place of server manager(5.1), they introduce workflow manager and workflow monitor.

Q2. Is A Fact Table Normalized Or De-normalized?

A fact table is always DENORMALISED table. It consists of data from dimension table (Primary Key’s) and Fact table has foreign keys and measures.

Q3. What Is A Source Qualifier?

When you add a relational or a flat file source definition to a mapping, you need to connect it to a Source Qualifier trformation. The Source Qualifier represents the rows that the Informatica Server reads when it executes a session.

Q4. How Many Types Of Dimensions Are Available In Informatica?

The types of dimensions available are:

Junk dimension
Degenerative Dimension
Conformed Dimension

Q5. At The Max How Many Trformations Can Be Us In A Mapping?

In a mapping we can use any number of trformations depending on the project, and the included trformations in the particular related trformations.

Q6. What Is The Difference Between Filter And Lookup Trformation?

Filter trformation is an Active trformation and Lookup is a Passive trformation.
Filter trformation is used to Filter rows based on condition and Lookup is used to look up data in a flat file or a relational table, view, or synonym.

Q7. How Can We Use pmcmd Command In A Workflow Or To Run A Session

By using command in the command task there is a option pression. we can write appropriate command of pmcmd to run workflow.

Q8. What Is Difference Between Informatica 7.1 And Abinitio?

There is a lot of difference between informatica and AbInitio:

In AbInitio we are using 3 parallelism but Informatica using 1 parallelism.
In Ab Initio no scheduling option we can schedule manually or pl/sql script but informatica contains 4 scheduling options.
In Ab Inition contains co-operating system but informatica is not.
Ramp time is very quickly in Ab Initio compare than Informatica.
Ab Initio is user friendly than Informatica.

Q9. When We Create A Target As Flat File And Source As Oracle. How Can I Specify First Rows As Column Names In Flat Files?

Use a pre sql statement., but this is a hard coding method. If you change the column names or put in extra columns in the flat file, you will have to change the insert statement.

You can also achieve this by changing the setting in the Informatica Repository manager to display the columns heading. The only disadvantage of this is that it will be applied on all the files that will be generated by this server.

Q10. How To Load Time Dimension?

We can use SCD Type 1/2/3 to load any Dimensions based on the requirement. We can also use procedure to populate Time Dimension

Q11. What Is Difference Between Maplet And Reusable Trformation?

Maplet :

one or more trformations.
set of trformations that are reusable.

Reusable trformation :

only one trformation
Single trformation which is reusable.

Q12. How To Import Oracle Sequence Into Informatica?

Create one procedure and declare the sequence inside the procedure, finally call the procedure in informatica with the help of stored procedure trformation.

Q13. How To Read Rejected Data Or Bad Data From Bad File And Reload It To Target?

Correction the rejected data and send to target relational tables using load order utility. Find out the rejected data by using column indicator and row indicator.

Q14. Compare Data Warehousing Top-down Approach With Bottom-up Approach.

Top down
ODS–>ETL–>Datawarehouse–>Datamart–>OLAP

Bottom up
ODS–>ETL–>Datamart–>Datawarehouse–>OLAP

Q15. What Is The Default Source Option For Update Strategy Trformation?

Default option for update strategy trformation is dd_insert or we can put ‘0’ in session level data driven.

Q16. Why We Use Stored Procedure Trformation?

A Stored Procedure trformation is an important tool for populating and maintaining databases. Database administrators create stored procedures to automate time-consuming tasks that are too complicated for standard SQL statements.

Q17. What Are Main Advantages And Purpose Of Using Normalizer Trformation In Informatica?

Narmalizer Trformation is used mainly with COBOL sources where most of the time data is stored in de-normalized format. Also, Normalizer trformation can be used to create multiple rows from a single row of data.

Normalizer Trformation read the data from COBOL Sources.
It support Horizontal Pivot .It is a processing of single input into a multiple output

Q18. Which All Databases Powercenter Server On Windows Can Connect To?

PowerCenter Server on Windows can connect to following databases:

IBM DB2
Informix
Microsoft Access
Microsoft Excel
Microsoft SQL Server
Oracle
Sybase
Teradata

Q19. Which All Databases Powercenter Server On Unix Can Connect To?

PowerCenter Server on UNIX can connect to following databases:

IBM DB2
Informix
Oracle
Sybase
Teradata

Q20. Define Informatica Repository?

The informatica repository is at the center of the informatica suite. You create a set of metadata tables within the repository database that the informatica application and tools access. The informatica client and server access the repository to save and retrieve metadata.

Q21. Which Is Better Among Connected Lookup And Unconnected Lookup Trformations In Informatica Or Any Other Etl Tool?

If you are having defined source you can use connected, source is not well defined or from different database you can go for unconnected.

Connected and unconnected lookup depends on scenarios and performance If you are looking for a single value for look up and the value is like 1 in 1000 then you should go for unconnected lookup. Performance wise its better as we are not frequently using the trformation. If multiple columns are returned as lookup value then one should go for connected lookup.

Q22. What Is The Difference Between Informatics 7x And 8x And What Is Latest Version?

Java Trformation available in the 8x version and it is not available in 7x version.

Q23. Differences Between Normalizer And Normalizer Trformation.

Normalizer : It is a trformation mainly using for cobol sources. It change the rows into columns and columns into rows.

Normalization : To remove the redundancy and inconsistency.

Normalizer Trformation : can be used to obtain multiple columns from a single row.

Q24. What Type Of Repositories Can Be Created Using Informatica Repository Manager?

Informatica PowerCenter includeds following type of repositories :

Standalone Repository : A repository that functions individually and this is unrelated to any other repositories.

Global Repository : This is a centralized repository in a domain. This repository can contain shared objects across the repositories in a domain. The objects are shared through global shortcuts.

Local Repository : Local repository is within a domain and it’s not a global repository. Local repository can connect to a global repository using global shortcuts and can use objects in it’s shared folders.

Versioned Repository : This can either be local or global repository but it allows version control for the repository. A versioned repository can store multiple copies, or versions of an object. This features allows to efficiently develop, test and deploy metadata in the production environment.

Q25. How To Get The First 100 Rows From The Flat File Into The Target?

create one procedure and declare the sequence inside the procedure, finally call the procedure in informatica with the help of stored procedure trformation.

Q26. What Is Difference Between Partitioning Of Relational Target And Partitioning Of File Targets?

Partition’s can be done on both relational and flat files.
Informatica supports following partitions

Database partitioning
RoundRobin
Pass-through
Hash-Key partitioning
Key Range partitioning

All these are applicable for relational targets. For flat file only database partitioning is not applicable.

Informatica supports Navy partitioning. you can just specify the name of the target file and create the partitions, rest will be taken care by informatica session.

Q27. Discuss The Advantages & Disadvantages Of Star & Snowflake Schema?

In a STAR schema there is no relation between any two dimension tables, whereas in a SNOWFLAKE schema there is a possible relation between the dimension tables.

In star schema there is no relationship between two relational tables. All dimensions are de-normalized and query performance is degrades. In this snow flake schema dimensions are normalized. In this SF schema table space is increased. Maintenance cost is high. Query performance is increased.

Q28. How Do You Handle Decimal Places While Importing A Flatfile Into Informatica?

while importing flat file definition just specify the scale for a numeric data type in the mapping, the flat file source supports only number datatype (no decimal and integer). In the SQ associated with that source will have a data type as decimal for that number port of the source.

source ->number datatype port ->SQ -> decimal datatype. Integer is not supported. hence decimal is taken care.

Import the field as string and then use expression to convert it, so that we can avoid truncation if decimal places in source itself.

Q29. How To Join Two Tables Without Using The Joiner Trformation?

Its possible to join the two or more tables by using source qualifier. But provided the tables should have relationship.

When you drag and drop the tables you will getting the source qualifier for each table. Delete all the source qualifiers. Add a common source qualifier for all. Right click on the source qualifier you will find EDIT click on it. Click on the properties tab, you will find sql query in that you can write your sqls.

You can also do it using Session — mapping—source there you have an option called User Defined Join there you can write your SQL.

Q30. What Are The Methods For Creating Reusable Trformations?

There two methods for creating reusable trformations:

Using trformation developer tool.
Converting a non reusable trformation into a reusable trformation in mapping.

Q31. Difference Between Summary Filter And Details Filter?

Summary Filter – we can apply records group by that contain common values.

Detail Filter – we can apply to each and every record in a database.

Q32. What Can We Do To Improve The Performance Of Informatica Aggregator Trformation?

Aggregator performance improves dramatically if records are sorted before passing to the aggregator and “sorted input” option under aggregator properties is checked. The record set should be sorted on those columns that are used in Group By operation.

It is often a good idea to sort the record set in database level e.g. inside a source qualifier trformation, unless there is a chance that already sorted records from source qualifier can again become unsorted before reaching aggregator.

Q33. Doubts Regarding Rank Trformation: Can We Do Ranking Using Two Ports? Can We Rank All The Rows Coming From Source, How?

When ETL load the data from source we can declare the rank of the incoming data to pass a rank trformation. We can’t declare two rank on a single source data. We can do rank the row by declaring the rank Trformation and declaring the rank port.

Q34. What Is The Router Trformation?

A Router trformation is similar to a Filter trformation because both trformations allow you to use a condition to test data. However, a Filter trformation tests data for one condition and drops the rows of data that do not meet the condition. A Router trformation tests data for one or more conditions and gives you the option to route rows of data that do not meet any of the conditions to a default output group.

If you need to test the same input data based on multiple conditions, use a Router Trformation in a mapping instead of creating multiple Filter trformations to perform the same task.

Q35. What Is A Code Page?

A code page contains encoding to specify characters in a set of one or more languages. The code page is selected based on source of the data. For example if source contains Japanese text then the code page should be selected to support Japanese text.

When a code page is chosen, the program or application for which the code page is set, refers to a specific set of data that describes the characters the application recognizes. This influences the way that application stores, receives, and sends character data.

Q36. How Can You Access The Remote Source Into Your Session?

Relational source : To access relational source which is situated in a remote place , you need to configure database connection to the datasource.

FileSource : To access the remote source file you must configure the FTP connection to the host machine before you create the session.

Heterogeneous : When you are mapping contains more than one source type, the server manager creates a heterogeneous session that displays source options for all types.

Q37. How Do I Import Vsam Files From Source To Target. Do I Need A Special Plugin

As far my knowledge by using power exchange tool convert VSAM file to oracle tables then do mapping as usual to the target table.

Q38. How Can You Complete Unrecoverable Sessions?

Under certain circumstances, when a session does not complete, you need to truncate the target tables and run the session from the beginning. Run the session from the beginning when the Informatica Server cannot run recovery or when running recovery might result in inconsistent data.

If there is no recovery mode on in session and workflow failed in mid of execution then

Don’t truncate table immediately.
If there is large volume of data is performing by the load and more than 25% data has loaded then-if same workflow has multiple session then check particular session which caused to be failed and fire the delete command only to delete particular session data which has loaded and copy the session into new workflow and run only that session or dependent others.

Q39. How Can We Partition A Session In Informatica?

The Informatica PowerCenter Partitioning option optimizes parallel processing on multiprocessor hardware by providing a thread-based architecture and built-in data partitioning.

GUI-based tools reduce the development effort necessary to create data partitions and streamline ongoing troubleshooting and performance tuning tasks, while ensuring data integrity throughout the execution process. As the amount of data within an organization expands and real-time demand for information grows, the PowerCenter Partitioning option enables hardware and applications to provide outstanding performance and jointly scale to handle large volumes of data and users.

Q40. Explain Use Of Update Strategy Trformation?

To flag source records as INSERT, DELETE, UPDATE or REJECT for target database. Default flag is Insert. This is must for Incremental Data Loading.

This is the important trformation,is used to maintain the history data or just most recent changes into the target table.

We can set or flag the records by using these two levels.

Within a session :
When you configure the session,you can instruct the informatica server to either treat all the records in the same way.
Within a mapping :
within a mapping we use update strategy trformation to flag the records like insert,update,delete or reject.

Q41. How Do We Estimate The Number Of Partitions That A Mapping Really Requires? Is It Dependent On The Machine Configuration?

It depends upon the informatica version we are using. suppose if we are using informatica 6 it supports only 32 partitions where as informatica 7 supports 64 partitions.

Q42. What Is The Use Of Incremental Aggregation? Explain Me In Brief With An Example.

Its a session option, when the informatica server performs incremental aggr. it passes new source data through the mapping and uses historical cache data to perform new aggregation calculations incrementally for performance we will use it.

When using incremental aggregation, you apply captured changes in the source to aggregate calculations in a session. If the source changes incrementally and you can capture changes, you can configure the session to process those changes. This allows the Integration Service to update the target incrementally, rather than forcing it to process the entire source and recalculate the same data each time you run the session.

Q43. If You Want To Create Indexes After The Load Process Which Trformation You Choose?

Its usually not done in the mapping (trformation) level. Its done in session level. Create a command task which will execute a shell script (if Unix) or any other scripts which contains the create index command. Use this command task in the workflow after the session or else, You can create it with a post session command.

Q44. Explain About Informatica Server Architecture?

Informatica server, load managers, data trfer manager, reader, temp server and writer are the components of informatica server. first load manager sends a request to the reader if the reader is ready to read the data from source and dump into the temp server and data trfer manager manages the load and it send the request to writer as per first in first out process and writer takes the data from temp server and loads it into the target.

Q45. How The Informatica Server Increases The Session Performance Through Partitioning The Source?

For a relational sources informatica server creates multiple connections for each partition of a single source and extracts separate range of data for each connection.

Informatica server reads multiple partitions of a single source concurrently. Similarly for loading also informatica server creates multiple connections to the target and loads partitions of data concurrently.

For XML and file sources, informatica server reads multiple files concurrently. For loading the data informatica server creates a separate file for each partition (of a source file). You can choose to merge the targets.

Q46. How Do You Configure Mapping In Informatica?

Import src from database.
Check if target table already exists in database.
If it exists make sure u delete data from it and import into designer.
Or else create it with create target wizard.
Now you can drag needed trformations into the workspace.
Use them according to your purpose.

For improved performance follow these tips:-

Use sorter before aggregator
If filter is used keep it as near to the source as possible.
If possible use an extra expression tr before target to make corrections in future.
Enable sorted input option if sorter is used before agg tr.
If more filters are needed use router tr.
you can use source filter option of SQ if filter tr is immediately after source.
In case of router if not needed do not connect default group to any target.

Q47. What Is The Default Join That Source Qualifier Provides?

Inner equi join.

Q48. Can You Start A Batches With In A Batch?

You cannot. If you want to start batch that resides in a batch, create a new independent batch and copy the necessary sessions into the new batch.

Q49. What Are The Joiner Caches?

Specifies the directory used to cache master records and the index to these records. By default, the cached files are created in a directory specified by the server variable $PMCacheDir. If you override the directory, make sure the directory exists and contains enough disk space for the cache files. The directory can be a mapped or mounted drive. There are 2-types of cache in the joiner:

Data cache
Index Cache

Q50. What Is Hash Table Informatica?

In hash partitioning, the Informatica Server uses a hash function to group rows of data among partitions. The Informatica Server groups the data based on a partition key.Use hash partitioning when you want the Informatica Server to distribute rows to the partitions by group. For example, you need to sort items by item ID, but you do not know how many items have a particular ID number.