Teradata Interview Preparation Guide
Prepare comprehensively for your Teradata interview with our extensive list of 20 questions. These questions are specifically selected to challenge and enhance your knowledge in Teradata. Perfect for all proficiency levels, they are key to your interview success. Get the free PDF download to access all 20 questions and excel in your Teradata interview. This comprehensive guide is essential for effective study and confidence building.20 Teradata Questions and Answers:
1 :: How teradata makes sure that there are no duplicate rows being inserted when its a SET table?
Teradata will redirect the new inserted row as per its PI to the target AMP (on the basis of its row hash value), and if it find same row hash value in that
AMP (hash synonyms) then it start comparing the whole row, and find out if duplicate.
If its a duplicate it silently skips it without throwing any error.
AMP (hash synonyms) then it start comparing the whole row, and find out if duplicate.
If its a duplicate it silently skips it without throwing any error.
2 :: What is the difference between Global temporary tables and Volatile temporary tables?
Global Temporary tables (GTT) -
1. When they are created, its definition goes into Data Dictionary.
2. When materialized data goes in temp space.
3. thats why, data is active upto the session ends, and definition will remain there upto its not dropped using Drop table statement.
If dropped from some other session then its should be Drop table all;
4. you can collect stats on GTT.
Volatile Temporary tables (VTT) -
1. Table Definition is stored in System cache
2. Data is stored in spool space.
3. thats why, data and table definition both are active only upto session ends.
4. No collect stats for VTT.
1. When they are created, its definition goes into Data Dictionary.
2. When materialized data goes in temp space.
3. thats why, data is active upto the session ends, and definition will remain there upto its not dropped using Drop table statement.
If dropped from some other session then its should be Drop table all;
4. you can collect stats on GTT.
Volatile Temporary tables (VTT) -
1. Table Definition is stored in System cache
2. Data is stored in spool space.
3. thats why, data and table definition both are active only upto session ends.
4. No collect stats for VTT.
3 :: What is an optimization and performance tuning and how does it really work in practical projects?
Performance tuning and optimization of a query involves collecting statistics on join columns, avoiding cross product join, selection of appropriate primary
index (to avoid skewness in storage) and using secondary index.
Avoiding NUSI is advisable.
index (to avoid skewness in storage) and using secondary index.
Avoiding NUSI is advisable.
4 :: What is explain and how does it work?
The EXPLAIN facility is a teradata extension that provides you with an "ENGLISH" translation of the steps choosen by the optimizer to execute an SQL
statement.It may be used oin any valid teradata database with a preface called "EXPLAIN".
The following is an example:-
EXPLAIN select last_name,first_name FROM employees;
The EXPLAIN parses the SQL statement but does not execute it.
This provides the designer with an "execution stratergy".
The execution stratergy provides what an optimizer does but not why it choses them.
The EXPLAIN facility is used to analyze all joins and complex queries.
statement.It may be used oin any valid teradata database with a preface called "EXPLAIN".
The following is an example:-
EXPLAIN select last_name,first_name FROM employees;
The EXPLAIN parses the SQL statement but does not execute it.
This provides the designer with an "execution stratergy".
The execution stratergy provides what an optimizer does but not why it choses them.
The EXPLAIN facility is used to analyze all joins and complex queries.
5 :: What are the enhanced features in Teradata V2R5 and V2R6?
V2R6 included the feature of replica in it.in which copy of data base are available on another system.meam V2R6 provide the additional data protaction as
comprison to V2R5 while if data from one system has been vanishes.
After creating tables dynamically in the Teardata, where is the GRANT table option usually done ? When tables are newly created, what is the default role and
comprison to V2R5 while if data from one system has been vanishes.
After creating tables dynamically in the Teardata, where is the GRANT table option usually done ? When tables are newly created, what is the default role and
6 :: What the default privileges which get assigned ?
The GRANT option for any particular table depends on the privilages of the user. If it is an admin user you can grant privilages at any point of time.
The deafult roles associated with the newly created tables depend on he schema in which they are created.
The deafult roles associated with the newly created tables depend on he schema in which they are created.
7 :: What is cliques? What is Vdisk and how it will communicate with physical data storage at the time of data retrieval through AMP?
A clique is a set of Teradata nodes that share a common set of disk arrays. Cabling a subset of nodes to the same disk arrays creates a clique.
Each AMP vproc must have access to an array controller, which in turn accesses the physical disks. AMP vprocs are associated with one or more ranks
(or mirrored pairs) of data. The total disk space associated with an AMP is called a vdisk. A vdisk may have up to three ranks. Hence Vdisk will communicate
with physical storage through array controllers.
Each AMP vproc must have access to an array controller, which in turn accesses the physical disks. AMP vprocs are associated with one or more ranks
(or mirrored pairs) of data. The total disk space associated with an AMP is called a vdisk. A vdisk may have up to three ranks. Hence Vdisk will communicate
with physical storage through array controllers.
8 :: What is basic teradata query language?
BTEQ(Basic teradata query)
It allows us to write SQL statements along with BTEQ commands. We can use BTEQ for importing,exporting and reporting purposes.
The commands start with a (.) dot and can be terminated by using (;), it is not mandatory to use (;). SQL statements doesnt start with a dot , but (;) is
compulsary to terminate the SQL statement.
BTEQ will assume any thing written with out a dot as a sql statement and requires a (;) to terminate it.
It allows us to write SQL statements along with BTEQ commands. We can use BTEQ for importing,exporting and reporting purposes.
The commands start with a (.) dot and can be terminated by using (;), it is not mandatory to use (;). SQL statements doesnt start with a dot , but (;) is
compulsary to terminate the SQL statement.
BTEQ will assume any thing written with out a dot as a sql statement and requires a (;) to terminate it.
9 :: How many codd s rules are satisfied by teradata database?
There are 12 codd's rules applied to the teradata database
10 :: What is the difference between Multiload and Fastload interms of Performance?
If you want to load, empty table then you use the fastload, so it will very usefull than the mutiload..because fastload performs the loading of the data in
2phase..and it noneed a work table for loading the data.., so it is faster as well as it follows the below steps to load the data in the table
Phase1-It moves all the records to all the AMP first without any hashing
Phase2-After giving endloading command, Amp will hashes the record and send it to the appropriate AMPS .
Multiload:
It does the loading in the 5 phases
Phase1:It will get the import file and checks the script
Phase2:It reads the record from the base table and store in the work table
Phase3:In this Application phase it locks the table header
Phase4:In the DML opreation will done in the tables
Phase 5: In this table locks will be released and work tables will be dropped.
2phase..and it noneed a work table for loading the data.., so it is faster as well as it follows the below steps to load the data in the table
Phase1-It moves all the records to all the AMP first without any hashing
Phase2-After giving endloading command, Amp will hashes the record and send it to the appropriate AMPS .
Multiload:
It does the loading in the 5 phases
Phase1:It will get the import file and checks the script
Phase2:It reads the record from the base table and store in the work table
Phase3:In this Application phase it locks the table header
Phase4:In the DML opreation will done in the tables
Phase 5: In this table locks will be released and work tables will be dropped.