DataStage Interview Questions

DataStage Interview Questions

  1. Types of Parallel Processing?
  2. How to kill the job in data stage?
  3. How do you populate source files?
  4. What is iconv and icon functions?
  5. Did you work in UNIX environment?
  6. How can ETL excel file to Data mart?
  7. How to find errors in job sequence?
  8. Why do you use SQL LOADER or OCI STAGE?
  9. How do you do Usage analysis in datastage?
  10. What is data set? and what is file set?
  11. Difference between Hash file and Sequential File?
  12. Do you know about INTEGRITY/QUALITY stage?
  13. What are Static Hash files and Dynamic Hash files?
  14. How I can convert Server Jobs into Parallel Jobs?
  15. How to find the number of rows in a sequential file?
  16. What is job control? How can it used explain with steps?
  17. What are Stage Variables, Derivations and Constants?
  18. How will you call external function or subroutine from datastage?
  19. What are the different types of lookups available in DataStage?
  20. What is the purpose of exception activity in data stage 7.5?
  21. How do you track performance statistics and enhance it?
  22. What does separation option in static hash-file mean?
  23. How can we implement Lookup in DataStage Server jobs?
  24. How can we improve the performance of DataStage jobs?
  25. How to implement slowly changing dimentions in Datastage?
  26. What is the difference between Datastage and Datastage TX?
  27. What are the Repository Tables in DataStage and what are they?
  28. What are the command line functions that import and export the DS jobs?
  29. Whats difference between operational data stage (ODS) & data warehouse?
  30. How to drop the index before loading data in target and how to rebuild it in data stage?
  31. What is the difference between Symetrically parallel processing, massively parallel processing?
  32. What is data file & descriptor file in dataset stage? Plese expain briefly
  33. How to run the job in command prompt in unix?
  34. Did you Parameterize the job or hard-coded the values in the jobs?
  35. What is hashing algorithm and explain breafly how it works?
  36. Can we use shared container as lookup in datastage server jobs?
  37. What is the exact difference between Join, Merge and Lookup Stage?
  38. How do you remove duplicates without using remove duplicate stage?
  39. What are OConv () and Iconv () functions and where are they used?
  40. How we can call the routine in datastage job? Explain with steps?
  41. If data is partitioned in your job on key 1 and then you aggregate on key 2, what issues could arise?
  42. How to handle Date convertions in Datastage? Convert a mm/dd/yyyy format to yyyy-dd-mm?
  43. How we use NLS function in Datastage? What are advantages of NLS function? Where we can use that one? Explain briefly?
  44. What are the different types of lookups in datastage?
  45. What is difference between Merge stage and Join stage?
  46. If a DataStage job aborts after say 1000 records, how to continue the job from 1000th record after fixing the error?
  47. How do we do the automation of dsjobs?
  48. How to implement type2 slowly changing dimenstion in datastage? Give me with example?
  49. How can you do incremental load in datastage?
  50. How can we join one Oracle source and Sequential file?
  51. What is the transaction size and array size in OCI stage? How these can be used?
  52. What are the environment variables in datastage? Give some examples?