CSV Files in Data Stage

Ratings:
(4)
Views:0
Banner-Img
  • Share this blog:

CSV format :- (comma Separated Values)

TO read the xls file in Sequential file, convert xls ta. CSV  format

How to convert .xls to .csv format :-

  • Go to excel 7 open emp. Xls 7Save as 7 other format 7 Select CSV (comma delimited) 7 emp .CSV

  Screenshot_6     Properties 7 file = emp .csv 7columns 7 load 7 while import 7 select file type 7.CSV7import 7 ok   Tara data with ODBC 

  • Start Tara data Service control 7 Minimize it
  • Go to start 7 Control panel 7 Administrative Tools 7 service7Click on Tara data data base Initiator 7 start 7 close it

Default values  User id = td user Password = td user Server = 127.0.0.1 Creating DSN for Tara data  Start 7Control panel 7Administrative Tools 7 Data base (ODBC)7System DSN 7 Add 7 Tara data 7 finish 7 Name  = chaladi2, IP address = 127.0.0.1   user name = td user name password = td user 7Ok 7 DSN created 7 chaladil ODBC Enterprise    —     Data set   Properties 7   Read Method = Table, Data source = chaladil,  User = td user, Password =td user 7Columns 7 load 7 Table Definition 7 import7Plug in Meta  Data definition 7 click on Tara data 7 ok7Tara data director  = 127.0.0.1,User name = td user, Password  = td user7Next7Data base   = financial7Select only Table 7 Next 7 Select table 7 customer 7 import 7Table Definition 7 plug in 7 Tara data 7 customer  7ok 7 ok   Now, go to properties Screenshot_7   Now, view Data Dynamic RDMS  This is the only stage that supports n inputs and n outputs Oracle can read multiple tables using join but not independently. Using Dynamic RDBMS , we can read multiple tables independently.   Properties 7 click on DBMS Type 7 Select oracle 7 connection name (oracle) user (scott) Password (tiger) 7click on output 7 name  = emp  columns 7 click on columns7Load 7change Data to time stamp 7 click on output 7 name = Dept7Click on columns 7 load 7 change data to times stamp

  • Now compile and Run
  • EMP table will be displayed in new.ds
  • DEPT table will be displayed in new1.ds

Transformer

It is a processing stage which has 1 input and n outputs It is the Heart of Data stage. It is an All – in –one stage(Can perform the process of column generator, copy operator)    Difference between Basic Transformation and Parallel Transformer: Basic Transformation

  1. Comes from Server Environment
  2. Effects the performance
  3. Jobs with Basic Transformation can run up to SMP Machines (Platform dependent)
  4. Basic Transformation can call the routines only in
  5. Basic language
  6. SHELLSCRIPT

   Parallel Transformation

  1. Comes from parallel Environment
  2. Does not Effects the performance, but effects the compilation
  3. Jobs can run any Machines (Platform  Independent)
  4. Parallel Transformation can call the routines in any language like Java,

C++,VC++,VB,>net Shall Script, pearl Script

  • Transformation is used to call a routine in to data stage

Calculating Net Salary by Adding  Screenshot_9   Properties 7Read Method = table Table = EMP

  • Click on stage
  • Data connection = load Data connection
  • When you load, it will automatically update the user, password, and re mode Server
  • Click on ColumnsàLoad àokàChange the date to Times stamp

Data set

  • Properties 7Browse the file7Give a file name with .ds extension

Transformer

Screenshot_10  

  • Drag all the fields from in to out in the upper part
  • In the bottom part, out

Add a new column Net Sa Integer 5 7click on ok

  • It will be added to the out of upper part

  Screenshot_11 Double click 7Right click 7Select input Column 7 Select Sal 7 in.  Sal7Right click , Select ‘+’ 7 in.sal + 7 Right function 7 Null  Handling

  • Null to Zero in .al + Null to Zero (% input %) 7 Right click 7Input column 7Select Can 7 Sal + Null to zero (in. COMM)7Ok7Now Compile and run
These core tutorials will help you to learn 
the fundamentals of Tableau.
For an in-depth understanding and practical experience, 
explore Online DataStage Training.

Stage Variable:-  (Temporary Variable) Stage variable  is a temporary variable which he it’s the value until the process complete,But does send the value to the output. If Net Sa > 2000 then Re home = Net Sa – 200 Else Take home = Net Sal +200 Transformer  At the bottom. Out, Now create an New column name ‘Take home’ Column Name                 SQL Type             length Take home                           Integer                    8  

  • Now click ok
  • It will be added to top out

  Screenshot_12

  • Now click on show / Hide Stage variable symbol
  • Stage variable dialogue box opens

Double click 7 write the logic here  in . Sal + Null Zero  (in.Comn) 7Now go to out 7 Drag the stage variable to out at Net Sal Screenshot_14   Double click 7 Right click 7If then else 7if Right click 7Stage variables 7If Net Val > 2000 Then Net Val – 2000  else  Net Val + 200 7 ok7Compile and RUN

  1. Create a table with fields

  SID, S Name, M1, M2, M3, M4   Stage variables  

In. M1 + in. M2 + in. M3 + in .M4 tot
Tot/5 Avg
If Avg >= 60, then “Pass” else “fail” res
If res = “pass” then “A” else “B” gra

  If  tot > 401 and tot < 500 then “A” else if Tot > 302 and tot < 400 then “B” else “c”

For indepth understanding of DataStage click on

About Author
Authorlogo
Name
TekSlate
Author Bio

TekSlate is the best online training provider in delivering world-class IT skills to individuals and corporates from all parts of the globe. We are proven experts in accumulating every need of an IT skills upgrade aspirant and have delivered excellent services. We aim to bring you all the essentials to learn and master new technologies in the market with our articles, blogs, and videos. Build your career success with us, enhancing most in-demand skills in the market.