Get in Touch

Course Outline

Advanced Building Blocks for Transformations

  • Handling complex data types
  • Managing fields, metadata, and dynamic structures
  • Identifying and applying reusable transformation patterns

Parameters, Variables, and Job-Oriented Design

  • Understanding runtime variables and their scoping
  • Parameterizing transformations for flexibility
  • Structuring parent-child job relationships

Database Integration and Lookup Strategies

  • Mastering advanced lookup steps
  • Implementing effective caching strategies
  • Designing efficient join operations

Integrating Files, APIs, and External Systems

  • Processing JSON and XML data formats
  • Invoking REST and SOAP services
  • Managing streaming and batch loads

Techniques for Error Handling and Data Quality

  • Capturing and routing errors appropriately
  • Applying data validation patterns
  • Conducting auditing and logging

Essentials for Performance Tuning

  • Optimizing the design of individual steps
  • Addressing memory usage and threading configurations
  • Identifying and resolving bottlenecks

Introduction to Repository-Based Development

  • Leveraging the Pentaho repository
  • Managing versions effectively
  • Adopting team collaboration practices

Practices for Deployment and Migration

  • Moving jobs across different environments
  • Managing configurations
  • Establishing operational best practices

Summary and Future Steps

Requirements

  • A foundational understanding of ETL principles
  • Prior experience using Pentaho Data Integration
  • Basic familiarity with data warehousing concepts

Target Audience

  • ETL developers
  • Data engineers
  • Technical professionals looking to expand their PDI expertise
 21 Hours

Number of participants


Price per participant

Testimonials (3)

Upcoming Courses

Related Categories