Course Outline
Advanced Building Blocks for Transformations
- Handling complex data types
- Managing fields, metadata, and dynamic structures
- Identifying and applying reusable transformation patterns
Parameters, Variables, and Job-Oriented Design
- Understanding runtime variables and their scoping
- Parameterizing transformations for flexibility
- Structuring parent-child job relationships
Database Integration and Lookup Strategies
- Mastering advanced lookup steps
- Implementing effective caching strategies
- Designing efficient join operations
Integrating Files, APIs, and External Systems
- Processing JSON and XML data formats
- Invoking REST and SOAP services
- Managing streaming and batch loads
Techniques for Error Handling and Data Quality
- Capturing and routing errors appropriately
- Applying data validation patterns
- Conducting auditing and logging
Essentials for Performance Tuning
- Optimizing the design of individual steps
- Addressing memory usage and threading configurations
- Identifying and resolving bottlenecks
Introduction to Repository-Based Development
- Leveraging the Pentaho repository
- Managing versions effectively
- Adopting team collaboration practices
Practices for Deployment and Migration
- Moving jobs across different environments
- Managing configurations
- Establishing operational best practices
Summary and Future Steps
Requirements
- A foundational understanding of ETL principles
- Prior experience using Pentaho Data Integration
- Basic familiarity with data warehousing concepts
Target Audience
- ETL developers
- Data engineers
- Technical professionals looking to expand their PDI expertise
Testimonials (3)
That it was very practical.
Alfonso Ramos - Banco de Mexico
Course - Fundamentos de Integración de Datos Pentaho
Machine Translated
Very useful in because it helps me understand what we can do with the data in our context. It will also help me
Nicolas NEMORIN - Adecco Groupe France
Course - KNIME Analytics Platform for BI
It's a hands-on session.