Get in Touch

Course Outline

Hunyuan Multimodal Foundations and Lab Configuration

  • Grasping Hunyuan's multimodal capabilities for image, 3D, and video scenarios
  • Identifying relevant business scenarios for creative, product, and content teams
  • Setting up the lab environment, sample assets, and model access
  • Executing initial generation tasks and reviewing results

Prompt Design and Workflow Patterns

  • Crafting prompts for consistent multimodal outcomes
  • Utilizing text prompts, reference images, and basic input configurations
  • Selecting appropriate workflows for image, video, or 3D generation
  • Refining prompts based on output quality and business objectives

Image Generation and Review Labs

  • Producing marketing, product, and concept images from prompts
  • Fine-tuning visual style, composition, and content consistency
  • Evaluating outputs for utility, quality, and brand alignment
  • Organizing image outputs for approval and downstream usage

Video Generation Labs

  • Generating short video clips from prompts and prepared inputs
  • Managing style, scene intent, and output variation
  • Assessing videos for clarity, continuity, and practical application
  • Preparing video outputs for demonstrations or content workflows

3D Asset Creation Labs

  • Generating basic 3D assets from text or image inputs
  • Verifying geometry, texture quality, and asset usability
  • Exporting assets for visualization, prototyping, or content pipelines
  • Determining when 3D generation is suitable compared to image or video workflows

Integration, Governance, and Next Steps

  • Distributing generated assets via simple apps, services, or APIs
  • Linking multimodal outputs to product, content, and review processes
  • Implementing practical checks for quality, brand safety, copyright, and responsible usage
  • Planning pilot use cases and next steps for internal adoption

Requirements

  • Foundational knowledge of AI and generative AI concepts
  • Experience utilizing web applications, APIs, or standard developer tools
  • Basic proficiency in Python or scripting

Audience

  • Developers creating AI-enhanced product features
  • Technical product managers and solution architects
  • Innovation, media, and digital teams handling image, video, or 3D content
 14 Hours

Number of participants


Price per participant

Upcoming Courses

Related Categories