×
Vidu is the latest AI video tool from China, and it’s awesome
Written by
Published on
Join our daily newsletter for breaking news, product launches and deals, research breakdowns, and other industry-leading AI coverage
Join Now

The AI video generation landscape continues to evolve with Vidu 1.5, a new platform from Chinese company Shengshu that introduces innovative features to compete with established players like Runway and Kling.

Key Innovation: Vidu 1.5 introduces ‘multi-entity consistency,’ a groundbreaking feature that enables users to combine unrelated images into cohesive videos.

  • The technology allows creators to merge separate elements, such as placing a person from one photo into a car from another photo to create fluid video content
  • Users can incorporate different clothing or objects from separate images into a single character’s video sequence
  • The feature has demonstrated practical applications, such as improving skateboarding motion by incorporating multiple perspective images

Technical Capabilities: Vidu 1.5 offers enhanced control and quality options that set it apart from existing platforms.

  • The platform supports 1080p video generation, surpassing the typical 720p limitation of competitors
  • Creators can customize motion degree, resolution, duration, and other parameters
  • An ‘Advanced Character Control’ feature provides precise management of camera movements and cinematic techniques
  • Motion speed can be adjusted between auto, low, medium, and high settings for more authentic outputs

Competitive Position: While showing promise in certain areas, Vidu 1.5 still faces some challenges in the broader AI video market.

  • The platform’s text-to-video capabilities currently lag behind competitors like Runway, Kling, and MiniMax
  • Traditional text-to-video and image-to-video modes are available, offering both photorealistic and illustration options
  • The platform is positioning itself as a challenger to OpenAI’s upcoming Sora release

Industry Perspective: Leadership at Shengshu Technology sees AI video generation as a democratizing force in content creation.

  • CEO Jiayu Tang emphasizes the technology’s potential to make high-quality content production accessible to everyone
  • The platform builds on recent research showing how AI video models interpret physics from images rather than understanding physical principles

Future Implications: The introduction of multi-entity consistency represents a significant advancement that could reshape AI video generation capabilities and influence future development across the industry, potentially spurring other platforms to develop similar features to remain competitive.

I put Vidu 1.5 to the test — a new major player in the AI video space

Recent News

AI courses from Google, Microsoft and more boost skills and résumés for free

As AI becomes critical to business decision-making, professionals can enhance their marketability with free courses teaching essential concepts and applications without requiring technical backgrounds.

Veo 3 brings audio to AI video and tackles the Will Smith Test

Google's latest AI video generation model introduces synchronized audio capabilities, though still struggles with realistic eating sounds when depicting the celebrity in its now-standard benchmark test.

How subtle biases derail LLM evaluations

Study finds language models exhibit pervasive positional preferences and prompt sensitivity when making judgments, raising concerns for their reliability in high-stakes decision-making contexts.