Synthesia: The Future of AI Video at Work (CNBC)

Synthesia
3 Jul 202410:07

TLDRSynthesia, a British AI startup, has launched new features allowing users to create AI avatars using webcams and phones. The company, backed by Nvidia, aims to make video content creation accessible and affordable for large firms. CEO Victor Riparbelli discusses the technology's potential, ethical considerations, and the future of personalized AI video content.

Takeaways

  • πŸ˜€ Synthesia is a British AI startup offering AI video generation services.
  • πŸš€ They have introduced new features like AI avatar creation through webcams and phones.
  • πŸ’Ό The company is backed by Nvidia and aims to target large firms as future clients.
  • πŸ“ˆ AI video generation addresses the high cost and time-consuming nature of traditional video production.
  • πŸ“± Users can now create avatars with their phones or webcams at a significantly lower cost.
  • 🌐 The avatars can speak in multiple languages, offering a global reach.
  • πŸ’‘ Synthesia has overcome early technological barriers and is now focused on market expansion.
  • 🌟 They are already working with almost 60% of the Fortune 100 companies.
  • πŸ’Έ While they don't disclose profit margins, they emphasize building a solid business over just fundraising.
  • πŸ” Synthesia has an ethical framework that prohibits non-consensual use of avatars or content.
  • 🀝 Their partnership with Nvidia is symbiotic, leveraging Nvidia's AI research and technology.

Q & A

  • What is Synthesia and what does it specialize in?

    -Synthesia is a British AI startup that specializes in creating AI avatars using webcams and phones, with the aim of making video content creation faster and more efficient.

  • Who is Victor Riparbelli and what is his role at Synthesia?

    -Victor Riparbelli is the CEO and co-founder of Synthesia.

  • What is the main challenge that Synthesia aims to address?

    -Synthesia aims to address the challenge of video content production being expensive and time-consuming by using AI to generate videos without the need for filming.

  • How does Synthesia's technology differ from traditional video production?

    -Synthesia's technology allows for the creation of video content without filming by generating it with AI, which is faster and more efficient than traditional methods that involve studios and professional videographers.

  • What is the cost difference between producing an avatar with Synthesia and traditional methods?

    -Traditional avatar production could cost around $1,000 per avatar, whereas Synthesia's method can be done for free or with an enterprise plan, making it significantly more cost-effective.

  • How quickly can an avatar be created using Synthesia's technology?

    -An avatar can be ready in less than 24 hours using Synthesia's technology.

  • What languages can Synthesia's avatars speak?

    -Synthesia's avatars can speak not only the language they were recorded in but also 29 other languages.

  • What are the barriers to growth that Synthesia has faced?

    -Synthesia has faced technological barriers in the early years and cultural barriers in terms of acceptance of AI videos and avatars. Now, it's more about market competition and financial environment.

  • How does Synthesia's technology fit into the current market?

    -Synthesia's technology is positioned to grow in a market that values efficient communication and knowledge sharing, with a focus on going-to-market strategies and customer satisfaction.

  • What is Synthesia's stance on ethical concerns regarding AI avatars and deepfakes?

    -Synthesia has an ethical framework that prohibits non-consensual content or avatars. Creating an avatar requires consent through a KYC-style process to ensure the individual is in control of their own avatar.

  • What is the partnership between Synthesia and Nvidia, and what does it entail?

    -Nvidia, which has one of the best AI research labs in the world, partners with Synthesia to accelerate AI adoption. Synthesia uses Nvidia's products to train their models, and this partnership enhances both companies' capabilities.

  • How does Synthesia plan to further monetize its product?

    -Synthesia plans to monetize its product by transforming video into an interactive medium, enabling personalized videos and conversations with avatars, thus opening an entirely new market.

Outlines

00:00

🌟 AI Video Content Creation with Synthesia

Synthesia, a UK-based AI startup backed by Nvidia, has introduced new features such as AI avatar creation using webcams and phones. The company aims to make video content creation more accessible and efficient for large enterprises. In an interview, CEO Victor Riparbelli explains that while people prefer video content, traditional video production is costly and time-consuming. AI offers a solution by generating videos without filming, thus speeding up communication. Victor highlights the cost reduction from approximately $1,000 per avatar to virtually free with Synthesia, and the ability to produce content in multiple languages. He also discusses the cultural acceptance of AI and the company's growth, emphasizing the transition from technical to market challenges. Synthesia already works with 60% of Fortune 100 companies, indicating a promising future.

05:02

πŸš€ Overcoming AI Adoption Barriers and Ethical Considerations

In the second part of the interview, Victor addresses the challenges of AI adoption, referring to the 'Holy Grail problem' where companies struggle to implement AI systems that meet all their needs. He mentions 'AI letdown' as a common issue, where AI technologies almost meet expectations but fall short. Victor emphasizes Synthesia's ability to demonstrate business value quickly. The discussion also touches on deepfakes and ethical concerns, with Victor stating that Synthesia has a strict policy against non-consensual use of content. He explains the company's Know Your Customer (KYC) process to ensure consent for avatar creation. The conversation also includes the ethical dilemmas surrounding voice樑仿 and the potential legal challenges it presents. Lastly, Victor discusses Nvidia's investment in Synthesia, highlighting the mutual benefits of their partnership, including access to Nvidia's AI research and expertise.

Mindmap

Keywords

πŸ’‘AI avatars

AI avatars refer to digital representations of a person that can be controlled by AI algorithms. In the context of the video, Synthesia uses AI to create avatars that can mimic human speech and movements. This technology allows for the creation of personalized video content without the need for physical presence, as highlighted by Victor when he mentions the ability to create avatars using webcams and phones.

πŸ’‘Enterprise

Enterprise refers to large businesses or organizations that often require efficient communication methods. Victor discusses how Synthesia's AI technology addresses the enterprise need for video content creation, which is traditionally expensive and time-consuming. The company aims to make video production more accessible and cost-effective for these large firms.

πŸ’‘AI-generated content

AI-generated content is any media, such as video or text, created using artificial intelligence. The video's main theme revolves around Synthesia's ability to generate video content through AI, which is a game-changer for enterprises looking to communicate more effectively without the high costs associated with traditional video production.

πŸ’‘Production cost

Production cost refers to the expenses incurred in creating a product or service. Victor explains that traditional avatar production could cost around $1,000 per avatar, but with Synthesia's AI technology, the cost is significantly reduced, making it almost free. This cost reduction is a major selling point for enterprises looking to adopt AI for content creation.

πŸ’‘Cultural barriers

Cultural barriers are societal or cultural challenges that can impede the adoption of new technologies. Victor mentions that one of the initial barriers to AI video adoption was cultural acceptance. People had to become comfortable with the idea of AI-generated videos and avatars, which has been improving with increased AI integration in various aspects of life.

πŸ’‘Go-to-market exercise

A go-to-market exercise is a strategic process for launching a product or service. Victor discusses how Synthesia, after overcoming technical and cultural barriers, is now focused on its go-to-market strategy, which involves offering the best product at the best price to attract and grow its customer base.

πŸ’‘Deepfakes

Deepfakes are AI-manipulated media, often used to create fake videos or audio that appear real. The interviewer raises the issue of deepfakes in relation to Synthesia's technology, questioning the ethical implications of creating realistic, yet fake, representations of people. Victor addresses this by emphasizing the importance of consent and ethical use of the technology.

πŸ’‘Ethical framework

An ethical framework is a set of principles that guide decision-making and behavior. Victor mentions that Synthesia established an ethical framework to ensure responsible use of their technology, with a clear red line against non-consensual content or avatar creation.

πŸ’‘Nvidia

Nvidia is a company known for its graphics processing units (GPUs) and AI research. The video discusses their partnership with Synthesia, where they provide not only hardware but also expertise in AI model training. This partnership is beneficial for both companies, as it helps Synthesia accelerate their AI technology development.

πŸ’‘AI adoption

AI adoption refers to the process of integrating artificial intelligence into various sectors or businesses. The discussion in the video highlights the growing adoption of AI across industries, with Synthesia being at the forefront of making AI-generated video content creation accessible to enterprises.

Highlights

Synthesia introduces new features to create AI avatars using webcams and phones.

The company is backed by Nvidia and aims to target large firms as clients.

AI avatars can generate video content without filming, making communication more efficient.

Traditional video production is expensive in both time and money.

AI technology allows for the creation of video content more quickly and cost-effectively.

Synthesia's avatars can speak in 29 languages besides the original, in the user's own voice.

The cost of producing an AI avatar has significantly decreased compared to traditional methods.

Synthesia has overcome early technological barriers to develop its AI video technology.

Cultural acceptance of AI videos and avatars has improved, especially with the rise of ChatGPT.

The company now focuses on market strategy to deliver the best product at the best price.

Synthesia works with almost 60% of the Fortune 100 companies.

The company places importance on building an actual business rather than just raising funds.

AI technology is poised to transform video from a linear medium into an interactive experience.

The future of AI video may include personalized content and conversations with avatars.

Financing in AI has become more rational, focusing on businesses with good margins and growth.

Synthesia can demonstrate business value quickly, which is key to adoption in the enterprise space.

The company faces the 'Holy Grail problem' of delivering an all-in-one AI system that meets high expectations.

Synthesia has an ethical framework that prohibits non-consensual use of content or avatars.

Creating an avatar with Synthesia requires a KYC-style process to ensure user consent and identity verification.

Nvidia's partnership with Synthesia is symbiotic, providing valuable AI research and expertise.