What is Google Vids and how will it change video creation?
Cover Photo Major News from Google Vids, Google's MaxDiffusion and JetStream, Meta's Llama 3, and Intel's Gaudi 3

Google Unveils AI-Powered Video Creation Tool, Google Vids, at Cloud Next Conference

Google has introduced a groundbreaking AI-fueled video creation tool called Google Vids. Set to become part of the Google Workspace productivity suite upon release, Google Vids aims to revolutionize the way users create and collaborate on videos. Aparna Pappu, VP & GM at Google Workspace, unveiled the tool, describing it as “your video editing, writing and production assistant, all in one.” Google Vids transforms existing assets, such as marketing copy or images stored in Google Drive, into compelling videos. One of the key features of Google Vids is its collaborative nature, allowing team members to work on a video story simultaneously in real-time, directly in the browser. This eliminates the need for emailing files back and forth, while maintaining the same access controls and security provided by Google Workspace. Google Vids is currently in limited testing, with plans to roll out to additional testers in Google Labs in June. Eventually, the tool will be available for customers with Gemini for Workspace subscriptions.

Google Releases Open Source Tools to Support AI Model Development at Cloud Next Conference

Google has taken a unique approach by introducing a range of open source tools designed to support generative AI projects and infrastructure. One of the key releases is MaxDiffusion, a collection of reference implementations of various diffusion models that run on XLA devices, which optimize and speed up specific types of AI workloads. Google has also launched JetStream, a new engine for running text-generating AI models, which currently supports TPUs and promises up to 3 times higher “performance per dollar” for models like Google’s Gemma 7B and Meta’s Llama 2. Additionally, Google has expanded its MaxText collection of text-generating AI models targeting TPUs and Nvidia GPUs in the cloud. The collection now includes Gemma 7B, OpenAI’s GPT-3, Llama 2, and models from AI startup Mistral, all of which can be customized and fine-tuned by developers.

Meta Confirms Imminent Release of Llama 3 Open Source Large Language Model

Meta has confirmed that it will release Llama 3, the next generation of its open source large language model, within the next month. The announcement was made during an event in London on Tuesday by Nick Clegg, Meta’s president of global affairs, and Chris Cox, the company’s Chief Product Officer. The tech giant plans to roll out a suite of next-generation foundation models throughout the year, with the aim of powering multiple products across Meta using Llama 3. This move comes as Meta strives to catch up with competitors like OpenAI, which took the industry by surprise with the launch of ChatGPT over a year ago. Llama 3 is expected to address the limitations of its predecessors by providing more accurate answers and fielding a wider range of questions, including controversial topics. Joelle Pineau, vice president of AI Research at Meta, stated that the goal is to make a Llama-powered Meta AI the most useful assistant in the world. 

Intel Launches Gaudi 3 AI Accelerator Chip to Compete with Nvidia and AMD

Intel has unveiled its next-generation AI processing chip, the Gaudi 3 AI accelerator, designed to streamline AI development and make it faster, more straightforward, and scalable. The chip promises to deliver four times more computing power, double the network bandwidth, and 1.5 times the HBM memory bandwidth compared to its predecessor. Jeni Barovian, Intel’s vice president and general manager for its data center AI solutions strategy, emphasized the significance of the Gaudi 3 launch, stating that the chip will deliver the performance, scalability, and efficiency required to build future AI systems. While the Gaudi 3 was previewed by Intel CEO Patrick Gelsinger five months ago, it will be generally available in the third quarter of 2024. Eitan Medina, the COO of Intel’s Habana Labs, described the Gaudi 3 AI Accelerator as having a “heterogeneous computer architecture” with advanced specifications. Building solutions based on Gaudi 3 will be similar to that of Gaudi 2, with Intel doubling the network bandwidth from each accelerator, allowing customers to build clusters of any size depending on their workload needs.

Frequently asked questions

Google Vids is an AI-powered video creation tool that will be integrated into Google Workspace. It helps users create and collaborate on videos by transforming existing assets like marketing copy and images from Google Drive into professional videos. The tool enables real-time collaboration between team members directly in the browser, eliminating the need for file sharing. Google Vids will be available to Gemini for Workspace subscribers after initial testing phases, with broader access starting in June through Google Labs.
Llama 3, Meta’s next-generation open source large language model, is designed to provide more accurate answers and handle a broader range of questions, including controversial topics. It represents a significant upgrade from previous versions and will power multiple Meta products. The model aims to make Meta AI the most useful assistant globally, helping Meta compete more effectively with other AI leaders like OpenAI.
Intel’s Gaudi 3 AI accelerator offers four times more computing power, double the network bandwidth, and 1.5 times the HBM memory bandwidth compared to its predecessor. It features a heterogeneous computer architecture and is designed to make AI development faster, more straightforward, and scalable. The chip will be available in Q3 2024 and allows customers to build clusters of any size based on their workload requirements.
Google has released MaxDiffusion, a collection of diffusion model implementations for XLA devices, and JetStream, a new engine for running text-generating AI models. They’ve also expanded MaxText to include support for various models like Gemma 7B, GPT-3, and Llama 2. These tools are designed to optimize AI workloads and provide better performance per dollar, particularly when using TPUs.
Google Vids will follow a phased rollout approach. It is currently in limited testing, with plans to expand testing through Google Labs in June. The tool will eventually be available to all customers with Gemini for Workspace subscriptions, though an exact public release date hasn’t been announced.
Llama 3 is part of Meta’s broader strategy to roll out next-generation foundation models throughout the year. It represents Meta’s effort to catch up with competitors in the AI space and will be integrated across multiple Meta products. The company aims to use Llama 3 to enhance their AI capabilities and provide more sophisticated AI interactions for users.
Intel’s Gaudi 3 competes through its significant performance improvements over previous generations, offering four times more computing power and enhanced network bandwidth. Its scalable architecture allows customers to build custom-sized clusters, while maintaining compatibility with existing Gaudi 2 solutions. These features, combined with its efficiency and performance capabilities, position it as a strong alternative to competing AI accelerator chips.
Picture of Gor Gasparyan

Gor Gasparyan

Optimizing digital experiences for growth-stage & enterprise brands through research-driven design, automation, and AI