What is Google’s new Ask Photos feature and how does it work?
Cover Photo Major News from Ask Photos, Apple, Mistral AI, Google and OpenAI

Google Unveils AI-Driven ‘Ask Photos’ Feature for Enhanced Image Search

Google is testing its new Ask Photos feature, powered by Gemini AI, which allows users to search their photo libraries using natural language. This innovation, previewed at Google I/O 2024, enables users to ask specific questions about their images without needing to tag or organize them beforehand. Initial demonstrations showcased its capability to retrieve specific photos, such as license plates. While currently in testing with select users, Ask Photos aims to significantly enhance image organization and retrieval, showcasing Google’s commitment to integrating AI across its services.

Apple Unveils High-Performance AI Models Surpassing Competitors

Apple has launched a new set of open-source AI models through its DataComp for Language Models project, showcasing significant advancements in natural language processing. The models, featuring 7 billion and 1.4 billion parameters, have demonstrated superior performance against competitors like Mistral and Hugging Face. By utilizing a standardized framework for training, the larger model achieved a 63.7% accuracy on the MMLU benchmark, marking a notable improvement over previous state-of-the-art models. This initiative emphasizes the importance of effective dataset design and aims to enhance AI training strategies, although the models are not intended for direct use on Apple devices.

Mistral AI and NVIDIA Launch Multilingual 12B NeMo Model

Mistral AI, in collaboration with NVIDIA, has unveiled the NeMo model, featuring 12 billion parameters and an extensive context window of 128,000 tokens. This model excels in reasoning, world knowledge, and coding accuracy, designed to replace the existing Mistral 7B seamlessly. With an open-source approach, both pre-trained and instruction-tuned checkpoints are available under the Apache 2.0 license, fostering research and adoption. NeMo introduces a new tokeniser, Tekken, which enhances compression efficiency across multiple languages. This release aims to democratize access to advanced AI capabilities, making it a valuable resource for diverse applications.

Google Enhances Paris Olympics Coverage with AI Innovations

Google has been named the official search AI partner for Team USA, marking a historic collaboration with the U.S. Olympic and Paralympic Committee. The company will integrate AI into the U.S. broadcast of the Paris Olympics, enabling commentators to use AI for real-time explanations of competitions. NBCUniversal plans to modernize its coverage to engage younger audiences, featuring personalized AI-generated recaps narrated by AI versions of prominent commentators. Comedian Leslie Jones will also leverage Google’s Gemini AI to enhance her commentary, enriching the viewer experience as the Games approach.

OpenAI Explores Development of New AI Chips with Broadcom

OpenAI is in discussions with Broadcom and other chip designers to develop its own artificial intelligence chips, aiming to address the ongoing shortage of costly graphics processing units essential for its AI models like ChatGPT and DALL-E3. The company is hiring former Google engineers who previously worked on AI chips and plans to create an AI server chip. OpenAI is also engaging with industry and government stakeholders to enhance access to necessary infrastructure, while CEO Sam Altman seeks billions in funding to establish semiconductor manufacturing partnerships with major chipmakers.

Frequently asked questions

Google’s Ask Photos is a new AI-powered feature that allows users to search their photo libraries using natural language queries. Powered by Gemini AI, it enables users to find specific images by asking questions without needing to manually tag or organize photos first. The feature can help locate particular details in photos, such as license plates, making photo organization and retrieval much more intuitive and efficient.
Apple’s new AI models, part of the DataComp for Language Models project, outperform competitors like Mistral and Hugging Face. Their 7 billion parameter model achieved 63.7% accuracy on the MMLU benchmark, setting a new standard in natural language processing. These open-source models demonstrate superior performance through standardized training frameworks, though they’re not designed for direct implementation on Apple devices.
Google’s role as the official search AI partner for Team USA represents a groundbreaking integration of AI technology in Olympic coverage. The partnership will enable real-time AI-powered explanations during competitions and personalized content generation, including AI-generated recaps narrated by virtual commentators, specifically designed to attract younger viewers to Olympic coverage.
The NeMo model is a 12-billion-parameter AI model developed by Mistral AI and NVIDIA. It features a large context window of 128,000 tokens and excels in reasoning, world knowledge, and coding accuracy. The model includes a new tokeniser called Tekken for improved multilingual compression and is available open-source under the Apache 2.0 license.
OpenAI is tackling the AI chip shortage by exploring partnerships with Broadcom and other chip designers to develop their own AI processors. They’re hiring former Google engineers with AI chip experience and seeking billions in funding to establish semiconductor manufacturing partnerships. This initiative aims to reduce dependency on existing GPU supplies and create custom solutions for their AI models.
Google’s Ask Photos provides a more intuitive and natural way to search through photo libraries compared to traditional methods. Instead of relying on manual tags or dates, users can simply ask questions about their photos in natural language. This AI-powered approach saves time, reduces the need for manual organization, and can identify specific details within images that might be difficult to find otherwise.
AI technology will revolutionize Olympic broadcasting through several innovations, including real-time AI explanations of competitions, personalized content generation, and AI-powered commentary. Google’s involvement will enable NBCUniversal to create customized recaps and enhance commentary, including AI-assisted content from personalities like Leslie Jones, making the coverage more engaging and accessible to diverse audiences.
Picture of Gor Gasparyan

Gor Gasparyan

Optimizing digital experiences for growth-stage & enterprise brands through research-driven design, automation, and AI