Google Steals the AI Spotlight!
Plus: Meta's Multimodal Models, Text-to-Animation Model by Stability AI, Anthropic's 100k context brain, and more.
Hey there π
Google kicked off its annual I/O Event and has been in the AI headlines since then. The Big G didn't just step into the AI ring, it belly-flopped right in, splashing us with a tidal wave of AI goodies. It's clear they're not just jogging, but sprinting in this AI marathon. We're here with the low-down on all the techie treats they tossed out. Strap in and get ready for some mind-blowing innovations:
PaLM 2: Powering 25+ Google products with a next-gen language model sporting multilingual and coding prowess.
Bard: Available in 180 nations, 40 languages, boasting image I/O, coding upgrades, and app integration.
Search Generative Experience: Upgraded search delivering more info and context to your queries.
Search Labs: Play with novel Google products like generative search, Code Tips, and Add to Sheets.
Workspace Labs: New AI-aided features for Docs and Gmail, and a text-to-image tool for Slides.
Project Tailwind: An AI-first notebook, powered by your notes and sources.
MusicLM: Generate music with text prompts; the AI maestro is here.
Vertex AI: Meet Codey (text-to-code), Imagen (text-to-image), and Chirp (speech-to-text); your AI tool troika.
Gemini: Multimodal model with API integrations, still in training.
Immersive View in Maps: Multi-dimensional route visualization through AI and computer vision.
Magic Editor in Google Photos: AI-assisted precise image edits, as if by magic!
But wait, there's more! π
Meta's new multimodal model is creating buzz, Humane's wearable promises screen-free computing, and OpenAI is set to roll out GPT-4 Web Browsing and 7+ ChatGPT Plugins. Buckle up, we're just getting started! Dive in for the rest of this week's exciting AI news!
This issue covers:
Latest Developments π
News from the Industry π§βπ«
Tools of the Trade βοΈ
Hot Takes π₯
AI Meme of the Week π€‘
Latest Developments π
Our Pick π
ImageBind by Meta AI: Multimodal AI model that can learn from six modalities, text, image/video, audio, depth, thermal and inertial measurement units to gain a holistic understanding of data.
LeMUR by Assembly AI: Leveraging LLMs to transcribe up to 10 hours worth of audio content (~150K tokens) with a single line of code.
Stable Animation SDK by Stability AI: A powerful text-to-animation tool to create stunning animations using Stable Diffusion models in three ways.
MultiModal-GPT: A vision and language model that can have continuous dialogue with humans by following instructions, and utilizes training data to improve performance.
FrugalGPT: Cost-saving strategies for using LLMs like GPT-4 and can match or improve performance with up to 98% cost reduction.
CodeExecutor: Model that leverages code execution pre-training and curriculum learning to enhance semantic comprehension with promising results.
TidyBot: A personalized robot that learns user preferences for household cleanup using LLMs, achieving high accuracy in object placement and adaptability.
InternChat: An interactive visual framework that integrates chatbots with non-verbal instructions like pointing movements to perform vision-centric tasks.
WikiWeb2M: A dataset for multimodal webpage understanding tasks that retains the full set of images, text, and structure data available in a webpage.
News from the Industry π§βπ«
Our Pick π
Humane is creating an AI-powered wearable device that lets users access computing power without being tethered to a smartphone or other device.
HuggignFace has released Transformers Agent which provides a natural language API on top of transformers with curated tools and an agent that interprets and uses them.
Meta is introducing the AI Sandbox for advertisers and expanding its Meta Advantage suite of ad automations tools to improve campaign results.
Anthropic has Claude's context window to 100K tokens, allowing for analyzing large volumes of materials and answering complex questions.
Amazon is building AI tools for merchants to boost advertising revenue by using generative AI for generating photos, videos and product descriptions on the platform.
Palantir's stock surged 15% after the company reported strong earnings, and said the demand for its upcoming AI platform is "without precedent".
Google Cloud is partnering with enterprise companies such as Box, Canva, Dialpad, Jasper, Salesforce, and UKG to integrate generative AI capabilities.
IBM has launched its enterprise AI and data platform, Watsonx, designed to create competitive advantage, scale AI, and advance trustworthy AI.
DeepMind co-founder, Mustafa Suleyman, warns that advanced AI will result in a large number of white-collar job losses, leaving many workers "very unhappy".
Google is partnering with Adobe to integrate Adobe's Firefly for generating media content and its free graphic design tool, Adobe Express, in Bard.
Scale AI has launched two major platforms: Scale Donovan, an AI copilot for defense, and Scale EGP, a full-stack enterprise AI, emphasizing the need of integrating AI into military and economic strategies.
Tools of the Trade βοΈ
Our Pick π
Nyric: AI world-generation platform that allows users to build their dream worlds with just text prompts.
ChatGPT Microphone: Add voice-to-text and shortcut snippets to ChatGPT for more efficiency.
YOYA.ai: Build personalized generative AI apps without code, including instant custom chatbot creation, website page training, and easy integration.
Intuo.ai: A suite of privacy-centric AI features, including generative AI, open-source chatbots, advanced web searching capabilities, and more.
ArchitectGPT: Create stunning visuals of home or property, with an intuitive drag-and-drop interface and a variety of design themes.
Promptitude: Manage, test, and improve GPT prompts with a simple API call.
Dart: project management tool powered by GPT-4 that automates tasks and optimizes workflows for teams.
Pace AI: Create various project-related materials such as requirements, user stories, roadmaps, product vision statements, and meeting agendas.
Kadoa: AI-powered web scraper that autogenerates scrapers for sources, adapts to website changes, extracts data accurately, and provides data in any format.
HelpGent: Personalized asynchronous communication with audiences through video, voice, text, and screen recording for increased engagement.
Learnt.ai: Generate various resources such as lesson plans, homework tasks, icebreakers, and assessment questions in seconds.
Seidesk: Create customized knowledge base, FAQ pages, and a help center for customers and collaborators.
Guidde: Create video documentation with AI 11x faster, with features such as Magic Capture, voiceover, and smart sharing.
Feathery: Create high quality web forms 10x faster with AI.
Receipt AI: AI-powered receipt management tool to simplify accounting, saves time by automatically extracting information from receipt images.
Jam GPT: AI debugging assistant that provides an integrated and secure solution for automated bug diagnosis, code fix suggestions, and adaptive AI learning.
Conversly.ai: Language learning app that offers conversation practice with various characters powered by ChatGPT.
ChatCAD: Generate 3D models with AI using just text prompts.
Raycast Pro: Productivity tool that harnesses AI to enable users to write smarter, code faster, summarize text, and perform various tasks.
Hot Takes π₯
AI Meme of the Week π€‘
Thatβs all for this week!
Will see you next Saturday with more such content. Donβt forget to subscribe and give your feedback below.
BONUS π
Share this newsletter with three other friends and stand a chance to win my book GPT-3: The Ultimate Guide to build NLP Products with OpenAI API. Winners will be selected on a monthly basis.