It was yet another thrilling week in the AI field with advancements that further extend the limits of what can be achieved with AI.
Here are 10 AI breakthroughs that you can’t afford to miss 🧵👇
Will Siri be powered by ChatGPT? 🗣️
Apple is nearing a deal with OpenAI to bring ChatGPT to iPhones. The deal would see ChatGPT features integrated into the upcoming iOS 18 operating system and powering AI in more apps. Rumors are flying that Siri might start using ChatGPT’s brain to answer your questions. Apple has also engaged in discussions with Google about potentially licensing its Gemini chatbot, although no agreement has been reached yet.
OpenAI releases its new flagship model: GPT-4O ⭐️
OpenAI has released a new GPT-4 Omni model that brings GPT-4-level of intelligence to all but 4x faster. It is also available to free users. It can process text, voice, and images end-to-end and can improve on its capabilities. GPT-4O is also available via API. It is 2x faster, 50% cheaper than GPT-4 Turbo, and comes with 5x higher rate limits.
OpenAI’s New Voice Assistant
OpenAI has released a new Voice assistant that is much more capable and faster than the existing Voice Mode in ChatGPT. The new Voice Assistant is powered by GPT-4O, which can natively handle text, audio, and voice. Since GPT-4O powers the Voice Assistant single-handedly, it is near real-time without awkward pauses. It can see while giving voice assistance. You can turn on your camera and show it what you’re doing, and it will help you step-by-step to complete your work.
AI on Spotlight at Google’s I/O Event 2024 💡
Google wrapped up its I/O 2024 this week and AI was the center theme. A heap of AI announcements and releases were made. The major ones are:
Gemini 1.5 Pro now has a staggering 2 Million context window. It is available via waitlist to developers using the API and to Google Cloud customers.
Google has introduced Gemini 1.5 Flash, a lighter-weight model that delivers fast performance with less cost. It is also multimodal and has a context window of 1 Million tokens. It is available in the Google AI Studio and via API for mere $0.35 per million tokens.
Google is prototyping Project Astra, a new AI agent that can give voice assistance over video calls and also memorize the information you have shown to it. Powered by Gemini 1.5 Pro, it benefits from native video processing and a huge context.
Google has announced its AI video generation model called Veo. It generates high-quality 1080p videos in different styles, from images and text prompts. You can even extend the clips to over 1 minute.
Google has released PaliGemma model which is its first open VLM. It is planning to release Gemma 2 models in the next few weeks. One of the models will be of 27B parameters, which will outperform models 2x its size.
OpenAI’s Co-founder & Chief Scientist Leaves 🧳
Ilya Sutsveker, OpenAI’s chief scientist and one of its co-founders, has left the company. OpenAI CEO Sam Altman announced the news on X, sharing that Sutskever is moving on to a project that holds personal significance for him. OpenAI’s Director of Research Jakub Pachocki, who has been with OpenAI since 2017, will take over Sutskever’s role.
ARM Plans to Launch its AI chip by 2025 🎯
UK-based semiconductor company Arm is planning to launch its own AI chips by 2025. This project, under the umbrella of SoftBank Group, aims to leverage Arm’s dominant position in smartphone processor architecture to enter the AI chip market. With a projected increase in the AI chip market value and a clear need for more advanced processing capabilities, Arm could challenge Nvidia and impact the industry significantly.
Eye Tracking in iPad and iPhone 👀
Apple is releasing new accessibility features in the upcoming OS updates to enhance the experiences of people with diverse needs. Eye Tracking - allows users with disabilities to control their iPad and iPhone using only their eyes. Vocal Shortcuts - users can create custom phrases for Siri to understand and execute shortcuts. New CarPlay features - allow deaf or hard-of-hearing people to turn on alerts to be notified of car horns and sirens. Vehicle Motion Cues - helps reduce motion sickness by displaying animated dots on the screen that correspond to changes in vehicle movement.
OpenAI Releases ChatGPT Desktop App for MacOS 🖥️
OpenAI has finally released its ChatGPT desktop app for MacOS and the experience of using ChatGPT is seamless and more intuitive. You can now take ChatGPT’s help with just a shortcut (Option + Space), be it any website or app. This lets you ask about anything you’re reading or doing without breaking your flow. You can also use the Voice Mode in the app where you can speak to ChatGPT and it’ll respond with voice too. However, it is not the new Voice Assistant that OpenAI showcased at last week’s event.
Connected Apps Feature in ChatGPT & a Lot More 🔥
ChatGPT now lets you upload files directly from Google Drive and Microsoft OneDrive. Not just this, ChatGPT will now have advanced data analysis capabilities where it can analyze multiple files at once and create interactive tables and charts. You can download, edit, and even interact with these tables to ask follow-up questions and make them presentation-ready. This feature will soon be rolled out to ChatGPT Plus, Team, and Enterprise users.
Live-Data from Reddit to ChatGPT 📈
Imagine having a conversation with ChatGPT that brings in the vibrant real-time discussions from Reddit! OpenAI has partnered with Reddit to use its data and provide the company with AI-powered features for Reddit users. OpenAI will use Reddit’s Data API for real-time insights, ensuring up-to-date and relevant content is available for users, particularly on trending topics. Reddit will leverage OpenAI’s platform to create new AI features for users and moderators, improving community management and interaction.
Which of the above AI development you are most excited about and why?
Tell us in the comments below ⬇️
That’s all for today 👋
Stay tuned for another week of innovation and discovery as AI continues to evolve at a staggering pace. Don’t miss out on the developments – join us next week for more insights into the AI revolution!
Click on the subscribe button and be part of the future, today!
📣 Spread the Word: Think your friends and colleagues should be in the know? Click the ‘Share’ button and let them join this exciting adventure into the world of AI. Sharing knowledge is the first step towards innovation!
🔗 Stay Connected: Follow us for AI updates, sneak peeks, and more. Your journey into the future of AI starts here!
Shubham Saboo - Twitter | LinkedIn ⎸ Unwind AI - Twitter | LinkedIn