It was yet another thrilling week in the AI field with advancements that further extend the limits of what can be achieved with AI.
Here are 10 AI breakthroughs that you can’t afford to miss 🧵👇
OpenAI Might be Testing a New AI Model in Stealth 🤯
A new chatbot called the “gpt2-chatbot” has appeared on LMSys’ chatbot arena and spurred quite a conversation on X. We tried it on a few reasoning tasks and it is undoubtedly very impressive! Not only did it get the answers right but also the structure of the output was very clean and it walked through all the steps. The capabilities seem similar to or even better than GPT-4 Turbo and Claude-3.
GitHub Challenges Devin with an Enhanced Copilot 🧑💻
GitHub has now launched GitHub Copilot Workspace that lets developers transition from idea, to code, to software. It targets the biggest pain point - figuring out how to approach the problem. Developers can brainstorm, plan, build, test, and run code all in natural language within this environment. This experience leverages different Copilot-powered agents from start to finish, integrating all stages of software development. While the workspace is powered by AI, developers retain complete control over their projects. All aspects of the proposed solution by Copilot Workspace are fully editable.
Make ChatGPT Know More About You for Better Chats 👯
In February, OpenAI started testing its new “Memory” feature in ChatGPT where it will remember things discussed in conversations to make future interactions more helpful. It was rolled out to select ChatGPT Plus and Enterprise users then. OpenAI has now made it available to all ChatGPT Plus users. The Memory feature makes your conversations with ChatGPT progressively more personalized.
China Releases Competitors to OpenAI Sora & GPT-4 💪
China has been advancing its AI technology, fiercely competing with the best in the Valley. At one end, SenseTime has released RiRiXin SenseNova 5.0, an LLM that leverages a unique hybrid architecture - cloud computing for powerful processing and edge computing for quick responses. It beats the latest version of GPT-4 Turbo across all benchmarks with a significant 10% overall margin. On the other hand, Shengshu Technology has introduced Vidu, a new and powerful text-to-video AI model competing with OpenAI’s Sora. The model can create high-definition videos from simple text prompts.
Benchmark AI Models with Live Data from Chatbot Arena 📈
Establishing a reliable benchmark for LLM that remains current and in line with human preferences is a challenge where benchmarks like MMLU are insufficient to evaluate these LLMs. To address this, LMSys has introduced a new benchmark called Arena-Hard, which utilizes live data from the Chatbot Arena platform and a unique evaluation pipeline. At just $25 per model evaluation, it offers a more affordable evaluation method than existing benchmarks, using efficient data collection and processing techniques.
GPT-4 is the Dumbest Model 🤦♀️
In a conversation with Stanford’s budding entrepreneurs, Sam Altman shared his insights on the future of AI and OpenAI’s approach to developing the technology responsibly. He said:
This is probably the best time to start an AI company
ChatGPT is not phenomenal, it is mildly embarrassing at best. GPT4 is the dumbest model any of you will ever have to use again.
Whether we burn 500 million a year or 5 billion or 50 billion a year, I don’t care. As long as we can create way more value for society than that and pay the bills, we’re making AGI.
GPT-5 is gonna be smarter than a lot smarter than GPT-4, GPT-6 is gonna be a lot smarter than GPT-5 and we are not near the top of this curve.
and a lot more!
OpenAI Might be Releasing a Search Engine+ChatGPT 🌐
It seems like OpenAI is getting ready to launch its own search engine or some other AI product to search for information. Clues like the domain name “search.chatgpt.com” were found in their recent files, and there are whispers about a possible launch date of May 9th.
ChatRTX Now Understands Images and Voice Commands 😎
Nvidia’s ChatRTX, the AI-powered chatbot app that lets you interact with your own data on your computer, using natural language, has received some cool updates. ChatRTX now supports new AI models, and lets you interact with images and voice commands. ChatRTX now supports Google’s Gemma and ChatGLM2 (a bilingual English and Chinese model). Till now, it was using Mistral 7B.
Claude 3 Goes Mobile and Expands for Businesses 🤳
Anthropic has released a “team plan” making available its Claude-3 models to cater to the growing demand for AI solutions in enterprises. The company has also released the Claude iOS app, available on the App Store today. For $30 a month, the Team plan offers advanced features and controls specifically designed for businesses, while the iOS app brings the power of Claude 3 directly to your fingertips.
Llama 3 on Par with Gemini Pro and GPT-4 🧠
Llama 3 models are certainly impressive but their limited context window of 8k tokens makes them inapplicable for tasks that have longer pieces of text. However, two separate teams have tackled this challenge head-on, significantly expanding the context window of Llama 3 models. First, Gradient AI has increased the context window of Llama 3 8B to a staggering 1 Million tokens, making it the second LLM after Gemini Pro to have such a huge capacity. Second, Abacus AI has released 128K long-context support for Llama 3 70B, making it head-to-head with GPT-4. Both the models are available on Hugging Face.
Which of the above AI development you are most excited about and why?
Tell us in the comments below ⬇️
That’s all for today 👋
Stay tuned for another week of innovation and discovery as AI continues to evolve at a staggering pace. Don’t miss out on the developments – join us next week for more insights into the AI revolution!
Click on the subscribe button and be part of the future, today!
📣 Spread the Word: Think your friends and colleagues should be in the know? Click the ‘Share’ button and let them join this exciting adventure into the world of AI. Sharing knowledge is the first step towards innovation!
🔗 Stay Connected: Follow us for AI updates, sneak peeks, and more. Your journey into the future of AI starts here!
Shubham Saboo - Twitter | LinkedIn ⎸ Unwind AI - Twitter | LinkedIn