LLM for Generating Synthetic Data 💡
PLUS: OpenAI hires former head of NSA, Robot learns from human actions
Today’s top AI Highlights:
Nvidia releases new suite of models to generate synthetic data for LLM training
Stanford researchers develop system for humanoid robots to shadow human movements
This company ditched the complex AI agent and found success
Retired US Army General and cybersecurity expert joins OpenAI Board
OpenAI might change into a for-profit company
& so much more!
Read time: 3 mins
Latest Developments 🌍
Cybersecurity Expert Joins OpenAI’s Board 🔐
Former head of the National Security Agency, retired US Army General, and a cybersecurity expert, Paul M. Nakasone has joined OpenAI’s Board. As their first priority, Nakasone has been appointed as a member of OpenAI’s Safety and Security Committee.
This Committee, formed by OpenAI two weeks ago, is responsible for making recommendations on critical safety and security decisions for OpenAI projects and operations. It will recommend processes and safeguards in 90 days to the Board.
While Nakasone’s insights can help build safer and more responsible systems, the timing suggests that OpenAI is more concerned with the optics, aiming to show that they are taking this issue seriously when there is growing criticism on how the company is prioritizing commercializing their products.
Further, OpenAI CEO Sam Altman has reportedly told some shareholders that the company is considering changing its governance structure to a for-profit business that the company’s nonprofit board doesn’t control.
Nvidia’s Toolkit for Generating Synthetic Data for LLMs 🧰
Nvidia has released an open-source toolkit to generate synthetic data for training LLMs. This new family of models, Nemotron-4 340B, offers a free and scalable way to address a major challenge in AI development: the lack of high-quality, readily available training data. Nemotron-4 340B models not only generate synthetic data but also assess its quality to build robust and reliable LLMs.
Key Highlights:
A Complete Pipeline - The toolkit includes three models – a base model, an instruct model, and a reward model – that work together to create synthetic data and then evaluate its accuracy, coherence, and other key characteristics.
Optimized for Efficiency - Nemotron-4 340B works seamlessly with Nvidia’s NeMo framework for model training and the TensorRT-LLM library for optimized inference to quickly train and deploy their models.
Customization and Flexibility - The base model can be further customized with proprietary data and the HelpSteer2 dataset by Nvidia, a collection of text and code examples to help developers fine-tune their models for specific domains and applications.
Robots Learn from Human Demos in Real-Time 👬
Training humanoid robots to perform complex tasks has been challenging as they can’t replicate human-like movements and due to lack of efficient data collection methods. Now, Stanford researchers have developed a new system called HumanPlus that allows robots to learn from human demonstrations, overcoming these limitations.
HumanPlus uses a two-pronged approach: it enables real-time shadowing of human motions by robots, and then uses this data to train a system for robots to perform the tasks autonomously. This system can help develop robots that can perform a wide range of tasks in real-world environments.
Key Highlights:
Real-Time Shadowing - HumanPlus uses a single RGB camera to track human motion and translate it into robot movements. It allows human operators to “teleoperate” the robot. This approach eliminates the need for expensive motion capture systems and enables training in more diverse settings.
Learning from Human Demonstrations - HumanPlus learns from a 40-hour human motion dataset, allowing the robot to mimic a wide range of skills. The system uses a transformer-based architecture to learn from both the robot’s own sensor readings and the visual input from its cameras, improving its ability to generalize to new tasks.
High Success Rates - The research team tested HumanPlus on tasks like wearing a shoe, unloading objects, folding clothes, and interacting with other robots. The robot achieved 60-100% success rates with just 40 demonstrations, demonstrating the system’s effectiveness in real-world scenarios.
Agents Aren’t All You Need 🤖
AI automation is changing the way businesses operate, increasing efficiency and reducing costs. However, building reliable AI agents for complex workflows, like compliance processes, is challenging. Many companies have faced difficulties with the complexity of dynamic planning and the inherent unpredictability of these systems.
Parcha, a company automating workflows, started its journey towards building fully autonomous agents, only to learn that a simpler approach could deliver greater value. Their experience is important for anyone working with AI, especially those building systems for real-world applications.
Key Highlights:
Focus on reliability and accuracy over autonomy - Parcha realized that building reliable agentic behavior with LLMs would require significant resources and time. They decided to focus on creating tools that were reliable and accurate for specific tasks, rather than striving for complete autonomy.
Modular approach for scalability and efficiency - They developed a modular approach where AI tools are built as decoupled, reusable blocks. This allows for independent evaluation and optimization of each component. It also enables them to parallelize tasks for efficiency gains.
Use LLMs strategically for specific tasks - Parcha uses different LLMs strategically based on task complexity. For straightforward extraction tasks, they use inexpensive models, while reserving more robust models for complex reasoning and judgment tasks.
Transparency and auditability are key - They provide a clear and human-readable explanation of the thought process behind each decision. This transparency is crucial for audit logs and compliance reporting so that customers can trust and verify the results.
“To put this into perspective, if an AI agent carries out a workflow consisting of 10 tasks autonomously but has a 10% error rate per task, the compounded error rate over the whole workflow is 65%.”
😍 Enjoying so far, share it with your friends!
Tools of the Trade ⚒️
Impel: AI tool for Mac that automates tasks by learning your workflow and acting instantly when it can help. It can summarize content, manage tasks, handle authentication, and provide answers to questions about any content, while ensuring your data privacy with on-device processing.
Jace: Your AI coworker that can handle any task for you with a single instruction. Jace can control and perform actions in a browser just like a human would. It can book trips, pay invoices, set up job posts, and more.
Thinkbuddy: A MacOS app that provides unlimited access to leading AI models like GPT-4o, Claude, and Gemini, all with a maximum $25 monthly fee. It offers features such as voice input, screen capture, customizable shortcuts, and model switching to enhance productivity and streamline workflow.
Awesome LLM Apps: Build awesome LLM apps using RAG for interacting with data sources like GitHub, Gmail, PDFs, and YouTube videos through simple texts. These apps will let you retrieve information, engage in chat, and extract insights directly from content on these platforms.
Hot Takes 🔥
The former head of the NSA may be a great guy. But you don’t put the former head of the NSA on your board (as OpenAI just did) because he’s nice. You put him there to signal that you’re open to doing business with the IC and DoD. ~
Matthew GreenI suspect we have a better idea now of what Ilya saw. ~
Harrison Kinsley
Meme of the Day 🤡
Google AI overview fiasco, explained
That’s all for today! See you tomorrow with more such AI-filled content.
Real-time AI Updates 🚨
⚡️ Follow me on Twitter @Saboo_Shubham for lightning-fast AI updates and never miss what’s trending!
PS: I curate this AI newsletter every day for FREE, your support is what keeps me going. If you find value in what you read, share it with your friends by clicking the share button below!