Revolutionizing Language Models with InstructLab
IBM Research, in partnership with Red Hat, has introduced InstructLab, a pioneering open-source project that simplifies the collaborative customization of large language models (LLMs). This project aims to streamline the incorporation of community contributions into base models, reducing the time and effort involved significantly.
Innovative Approach
- InstructLab combines human-curated data with high-quality examples generated by an LLM.
- This approach reduces the data creation costs and enhances the base model without requiring complete retraining.
- IBM Research leveraged InstructLab to create synthetic data for advancing its open-source Granite models for language and code.
Recent Developments
- Researchers utilized InstructLab to improve an IBM 20B Granite code model for modernizing software on IBM Z mainframes.
- This process showcased speed and effectiveness, leading to a strategic collaboration between IBM and Red Hat.
- The Watsonx Code Assistant for Z, IBM’s mainframe modernization solution, underwent enhancements using InstructLab capabilities.
Functionalities of InstructLab
- InstructLab offers a command-line interface (CLI) for adding alignment data to the target model through a GitHub workflow.
- The backend relies on IBM Research’s LAB method for synthetic data generation and phased training to create high-quality data for specific tasks.
- Community members can experiment with models, submit improvements on GitHub, and contribute to fine-tuning the base model.
Community Collaboration and Open Source
- InstructLab promotes community engagement by enabling users to enhance IBM models and contribute to the project via GitHub.
- IBM’s AI supercomputer, Vela, supports updating InstructLab models regularly, with plans to include other public models as the project expands.
- The project operates under the Apache 2.0 license, ensuring transparency and collaboration in model customization.