October 14, 2024

•

5 min read

The The AI Scientist Due Dilligence Report

Pablo Sanchéz

AI Researcher, Co-founder at Intor AI

Executive Summary

The AI Scientist by Sakana AI represents the first approach to automating the scientific research process. By leveraging state-of-the-art Large Language Models (LLMs), the AI Scientist can generate research ideas, run experiments, and even produce and review scientific papers autonomously. This technology could potentially disrupt traditional research methodologies, offering significant advantages in terms of cost, speed, and scalability. However, it also presents notable risks and limitations that must be carefully considered.

Story of the company

Sakana AI was founded in 2023 by a team of top researchers including Llion Jones, who previously worked at Google Research and is co-author of one of the most influential papers in AI in the last decade, namely “Attention is All you Need” (https://arxiv.org/abs/1706.03762), the paper that set the foundations for current LLMs. The company's name, "Sakana," means "fish" in Japanese, symbolizing their aim to navigate the vast ocean of knowledge and push the boundaries of artificial intelligence in scientific discovery. The company's flagship project, The AI Scientist, targets one of the most ambitious goals in AI research: creating an autonomous system capable of conducting scientific research end-to-end.

The company's decision to open-source key components of their work shows their commitment to transparency and to democratize scientific discovery and accelerate progress across multiple disciplines, starting with machine learning.

Funding and Investors

Sakana AI has recently secured a Series A funding round, raising $100 million [1][3]. Key investors include [2]:

NEA (New Enterprise Associates), who co-led the round
Khosla Ventures and Lux Capital (existing investors)
NVIDIA, joining as a new strategic investor

Additionally, Sakana AI has received support from the Japanese government, granting the company access to national data center clusters for research purposes.

Team

Sakana AI has brought together a talented and diverse team with expertise in many different fields. Our analysis of the team composition reveals several key strengths that position the company as one of the leaders in AI-driven scientific discovery.

Geographic Distribution and Global Perspective: While the majority of the team is based in Japan, primarily in and around Tokyo, Sakana AI has also attracted talent from international locations, including the United Kingdom and the United States. This mix provides a blend of local expertise and global perspectives, potentially fostering innovation through diverse cultural and academic experiences.

Educational Background: The team of around 30 people has strong academic credentials from world-renowned institutions such as the University of Tokyo, Kyoto University, and prestigious international institutions like Stanford University and the University of Oxford.

The fields of study of the members are particularly well-aligned with the company's mission including a blend of Computer Science, Statistics, and Physics. This team composition shows several key strengths:

Technical Expertise: The concentration of team members with backgrounds in Computer Science and AI aligns perfectly with the technical demands of the AI Scientist project.
Balance of Research and Application: The mix of research and engineering roles indicates a team capable of both theoretical innovation and practical implementation, key for bridging the gap between AI advancements and real-world scientific discovery.
Global Perspective: While primarily based in Japan, the international team members bring diverse experiences, potentially fostering a more global outlook and approach to AI development.
Business Orientation: The presence of business development roles suggests a focus on practical applications and market viability of their research, which is essential for long-term success and impact.

Technology Overview

The AI Scientist developed by Sakana AI represents a novel integration of several novel AI technologies:

Large Language Models (LLMs)

At the core of The AI Scientist are large language models, which power various stages of the scientific process. These models can generate human-like text, write code, and perform complex tasks that some consider akin to reasoning.

The AI Scientist leverages these capabilities to:

Generate novel research ideas
Plan and design experiments
Write and execute code for experiments
Analyze and interpret results
Write full scientific papers
Conduct peer reviews

Chain-of-Thought and Self-Reflection

The AI Scientist incorporates prompting techniques such as chain-of-thought reasoning and self-reflection. These methods enhance the LLM's decision-making capabilities, allowing for more coherent and logical idea generation, experimental planning, and paper writing.

Automated Code Generation

A key component of The AI Scientist is its ability to write and modify code autonomously. This is achieved through the integration of Aider, a state-of-the-art LLM-based coding assistant. Aider enables The AI Scientist to:

Implement proposed ideas by modifying existing codebases
Debug and refactor code as needed
Adapt experiments based on intermediate results

Automated Paper Review System

Sakana AI has developed an LLM-based reviewing process that achieves near-human-level performance in evaluating scientific papers. This system allows for:

Objective assessment of generated papers
Selection of the best ideas for "publication"
Iterative improvement of the research quality

Open-Ended Discovery Loop

The AI Scientist is designed to operate in an open-ended loop, where:

New ideas are generated based on previous discoveries
Experiments are conducted and papers are written
Papers are reviewed and added to an archive of scientific findings
The process repeats, building upon the growing knowledge base

This iterative approach mimics the human scientific community's process of continuous discovery and improvement.

Strengths

Full Automation: The AI Scientist automates the entire research pipeline, drastically reducing the time and cost typically associated with scientific research. This can accelerate innovation and democratize access to high-quality research.
Cost Efficiency: Producing a scientific paper for as little as $15 is a game-changer, especially for early-stage research projects that may not have substantial funding. The low operational cost could make this technology highly scalable.
Broad Applicability: While currently focused on machine learning, the technology is versatile and can be applied to other scientific domains that rely on programming for the experimentation.
Innovative Peer-Review Mechanism: The AI’s ability to review its own papers and provide iterative improvements is unique. It mimics the human scientific community, creating a feedback loop that refines the quality of research over time.

Weaknesses

Reasoning Limitations: The reasoning capabilities of large language models have been a subject of ongoing debate in the AI community. While these models have demonstrated remarkable abilities in understanding context, making logical inferences, and solving complex problems, they also exhibit surprising failures in seemingly simple reasoning tasks (https://arxiv.org/abs/2302.04023). This duality highlights the complexity of assessing the true cognitive capabilities of LLMs. Many professionals in the field, myself included, argue that what appears to be reasoning might actually be sophisticated pattern matching based on the vast amount of data these models are trained on, rather than genuine understanding or logical reasoning. The AI Scientist has shown the potential of LLMs to contribute to scientific research, however, we may not yet fully comprehend the extent of their reasoning abilities.
Experimental Challenges: While The AI Scientist excels at computational experiments, it cannot currently directly interact with physical laboratory equipment. This limits its application in experimental sciences that require hands-on manipulation of materials or biological samples. Future integration with automated lab equipment could potentially address this limitation for physical sciences.
Review Challenges: The AI sometimes fails to correctly evaluate results, which can compromise the integrity of its findings. Additionally, this review process, while automated, is not yet on par with human peer review, often missing subtle but critical details.
Repetitive Idea Generation: The AI tends to generate similar ideas across different runs, indicating a lack of true novelty. This could limit the diversity and impact of its research contributions.
Bias and Error Risks: There are notable concerns about the potential to introduce biases or errors, especially when it hallucinates details or makes incorrect assumptions about data and experimental conditions.

‍

Codebase

Sakana AI has open-sourced The AI Scientist codebase, available at https://github.com/SakanaAI/AI-Scientist. The analysis of the codebase reveals both strengths and areas for improvement:

Strengths

Containerization: The use of Docker for experiments is a notable strength, ensuring consistency and reproducibility across different environments.
Open-source nature: The public availability of the codebase promotes transparency and allows for community contributions, potentially accelerating the development and improvement of The AI Scientist.

Areas for Improvement

Lack of Type Annotations: The absence of type hints makes the code less self-documenting and more prone to type-related errors.
Suboptimal Folder Structure: The codebase could benefit from a more organized and intuitive folder structure, adhering to established best practices for project organization.
Absence of Testing: The lack of a comprehensive testing suite makes it challenging to ensure code reliability and makes future modifications riskier.
Hardcoded Values: The presence of hardcoded values throughout the codebase reduces flexibility and makes the system less adaptable to different scenarios or configurations.
Functional vs. Object-Oriented Paradigm: While functional programming is not inherently bad, the exclusive use of this paradigm in a complex system like The AI Scientist may limit code reusability and make the system harder to extend.

Recommendations

To improve the codebase quality and maintainability, Sakana AI could consider:

Implementing type annotations to enhance code readability and catch potential errors early.
Restructuring the project folders following established Python project conventions.
Developing a comprehensive test suite, including unit and integration tests.
Replacing hardcoded values with configuration files or environment variables.
Evaluating the potential benefits of incorporating object-oriented design principles where appropriate.

These improvements could significantly enhance the codebase's robustness, maintainability, and potential for community contributions. Despite these areas for improvement, the open-source nature of the project remains a significant strength, allowing for collaborative improvement of The AI Scientist's implementation.

Competitors

While Sakana AI's The AI Scientist represents a significant leap in automated scientific discovery, several companies and research groups are working on related technologies:

DeepMind: Known for its work on AI systems that can assist in scientific research, such as AlphaFold (https://deepmind.google/technologies/alphafold/) for protein structure prediction.
OpenAI: Develops large language models and has shown interest in AI-assisted scientific discovery [4].
IBM Research: Has a history of working on AI for scientific discovery, including projects like IBM RXN for chemistry (https://rxn.app.accelerate.science/rxn/sign-in).

While these competitors are working on AI-assisted scientific discovery, Sakana AI's The AI Scientist stands out for its end-to-end automation of the entire scientific process, from idea generation to paper writing and peer review.

Opportunities

Improved version of the AI Scientist will present numerous opportunities for Sakana AI and the broader scientific community:

Democratization of research: By significantly reducing the cost and time required for scientific discovery, The AI Scientist could enable more individuals and organizations to contribute to scientific progress.
Accelerated scientific progress: The ability to generate and test hypotheses at scale could lead to faster breakthroughs in various fields, particularly in machine learning and AI.
Cross-disciplinary applications: While currently focused on machine learning, The AI Scientist's framework could potentially be adapted for use in other scientific disciplines such as biology, physics, or chemistry.
AI-human collaboration: The system could serve as a powerful tool for human researchers, augmenting their capabilities and helping to explore vast solution spaces more efficiently.
Commercial applications: The technology could be licensed to research institutions, pharmaceutical companies, or other organizations seeking to accelerate their R&D processes.

Risks

Despite its potential, The AI Scientist and similar technologies face several risks and challenges:

Ethical concerns: Automated scientific discovery raises questions about the role of human creativity and intuition in research, as well as potential biases in AI-generated hypotheses.
Quality control: Ensuring the reliability and reproducibility of AI-generated research will be crucial to maintain scientific integrity.
Overreliance on AI: There's a risk that overreliance on automated systems could lead to a decline in human scientific skills and critical thinking.
Intellectual property issues: The automated generation of ideas and papers may complicate traditional notions of authorship and patent rights.
Job displacement: Widespread adoption of AI scientists could potentially impact employment in the research sector.
Misinformation risk: If not properly controlled, AI-generated scientific content could contribute to the spread of misinformation or low-quality research.
Security concerns: The powerful capabilities of The AI Scientist could potentially be misused for generating harmful content or conducting unethical experiments.
Regulatory challenges: As AI takes on a more significant role in scientific discovery, new regulations may be needed to govern its use and ensure responsible development.
Resource intensity: The computational resources required to run advanced AI models may limit accessibility and raise environmental concerns.
Overwhelming the Review System: If not managed carefully, the widespread adoption of AI-generated research could overwhelm existing peer-review processes, reducing the quality and rigor of published research.

Conclusion

Sakana AI's The AI Scientist represents a groundbreaking advancement in automated scientific discovery. The system's ability to generate novel ideas, conduct experiments, and produce publishable papers autonomously positions it as a potential game-changer in the scientific community. The near-human-level performance in paper reviewing and the cost-effectiveness of paper production ($15 per paper) are particularly impressive features. The open-source nature of the project and the team's academic backgrounds lend credibility to the technology. However, the technology is still in its early stages, and its full potential and limitations remain to be seen. The ethical and quality control challenges associated with AI-driven research will need to be carefully addressed as the technology develops.

Recommendations:

For investors: Sakana AI presents an exciting investment opportunity in the rapidly growing field of AI-driven scientific discovery. The potential for cross-disciplinary applications and the democratization of research could lead to significant returns. However, technical due diligence should focus on the scalability of the technology, its performance in peer-reviewed settings, and the company's strategy for addressing ethical and quality control concerns.

For AI researchers and enthusiasts: The open-source nature of The AI Scientist provides an excellent opportunity for collaboration and further development. Researchers should consider contributing to the project or using it as a foundation for related work in automated scientific discovery.

For scientific institutions: Institutions could consider pilot programs to explore how this technology can be used as a complementary tool to augment existing research processes.

For Sakana AI: The company should focus on:◦ Expanding the application of The AI Scientist to other scientific domains◦ Developing robust quality control mechanisms◦ Engaging with ethicists and policymakers to address potential concerns◦ Exploring commercial partnerships with research institutions and R&D-intensive companies

In conclusion, Sakana AI's The AI Scientist represents a significant leap towards fully automated scientific discovery. However, it is still a prototype. While challenges remain, the potential impact on accelerating scientific progress and democratizing research is substantial. As the technology matures, it could play a crucial role in addressing some of the world's most pressing scientific challenges.

Citations

[1] https://app.dealroom.co/companies/sakana_ai[

2] https://www.nea.com/blog/our-investment-in-sakana-ai-pioneering-japans-ai-future

‍[3] https://500.co/theglobalvc/sakana-ai-investing-in-global-pioneers-in-ai

[4] https://www.reuters.com/technology/sam-altmans-ouster-openai-was-precipitated-by-letter-board-about-ai-breakthrough-2023-11-22/

Relevant Links

https://sakana.ai/ai-scientist/

https://arxiv.org/abs/2408.06292

https://github.com/SakanaAI/AI-Scientist