The Bright Future of Materials Science with AI: Self-Driving Laboratories and Closed-Loop Discovery
Dilshod Nematov1,2*, Iskandar Raufov1, Saidjaafar Murodzoda1, Anushervon Ashurov1, Sakhidod Sattorzoda1
1Quantum Electronics Laboratory, S.U. Umarov Physical-Technical Institute of National Academy of Sciences of Tajikistan, Dushanbe, Tajikistan
2School of Optoelectronic Engineering & CQUPT-BUL Innovation Institute, Chongqing University of Posts and Telecommunications, Chongqing 400065, China
*Correspondence to: Dilshod Nematov, PhD, Quantum Electronics Laboratory, S.U. Umarov Physical-Technical Institute of National Academy of Sciences of Tajikistan, Dushanbe, 734042, Tajikistan; Email: dilnem@mail.ru
DOI: 10.53964/jmn.2025002
Abstract
Next-generation autonomous laboratories that combine machine learning (ML), robotic synthesis, and in situ characterization is rapidly emerging as one of the key directions in modern materials science. By integrating active and deep learning algorithms with multi-level datasets ranging from quantum mechanical simulations to high-throughput flow-based experiments, these platforms establish closed-loop frameworks that guide complex nanofabrication processes in real time. Such AI-driven systems lower material and energy consumption, enable the precise design of nanostructures with tailored optical, electronic, and mechanical properties, and create research opportunities that were previously difficult to realize. This review provides a comprehensive overview of self-driving laboratories (SDLs), including their architectures, algorithmic strategies, and practical demonstrations such as the optimization of perovskite nanostructures, the development of nanoparticles for targeted drug delivery, and the synthesis of quantum dots with controlled emission. It also discusses existing methodological limitations, the need for standardized data practices, and challenges related to the integration of in situ techniques, while highlighting the prospects for creating fully digital pipelines for material production, as evidenced by recent autonomous laboratory demonstrations that progressed from initial hypotheses to functional prototypes within remarkably short timeframes. Furthermore, the self-driving lab paradigm is beginning to transition from academic laboratories into industrial research and development. This emerging trend suggests that SDL techniques will soon accelerate discovery and development in sectors such as pharmaceutical research and chemical manufacturing.
Keywords: artificial intelligence; machine learning, self-driving labs, autonomous material discovery, nanomaterials, in situ characterization, inverse design, bayesian optimization, reinforcement learning, digital twins; explainable ai
1 INTRODUCTION
Autonomous laboratories represent a significant development in modern materials science. These systems integrate machine learning (ML), robotic synthesis, and in situ characterization to create closed-loop research environments. By combining active learning algorithms with multilevel datasets from quantum-mechanical simulations to high-throughput flow experiments, they establish real-time theory-experiment frameworks capable of guiding complex nanofabrication processes[1-9]. The field addresses fundamental challenges in traditional materials discovery[10-16]. Conventional approaches often involve slow, sequential experimentation with high resource consumption and limited exploration of complex parameter spaces. The interpretation of high-dimensional data from advanced characterization techniques presents additional difficulties[17-22]. Data fragmentation across proprietary formats and incompatible systems further hinders scientific progress and reproducibility[23].
One crucial challenge is the lack of standardized data formats in materials research, a hurdle that the community is now actively addressing. Initiatives like the Materials Genome Initiative (MGI) have underscored the importance of common data infrastructures and sharing protocols to accelerate discovery[23]. In practice, this has led to the development of frameworks such as NIST’s Materials Data Curation System (MDCS), which provides tools for capturing and sharing materials data in structured, XML-based formats[22]. Using templated schemas, platforms like MDCS enable researchers to upload experimental and simulation data in a consistent way, facilitating interoperability between different Self-driving laboratory (SDLs). For example, an experiment’s results stored via a standard schema can be directly parsed and used by another lab’s Artificial intelligence (AI) model, reducing duplication and errors. Adopting FAIR data principles (Findable, Accessible, Interoperable, Reusable) and community-agreed ontologies for material properties and processes is increasingly seen as essential. These efforts aim to ensure that as SDLs generate vast datasets, the data can be easily exchanged, validated, and built upon across laboratories and industrial partners.
AI-driven platforms significantly reduce material and energy requirements while enabling precise engineering of nanostructures with tailored functionalities. They facilitate research directions previously considered impractical, particularly for metastable phases[19] and reactions with narrow processing windows. These systems automate the entire research cycle from hypothesis generation and experimental design to execution and analysis, enabling autonomous selection of subsequent experiments without human intervention[1,8,10].
This review examines the current state of self-driving laboratories through several key aspects. We analyze architectural principles and algorithmic strategies, including Bayesian optimization (BO)[8,16], reinforcement learning (RL), and generative models for inverse design[20,22]. Particular attention is given to automated methods for processing scientific literature[12,21]. We evaluate demonstrated capabilities across diverse material systems, covering perovskite nanostructures for optoelectronics[24-33], drug delivery nanoparticles[34], and quantum dots with controlled emission profiles[7,15]. The assessment includes performance benchmarking and comparative analysis of different platforms.
Critical examination addresses persistent challenges such as data standardization issues[23], limited data availability in unexplored chemical domains, model interpretability needs for high-stakes applications[24-35], and cybersecurity concerns[25]. As SDL platforms become more networked and data-driven, cybersecurity emerges as a critical concern. Autonomous labs often involve remote access to robotic equipment and cloud-connected databases, which could be vulnerable if not properly secured. A breach or malicious attack on an SDL might lead to faulty experiments or theft of valuable proprietary data. Ensuring robust cyber-physical security measures, such as encrypted communication, authentication protocols for remote experiment control, and real-time intrusion detection is therefore paramount. For example, IBM’s RoboRXN team highlighted the need for secure handling of proprietary chemical data in cloud-based labs, introducing features to allow AI model training on sensitive datasets without exposing them[35]. Similarly, SDL developers are beginning to adopt industry cybersecurity standards (like ISO 27001 for information security) to protect intellectual property and experiment integrity. In addition, data governance policies are needed to control who can access or modify an experiment’s parameters in a self-driving lab. By addressing these data protection issues proactively, researchers can safeguard the reliability of SDL outcomes and build trust for wider deployment in corporate and defense settings.
The discussion extends to future development pathways including accessible virtual laboratories[29], multi-agent architectures[3], and integration with industrial-scale manufacturing[26]. Recent advances through platforms like A-Lab[11,12], AlphaFlow[8], and Rainbow[30] demonstrate the transition from proof-of-concept demonstrations to functional research systems. These developments highlight the potential for rapid progression from initial hypotheses to practical implementations. The following sections provide detailed analysis of these advancements, current limitations, and future directions for autonomous materials discovery.
2 REVIEW METHODOLOGY
Our literature review methodology employed systematic search strategies across major scientific databases. We utilized Scopus and Web of Science as primary sources for peer-reviewed publications. The arXiv preprint server provided access to emerging research developments. Our search incorporated multiple key terms including "", "autonomous materials discovery", "closed-loop synthesis", "BO", "materials informatics", and "inverse design". The analysis focused on publications from 2020 to 2025 to capture current advancements. We prioritized high-impact studies and pioneering experimental demonstrations in our selection process. Articles underwent screening based on their relevance to architectural frameworks, algorithmic approaches, and instrumental implementations of autonomous laboratories. The comparative evaluation in Table 1 derives from examining core technologies, application domains, and performance metrics from leading studies in this field. This approach ensured comprehensive coverage of both established and emerging research directions while maintaining focus on practically implemented systems and their experimental validation. No AI-assisted tools were used in the preparation, analysis, or writing of this review manuscript.
3 ADVANCES IN AUTONOMOUS LABORATORY ARCHITECTURES
Over the past decade, AI has shifted from a supporting instrument for data analysis to a primary engine of progress in materials science and nanotechnology when paired with autonomous laboratories[1-4], a transition reflected in SDLs that assemble robotic synthesis modules, versatile in situ characterization systems, and ML algorithms spanning generative models, BO, and RL[5-7]. A key advantage of these platforms is the closed-loop workflow that enables automated hypothesis generation, experimental design, synthesis, data acquisition and analysis, and the selection of the next experiment without human intervention[8-10], which shortens research cycles while improving reproducibility and overall data quality. A representative case is the A-Lab platform at Lawrence Berkeley National Laboratory, which achieved the first fully autonomous solid-state synthesis of inorganic compounds[11] by combining large-scale literature mining with outputs from density functional theory to plan synthesis steps, operate robotic dispensers and furnaces, and analyze powder diffraction data in real time using ML models[12-14]; this integration allows discovery of previously unknown phases within days rather than weeks or months and has already been applied to the synthesis of complex oxides with promising thermoelectric performance[12].
Progress in SDLs increasingly relies on the incorporation of flow chemistry, since flow reactors offer continuous control over composition and conditions, expand operational flexibility, and markedly increase the density of experimental data[7,8,14]. Work by Delgado-Licona and colleagues[15] showed that dynamic flow conditions in autonomous quantum dot synthesis expanded the volume of usable data by more than an order of magnitude and enabled rapid identification of conditions that deliver targeted size distributions and spectral properties, while the AlphaFlow platform integrates microfluidic channels with BO and active learning to support multistep reactions with precise parameter control at each stage[16].
Advances in SDLs also depend on the deep integration of in situ and in operando methods, since ML control of X-ray diffraction (XRD), Raman spectroscopy, and absorption spectroscopy can target the most informative spectral or angular regions, thereby reducing total experiment time and instrument load[17,18], and since the real-time theory-experiment loop implemented by Liang and co-authors[19] updates reaction parameters directly from synthesis outcomes without human intervention, a capability that is particularly important for metastable nanophases and reactions with narrow parameter ranges. Interest in generative models for inverse design continues to grow because researchers can specify target properties such as band gap or absorption coefficient and then evaluate structures proposed by the model as likely to meet those targets, as demonstrated by the Generative Toolkit for Scientific Discovery (GT4SD), an extensible open-source library for organic materials design with experimental validation[20], by a comprehensive perspective on foundational modeling that spans property prediction to molecular generation[21], and by MatterGen, a diffusion-based generative framework for inorganic materials that led to synthesis of a generated structure with properties within 20 percent of the target[22].
Despite rapid progress, several obstacles continue to limit broader adoption, with the absence of widely accepted standards for data visualization and exchange ranking among the most consequential because laboratories often maintain incompatible formats that complicate the integration of simulators, robotic systems, and AI modules; for example, XRD data stored in a proprietary format may require manual conversion before any ML agent in another laboratory can read it, which introduces delays and potential errors. The MGI workshop report[23] therefore calls for standardized data formats and APIs and for digital infrastructure that ensures interoperability across platforms. A further limitation arises from the scarcity of large training datasets for emerging material systems, which pushes ML into data-poor regimes and constrains predictive power in unexplored chemical spaces, while explainability remains a concern because decisions without transparent rationales face barriers in domains demanding strict validation such as pharmaceuticals or nuclear energy[24]. Cybersecurity risks add another layer of difficulty because SDLs are networked and thus exposed to disruption and data breaches, including threats to intellectual property[25-28], and these issues are intertwined since the lack of data standards makes explainability and robust cybersecurity more difficult to achieve.
To address these challenges, researchers are turning to new paradigms that aim to make SDLs more scalable, interoperable, and economically feasible. One promising direction is the emergence of so-called “frugal twins,” simplified digital replicas of laboratory environments that allow researchers to pre-optimize synthesis pathways and experimental workflows in a virtual setting before committing resources to costly robotic experiments. These lightweight digital twins, which can be implemented in cloud infrastructures, have been shown to accelerate design-make-test cycles while simultaneously lowering entry barriers for institutions with limited access to advanced robotics, thus making autonomous experimentation more democratized[2,4,26]. However, because frugal twins rely on simplified digital models, the predictive reliability and robustness of such surrogates still require further validation before broad adoption[2,4]. Importantly, the concept of frugal twins extends beyond mere cost reduction, where they create opportunities for running large sets of “what-if” experiments in silico, using ML surrogates and coarse-grained models to filter out suboptimal conditions before physical testing begins. In recent demonstrations, frugal twins were coupled to cloud laboratories, enabling dozens of geographically dispersed research groups to replicate identical experiments with minimal variation (Figure 1), effectively building a distributed ecosystem of SDLs working on harmonized protocols[29].
|
Figure 1. Interconnected Challenges Slowing the Advancement of Platforms[29].
Such integration of digital twins and real-time experimentation not only facilitates reproducibility but also strengthens the long-term goal of open science, where data and methods can be seamlessly shared and verified across laboratories worldwide. The ability to replicate conditions virtually also helps address issues of safety and sustainability: potentially hazardous chemical routes can first be evaluated computationally, reducing unnecessary exposure to toxic reagents, while simultaneously optimizing the use of scarce or expensive precursors. These features position frugal twins as indispensable in the democratization and scaling of autonomous materials discovery, bridging the gap between theory-driven modeling and real laboratory practice.
The issue of scalability is also being addressed through the design of multi-robot self-driving laboratories, among which the Rainbow platform represents a representative example. Rainbow is distinguished by its modular and parallelized architecture, integrating multiple robotic arms, liquid handling modules, and photonic characterization systems, all coordinated by a central ML controller[5-7,10]. Unlike earlier single-arm SDLs that performed sequential optimization, Rainbow can execute combinatorial parameter sweeps in parallel, thereby exploring much larger regions of chemical space in the same timeframe. Its implementation of real-time photoluminescence (PL) monitoring allows the system to detect and respond to subtle changes in perovskite nanocrystal emission profiles, thereby guiding the optimization process with both speed and accuracy. Benchmarking studies revealed that Rainbow identified optimal perovskite nanocrystal compositions in a fraction of the time needed by conventional SDLs, with significantly reduced experimental noise[30,10]. This demonstrates Rainbow’s high-throughput discovery potential and serves as a proof-of-concept for the next generation of autonomous laboratories where parallelization, adaptability, and modularity are standard features (Figure 2).
|
Figure 2. Schematic representation of Rainbow’s components and automated functionalities[30]. (A) Rainbow consists of a characterization robot (Robot 1), a pipetting robot (Robot 2), a robotic arm (Robot 3), a labware refreshment robot (Robot 4), and an AI agent. The integrated hardware enables automated (i) multi-step synthesis, (ii) sampling, (iii) characterization, and (iv) labware restocking. (B) Photograph of Rainbow’s hardware, highlighting the pipetting robot (red), characterization robot (magenta), robotic arm (cyan), and labware refreshment robot (green).
A close examination of Rainbow’s performance metrics shows that error bars in measured PL quantum yields narrowed with successive closed-loop iterations, indicating an intrinsic capacity for self-correction and noise suppression. The figure illustrates a substantial reduction in variance across parallel reactors as the system converges toward optimal synthesis conditions, underscoring the robustness of multi-robotic workflows for reproducible nanomaterial discovery. In addition, Rainbow’s architecture points to the feasibility of scaling SDLs toward industrially relevant throughputs, where hundreds of compositions could be synthesized and characterized per day, feeding into ML models that progressively refine their predictive accuracy. This represents a concrete step toward bridging laboratory-scale demonstrations and pilot-scale manufacturing, a longstanding challenge in translational materials research.
Another critical advancement is the integration of natural language processing and algorithmic planning frameworks with autonomous laboratories. A recent study introduced a platform that combines automated literature analysis with the A-star path-finding algorithm (A*) path-finding algorithm for the synthesis of gold nanorods. The system was capable of scanning thousands of synthesis protocols in the literature, extracting relevant parameters, and building a knowledge base that guided robotic synthesis experiments[12,20,21,31]. By employing A* as the optimization engine, the platform systematically refined experimental parameters such as surfactant concentration, temperature profiles, and seed-to-precursor ratios. Notably, the system achieved highly reproducible nanorod synthesis with fine-tuned aspect ratios (Figure 3), resulting in sharp Ultraviolet–visible (spectroscopy) (UV–vis) absorption bands with linewidths narrowed below 3nm after a few optimization cycles[31]. This example illustrates an ongoing shift and SDLs are no longer isolated automation platforms but increasingly incorporate text-processing algorithms as knowledge modules that bridge scientific literature and laboratory practice. In doing so, they reduce the information bottleneck between human knowledge and robotic action, effectively converting text-based knowledge into actionable experimental instructions. The implications of this development are significant, as it opens the possibility of automated “literature-to-lab” pipelines where AI continuously digests the scientific corpus, identifies promising synthesis strategies, and validates them experimentally with minimal human intervention[20,21]. This ability to learn from both structured datasets and unstructured textual information has the potential to become a cornerstone of next-generation SDLs.
|
Figure 3. Literature mining and characterization of Au nanobipyramids (NBPs)[31]. (A) Workflow of the literature mining module for nanoparticle synthesis; (B) UV–vis spectra of Au NBPs synthesized with optimal parameters; (C) Transmission electron microscopy images of Au NBPs (scale bar=200nm); (D) Longitudinal length statistics; (E) Radial length statistics.
In parallel, structural characterization is evolving through approaches such as ScatterLab, which leverages total scattering and pair distribution function (PDF) data as direct input for inverse design loops[17-19]. Traditionally, structure determination in materials science has been limited by the need for predefined structural models, but ScatterLab bypasses this constraint by using ML to map experimental scattering data directly onto atomic configurations. In recent demonstrations, the system autonomously guided the synthesis of gold nanoparticles with desired structural motifs, selecting reaction pathways that maximized agreement between in situ scattering signals and target PDF features. The graphical representation (Figure 4) illustrates how iterative refinement of synthesis parameters rapidly converged toward the correct structural arrangement, even in the absence of human-supplied crystallographic models[32,17-19]. This capability is particularly significant for nanostructures and metastable phases where structural disorder obscures conventional diffraction patterns. By integrating scattering data into closed-loop experimentation, ScatterLab demonstrates the potential of SDLs to not only optimize macroscopic properties such as band gaps or catalytic activity but also directly engineer atomic-level arrangements in real time. The interpretation of the figure reveals that as the feedback loop progresses, misfit values between experimental and simulated PDFs decline sharply, providing a quantitative metric of convergence and improving transparency in discovery efficiency.
|
Figure 4. Workflow of ScatterLab Integrating Robotic Synthesis, Scattering Experiments, and BO[32].
Complementary to these technological advances, multi-agent architectures such as MAPPS (Materials Agent unifying Planning, Physics, and Scientists) highlight the growing importance of modular AI ecosystems in autonomous experimentation[3,16,24]. MAPPS unifies planning agents, physics-based simulators, and human-in-the-loop feedback modules into a coherent framework, thereby achieving resilience against errors and improving novelty in material proposals. Benchmarking against earlier Large language model (LLM)-based platforms showed that MAPPS improved stability of closed-loop experiments by a factor of five, while also expanding the diversity of candidate materials considered. Unlike single-agent models that often gravitate toward local optima, MAPPS encourages exploration of broader design spaces, mitigating the risks of premature convergence. The system has been demonstrated in case studies involving battery electrolytes and optoelectronic polymers, where the combination of simulation-driven priors and experimental validation led to the identification of previously untested but synthetically accessible candidates. Such multi-agent frameworks could represent a future pathway for SDL design, where a network of specialized AI agents collaborates with robotic hardware and human scientists to deliver balanced, interpretable, and reproducible discoveries.
To synthesize these developments, comparative analyses have begun to map the rapidly diversifying SDL ecosystem. A tabular summary of recent platforms contrasts Rainbow, Generative pre-trained transformer (GPT)+A*, ScatterLab, AlphaFlow, and frugal twins in terms of architectural principles, targeted material systems, and demonstrated performance (Table 1). From this comparison, two trends emerge clearly: first, SDLs are moving toward ever-greater levels of integration, combining robotic modules, real-time characterization, and cognitive engines in a single workflow; second, reproducibility and interpretability are becoming as important as speed and throughput, reflecting the community’s recognition that scientific trust depends on transparent and verifiable decision-making processes. The analysis presented in this review directly reflects these emerging trends. The discussion of Rainbow’s architecture highlights the potential of parallelization, the GPT+A* schematics illustrate the fusion of literature and laboratory, ScatterLab’s convergence trajectories exemplify atomic-level structural control, and frugal twin frameworks emphasize the democratization and accessibility of autonomous experimentation. Taken together, these cases demonstrate that the evolution of SDLs is no longer determined by hardware or algorithms in isolation but by the holistic integration of digital infrastructure, robotic capability, AI cognition, and human oversight. This synthesis points toward a future in which autonomous discovery becomes not only faster but also more collaborative, interpretable, and impactful for the broader scientific enterprise.
Table 1. Comparative Summary of Recent SDL Platforms for Nanomaterials
Platform |
Year |
Core Technology |
Application Domain |
Source |
Rainbow |
2025 |
Multi-robot parallelized workflow with real-time PL monitoring |
Perovskite nanocrystals, optoelectronics |
[30] |
GPT + A* |
2025 |
GPT literature mining + A* path-finding optimization |
Gold nanorods, nanophotonics |
[31] |
ScatterLab |
2025 |
Total scattering + PDF-driven inverse design |
Nanoparticles, metastable phases |
[32] |
AlphaFlow |
2023 |
Autonomous multi-step flow synthesis with BO |
Nanocrystals, inorganic materials |
[8] |
Frugal Twin |
2025 |
Cloud-based digital twins, cost-efficient virtual labs |
Generalizable across nanomaterials |
[29] |
In summary, the evolution of SDLs is advancing along two primary tracks: first, toward greater hardware integration and parallelization to drastically increase experimental throughput, and second, toward deeper cognitive capabilities that incorporate prior knowledge and reason across complex constraints. Platforms like Rainbow and multi-agent architectures such as MAPPS exemplify these trends, highlighting a shift from isolated automation to holistic, AI-driven discovery systems. However, truly autonomous discovery will remain elusive until the field satisfactorily addresses the intertwined fundamental challenges of data standardization, model explainability, and robust cybersecurity. Overcoming these hurdles is essential to transition from proof-of-concept demonstrations to reliable, scalable, and democratized SDLs that can fully bridge the gap between laboratory-scale innovation and industrial-scale manufacturing.
The development of self-driving laboratories represents an important step in the evolution of materials science, transitioning from traditional, sequential experimentation to autonomous, AI-driven discovery. This review has outlined the rapid evolution of SDLs, highlighting key architectural innovations such as the multi-robot parallelization exemplified by Rainbow[30], the integration of flow chemistry and BO in platforms like AlphaFlow[8], and the cognitive capabilities introduced by LLM-based literature mining and algorithmic planning[12,21,31]. Despite significant progress, the path toward fully autonomous discovery faces several interconnected challenges. The absence of standardized data formats and protocols continues to hinder interoperability and reproducibility[23]. The "black box" nature of complex AI models requires advances in explainable AI (XAI) to build trust and facilitate adoption in high-stakes applications such as pharmaceuticals[24].
To fully realize AI’s potential in materials discovery, scientists must be able to trust and understand the recommendations made by machine-learning models. This has led to a growing emphasis on XAI within SDL platforms. Researchers are employing a variety of XAI techniques to peer inside the “black box” of complex models[25]. Broadly, these approaches fall into two categories: making models inherently interpretable (for example, using simpler surrogate models or physically meaningful descriptors), and post-hoc explanations that analyze an AI’s outputs after training. Common post-hoc methods include feature importance scores, which identify which input parameters (composition, temperature, processing time) most influenced an outcome, and example-based explanations, where the model highlights past experiments or data points that are analogous to the current case[25]. For instance, tools like SHAP or LIME can attribute a prediction (say, a predicted optimal material composition) to specific input factors, helping researchers see why the AI suggests a certain experiment. In one recent study, a deep learning model (ElemNet) for predicting formation energies was analyzed with XAI methods – revealing that the model’s decisions aligned with known chemical trends like element reactivity and electronegativity[25]. Such insights confirmed that the AI was learning physically meaningful relationships, not random correlations, thereby building trust in its suggestions[25]. Embracing XAI in SDLs not only facilitates model validation and error diagnosis, but also can lead to scientific discoveries – for example, by uncovering which material features or processing conditions most strongly drive performance, researchers might design new hypotheses or simplified models.
Looking forward, the convergence of several technological trends points toward a transformative future for materials research. The development of "frugal twins" and cloud-based laboratories promises to democratize access to autonomous experimentation[29], while multi-agent architectures like MAPPS enhance robustness, exploration, and collaboration between AI and human scientists[3]. The ultimate goal remains the creation of end-to-end digital pipelines that seamlessly integrate basic discovery, prototyping, and scale-up manufacturing[26].
In the next five to ten years, SDLs are expected to develop toward fully functional digital twins of laboratories and production lines, capable of modeling and optimizing not only synthesis processes but also industrial manufacturing[26]. This integration would combine basic discovery, prototyping, and large-scale production into a single automated pipeline. Current work already demonstrates adaptation of SDLs for quantum electronics, nanophotonics, and nanomedicine, including autonomous optimization of defects in diamond nitrogen-vacancy centers, design of negative refractive index metamaterials, and development of nanoparticles for targeted drug delivery with programmable pharmacokinetics[33-40]. These developments bring us closer to autonomous platforms that can generate ideas, design experiments, interpret results, and even prepare draft patent documentation with minimal human supervision.
The widespread adoption of self-driving laboratories promises not only to accelerate materials discovery but also to change established research practice, fostering a more data-driven, reproducible, and collaborative research ecosystem. By enabling efficient exploration of vast chemical spaces, these systems could help address pressing global challenges related to energy, healthcare, and sustainability through rapid development of advanced functional materials. Ultimately, overcoming the challenges of data standardization, explainability, and cybersecurity will be crucial to fully realizing this transformative potential and bridging the gap between laboratory-scale innovation and industrial-scale manufacturing. Encouragingly, initial steps toward industrial adoption of SDLs have already begun to emerge. For example, Pfizer has partnered with an SDL technology company (Telescope Innovations) to develop AI-driven, self-driving lab workflows aimed at accelerating pharmaceutical research[41]. Meanwhile, IBM’s cloud-based RoboRXN platform is being integrated with autonomous chemistry systems (such as Atinary’s SDLabs) to optimize complex chemical syntheses[42], and major industry players like Syngenta have started incorporating these AI-driven platforms into their compound discovery pipelines[43]. These early collaborations demonstrate that SDL technologies are gradually moving beyond academic prototypes and into real-world industrial R&D settings, where they can substantially shorten development timelines and improve R&D productivity.
4 CONCLUSION
The emergence of self-driving laboratories represents a true paradigm shift for materials science, moving the field from its traditional, linear experimentation to a new era of autonomous and AI-driven discovery. This review has detailed the key architectural innovations enabling this change, from multi-robot platforms and the fusion of flow chemistry with BO, to the integration of cognitive capabilities drawn from scientific literature. Yet, for all this remarkable progress, the journey toward fully autonomous discovery is not without its challenges. Significant hurdles remain in the lack of universal data standards, the "black box" nature of sophisticated AI models requiring greater explainability, and the critical cybersecurity concerns that arise as laboratory systems become increasingly networked. Looking forward, the convergence of technologies like cost-effective digital twins and collaborative multi-agent systems points toward a more democratic and resilient future for materials research. The overarching ambition is the creation of seamless digital pipelines that integrate discovery, prototyping, and production.
Ultimately, these technical challenges demand more than just engineering solutions; they call for a coordinated, community-wide response. Success will require a concerted effort across disciplines, uniting materials scientists, data engineers, and policymakers to establish shared data repositories, robust validation protocols, and tailored cybersecurity frameworks. Equally important is the mission to educate and train a new generation of researchers skilled in navigating this AI-augmented landscape. We therefore urge the formation of broad, cross-disciplinary alliances spanning academia, industry, and government. Through such collaborative action, we can accelerate the maturation of self-driving laboratories from promising prototypes into reliable, widespread tools. The true measure of success in this field will be not only the speed of discovery, but also our ability to ensure those discoveries are reproducible, explainable, secure, and scalable. The next era of materials innovation depends on this collective push.
Acknowledgements
This research was funded by the Interstate Fund for Humanitarian Cooperation of the CIS Member States, grant number 25-113, and the International Science and Technology Center (ISTC), grant number TJ-2726.
Conflicts of Interest
The authors declared no conflict of interest.
Data Availability
No new data were created or analyzed in this study. Data sharing is not applicable to this article.
Copyright Permissions
Copyright © 2025 The Author(s). Published by Innovation Forever Publishing Group Limited. This open-access article is licensed under a Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, sharing, adaptation, distribution
Author Contribution
Conceptualization was handled by Nematov D and Raufov I; literature search and data curation were performed by Nematov D; the original draft was prepared by Nematov D and Raufov I; and writing – review and editing were completed by Nematov D and Raufov I. Both Nematov D and Raufov I had read and agreed to the final version of the manuscript.
Abbreviation List
A*, A-star path-finding algorithm
AI, Artificial intelligence
BO, Bayesian optimization
GPT, Generative pre-trained transformer
LLM, Large language model
MAPPS, Materials Agent unifying Planning, Physics, and Scientists
MGI, Materials Genome Initiative
ML, Machine learning
NBP, Nanobipyramid
PDF, Pair distribution function
PL, Photoluminescence
RL, Reinforcement learning
SDL, Self-driving laboratory
UV–vis, Ultraviolet–visible (spectroscopy)
XAI, Explainable artificial intelligence
XRD, X-ray diffraction
References
[1] Tom G, Schmid SP, Baird SG et al. Self-Driving Laboratories for Chemistry and Materials Science. Chem Rev, 2024; 124: 9633-9732.[DOI]
[2] Bayley PM, Hao H, Greer JMR et al. Navigating self-driving laboratories. Digit Discov, 2024; 3: 628-645.
[3] Hysmith H, Gani JD, Rondinelli JM. Human-in-the-loop approaches to self-driving laboratories. Digit Discov, 2024; 3: 835-849.
[4] Volk AA, Abolhasani M. Performance metrics for self-driving laboratories. Nat Commun, 2024; 15: 4234.[DOI]
[5] Szymanski NJ, Zeng Y, Huo H et al. Autonomous synthesis of inorganic materials. Nature, 2023; 615: 604-610.
[6] Nature News. Self-driving lab produces new materials. Nature, 2023; 615: 578.
[7] Delgado-Licona F, Valentine AJS, Elkin KD et al. Dynamic flow chemistry in autonomous nanocrystal synthesis. Nat Chem Eng, 2025; 1: 134-142.
[8] Volk AA, Motta SE, Abolhasani M. AlphaFlow: Autonomous multi-step flow synthesis. Nat Commun, 2023; 14: 5048.[DOI]
[9] Liang Y, Khan AR, Apley DW et al. Real-time theory–experiment closed-loop in materials discovery. Sci Adv, 2025; 11: eadk5123.[DOI]
[10] Szymanski NJ, Rendy B, Fei Y et al. Adaptive XRD control in autonomous labs. npj Comput Mater, 2023; 9: 155.[DOI]
[11] Manica M, da Silva JC, Guidici R et al. Generative models for organic materials design. npj Comput Mater, 2023; 9: 102.
[12] Nematov D, Burhonzoda A, Khusenov M et al. Molecular dynamics simulations of the DNA radiation damage and conformation behavior on a zirconium dioxide surface. Egypt J Chem, 2019; 62: 149-161.[DOI]
[13] Nematov D. Bandgap tuning and analysis of the electronic structure of the Cu2NiXS4 (X = Sn, Ge, Si) system: mBJ accuracy with DFT expense. Chem Inorg Mater, 2023; 1: 100001.[DOI]
[14] Nguyen TD. Materials Genome Initiative (MGI) Biennial Principal Investigator Workshop. In: Proc MGI Bienn PI Workshop. Washington, USA, 30-31 July 2024. NSF Award 2422384. Available at:[Web]
[15] Delgado-Licona F, Alsaiari A, Dickerson H et al. Flow-driven data intensification to accelerate autonomous inorganic materials discovery. Nat Chem Eng, 2025; 2: 436-446.[DOI]
[16] Tobias AV, Wahab A. Autonomous ‘self-driving’laboratories: a review of technology and policy implications. Roy Soc Open Sci, 2025; 12: 250646.[DOI]
[17] Qi Y, Hu D, Zheng M et al. Deep learning assisted Raman spectroscopy for rapid identification of 2D materials. Appl Mater Today, 2024; 41: 102499.[DOI]
[18] Nematov D. Titanium Dioxide and Photocatalysis: A Detailed Overview of the Synthesis, Applications, Challenges, Advances and Prospects for Sustainable Development. J Mod Green Energy, 2024; 3: 6.[DOI]
[19] Choudhary K, DeCost B, Tavazza F. AI-enabled discovery of metastable phases via closed-loop experimentation. npj Comput Mater, 2024; 10: 92.
[20] Manica M, Born J, Cadow J et al. Accelerating material design with the generative toolkit for scientific discovery. npj Comput Mater, 2023; 9: 69.[DOI]
[21] Pyzer-Knapp EO, Manica M, Staar P et al. Foundation models for materials discovery–current state and future directions. npj Comput Mater, 2025; 11: 61.[DOI]
[22] Xu Z, Hu X, Guo C et al. Creep behaviour investigation of additively manufactured IN738LC superalloy based on Materials Genome approach. Mat Sci Eng R, 2025; 163: 100914.[DOI]
[23] Green ML, Wolverton C, Rondinelli JM. Materials Genome Initiative: Update and outlook. Apl Mater, 2023; 11: 110401.
[24] Rudin C. Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead. Nat Mach Intell, 2019; 1: 206-215.[DOI]
[25] Wang K, Gupta V, Lee CS et al. XElemNet: towards explainable AI for deep neural networks in materials science. Sci Rep, 2024; 14: 25178.[DOI]
[26] Kalidindi SR, Buzzy M, Boyce BL et al. Digital twins for materials. Front Mater, 2022; 9: 818535.[DOI]
[27] Foy C, Awschalom DD, de Leon NP. Autonomous defect engineering in diamond NV centers. ACS Nano, 2024; 18: 4567-4579.
[28] Moon G, Choi JR, Lee C et al. Machine learning-based design of meta-plasmonic biosensors with negative index metamaterials. Biosens Bioelectron, 2020; 164: 112335.[DOI]
[29] Canty RB, Bennett JA, Brown KA et al. Science acceleration and accessibility with self-driving labs. Nat Commun, 2025; 16: 3856.[DOI]
[30] Xu J, Moran CHJ, Ghorai A et al. Autonomous multi-robot synthesis and optimization of metal halide perovskite nanocrystals. Nat Commun, 2025; 16: 7841.[DOI]
[31] Gao F, Li H, Chen Z, et al. A chemical autonomous robotic platform for end-to-end synthesis of nanoparticles. Nat Commun, 2025; 16: 7558.[DOI]
[32] Anker AS, Jensen JH, Gonzalez-Duque M et al. Autonomous nanoparticle synthesis by design. arXiv preprint, 2025; arXiv:2505.13571.
[33] Liao J, Shenhav L, Urban JA, et al. Microdiversity of the vaginal microbiome is associated with preterm birth. Nat Commun, 2023; 14: 4997.[DOI]
[34] Zhang H, Chan WCW, Farokhzad OC. Autonomous synthesis of nanoparticles for targeted drug delivery. Small, 2024; 20: 2308123.
[35] IBM. IBM expands AI-driven materials discovery capabilities and signs new industry collaborations. IBM UK Newsroom, 2021. Available at:[Web]
[36] Barati Sedeh H, George R C, Lai F et al. Toward the meta-atom library: experimental validation of machine learning-based Mie-tronics. Adv Photonics, 2025; 7: 036004-036004.[DOI]
[37] Han XQ, Wang XD, Xu MY et al. AI-driven inverse design of materials: Past, present, and future. Chin Phys Lett, 2025; 42: 027403.[DOI]
[38] Nematov DD, Burhonzoda AS, Kholmurodov KT et al. A detailed comparative analysis of the structural stability and electron-phonon properties of ZrO2: Mechanisms of water adsorption on t-ZrO2 (101) and t-YSZ (101) surfaces. Nanomaterials, 2023; 13: 2657.[DOI]
[39] Zhang K, Liu Y, Li H et al. Deep learning-based inverse design of metamaterial lattice band structures. Extreme Mech Lett, 2024; 67: 102013.[DOI]
[40] Wu L, Shen Y, Li J et al. Machine learning-based design of meta-plasmonic biosensors with negative refractive index. Photonics Nanostruct Fundam Appl, 2020; 39: 100857.
[41] Telescope Innovations. Telescope Innovations and Pfizer sign master collaborative research agreement. Press Release, 2023. Available at:[Web]
[42] Atinary Technologies. Maximizing yield and conversion in chemical synthesis with IBM Research Europe. Use Cases, 2023. Available at:[Web]
[43] Pyzer-Knapp EO, Pitera JW, Staar PW et al. Accelerating materials discovery using artificial intelligence, high performance computing and robotics. npj Comput Mater, 2022; 8: 84.







Copyright ©