The author of this text is a back-end developer at CytoSMART, exploring how to combine three biochemical datasets in a graph database. The goal is to find connections between different measurements in different experiments and conditions. To achieve this, the author designs a graph structure for biomedical research, which includes two node types (compounds and experiments) and one relationship type (is attribute of). The author then populates the database with NCI60 measurements, a dataset containing 50k+ chemicals tested on ~60 cell lines. The author introduces cell lines to connect to the experiment, using a script to select the best match for each compound. The author also picks an NSC synonym for each compound, connecting it to one and only one synonym. After filtering and selecting the data, the final script combines all three datasets, resulting in 4m conditions and 4.6k experiments.