10 Multiple experiments

10.1 Gimme more plates

The reading and processing that seahtrue uses, together with the nested tibble output, has another benefit. It allows for collecting and organizing multiple experiments into one nested tibble. We can do that by loading multiple xlsx files into our working directory, making a file list of those excel file names and mapping the run_seahtrue function against the file list.

For now we will first have a look at how the output looks like. We load the three experiment output from github.

You can see we have one experiment per row of the tibble . They are labeled with the plate_id, file_path, and date in the first three columns. Since it is in one tibble, we can now pluck for example all raw_data from the complete tibble:

You see we now have a tibble of almost 2 million rows and 22 columns, nicely and tidyly loaded in our webR/wasm R environment!! How cool is that! Our favorite point-and-click software Excel wouldn’t even be able to handle this amount of rows.

Now we can plot for example the distribution of emission values for each experiment. We use the ggridges package for this.

With only a couple of lines of code we go from raw_data to real insights in your data. For example,

The three experiments seem to have similar distirbutions between groups, but the third experiment V01744.... looks a bit different than the first two
From the plot, it can be seen that in the group with highest cell density 300.000 the O2 levels are still in a range that the wells do not become hypoxic. The lowest O2 levels are at around 100 mmHg.
For the 50.000 group in the first two experiments there seems to be some signal in the right tail of the distribution which is not obviously in the third experiment 50.000 group or in the other groups.

Let’s explore this right tail of the 50.000 group in more detail.

So what are these high O2_mmHg values. We can find out by looking at the data for the individuals wells. By filtering for the first plate we can see the O2_mmHg in more detail. By using a short unique string as input for the str_detect we only have to type a couple of characters, which saves us some time. Now we make a simple scatter plot.

Indeed, there are two wells that have high O2_mmHg. Because of the many colors it is difficult to see which wells they are. Now do some ggplot trickery to label the two wells with higher O2.

Exercise 1

Make the same scatter plot for the other two plates for the 50.000 group, and observe if there are wells with abnormally high O2.

Solution to Exercise 1