10 Multiple experiments
10.1 Gimme more plates
The reading and processing that seahtrue
uses, together with the nested tibble output, has another benefit. It allows for collecting and organizing multiple experiments into one nested tibble. We can do that by loading multiple xlsx files into our working directory, making a file list of those excel file names and mapping the run_seahtrue
function against the file list.
For now we will first have a look at how the output looks like. We load the three experiment output from github.
You can see we have one experiment per row of the tibble . They are labeled with the plate_id
, file_path
, and date
in the first three columns. Since it is in one tibble, we can now pluck
for example all raw_data
from the complete tibble:
You see we now have a tibble of almost 2 million rows and 22 columns, nicely and tidyly loaded in our webR/wasm R environment!! How cool is that! Our favorite point-and-click software Excel wouldn’t even be able to handle this amount of rows.
Now we can plot for example the distribution of emission values for each experiment. We use the ggridges
package for this.
With only a couple of lines of code we go from raw_data
to real insights in your data. For example,
- The three experiments seem to have similar distirbutions between groups, but the third experiment
V01744....
looks a bit different than the first two - From the plot, it can be seen that in the group with highest cell density
300.000
the O2 levels are still in a range that the wells do not become hypoxic. The lowest O2 levels are at around 100 mmHg. - For the
50.000
group in the first two experiments there seems to be some signal in the right tail of the distribution which is not obviously in the third experiment50.000
group or in the other groups.
Let’s explore this right tail of the 50.000
group in more detail.
So what are these high O2_mmHg values. We can find out by looking at the data for the individuals wells. By filtering for the first plate we can see the O2_mmHg in more detail. By using a short unique string as input for the str_detect
we only have to type a couple of characters, which saves us some time. Now we make a simple scatter plot.
Indeed, there are two wells that have high O2_mmHg
. Because of the many colors it is difficult to see which wells they are. Now do some ggplot trickery to label the two wells with higher O2.
Make the same scatter plot for the other two plates for the 50.000
group, and observe if there are wells with abnormally high O2.