Find metabolites with high variability across total pool samples
Source:R/Processing.R
pool_estimation.Rd
Find metabolites with high variability across total pool samples
Usage
pool_estimation(
data,
metadata_sample = NULL,
metadata_info = NULL,
cutoff_cv = 30,
save_plot = "svg",
save_table = "csv",
print_plot = TRUE,
path = NULL
)
Arguments
- data
DF which contains unique sample identifiers as row names and metabolite numerical values in columns with metabolite identifiers as column names. Use NA for metabolites that were not detected. Can be either a full dataset or a dataset with only the pool samples.
- metadata_sample
Optional: DF which contains information about the samples when a full dataset is inserted as Input_data. Column "Conditions" with information about the sample conditions (e.g. "N" and "T" or "Normal" and "Tumor"), has to exist.Default = NULL
- metadata_info
Optional: NULL or Named vector including the Conditions and PoolSample information (Name of the Conditions column and Name of the pooled samples in the Conditions in the Input_SettingsFile) : c(Conditions="ColumnNameConditions, PoolSamples=NamePoolCondition. If no Conditions is added in the Input_metadata_info, it is assumed that the conditions column is named 'Conditions' in the Input_SettingsFile. ). Default = NULL
- cutoff_cv
Optional: Filtering cutoff for high variance metabolites using the Coefficient of Variation. Default = 30
- save_plot
Optional: Select the file type of output plots. Options are svg, png, pdf or NULL. Default = svg
- save_table
Optional: File types for the analysis results are: "csv", "xlsx", "txt", ot NULL default: "csv"
- print_plot
Optional: If TRUE prints an overview of resulting plots. Default = TRUE
- path
Optional: Path to the folder the results should be saved at. default: NULL
Value
List with two elements: DF (including input and output table) and Plot (including all plots generated)
Examples
Intra <- intracell_raw %>%tibble::column_to_rownames("Code")
Res <- pool_estimation(data=Intra[ ,-c(1:3)],
metadata_sample=Intra[ , c(1:3)],
metadata_info = c(PoolSamples = "Pool", Conditions="Conditions"))
#> Warning: ggrepel: 36 unlabeled data points (too many overlaps). Consider increasing max.overlaps