README

December 2024

Step 1: Develop the R Workflow
Step 2: Create the Python Runner
Step 3: Create a Docker Image
Step 4: Create the project.toml file
Step 5: Deploy to the MD platform

This repository provides instructions and an example for setting up a new custom R workflow integrated into the Mass Dynamics ecosystem.

The example in this repository demonstrates how to create a workflow that modifies an input intensity dataset and returns a new transformed intensity dataset.

Step 1: Develop the R Workflow

Create an R workflow, which can either be a package or a simple function. The example in this repository is implemented as an R package, with the workflow located at ./R/transformIntensities.R.

This package depends on the limma and tidyr R packages.

The main R function should accept an intensity table and a metadata table as inputs. Additional parameters of your choice can also be included. The input and output tables must adhere to the standard Mass Dynamics format for an INTENSITY dataset.

Develop an optional runner function to invoke the main workflow function and produce the output as a named list. An example of this function is provided in ./src/md_custom_r/process.R.

The naming conventions for the output need to be based on the type of dataset produced.

Currently, only Intensty datasets are supported. For example, the intensity type needs to return a list including intensity, metadata and the optional runtime_metadata.

These types can be found in the MD Dataset Package

Step 2: Create the Python Runner

Write a Python runner, as shown in ./src/md_custom_r/process_r.py. This script uses the Mass Dynamics md_dataset Python package to prepare the R input and execute the workflow in Prefect.

In this file, we prepare the dataframes and program arguments provided to our R Runner function in Step 1.

This function needs to use the @md_r python decorator and provide the r_file path and r_function provided in Step 1.

The arguments to the function are also important. The second argument is the custom set of params that can be be used by the R function and also render the form on the MD platform.

Step 3: Create a Docker image

See an example, Dockerfile

The base docker image Dockerfiles can be found in MD Dataset Package

When using local docker images built using the MD Dataset Package in scripts/local-docker-images you can use the following to build the provided example:

docker build --build-arg BASE_IMAGE=md_dataset_package-linux-r-base:latest -t md_custom-r-linux:latest -f Dockerfile --platform="linux/amd64" .

Step 4: Create the project.toml file

This file provides details about the package, including its versions, dependencies, and authors. For reference, see the example project.toml.

Ensure that the md_dataset package version is updated to the latest available version if required unless a specific version is explicitly needed.

Step 5: Deploy to the MD platform

Register the new 'workflow' with the Mass Dynamics platform. This is essentially done via the script md-dataset-deploy which comes from the MD Dataset Package.

An example of running this script can be found in this project using Helm along with all the environment variables required. This can be useful if you want to automate this deployment, for example using CI/CD.

To use Helm, Helm needs to be installed on a VM that has access to the Kubernetes cluster which has appropriate authorisation along with the code to be installed.

See the ./infra directory and ./scripts/deploy for an example of how this could work.

Note: the above example does not provide IAM or Kubernetes Service Account setup.

Name		Name	Last commit message	Last commit date
Latest commit History 173 Commits
.buildkite		.buildkite
R		R
data		data
infra/helm/md-custom-r		infra/helm/md-custom-r
man		man
scripts		scripts
src/md_custom_r		src/md_custom_r
tests		tests
.Rbuildignore		.Rbuildignore
.gitignore		.gitignore
DESCRIPTION		DESCRIPTION
Dockerfile		Dockerfile
LICENSE		LICENSE
MDCustomR.Rproj		MDCustomR.Rproj
NAMESPACE		NAMESPACE
README.md		README.md
dependencies.R		dependencies.R
install.R		install.R
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

README

Step 1: Develop the R Workflow

Step 2: Create the Python Runner

Step 3: Create a Docker image

Step 4: Create the project.toml file

Step 5: Deploy to the MD platform

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

License

MassDynamics/MDCustomR

Folders and files

Latest commit

History

Repository files navigation

README

Step 1: Develop the R Workflow

Step 2: Create the Python Runner

Step 3: Create a Docker image

Step 4: Create the project.toml file

Step 5: Deploy to the MD platform

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages