CARNIVAL#

CARNIVAL (CAusal Reasoning for Network identification using Integer VALue programming) is a method for the identification of upstream reguatory signalling pathways from downstream gene expression (GEX). Applications of CARNIVAL include the identification of drug’s modes of action and of deregulated processes in diseases (even if the molecular targets remain unknown) by deciphering the alterations of main signalling pathways as well as alternative pathways and off-target effects.

Figure 1: Liu A., Trairatphisan P., Gjerga E. et al. From expression footprints to causal pathways: contextualizing large signaling networks with CARNIVAL npj Systems Biology and Applications volume 5, Article number: 40 (2019).

The aim of the CARNIVAL pipeline is to identify a subset of interactions from a prior knowledge network that represent potential regulated pathways linking known or potential targets of perturbation towards active transcription factors derived from GEX data. The pipeline includes a number improved functionalities comparing to the original version and consists of the following processes:

Transcription factors’ (TFs) activities and pathway scores from gene expressions can be inferred with our in-house tools (Dorothea, CollecTRI).
TFs’ activities and signed directed protein-protein interaction networks with or without the provided target of perturbations and pathway scores are then used to construct an optimization problem with CORNETO.
CORNETO is used to solve the optimization problem with any of the supported solvers (CPLEX, GUROBI, SCIPY, etc), which identifies the sub-network topology with minimised fitting error and model size.

The original version of CARNIVAL was implemented in R and CPLEX. The new re-implementationo of CARNIVAL in CORNETO support a wide variety of solvers thanks to the support of both CVXPY and PICOS. It also has more flexibility since the problem is symbolically defined, and can be modified through the CORNETO API after creating the CARNIVAL problem. This gives user extra flexibility to modify the problem or to use CORNETO as a building block for other optimization problems.

import numpy as np
import corneto as cn
import pandas as pd

cn.info()

Installed version:	v1.0.0.dev5 (latest stable: v1.0.0-alpha)
Available backends:	CVXPY v1.6.5, PICOS v2.6.1
Default backend (corneto.opt):	CVXPY
Installed solvers:	CVXOPT, GLPK, GLPK_MI, HIGHS, SCIP, SCIPY
Graphviz version:	v0.20.3
Installed path:	/home/runner/work/corneto/corneto/corneto
Repository:	https://github.com/saezlab/corneto

Creating a toy example#

from corneto.graph import Graph

G = Graph.from_tuples(
    [
        ("rec1", 1, "a"),
        ("rec1", -1, "b"),
        ("rec1", 1, "f"),
        ("rec1", -1, "c"),
        ("rec2", 1, "b"),
        ("rec2", 1, "tf2"),
        ("b", 1, "g"),
        ("g", -1, "d"),
        ("rec2", -1, "d"),
        ("a", 1, "c"),
        ("a", -1, "d"),
        ("c", 1, "d"),
        ("c", -1, "e"),
        ("c", 1, "tf3"),
        ("e", 1, "a"),
        ("d", -1, "c"),
        ("e", 1, "tf1"),
        ("a", -1, "tf1"),
        ("d", 1, "tf2"),
        ("c", -1, "tf2"),
        ("tf1", 1, "tf2"),
        ("tf1", -1, "rec2"),
        ("tf2", 1, "rec1"),
        ("tf1", 1, "f")
    ]
)
# Plot our PKN
G.plot()

../../_images/4fd3b062c8fc55e25036e3fb65b0e7f7ab9ea9f69f11ac63c04901cda82874f4.svg

from corneto.methods.future.carnival import CarnivalFlow, CarnivalILP

samples = {
    "input_example": {
        "rec2": {
            "value": 1,
            "mapping": "vertex",
            "role": "input"
        },
        "tf1": {
            "value": -2,
            "mapping": "vertex",
            "role": "output"
        },
        "tf2": {
            "value": 1,
            "mapping": "vertex",
            "role": "output"
        }
    }
}

data = cn.Data.from_cdict(samples)
data

Data(n_samples=1, n_feats=[3])

CarnivalFlow#

Carnival Flow is a generalization of Carnival for multi-samples, using the structured sparsity inducing penalty to regularize solutions taking into account multiple samples. However, in the single sample case, it is equivalent to the original Carnival, although using a complete different formulation.

c = CarnivalFlow(lambda_reg=1e-3)
P = c.build(G, data)
P.solve(verbosity=0, solver="scipy")
for o in P.objectives:
    print(o.value)

c.processed_graph.edge_subgraph(np.flatnonzero(P.expr.edge_has_signal.value)).plot()

Unreachable vertices for sample: 0
0.0
5.0

../../_images/f241de182f9f6b73e736b744d931f928b2f4274204a15fe91d745cf9008fa499.svg

# Extract the values from the problem
pd.DataFrame(P.expr.edge_value.value, index=c.processed_graph.E, columns=["edge_activity"]).astype(int)

		edge_activity
(rec1)	(a)	1
	(b)	0
	(c)	0
(rec2)	(b)	0
(rec2)	(tf2)	1
(b)	(g)	0
(g)	(d)	0
(rec2)	(d)	0
(a)	(c)	0
(a)	(d)	0
(c)	(d)	0
(c)	(e)	0
(e)	(a)	0
(d)	(c)	0
(e)	(tf1)	0
(a)	(tf1)	-1
(d)	(tf2)	0
(c)	(tf2)	0
(tf1)	(tf2)	0
(tf1)	(rec2)	0
(tf2)	(rec1)	1
(tf1)	()	0
(tf2)	()	0
()	(rec2)	1

pd.DataFrame(P.expr.vertex_value.value, index=c.processed_graph.V, columns=["node_activity"]).astype(int)

	node_activity
d	0
rec2	1
tf1	-1
b	0
c	0
tf2	1
a	1
rec1	1
e	0
g	0

c.processed_graph.plot_values(vertex_values=P.expr.vertex_value.value, edge_values=P.expr.edge_value.value)

../../_images/927af6e56fd0d5d4008258bf4b982417ab8170c87cc18777a8b220c821654148.svg

Carnival ILP#

For completion, we have a version of the original Carnival that is not based on the modern formulation of CarnivalFlow, cannot be used for multi-samples, and is slower for larger problems.

c = CarnivalILP(beta_weight=1e-3)
P = c.build(G, data)

P.solve(verbosity=0, solver="scipy");

for o in P.objectives:
    print(o.value)

0.0
4.0

c.processed_graph.edge_subgraph(np.flatnonzero(P.expr.edge_values.value)).plot()

../../_images/c2166b50249596243db423aedb6642b73bf1c272aabc556f4d5f57e3abac2c4c.svg

# Extract the values from the problem
pd.DataFrame(P.expr.edge_values.value, index=c.processed_graph.E, columns=["edge_activity"]).astype(int)

		edge_activity
(rec1)	(a)	1
	(b)	0
	(c)	0
(rec2)	(b)	0
(rec2)	(tf2)	1
(b)	(g)	0
(g)	(d)	0
(rec2)	(d)	0
(a)	(c)	0
(a)	(d)	0
(c)	(d)	0
(c)	(e)	0
(e)	(a)	0
(d)	(c)	0
(e)	(tf1)	0
(a)	(tf1)	-1
(d)	(tf2)	0
(c)	(tf2)	0
(tf1)	(tf2)	0
(tf1)	(rec2)	0
(tf2)	(rec1)	1

pd.DataFrame(P.expr.vertex_values.value, index=c.processed_graph.V, columns=["node_activity"]).astype(int)

	node_activity
d	0
rec2	1
tf1	-1
b	0
c	0
tf2	1
a	1
rec1	1
e	0
g	0

c.processed_graph.plot_values(vertex_values=P.expr.vertex_values.value, edge_values=P.expr.edge_values.value)

../../_images/d745b305335376dbb66473a4f5ca3527b50813c61bf623a7961bd35ab446341e.svg

Old implementation#

An older implementation used in previous versions of CORNETO is still available in the corneto.methods.carnival package. It uses a simpler interface and a formulation more similar to the original CarnivalR:

from corneto.methods.carnival import milp_carnival

P = milp_carnival(
    G,
    {"rec2": 1},
    {"tf1": -2, "tf2": 1},
    beta_weight=1e-3,
) 
P.solve(solver="scipy");

for o in P.objectives:
    print(o.value)

0.0
4.0

G.edge_subgraph(np.flatnonzero(P.expr.edge_values.value)).plot()

../../_images/e0ee0d02cb945b90c4e9dedcf8a0e02571e2e4d34fa663a08696f3305c181856.svg