pymc-labs / pymc-marketing Goto Github PK

Bayesian marketing toolbox in PyMC. Media Mix (MMM), customer lifetime value (CLV), buy-till-you-die (BTYD) models and more.

Home Page: https://www.pymc-marketing.io/

License: Apache License 2.0

Makefile 0.06% Python 99.90% Dockerfile 0.05%

clv data-science marketing mmm python btyd customer-lifetime-value media-mix-modeling buy-till-you-die

pymc-marketing's Introduction

PyMC-Marketing: Bayesian Marketing Mix Modeling (MMM) & Customer Lifetime Value (CLV)

Marketing Analytics Tools from PyMC Labs

Unlock the power of Marketing Mix Modeling (MMM) and Customer Lifetime Value (CLV) analytics with PyMC-Marketing. This open-source marketing analytics tool empowers businesses to make smarter, data-driven decisions for maximizing ROI in marketing campaigns.

This repository is supported by PyMC Labs.

For businesses looking to integrate PyMC-Marketing into their operational framework, PyMC Labs offers expert consulting and training. Our team is proficient in state-of-the-art Bayesian modeling techniques, with a focus on Marketing Mix Models (MMMs) and Customer Lifetime Value (CLV). For more information see here.

Explore these topics further by watching our video on Bayesian Marketing Mix Models: State of the Art.

Community Resources

Quick Installation Guide for Marketing Mix Modeling (MMM) & CLV

To dive into MMM and CLV analytics, set up a specialized environment, marketing_env, via conda-forge:

conda create -c conda-forge -n marketing_env pymc-marketing
conda activate marketing_env

For a comprehensive installation guide, refer to the official PyMC installation documentation.

Docker

We provide a Dockerfile to build a Docker image for PyMC-Marketing so that is accessible from a Jupyter Notebook. See here for more details.

In-depth Bayesian Marketing Mix Modeling (MMM) in PyMC

Leverage our Bayesian MMM API to tailor your marketing strategies effectively. Based on the research Jin, Yuxue, et al. “Bayesian methods for media mix modeling with carryover and shape effects.” (2017), and integrating the expertise from core PyMC developers, our API provides:

Custom Priors and Likelihoods: Tailor your model to your specific business needs by including domain knowledge via prior distributions.
Adstock Transformation: Optimize the carry-over effects in your marketing channels.
Saturation Effects: Understand the diminishing returns in media investments.
Time-varying Intercept: Capture time-varying baseline contributions in your model (using modern and efficient Gaussian processes approximation methods).
Visualization and Model Diagnostics: Get a comprehensive view of your model's performance and insights.
Out-of-sample Predictions: Forecast future marketing performance with credible intervals. Use this for simulations and scenario planning.
Budget Optimization: Allocate your marketing spend efficiently across various channels for maximum ROI.
Experiment Calibration: Fine-tune your model based on empirical experiments for a more unified view of marketing.

MMM Quickstart

import pandas as pd
from pymc_marketing.mmm import DelayedSaturatedMMM

data_url = "https://raw.githubusercontent.com/pymc-labs/pymc-marketing/main/data/mmm_example.csv"
data = pd.read_csv(data_url, parse_dates=['date_week'])

mmm = DelayedSaturatedMMM(
    date_column="date_week",
    channel_columns=["x1", "x2"],
    control_columns=[
        "event_1",
        "event_2",
        "t",
    ],
    adstock_max_lag=8,
    yearly_seasonality=2,
)

Initiate fitting and get a visualization of some of the outputs with:

X = data.drop("y",axis=1)
y = data["y"]
mmm.fit(X,y)
mmm.plot_components_contributions();

Once the model is fitted, we can further optimize our budget allocation as we are including diminishing returns and carry-over effects in our model.

Explore a hands-on simulated example for more insights into MMM with PyMC-Marketing.

Essential Reading for Marketing Mix Modeling (MMM)

Unlock Customer Lifetime Value (CLV) with PyMC

Understand and optimize your customer's value with our CLV models. Our API supports various types of CLV models, catering to both contractual and non-contractual settings, as well as continuous and discrete transaction modes.

Explore our detailed CLV examples using data from the lifetimes package:

Examples

	Non-contractual	Contractual
Continuous	Buying groceries	Audible
Discrete	Cinema ticket	Monthly or yearly subscriptions

CLV Quickstart

import matplotlib.pyplot as plt
import pandas as pd
import seaborn as sns
from pymc_marketing import clv

data_url = "https://raw.githubusercontent.com/pymc-labs/pymc-marketing/main/data/clv_quickstart.csv"
data = pd.read_csv(data_url)
data["customer_id"] = data.index

beta_geo_model = clv.BetaGeoModel(data=data)

beta_geo_model.fit()

Once fitted, we can use the model to predict the number of future purchases for known customers, the probability that they are still alive, and get various visualizations plotted.

See the Examples section for more on this.

Why PyMC-Marketing vs other solutions?

PyMC-Marketing is and will always be free for commercial use, licensed under Apache 2.0. Developed by core developers behind the popular PyMC package and marketing experts, it provides state-of-the-art measurements and analytics for marketing teams.

Due to its open-source nature and active contributor base, new features are constantly added. Are you missing a feature or want to contribute? Fork our repository and submit a pull request. If you have any questions, feel free to open an issue.

Thanks to our contributors!

Marketing AI Assistant: MMM-GPT with PyMC-Marketing

Not sure how to start or have questions? MMM-GPT is an AI that answers questions and provides expert advice on marketing analytics using PyMC-Marketing.

Try MMM-GPT here.

📞 Schedule a Free Consultation for MMM & CLV Strategy

Maximize your marketing ROI with a free 30-minute strategy session with our PyMC-Marketing experts. Learn how Bayesian Marketing Mix Modeling and Customer Lifetime Value analytics can boost your organization by making smarter, data-driven decisions.

We provide the following professional services:

Custom Models: We tailor niche marketing analytics models to fit your organization's unique needs.
Build Within PyMC-Marketing: Our team members are experts leveraging the capabilities of PyMC-Marketing to create robust marketing models for precise insights.
SLA & Coaching: Get guaranteed support levels and personalized coaching to ensure your team is well-equipped and confident in using our tools and approaches.
SaaS Solutions: Harness the power of our state-of-the-art software solutions to streamline your data-driven marketing initiatives.

pymc-marketing's People

Contributors

Stargazers

Watchers

Forkers

oriolabril coltallen ejhortala ricardov94 lucianopaz xalelax coreyabs-db michaelraczycki wd60622 oyassin christianmichelsen techthiyanes giladirim larryshamalama pgnepal schumzy futureprove 42digital ptiwari2407 sandy4321 garve davidqing2000 thipokkub maresb rita-linz stefan-jansen sangamswadik shaoshen-pepsico sgopalandf vishalthatsme mahrukhw cetagostini bkrmint ankursikka songyang0716 chendbox pterameta dok codehornets konkinit karmstrong-wave yijingyidong conef06 giuliacaglia renferpur ameynen speedymillie kalsalw gitosq ekmanib philclarkphd cooliot nialloulton nikisix brites101 kkgit111 mustaphau takechanman1228 tdl77 animenon xhulianothe1 pk1706 robferd dancingdjali ganbayard abdalazizrashid tomthepeach pamant22 caroheymes charlesdtdb charlesdethibault nazarmaidanenko vincent-grosbois eirikbaekkelund manfredwang093 dangoml wajihfabernovel ikitcheng ulfaslak bahgatn stephen137 mvandermeulen rodonn elbbysolocal jiawensunankorstore jcd85 cetagostini-wise morovia markussagen akyldamirov christw castilloraymond robandrewford psrossman jnslns mdemichiel gdberrio ivatams jgowenjr varunchach

pymc-marketing's Issues

Move distributions into CLV submodule

The ones implemented so far are CLV specific

Add `setup.py`

Add setup.py so that we can install the python package via python -m pip install -e .

Improve scaling example mmm notebook

According to https://github.com/google/lightweight_mmm#preparing-the-data it is better to use a min-max scaler for the target variable instead of an usual standard scaler to keep the target positive.

Should prior/posterior predictive plots rely on arviz hdi or xarray symmetric quantiles?

The prior and posterior predictive plots of the MMM base class use arviz.hdi. This can lead to potentially confusing plots like these. Should we default to use xarray.quantile and plot symmetric quantiles around the median instead of the highest density interval?

Implement pre-built shifted beta geometric `sBG` model

Related to #32

We should also add some useful summary stats / plots. If they are not specific to the sBG the better!

Add more MMM notebooks using a Facebook's Robyn dataset

In addition, to the simulated example we want to have an example with "real" data. It seems that Facebook's Robyn has some data sets which we can use.

Initial MMM workflow example

Create a notebook where we show how to use the saturation and adstock transformations in a simple simulated example in a similar manner as in https://juanitorduz.github.io/pymc_mmm/ Let us also see how to scale the variables back-and-forth.

BG/NBD Model in PyMC (base)

We would like to have BG/NBD Model in PyMC. The results should match the estimates from the lifetimes package. See https://juanitorduz.github.io/bg_nbd_pymc/ For this first iteration we do not need external time-invariant regressors, but keep it mind we would like to add them in the future.

Diagnostic plots (as for example the current notebook)

Diagnostic plots from model fit in #49

Add delayed adstock function

Add delayed adstock function from Jin, Yuxue, et al. "Bayesian methods for media mix modeling with carryover and shape effects." (2017).

Add type checking into pre-commit

We would have to add mypy as a dev dependency, a Makefile target and also add it as a pre-commit action

Publish paper about this package in JOSS

It would be good to maximise the benefits of writing this. So we could put together a paper at JOSS https://joss.theoj.org/

add black[jupyter] to the environment

Now we have notebooks, it would be useful to add black[jupyter] to the environment. I have no understanding of how to play with commit hooks, but maybe it's relevant to add in there too?

Sketch MMM user API

In this issue we want to discuss a potential API for the MMM model. For inspiration, take a look into Google's LightWeight MMM https://github.com/google/lightweight_mmm

Should unit tests live outside of the package?

We are shipping our the tests folder inside of the pymmmc package. I don't think is is necessary or desirable. I propose that we move the tests folder to the root directory instead.

Current organization:
root/
└─> pymmmc/
└─> tests/

Proposed:
root/
├─> pymmmc/
└─> tests/

`MMM` should rely on `axes.plot` as much as possible, and resort to seaborn only if really necessary

As far as I can tell, this plot could have been done using ax.plot instead of seaborn.lineplot. I think that for these simple situations, we should rely on matplotlib behavior instead of delegating to seaborn.

Should `mmm` be a subpackage like `clv` instead of a module?

This would make it simpler to have multiple implementations of MMM models without having to resort to a huge monolithic script.

Add CLV Grid and description to README

See #25 and #26

Default choice of hyper priors on MMM parameters based on data where ever possible

The default prior parameter values for the MMM model are hard-coded at the moment. It is desirable to establish their values heuristically from the supplied data whenever possible.

Add `tanh` saturation function

Could be useful to add the 2-parameter saturation function we've got outlined in this blog post https://www.pymc-labs.io/blog-posts/reducing-customer-acquisition-costs-how-we-helped-optimizing-hellofreshs-marketing-budget/ We find this works well because the parameterisation has independent control of the initial slope and the saturation level.

Add study case with MMM + CLV

Create some fake dataset with continuous non-contractual process and build a story around:

Using MMM to infer cost of acquisition across different channels
Using CLV to infer differential lifetime value of customers coming from different channels
Making business decision that takes the two sources of information into account
a. Preferring to invest in a channel with higher CAC because of higher CLV
b. Binary decision to not further invest in channel if CLV is lower than CAC

Requires: #24, #19, #39

We might use this case-study for the initial announcement of the package

MMM `adstock_max_lag` and `control_data` are model specific

If we intend our MMM class to be a suitable generic base for many media mix models, we should have as generic arguments to __init__ as possible. adstock_max_lag is completely specific to a particular way to apply a convolution operation on a vector, and control_data might be handled differently by different MMM subclasses (e.g. it could handle continuous controls differently than categorical controls). We should remove as much model specific arguments from the __init__ signature, and leave the rest as kwargs that are forwarded into _build_model

Decide how to handle `date_column` dtype and transformations

At the moment, the date_column is left untouched in the data. This might be ok, but it would be great to support datetime dtypes as inputs. Those can be easily handled in plots of time series, and they are more natural to reason about as opposed to floating point arrays referenced to some onset event and using some unknown resolution in some unknown time zone. The difficulty of supporting datetime dtypes is that we need to:

Ensure that the date_column is a date time compatible dtype
Convert it to a datetime if it isn't using some string format?
Add a transformation to go from datetimes to floating point arrays and back. This is necessary if we want to have time series with seasonal components or other mathematical dependencies on time.

Add long-term seasonality (via Fourier modes)

Add long-term seasonality (via Fourier modes) to base MMM API model, see #49

Prior predictive checks MMM API

Prior predictive in #49

MMM Example Notebook: Time Varying Coefficients

To prove the flexibility and potential of framework (namely, bayesian stats and pymc), we would like to have an example notebook to illustrate how to extend the base model (based on simulated data) introduced in #41 by allowing time varying coefficients via gaussian processes.

Util function to generate Fourier modes

To model seasonality it would be useful to have a utility function to generate Fourier modes as described in https://docs.pymc.io/en/stable/pymc-examples/examples/time_series/Air_passengers-Prophet_with_Bayesian_workflow.html

update environment + README to enable use of graphviz

Would be good to able to use pm.model_to_graphviz(model) in the Jupyter notebooks.

Migrate MMM plotting tests to test_base

The tests of the plotting functions should be in the base class and operate on a mock inference data result, not on a real result from a fitted model.

Rely on `xarray.DataArray.to_dataframe` more in MMM plotting.

This snippet creates a pandas DataFrame from an xarray DataArray. I think that it's better to rely on xarray builtin functionality to do this, specially if the underlying numpy.array dimensions are not aligned to the expected columns specification.

Instantiate `pymc.Model` after data validation and preprocessing in MMM base class

The MMM base class instantiates an empty pymc.Model during its __init__. It does this before having gone through data preprocessing and validation. We should move the model instantiation later in the __init__, closer to the call to _build_model

Wrap data into `pm.MutableData` containers for out-of-sample predictions.

Wrap data into pm.MutableData containers for out-of-sample predictions in the base model API. Make sure we can also extend coordinates.

Marginalization over dichotomous variable

I know that this is ongoing work raised in issue 21 of AePPL, but is there a quick way to marginalize over a discrete parameter? Akin to the example provided in the issue linked above:

$$p(Y=y | X=0) * p(X=0) + p(Y=y | X=1) * p(X=1)$$

for some continuous $Y$ and dichotomous $X$. In this case, $X$ would be the variable indicating a customer churning or not and the difference between a contractual and non-contractual likelihood would be that one is the marginalized version of the other. I have not revisited the math in several weeks, but I'm fairly certain of this and the ability to marginalize likelihoods as such may provide better model building blocks rather than define a distribution class for each quadrant. Just an idea so far...

Implement MMM model fit

Implement MMM model fit in the base API, see #49

Add study case with alternative to gamma-gamma model

If we don't summarize individual transaction values, there should be much more flexibility in how to model user latent "spend", with e.g, timeseries component, glm predictors, ....

Would be nice to add a study case of such, perhaps motivating new summary/plotting/prediction functionality of the library.

Add external regressors (both continuous and categorical variables)

Add external regressors (both continuous and categorical variables) in MMM API, see #49

Gamma-Gamma Model of Monetary Value in PyMC

Similarly as in #18, we want to have the Gamma-Gamma Model of Monetary Value in PyMC and compare it against the results in lifetimes. See https://juanitorduz.github.io/gamma_gamma_pymc/

Vectorize the random method in `ContNonContract`

The rng_fn in ContNonContract uses a for loop, but it can surely be vectorized although it would require some clever math and perhaps some concepts from order statistics.

Figure out API for incorporating information from lift tests on MMM

@juanitorduz mentioned that in the literature this is usually incorporated into the prior information, but I think that in the HelloFresh project @lucianopaz found a way to incorporate such information via observed variables in the same MMM model?

Continuous contractual notebook looks problematic

The parameters don't seem to be well recovered, suggesting we have a bug somewhere?

https://github.com/pymc-labs/pymc-marketing/blob/3228a4c01dd4411b121f4d9eed9dc3accb388820/notebooks/continuous-contractual.ipynb

CC @larryshamalama

Import `RandomState` type hint from `pymc.util`

The MMM module implements the RandomState type hints. We could import it from pymc.util instead

LTV computation

After having an implementation of #18 and #19 we would like to combine them to get an LTV estimation at user level following the approach in the lifetimes package. Se should be able to reproduce all the results of the documentation https://lifetimes.readthedocs.io/en/latest/Quickstart.html

Implement pre-built `BG/NBD`/`BetaGeoFitter` model

It would be good to add a BetaGeoFitter function that returns a ContNonContract with some default priors. A signature that resembles what is provided in the lifetimes package would be a good idea. Something along the lines of the following snipet.

def BetaGeoFitter(name, a, b, r, alpha, T, T0, *, observed, **kwargs):
    p = pm.Beta(f"{name}_beta", a, b, size=size, shape=shape)
    lam = pm.Gamma(f"{name}_gamma", r, 1/alpha, size=size, shape=shape)
    return ContNonContract(name, lam, p, T, T0, size=size, shape=shape, **kwargs)

We should also add some useful summary stats / plots. If they are not specific to the BG/NBD the better!

Explore variants of the shifted Beta Geometric model

From: Fader, P. S., & Hardie, B. G. (2007). How to project customer retention. Journal of Interactive Marketing, 21(1), 76-90. pdf

They mention this other model derived from the beta-binomial, which is conceptually equivalent:

Their model is based on assumptions simi-
lar to those behind the sBG model: (a) Each person
responds to a direct-mail solicitation with constant
probability p, and (b) p varies across the population
according to a beta distribution. While BM base their
framework on the beta-binomial model, it could have
been derived as an sBG model (e.g., the mailing on
which the prospect responds to the offer is character-
ized by the shifted-geometric distribution). As such, it
is possible to identify clear relationships between
some of the results in this article [e.g., rt and S(t)] and
some quantities of interest in a list-falloff setting.

Then extensions with cohort covariates:

The BM framework was extended by Rao and Steckel
(1995) to incorporate (time-invariant) descriptor
variables such as age, income, and sex. This is accom-
plished using the beta-logistic model (Heckman &
Willis, 1977),

Incorporating the effects of time-
varying covariates (e.g., marketing-mix effects, sea-
sonality) is more complicated. The key is to bring in
all of these factors at the right level; that is, at the
level of the latent parameter of interest (in this case,
�) instead of just “jamming” different covariate effects
into a regression-like model (see Schweidel, Fader, &
Bradlow, 2006, for a discussion of how to do this in a
continuous-time contractual setting.)

And extensions with time effets:

Both the sBG model and its continuous-time analog
(i.e., the EG model) are based on the assumption that
the commonly observed phenomenon of increasing
retention rates is due entirely to heterogeneity;
individual-customer-level retention rates are assumed
to be constant. If we wish to allow for the possibility of
time dynamics at the level of the individual customer,
we can no longer characterize the duration of an indi-
vidual’s relationship with the firm using either the
shifted-geometric or exponential distributions, both of
which have the “memoryless” property (i.e., the proba-
bility of survival to s � t, given survival to t , is the
same as the initial probability of survival to s ). In a
continuous-time setting, we can accommodate this
effect by assuming that individual lifetimes can be
characterized by the Weibull distribution, which allows
for an individual’s risk of canceling a contract to
increase or decrease as the length of the relationship
with the firm increases. In a discrete-time contractual
setting, this leads to the beta-discrete-Weibull (BdW)
model (Fader & Hardie, 2006), which is a generaliza-
tion of the sBG model, while in a continuous-time con-
tractual setting, this leads to a generalization of the EG
model, the Weibull-gamma (WG) model (Hardie et al.,
1998; Morrison & Schmittlein, 1980).

Add ROAS plot

Add a method to compute the return of ad spend for certain channels and plot it. The plot could follow the style of figure 3 from Jin et al 2017, which I'll copy down here just as a reference.

New default sampler for `ContNonContract` when there are no observations

Currently, the intention is to primarily use ContNonContract to perform inference on observational data. However, without the observed= keyword, our samplers will misbehave as value = [t_x, x] for t_x being the time of the xth observation with x being an integer.

A moment method would be beneficial, but careful thought must be put into the sampler.

Adstock transformation without `for` loop

We would like to write a more efficient implementation of the adstock transformations so that we do not use a for loop. An attempt with scan was (unsuccessfully) implemented in #15

Requirements:

We should be able to add the l_max parameter to truncate the size of the effect.
Be vectorised
Should be faster than the current implementation.

Take the CLV grid to the next level

The idea is to follow up to #25, and extend the basic models in interesting ways, e.g., by adding hierarchical effects, time-varying effects... and so on. This should give us a more refined idea not only of what building blocks we need, but how flexible they should be.

This will, potentially, also be the biggest selling point of the package, as we will be doing things that are not really done out there (or at least not published in neat papers / packages), in large part by taking advantage of working with a fully-fledged PPL (PyMC!)

Unlike #25, these squares are not yet fixed, and any cool idea you have can be used.

Continuous Non-contractual + Hierachical structure (#39) (up for grabs)
Continuous Contractual + ??? (up for grabs)
Discrete Non-contractual + ??? (up for grabs)
Discrete Contractual + cohort / temporal effects (suggested to @drbenvincent), see #35

Possible the grid won't be over the 4 types of models, but perhaps over extensions:

Cohort + Temporal effects on lifetime
Cohort + Temporal effects on value
Complex interactions between Lifetime and Value components
- E.g., subscription fee affects churn-rate and value, @juanitorduz brought something that resonates with this
THE NEXT BANG IN MARKETING MODELS ???

Add notebooks to fill the basic CLV grid

The idea is to write a notebook with pure PyMC model(s) for each of these CLV scenarios. We can start with the Lifetime part (not-value yet), but ideally we will include value as well by the end. This might be a constant with time-decay penalty in the simplest cases.

Hopefully this will give us a good picture of the building blocks that are necessary for a minimum viable package, and can also serve as the base documentation. Overtime we would replace the custom PyMC code with imports from the CLV sub-package.

Continuous Non-contractual #16
Continuous Contractual #36
Discrete Non-contractual (up for grabs)
Discrete Contractual #32

Publicise by writing blog posts

After #30, publicise by writing post at

https://towardsdatascience.com/
PyMC Labs