HomeDocsAPI Reference
Kumo.ai
Docs

Customer Lifetime Value (LTV) Prediction

Solution Background and Business Value

Predicting customer lifetime value (LTV) is a pivotal strategy for businesses aiming to optimize their marketing efforts, enhance customer relationships, and maximize profitability. By accurately forecasting the future revenue a customer is likely to generate, companies can tailor their marketing strategies, allocate resources more effectively, and focus on high-value customers who contribute the most to the bottom line.

LTV prediction can be joined together with other ML models, like churn prediction and coupon affinity models to enables businesses to identify and retain their most valuable customers, reduce churn rates, and enhance customer satisfaction through personalized experiences.

Data Requirements and Kumo Graph

Although you can build an LTV prediction model with a small set of core tables (customers, orders), you can also include many other data sources in your graph. The following is a non-exhaustive list of example tables you can use.

Core tables

  1. Customers: This table contains basic information about each customer.
    1. customer_id (Primary Key)
    2. name
    3. email
    4. phone
    5. registration_date
    6. address
  2. Orders: This table captures details about each order placed by customers.
    1. order_id (Primary Key)
    2. customer_id (Foreign Key referencing Customers)
    3. Product_id (Foreign Key referencing Products)
    4. order_date
    5. quantity
    6. price

Additional Table Suggestions

You can extend the graph with more Optional tables:

  1. (optional) Products: This table holds information about the products sold.
    1. product_id (Primary Key)
    2. product_name
    3. category
    4. price
    5. cost
  2. (optional) Order_events: This table logs events associated with an order.
    1. order_id (Foreign Key referencing Orders)
    2. event (payment method, delivery status etc.)
    3. event_date
    4. amount
  3. (optional) Customer_Interactions: This table records customer interactions with the company (e.g., support tickets, feedback).
    1. interaction_id (Primary Key)
    2. customer_id (Foreign Key referencing Customers)
    3. interaction_date
    4. interaction_type (Support, Feedback, Inquiry, etc.)
    5. interaction_details
  4. (optional) Returns: This table tracks returns made by customers.
    1. return_id (Primary Key)
    2. order_id (Foreign Key referencing Orders)
    3. product_id (Foreign Key referencing Products)
    4. return_date
    5. return_reason
    6. refund_amount
  5. (optional) Customer_Loyalty: This table includes data on customer loyalty programs and points.
    1. loyalty_id (Primary Key)
    2. customer_id (Foreign Key referencing Customers)
    3. loyalty_points
    4. membership_level (e.g., Bronze, Silver, Gold)
    5. points_earned
    6. points_redeemed
  6. (optional) Marketing_Campaigns: This table tracks marketing efforts targeted at customers.
    1. campaign_id (Primary Key)
    2. customer_id (Foreign Key referencing Customers)
    3. campaign_type (Email, SMS, Social Media, etc.)
    4. campaign_date
    5. campaign_response (Opened, Clicked, Ignored)

Predictive Query

LTV metrics can be defined in a variety of ways, depending on what KPIs you are interested in. Here are some example definitions of customer LTV:

  • Aggregate total spending per customer from the Orders and Transactions tables in a future timeframe
  • A combination purchase frequency, average order value, and return rates.

We can extend the LTV definition by:

  • Incorporating data on customer engagement from Customer_Interactions and Marketing_Campaigns.
  • Considering loyalty data from the Customer_Loyalty table.

Kumo allows you to flexibly define targets and train predictive models—here are some example predictive queries that predict a variant of customer LTV:

  1. Predict customer spend in next 6 months (LTV)

    PREDICT SUM(Orders.price, 0, 180, days)
    FOR EACH Customers.customers_id
    
  2. Predict Volume of transactions in next 6 months for active customers

    PREDICT COUNT(Orders.price, 0, 180, days)
    FOR EACH Customers.customers_id
    WHERE COUNT(Orders.*, -30,0, days) > 0
    

Deployment

Several downstream applications of LTV models are possible, the most common being a combination of LTV and churn prediction for identifying high value customers at risk of churning. A growth machine learning strategy might consist of the following:

  1. Using Kumo to predict LTV and churn probabilities for all active customers
  2. Writing a SQL view on the predictions, considering a combination of estimated user value and churn probability to identify a subset of users to email.
  3. Exporting these user's IDs to a cross-channel marketing orchestration platform (e.g., Braze or Marketo).
  4. Sending a personalized email or coupon to these users to spur re-engagement.
  5. Orchestrating all of these steps to happen automatically (e.g., on a weekly basis) using an existing orchestration tool like Airflow or Dagster.