Multi-stage calculations #8486

igorlukanin · 2024-07-17T22:18:54Z

Cube provides rich data modeling capabilities and supports various use cases.

We would like to level up Cube's data modeling with the post-aggregation engine that would allow for further manipulations with already aggregated data, supporting more sophisticated analytics use cases or providing a way to express them in the data model in a streamlined way:

Period-to-date calculations, such as year-to-date (YTD), quarter-to-date (QTD), or month-to-date (MTD) analyses.
Differences or changes: finding the difference between two aggregated measures, like year-over-year sales growth.
Fixed vs. relative comparison, which are useful when you need to compare individual items to a broader dataset.
Ratio and percent of total calculations, calculations of ratios or percent totals that need specific control over the numerator and the denominator.
Segmentation and grouping, calculations of advanced segmentation or grouping that is independent of the view’s granularity.

It's currently planned for October 2024.

Out of scope:

Pre-aggregation support in multi-stage calculations #8487

HMVarshney · 2025-04-21T15:44:53Z

Hi.
This would be a very useful feature for me. Any updated timeline of its release?

AlexisBocuze · 2025-05-05T16:57:50Z

That looks like a great feature!

Testing it, I have what I think is an unexpected behaviour when I select weekly granularity.
On a measure summing revenues, the result for revenue_prior_year on the 1st week of 2025 does not match the result for revenue for the 1st week of 2024.

I can get the expected behaviour if I set

- name: revenue_last_year_weekly
  sql: revenue
  type: number
  rolling_window:
      trailing: 52 week
      leading: -51 week
      offset: start

date	revenue	revenue_last_year_weekly
2024-01-01 W1	409,001	NA
...	...	...
2024-12-30 W1	230,002	409,001

But with the recommended implementation of multi-stage calculations, the numbers don't match anymore.

- name: revenue_prior_year
  multi_stage: true
  sql: "{revenue}"
  type: number
  time_shift:
     - time_dimension: date
        interval: 1 year
        type: prior

date	revenue	revenue_prior_year
2024-01-01 W1	409,001	NA
...	...	...
2024-12-30 W1	230,002	558,208

That approach returns matching results when the time dimension is set at yearly or monthly level, but not for weekly granularity.
I could keep the first one but it'd mean to have a dedicated measure for each time dimension granularity which is not ideal.
By the way, the first approach stops working when setting cubejs_tesseract_sql_planner = true. It return the following error:

SQL compilation error: syntax error line 19 at position 6 unexpected 'VALUES'.

Am I misinterpreting the expected behaviour? Is there a way we can align comparison vs last year on weekly granularity too?

igorlukanin added enhancement New feature proposal data modeling labels Jul 17, 2024

igorlukanin assigned paveltiunov Jul 17, 2024

This was referenced Jul 17, 2024

Pre-aggregation support in multi-stage calculations #8487

Open

Post-aggregate measure type #8445

Closed

Cube Core roadmap #8492

Open

igorlukanin added the Roadmap: Q3 FY'25 (Aug-Oct 2024) label Jul 18, 2024

igorlukanin changed the title ~~Post-aggregation engine~~ Multi-stage calculations Oct 11, 2024

itestyoy mentioned this issue Oct 11, 2024

[Feature request] Post-aggregate measure filters #8804

Open

igorlukanin mentioned this issue Nov 8, 2024

Support window functions in Cube Store #8932

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multi-stage calculations #8486

Multi-stage calculations #8486

igorlukanin commented Jul 17, 2024 •

edited

Loading

HMVarshney commented Apr 21, 2025

AlexisBocuze commented May 5, 2025

Multi-stage calculations #8486

Multi-stage calculations #8486

Comments

igorlukanin commented Jul 17, 2024 • edited Loading

HMVarshney commented Apr 21, 2025

AlexisBocuze commented May 5, 2025

igorlukanin commented Jul 17, 2024 •

edited

Loading