Arithmetic mean vs average linear/step interpolation

dandjo · October 28, 2024, 10:30am

Hi folks,

I’m trying to achieve the following. I want to compare different calculations from my persistence service (InfluxDB). Openhab-JS offers a nice API for calculating time-weighted averages.

In HomeAssistant there are two properties that do something similar: average_linear and average_step. You can also query average_timeless in HomeAssistant.

Is there something similar for openHAB? And which interpolation is currently used for averageSince, averageBetween and averageUntil? According to this implementation, I assume it is step interpolation, right?

github.com

openhab/openhab-core/blob/main/bundles/org.openhab.core.persistence/src/main/java/org/openhab/core/persistence/extensions/PersistenceExtensions.java#L1508


      
           * @param serviceId the name of the {@link PersistenceService} to use
           * @return the average value between <code>begin</code> and <code>end</code>, or <code>null</code> if no
           *         states could be found or if the persistence service given by <code>serviceId</code> does not
           *         refer to an available {@link QueryablePersistenceService}
           */
          public static @Nullable State averageBetween(Item item, ZonedDateTime begin, ZonedDateTime end,
                  @Nullable String serviceId) {
              return internalAverageBetween(item, begin, end, serviceId);
          }
          
          private static @Nullable State internalAverageBetween(Item item, @Nullable ZonedDateTime begin,
                  @Nullable ZonedDateTime end, @Nullable String serviceId) {
              String effectiveServiceId = serviceId == null ? getDefaultServiceId() : serviceId;
              if (effectiveServiceId == null) {
                  return null;
              }
              ZonedDateTime now = ZonedDateTime.now();
              ZonedDateTime beginTime = Objects.requireNonNullElse(begin, now);
              ZonedDateTime endTime = Objects.requireNonNullElse(end, now);
          
              if (beginTime.isEqual(endTime)) {

I saw that there are internalMean[since|until|between] and internalMedian[since|until|between] in openhHAB core, but I guess not exposed to the JS API?

Thanks,
Daniel

rlkoshak · October 28, 2024, 3:40pm

OH only provides one type of average calculation and based on the code it looks like OH implements something close to what HA is called a step average.

From the docs: Persistence | openHAB

Time-weighted averages take into consideration not only the numerical levels of a particular variable, but also the amount of time spent on it. For instance, if you are measuring the temperature in a room - acknowledging the differences in the amounts of time until it changes. A brief example: 18 °C for 13 hours a day, 21 °C for 7 hours a day, and 16.5 °C for 4 hours a day, you would obtain 18 °C x 13 h, 21 °C x 7 h and 16.5 °C x 4 h (234, 147, and 66, respectively). Sum the values that you obtained. In this case, 447 °C hours. Add together the time weights to get the total weight. In our example, the total weight is 13 h + 7 h + 4 h = 24 h. Divide the value in Step 2 by the total weights in Step 3, to get an average of 447 °C hours / 24 h = 18.625 °C.
It uses weights therefore, instead of interpolation as I understand it but I’m not sure it matches any of the types of averages HA does based on their descriptions.

Other types of averages of course could be added I suspect, but we also run the risk of adding a whole lot of confusion if we have too many choices. I’d be surprised if the average OH users (pun intended) would know the differences between these and which is appropriate to use in which circumstances. So that needs to be weighed against the added complexity by adding a bunch new average functions.

Of course, using any of the “getAllStates*” action gives you the raw data to calculate the average how ever you wish.

dandjo · October 28, 2024, 5:15pm

I agree, with getAllStates you could implement any algorithm you can imagine. I tried that, but it is terribly slow compared to code executed via Java in the back of the JS wrappers.

The use case is this: one would calculate an integral to get the area under a curve using the average function and multiplying it by time. Now it depends on the persistence strategy and filters whether a linear or stepwise approach fits best (approximation closest to the truth).

More specifically: For example, I measure the performance of my heat pump and want to calculate the energy consumption. The heat pump is intermittent, meaning I always have performance values of 0 W in the recording, but I only persist them every 15 minutes (all values above 0 W every 5 seconds). The step variant is currently suitable, but there are use cases where linear interpolation makes more sense.

Mherwege · October 28, 2024, 5:20pm

The Median functions are available in JS scripting, just like the Average functions. The Average functions correspond to the step averages in Home Assistant. There are no Mean functions at all (also not internally). While they could be added, they were considered less useful. But if you are interested, open an issue in the core repo, and I may look into it when I have time.

rlkoshak · October 28, 2024, 5:45pm

I doubt JS adds that much overhead. I suspect it’s the persistence itself that makes it slower.

But you can use Rules DSL or in JS skip the wrappers entirely by using var Persistence = Java.type("org.openhab.core.persistence.extensions.PersistenceExtensions");. That will import the raw Java Class that implements the persistence actions. Of course, that means that anything you get from calls to that class will also be raw Java so you’ll have to treat them accordingly. For example, you can’t use JS methods to iterate and map/reduce a java.utils.List.

dandjo · October 28, 2024, 6:54pm

Here is my script. The JavaScript approach (LOCF - last observation carried forward weighting), which follows the same calculation approach, takes almost 15 times as long.

(function() {
  const logger = log('org.openhab.rule.' + ctx.ruleUID);
  
  const calcEnergy = function(item) {
    const midnight = time.toZDT('00:00');
    const now = time.toZDT();
    const avg = items.getItem(item).persistence.averageBetween(midnight, now).numericState;
    const hours = time.Duration.between(midnight, now).toNanos() / 3600000000000;
    const sum = avg * hours;
    logger.info('AVG: ' + sum + ' (took: ' + time.Duration.between(now, time.toZDT()).toMillis() / 1000 + ' secs)');
    return sum;
  };
  
  const calcEnergyAlt = function(item) {
    // LOCF - last observation carried forward weighting
    const midnight = time.toZDT('00:00');
    const now = time.toZDT();
    const states = items.getItem(item).persistence.getAllStatesBetween(midnight, now);
    logger.info('ALT (persistence call) took ' + time.Duration.between(now, time.toZDT()).toMillis() / 1000 + ' secs');
    let sum = 0;
    let previousTimestamp = midnight;
    let previousValue = 0;
    for (state of states) {
      const dt = time.Duration.between(previousTimestamp, state.timestamp).toNanos() / 3600000000000;
      sum += previousValue * dt;
      previousTimestamp = state.timestamp;
      previousValue = state.numericState;
    }
    logger.info('ALT: ' + sum + ' (took: ' + time.Duration.between(now, time.toZDT()).toMillis() / 1000 + ' secs)');
    return sum;
  };
  
  calcEnergy('espaltherma_electrical_power_space');
  calcEnergyAlt('espaltherma_electrical_power_space');
})();

Output:

2024-10-28 20:00:19.996 [INFO ] [ion.openhab-js.org.openhab.rule.test] - AVG: 2878.4258649361104 (took: 1.247 secs)
2024-10-28 20:00:21.543 [INFO ] [ion.openhab-js.org.openhab.rule.test] - ALT (persistence) took 1.533 secs
2024-10-28 20:00:34.781 [INFO ] [ion.openhab-js.org.openhab.rule.test] - ALT: 2878.1063099333337 (took: 14.772 secs)

It’s definitely the JavaScript code that makes it slow. The persistence call is just as fast as the AVG calculation.

rlkoshak · October 28, 2024, 7:41pm

All I can say is I still doubt it’s the JS wrappers causing problems. But you can prove one way or the other by using the raw Java.

var PersistenceExtensions = Java.type("org.openhab.core.persistence.extensions.PersistenceExtensions");
...

var states = PersistenceExtensions.getAllStatesBetween(items.getItem(item).rawItem, midnight, now);
...
states.stream.forEach(hi, () => { 
  // hi is the Java HistoricItem, not a JS PersistedState
  ...
})

I think you can use a Joda-JS ZonedDateTime here. If not you’ll need to import and use a java.time.ZonedDateTime.

dandjo · October 28, 2024, 7:49pm

I’m 100% sure as you can see in my “time tracking” in the script. The loop consumes 14 seconds. If I am slicing the states-array to 5 elements, the loop consumes less than 0.1 seconds.

I found out that the access to state.timestamp consumes most of the time, so handling Joda-JS ZonedDateTime objects is pretty slow.

rlkoshak · October 28, 2024, 7:54pm

OK, but I’ve also provided the code above you can use to use the raw Java everything inside JS. If you use the Java above, you won’t be calling state.timestamp at all. You’ll be calling hi.getInstant() which returns a java.time.Instant. It completely bypasses all the JS wrappers and everything else that can cause such slowdowns.

dandjo · October 28, 2024, 8:12pm

Yes, it’s the Joda-JS wrapper that’s damn slow.

dandjo · October 28, 2024, 8:34pm

Here’s the full example as proof, that the JS wrappers, especially for ZonedDateTime with Joda-JS are extremely slow. I mean: extremely.

(function() {
  const logger = log('org.openhab.rule.' + ctx.ruleUID);
  
  const calcEnergy = function(item) {
    const midnight = time.toZDT('00:00');
    const now = time.toZDT();
    const avg = items.getItem(item).persistence.averageBetween(midnight, now).numericState;
    const hours = time.Duration.between(midnight, now).toNanos() / 3600000000000;
    const sum = avg * hours;
    logger.info('AVG: ' + sum + ' (took: ' + time.Duration.between(now, time.toZDT()).toMillis() / 1000 + ' secs)');
    return sum;
  };
  
  const calcEnergyAlt = function(item) {
    // LOCF - last observation carried forward weighting
    const midnight = time.toZDT('00:00');
    const now = time.toZDT();
    const states = items.getItem(item).persistence.getAllStatesBetween(midnight, now);
    logger.info('ALT (persistence call) took ' + time.Duration.between(now, time.toZDT()).toMillis() / 1000 + ' secs');
    let sum = 0;
    let previousTimestamp = midnight;
    let previousValue = 0;
    for (state of states) {
      const dt = time.Duration.between(previousTimestamp, state.timestamp).toNanos() / 3600000000000;
      sum += previousValue * dt;
      previousTimestamp = state.timestamp;
      previousValue = state.numericState;
    }
    logger.info('ALT: ' + sum + ' (took: ' + time.Duration.between(now, time.toZDT()).toMillis() / 1000 + ' secs)');
    return sum;
  };
  
  const calcEnergyPureJava = function(item) {
    const PersistenceExtensions = Java.type('org.openhab.core.persistence.extensions.PersistenceExtensions');
    const Duration = Java.type('java.time.Duration');
    const ZonedDateTime = Java.type('java.time.ZonedDateTime');
    const LocalTime = Java.type('java.time.LocalTime');
    const midnight = ZonedDateTime.now().with(LocalTime.MIDNIGHT);
    const now = ZonedDateTime.now();
    const states = PersistenceExtensions.getAllStatesBetween(items.getItem(item).rawItem, midnight, now);
    logger.info('JVA (persistence call) took ' + Duration.between(now, ZonedDateTime.now()).toMillis() / 1000 + ' secs');
    let sum = 0;
    let previousTimestamp = midnight;
    let previousValue = 0;
    states.forEach(state => {
      const dt = Duration.between(previousTimestamp, state.timestamp).toNanos() / 3600000000000;
      sum += previousValue * dt;
      previousTimestamp = state.timestamp;
      previousValue = parseFloat(state.state);
    });
    logger.info('JVA: ' + sum + ' (took: ' + Duration.between(now, ZonedDateTime.now()).toMillis() / 1000 + ' secs)');
    return sum;
  };
  
  calcEnergy('espaltherma_electrical_power_space');
  calcEnergyAlt('espaltherma_electrical_power_space');
  calcEnergyPureJava('espaltherma_electrical_power_space');
})();

2024-10-28 21:32:27.077 [INFO ] [ion.openhab-js.org.openhab.rule.test] - AVG: 3047.0991170055563 (took: 1.182 secs)
2024-10-28 21:32:28.361 [INFO ] [ion.openhab-js.org.openhab.rule.test] - ALT (persistence call) took 1.275 secs
2024-10-28 21:32:41.922 [INFO ] [ion.openhab-js.org.openhab.rule.test] - ALT: 3047.0716656027776 (took: 14.835 secs)
2024-10-28 21:32:43.071 [INFO ] [ion.openhab-js.org.openhab.rule.test] - JVA (persistence call) took 1.146 secs
2024-10-28 21:32:43.338 [INFO ] [ion.openhab-js.org.openhab.rule.test] - JVA: 3047.0716656027776 (took: 1.412 secs)

rlkoshak · October 28, 2024, 9:34pm

Fine, it’s slow. But again, you can use the raw Java.

dandjo · October 28, 2024, 10:00pm

It’s ridiculously slow. I think it’s worth investigating here.

But back to the roots: it would be fine if the openhab-js could provide convenience for such calculations. I’m also missing a Riemann approach.

rlkoshak · October 29, 2024, 1:43am

openhab-js just wraps the functions provided by OH core. Any new calculations should be implemented there. It’s definitely worth an issue as @Mherwege suggested. It’s not hard to implement by any means.

As for the slowness, I’m not seeing the same slowdown but I’m running on a relatively fast machine so I wouldn’t necessarily notice. But that’s is worth an issue on openhab-js repo to see if anything can be done. Historically messing around with date times tends to be expensive so there may not be anything that can be done there, but where performance is an issue I’ve can use the raw Java.

dandjo · October 29, 2024, 11:26pm

I did not file issues yet but implemented a pretty accurate calculation for power consumptions in a JS rule. This could be a nice gimmick for openHAB core and the rule API (JS and DSL). Input is an Item with a series of power consumptions. It will calculate the Riemann Sum with Midpoint strategy.

https://www.statisticshowto.com/calculus-problem-solving/riemann-sums/

  const calcRiemannSumMidpoint = function(item) {
    const PersistenceExtensions = Java.type('org.openhab.core.persistence.extensions.PersistenceExtensions');
    const Duration = Java.type('java.time.Duration');
    const ZonedDateTime = Java.type('java.time.ZonedDateTime');
    const LocalTime = Java.type('java.time.LocalTime');
    const midnight = ZonedDateTime.now().with(LocalTime.MIDNIGHT);
    const now = ZonedDateTime.now();
    const states = PersistenceExtensions.getAllStatesBetween(items.getItem(item).rawItem, midnight, now);
    if (states.size() === 0) {
      return 0;
    }
    let dtPrev = Duration.between(midnight, states[0].getTimestamp()).toNanos() / 3600000000000;
    let dtNext = Duration.between(states[states.size() - 1].getTimestamp(), now).toNanos() / 3600000000000;
    let sum = 0;
    for (index = 0; index < states.size(); index++) {
      const curr = states[index];
      const prev = states[index - 1];
      const next = states[index + 1];
      if (prev) {
        dtPrev = Duration.between(prev.getTimestamp(), curr.getTimestamp()).toNanos() / 3600000000000 / 2;
      }
      if (next) {
        dtNext = Duration.between(curr.getTimestamp(), next.getTimestamp()).toNanos() / 3600000000000 / 2;
      }
      sum += (dtPrev * curr.getState().floatValue()) + (dtNext * curr.getState().floatValue());
    }
    return sum;
  };

dandjo · November 4, 2024, 4:56pm

github.com/openhab/openhab-core

Provide Riemann Sum implementation for items

opened 04:55PM - 04 Nov 24 UTC

dandjo

enhancement

## Requirement I want to calculate the energy consumption of a sensor that on…ly provides me with power data at irregular intervals. This could be, for example, the output of a heat pump, a heat output meter or other time-discrete values. For this, we need to implement an integral function. ## Suggestion Provide a Riemann sum implementation with different alignments (left, right, mid, trapezoidal). Center/mid alignment is the best compromise between accuracy and implementation complexity. [Riemann Sums](https://www.statisticshowto.com/calculus-problem-solving/riemann-sums/) ## Implementation I provided a Riemann sum mid implementation in this discussion: [Arithmetic mean vs average linear/step interpolation](https://community.openhab.org/t/arithmetic-mean-vs-average-linear-step-interpolation/159892/15) ## Your Environment ``` runtimeInfo: version: 4.3.0 buildString: "Build #4362" locale: en-AT systemInfo: configFolder: /etc/openhab userdataFolder: /var/lib/openhab logFolder: /var/log/openhab javaVersion: 17.0.12 javaVendor: Ubuntu osName: Linux osVersion: 6.8.0-1013-raspi osArchitecture: aarch64 availableProcessors: 4 freeMemory: 83704192 totalMemory: 605028352 uptime: 29022 startLevel: 100 addons: - automation-jsscripting - binding-enigma2 - binding-http - binding-mielecloud - binding-modbus - binding-mqtt - binding-netatmo - binding-tado - misc-openhabcloud - persistence-influxdb - persistence-mapdb - transformation-jsonpath - transformation-regex - ui-basic clientInfo: device: ios: false android: false androidChrome: false desktop: true iphone: false ipod: false ipad: false edge: false ie: false firefox: false macos: false windows: false cordova: false phonegap: false electron: false nwjs: false webView: false webview: false standalone: false pixelRatio: 1 prefersColorScheme: dark isSecureContext: false locationbarVisible: true menubarVisible: true navigator: cookieEnabled: true deviceMemory: N/A hardwareConcurrency: 12 language: en-AT languages: - en-AT - en-GB - en-US - en - de onLine: true platform: Linux x86_64 screen: width: 3440 height: 1440 colorDepth: 24 support: touch: false pointerEvents: true observer: true passiveListener: true gestures: false intersectionObserver: true themeOptions: dark: dark filled: true pageTransitionAnimation: default bars: light homeNavbar: default homeBackground: default expandableCardAnimation: default blocklyRenderer: geras userAgent: Mozilla/5.0 (X11; Linux x86_64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/130.0.0.0 Safari/537.36 timestamp: 2024-11-04T16:52:32.492Z ```