Optimise Event Processing: First Frame as Key Frame, Improved Memory Prompt, and Gemini Error catching #251

lich2000117 · 2025-03-06T02:17:25Z

Blueprints Updates

Fixed issues with Gemini model selection
- The default model should be empty to prevent incorrect behavior.
Added a default prompt to enhance the "memory" function
- This ensures better AI assistance when identifying persons/pets.
Tried stream_analyzer to run in parallel with the image_analyzer that classifies "important" but no luck, so removed "important" and "image_analyzer" feature.

Ensures no initial clips/frames are missed since originally, stream_analyzer only starts processing after "image_analyzer" finished classifying "important or not" .

Media Handler Changes

Prioritized the first frame of the stream as the most important, by hard coding the similarity score
- Since the first frame triggers the automation, it should always be considered important.
- This reduces edge cases where objects that appear for a short duration might otherwise be missed.

1. saves event to timeline no matter how important it it. 2. start analysis in parallel, to avoid missing initial clips.

lich2000117 · 2025-03-06T02:18:07Z

Added as draft as still doing some real life testing with the new version

…l block

valentinfrlch · 2025-03-19T12:03:26Z

Appreciate all your work!
Just wanted to check if you still want to contribute to this repository, or maintain your own fork. Both is fine of course. Also let me know if you require any assistance!

lich2000117 · 2025-03-19T12:14:12Z

Appreciate all your work! Just wanted to check if you still want to contribute to this repository, or maintain your own fork. Both is fine of course. Also let me know if you require any assistance!

Absolutely love to, but I'm keen to hear your thoughts on hardcoding the initial frame as a key frame.

The existing "important" feature at this moment would significantly add 1-2 seconds delay to the stream analyzer unless it is handled concurrently either using Blueprints or python.

Lmk your thoughts!

valentinfrlch · 2025-03-19T12:21:56Z

I think always including the first frame for analysis is a great idea, and using the first frame as keyframe as well! Also really appreciate the translation!

You're right the important feature adds some delay. I personally don't use it myself. I think maybe it could be improved, though because sending the actual notification depends on what important returns, I am not sure if it can be parallelized. The feature is a little bit of a gimmick (which is why it is labelled as 'experimental') but I like the idea. It is optional after all.

Let me know what you think, and thank you for your work, it is much appreciated!

lich2000117 · 2025-03-19T12:25:12Z

That's great to hear! to further personalise it, I believe running some basic CV before sending the image and prompt to LLM could greatly improve the accuracy and usability.

It would also be good if error handling could be added to blueprints (my free gemini model usually exceeds quota).

Edit:
BTW, the link and author information in manifest.json was changed for my local testing of the integration (setup my own HACS integration and trying to pull updates from my forked branch), I have reverted them back and will help create a PR soon.

valentinfrlch · 2025-03-19T15:00:01Z

I'm sure using CV before sending would increase accuracy, but it would also increase latency. I think it's best to keep the integration minimal and focus only on one thing. Personally I use Frigate (which does all the CV processing e.g. detecting people) and it would only trigger the automation if Frigate detects something.

The other problem with more sophisticated preprocessing is raw performance. I think a big part of CV is optimizing it to run well on different hardware with all the different vendor-specific hardware acceleration. In my opinion it's best to maybe offer better integration with tools like Frigate that already do that.

I 100% agree on the error handling. I'm actually pretty new to blueprints and so I don't know if there is a proper way to catch errors. So far I haven't found anything. Graceful error handling is definitely something that would be nice!

The Gemini issue you mentioned doesn't seem limited to this integration but also the native 'Google Generative AI' integration. There is an issue here: #262 (another reason error handling would indeed be great!)

lich2000117 · 2025-03-19T22:22:55Z

Great insights! Do you mind sharing your email address or discord? I'm happy to connect and talk about it in other applications if you are keen. ***@***.***

…

________________________________ From: Valentin Fröhlich ***@***.***> Sent: Thursday, March 20, 2025 2:00:24 AM To: valentinfrlch/ha-llmvision ***@***.***> Cc: Lee ***@***.***>; Author ***@***.***> Subject: Re: [valentinfrlch/ha-llmvision] Optimise Event Processing: Parallel Stream Analysis, Improved Memory Prompt, and Gemini Fixes (PR #251) I'm sure using CV before sending would increase accuracy, but it would also increase latency. I think it's best to keep the integration minimal and focus only on one thing. Personally I use Frigate (which does all the CV processing e.g. detecting people) and it would only trigger the automation if Frigate detects something. The other problem with more sophisticated preprocessing is raw performance. I think a big part of CV is optimizing it to run well on different hardware with all the different vendor-specific hardware acceleration. In my opinion it's best to maybe offer better integration with tools like Frigate that already do that. I 100% agree on the error handling. I'm actually pretty new to blueprints and so I don't know if there is a proper way to catch errors. So far I haven't found anything. Graceful error handling is definitely something that would be nice! The Gemini issue you mentioned doesn't seem limited to this integration but also the native 'Google Generative AI' integration. There is an issue here: #262<#262> (another reason error handling would indeed be great!) — Reply to this email directly, view it on GitHub<#251 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/ALW3VKQBIRROQ7WFDG3Z7YD2VGBARAVCNFSM6AAAAABYNQ5DKOVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDOMZWHE3DINBZG4>. You are receiving this because you authored the thread.Message ID: ***@***.***> [valentinfrlch]valentinfrlch left a comment (valentinfrlch/ha-llmvision#251)<#251 (comment)> I'm sure using CV before sending would increase accuracy, but it would also increase latency. I think it's best to keep the integration minimal and focus only on one thing. Personally I use Frigate (which does all the CV processing e.g. detecting people) and it would only trigger the automation if Frigate detects something. The other problem with more sophisticated preprocessing is raw performance. I think a big part of CV is optimizing it to run well on different hardware with all the different vendor-specific hardware acceleration. In my opinion it's best to maybe offer better integration with tools like Frigate that already do that. I 100% agree on the error handling. I'm actually pretty new to blueprints and so I don't know if there is a proper way to catch errors. So far I haven't found anything. Graceful error handling is definitely something that would be nice! The Gemini issue you mentioned doesn't seem limited to this integration but also the native 'Google Generative AI' integration. There is an issue here: #262<#262> (another reason error handling would indeed be great!) — Reply to this email directly, view it on GitHub<#251 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/ALW3VKQBIRROQ7WFDG3Z7YD2VGBARAVCNFSM6AAAAABYNQ5DKOVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDOMZWHE3DINBZG4>. You are receiving this because you authored the thread.Message ID: ***@***.***>

valentinfrlch · 2025-03-20T18:39:59Z

Sure! valentinfrlch on Discord.

lich2000117 · 2025-03-23T08:23:52Z

Done!

…

________________________________ From: Valentin Fröhlich ***@***.***> Sent: Friday, March 21, 2025 5:40:21 AM To: valentinfrlch/ha-llmvision ***@***.***> Cc: Lee ***@***.***>; Author ***@***.***> Subject: Re: [valentinfrlch/ha-llmvision] Optimise Event Processing: First Frame as Key Frame, Improved Memory Prompt, and Gemini Error catching (PR #251) Sure! valentinfrlch on Discord. — Reply to this email directly, view it on GitHub<#251 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/ALW3VKW5PUE7QDVIIDO4VYL2VMDRLAVCNFSM6AAAAABYNQ5DKOVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDONBRGM2TKNRQGI>. You are receiving this because you authored the thread.Message ID: ***@***.***> [valentinfrlch]valentinfrlch left a comment (valentinfrlch/ha-llmvision#251)<#251 (comment)> Sure! valentinfrlch on Discord. — Reply to this email directly, view it on GitHub<#251 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/ALW3VKW5PUE7QDVIIDO4VYL2VMDRLAVCNFSM6AAAAABYNQ5DKOVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDONBRGM2TKNRQGI>. You are receiving this because you authored the thread.Message ID: ***@***.***>

valentinfrlch

Thank you! LGTM.

Unrelated, but while I was looking through this, I realized there are way too many consts. One API_KEY would be enough (instead of every provider having their own const). I'll fix that soon.

lich2000117 added 4 commits March 6, 2025 10:21

fix issues of not capturing initial video clip

ec4de33

blueprint automation now

7ca48b4

1. saves event to timeline no matter how important it it. 2. start analysis in parallel, to avoid missing initial clips.

Merge branch 'main' of https://github.com/valentinfrlch/ha-llmvision

bb51e31

restore beta

f47b1b9

lich2000117 added 7 commits March 6, 2025 14:08

update logic since response variable is not available outside paralle…

54adaae

…l block

remove important

b64b630

Merge branch 'valentinfrlch:main' into main

90b2c98

production ready

3877dcf

update to chenghao version

884f5f2

Merge branch 'main' of https://github.com/valentinfrlch/ha-llmvision

d595097

add chinese translation

a2250c3

update meta info

58efaa4

lich2000117 added 3 commits March 20, 2025 10:22

update graceful error handling and constant var

a84bd02

Update prompt to limit 255 characters

8b0fdaf

update version

63f31f0

lich2000117 changed the title ~~Optimise Event Processing: Parallel Stream Analysis, Improved Memory Prompt, and Gemini Fixes~~ Optimise Event Processing: First Frame as Key Frame, Improved Memory Prompt, and Gemini Error catching Mar 19, 2025

lich2000117 marked this pull request as ready for review March 19, 2025 23:29

valentinfrlch changed the base branch from main to 1.4.2-beta March 25, 2025 10:58

valentinfrlch approved these changes Mar 25, 2025

View reviewed changes

Merge branch '1.4.2-beta' into main

56291b3

valentinfrlch merged commit a43d126 into valentinfrlch:1.4.2-beta Mar 25, 2025
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Optimise Event Processing: First Frame as Key Frame, Improved Memory Prompt, and Gemini Error catching #251

Optimise Event Processing: First Frame as Key Frame, Improved Memory Prompt, and Gemini Error catching #251

Uh oh!

lich2000117 commented Mar 6, 2025 •

edited

Loading

Uh oh!

lich2000117 commented Mar 6, 2025

Uh oh!

valentinfrlch commented Mar 19, 2025

Uh oh!

lich2000117 commented Mar 19, 2025

Uh oh!

valentinfrlch commented Mar 19, 2025

Uh oh!

lich2000117 commented Mar 19, 2025 •

edited

Loading

Uh oh!

valentinfrlch commented Mar 19, 2025

Uh oh!

lich2000117 commented Mar 19, 2025 via email

Uh oh!

valentinfrlch commented Mar 20, 2025

Uh oh!

lich2000117 commented Mar 23, 2025 via email

Uh oh!

valentinfrlch left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Optimise Event Processing: First Frame as Key Frame, Improved Memory Prompt, and Gemini Error catching #251

Optimise Event Processing: First Frame as Key Frame, Improved Memory Prompt, and Gemini Error catching #251

Uh oh!

Conversation

lich2000117 commented Mar 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Blueprints Updates

Media Handler Changes

Uh oh!

lich2000117 commented Mar 6, 2025

Uh oh!

valentinfrlch commented Mar 19, 2025

Uh oh!

lich2000117 commented Mar 19, 2025

Uh oh!

valentinfrlch commented Mar 19, 2025

Uh oh!

lich2000117 commented Mar 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

valentinfrlch commented Mar 19, 2025

Uh oh!

lich2000117 commented Mar 19, 2025 via email

Uh oh!

valentinfrlch commented Mar 20, 2025

Uh oh!

lich2000117 commented Mar 23, 2025 via email

Uh oh!

valentinfrlch left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

lich2000117 commented Mar 6, 2025 •

edited

Loading

lich2000117 commented Mar 19, 2025 •

edited

Loading