I’m surprised not more momentum on this. It would be neat to modify this to support local a LLM and maybe with txtai try to build semantic knowledge graphs on screenshots to auto detect “topics” of your desktop activity.
I wonder if there is a sensible API where you can track everything without storing a video file. Something like seeing Window positions, classes, names, and maybe even GTK/QT cache?