Solves a bug for the scenario when multiple processes create a context and a stream into a single GPU but withou activity.

Adds trace size to timing.
Bump to dev version 0.6.0-dev202601201