evaluate

Evaluate model loss and accuracy using testSamples and spec.

Thread-safe, and block operations on testSamples.

Return

(loss, accuracy).