Dry Run
batchling allow users to declare that they want to launch a dry run for their batching.
This feature exists for users to be able to debug and better understand what WILL happen when they ultimately disable the flag, giving them the transparency required to be confident in the library.
In practice, the dry run feature deactivates all batch submissions, but everything is done virtually, which means we can count incoming requests, number of batch we would have created, etc..
To put it simply, it provides users with an exact breakdown of what their batched inference run would have been for real.
Activating dry run
Dry run is activated by setting up a flag in the CLI or SDK:
-
dry_run=Trueif using the SDK -
--dry-runif using the CLI
Next Steps
- See how cache is saved and for how long it is kept.