Avoid splitting outputs in batches when nbatches == 1 #406

lantiga · 2020-06-08T10:02:28Z

Fixes #401

TORCH and TFLITE didn't have the issue, while ONNX and TF did.

It might also be a good idea to add a test targeting this use case (not with full Yolo to avoid including a 200+MB model, only with a tiny model simulating a 1-batch input and 3-batch output).

codecov · 2020-06-08T12:13:43Z

Codecov Report

Merging #406 into master will increase coverage by 0.13%.
The diff coverage is 65.82%.

@@            Coverage Diff             @@
##           master     #406      +/-   ##
==========================================
+ Coverage   72.46%   72.60%   +0.13%     
==========================================
  Files          21       21              
  Lines        4384     4424      +40     
==========================================
+ Hits         3177     3212      +35     
- Misses       1207     1212       +5

Impacted Files	Coverage Δ
src/model.c	`69.20% <0.00%> (ø)`
src/redisai.h	`0.00% <0.00%> (ø)`
src/tensor.c	`83.50% <ø> (ø)`
src/script.c	`63.78% <47.36%> (ø)`
src/backends/tensorflow.c	`65.34% <70.58%> (-0.16%)`	⬇️
src/backends/onnxruntime.c	`68.53% <79.31%> (+0.52%)`	⬆️
src/backends/torch.c	`84.66% <100.00%> (+0.28%)`	⬆️
src/redisai.c	`77.09% <100.00%> (ø)`
... and 3 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update fcdf0d2...3a1bf0b. Read the comment docs.

src/backends/torch.c

lantiga · 2020-06-10T08:51:40Z

@filipecosta90 I don't understand the failing codecov/patch, what's the issue there?

Probably it's the batch size check with TF and ONNX in 7ad0d18

filipecosta90 · 2020-06-10T09:23:48Z

@filipecosta90 I don't understand the failing codecov/patch, what's the issue there?

Probably it's the batch size check with TF and ONNX in 7ad0d18

I believe the reason it fails is due to patch coverage ( added code is 65% ) even tough on overall the project coverage will increase. IMO we can merge and address this in the future via unit testing ( same as other corner cases that are difficult to test on flows )

* Avoid splitting outputs in batches when nbatches == 1 * Add batch size checks * Fix batch checks * Update readies * Add bad batching test

* Introduced readies submodule (#377) * Introduced readies submodule * Fix in paella * Updated get_deps.sh and docs * [WIP] add C API reference to docs using auto tool (#368) * [add] add C API reference to docs using auto tool ( python3 generate_llapi_reference.py ) * [add] generating C API reference on the fly * [add] moving api_reference under Developer Nav Co-authored-by: Luca Antiga <[email protected]> * [WIP] Enable AI.SCRIPTRUN on AI.DAGRUN* (#383) * [add] decoupled scriptrun command parsing from runtime scriptrun. * [add] split positive/negative tests on pytorch scriptrun. * [add] refactor AI.DAGRUN_RO and AI.DAGRUN to use the same code base (with read/write modes) * [add] added positive and negative tests for dagrun with scriptrun * [add] updated documentation to reflect scriptrun support on dagrun * [add] added example enqueuing multiple SCRIPTRUN and MODELRUN commands within a DAG * Add support for variadic arguments to SCRIPT (#395) * Add support for variadic arguments to SCRIPT * Add negative errors * atomic ref count (#403) * Multi-platform build (#398) * CircleCI: increased GPU test timeout to 40m (#404) * CI: platform-build-defs -> on-master-and-version-tags (#405) * Llapi updates (#400) * added low level api return redis types * added variadic to llapi * fixed memory issue for params array re-alloc * Avoid splitting outputs in batches when nbatches == 1 (#406) * Avoid splitting outputs in batches when nbatches == 1 * Add batch size checks * Fix batch checks * Update readies * Add bad batching test * Update README.md * CircleCI: fixed redis installation in coverage builds (#423) * Docker6.0.5 (#427) * Update Dockerfile * Update Dockerfile.arm * Update Dockerfile.gpu * submodule pull moved up (#428) * Update mkdocs.yml (#432) * [add] Add relevant RedisAI config entries to the INFO output, so that standard monitoring systems would be able to monitor them. (#396) * Updated support email and Orobix to Tensorwork on ramp file (#436) * [add] updated support email and Orobix to Tensorwork on ramp file Co-authored-by: Sherin Thomas <[email protected]> * CircleCI: Fixed problem with GPU testing (#440) * Fixed platforms build problem (#441) * Fixed platforms build problem * Build: updated Redis versions * Build: another Redis version update * Dependencies in ramp.yml (#444) * Fixed flagged as "getkeys-api" during the registration ( AI.DAGRUN, AI.DAGRUN_RO, AI.MODELRUN, AI.SCRIPTRUN ) (#438) * [wip] wip on fixing the commands being flagged as getkeys-api during the registration. ( AI.MODELRUN, AI.SCRIPTRUN, AI.DAGRUN, AI.DAGRUN_RO ) * [add] pytorch oss cluster tests passing * [add] TensorFlow lite tests passing on oss cluster * [add] ONNX tests passing on oss cluster * [add] TensorFlow tests passing on oss cluster * [wip] wip on dag * [add] DAG tests passing on oss cluster * [add] enabling oss cluster tests on CI * [add] bumping rmbuilder version from 6.0.1 to 6.0.5 on circleci * [fix] fixed potential invalid mem accesses on RedisAI_DagRunSyntaxParser, RAI_parseDAGPersistArgs * [fix] fixed RAI_FreeDagOp wrongfully calling RedisModule_Free on dagOp->inkeys, dagOp->outkeys * [fix] fixed dictKey allocation on RAI_parseDAGLoadArgs * [add] alter Sanitizer tests to accommodate keys on the same slot. * [add] extracted the methods to respond to RedisModule_IsKeysPositionRequest() from main code of dagrun,modelrun, and scriptrun to specific methods * [fix] Fixes per PR review * [fix] fixes per PR review * [fix] fix per CI ascii conversion error on test_dagro_modelrun_financialNet_no_writes_multiple_modelruns * Safely add to arrays + fix for #443 (#449) * Make sure we reassign the pointer in array_append * Fix case-sensitive comparison for devicestr * Fix sanitizer tests (cherry picked from commit 5c7813e) * fixed tests messages * Shallow copy persisted tensor * Resolve log format warnings * Fix error message and test * Fix memory management in local context dict * Fixed artifacts handling + added tests logs aggregation (#445) * Snapshot packages are placed into http://redismodules.s3.amazonaws.com/redisai/snapshots. Private branches (i.e., non-master) are also supported, just need to enable deploy-snapshot in config.yml to fire on on-any-branch rather than just on-master. * Release packages are placed into http://redismodules.s3.amazonaws.com/redisai, and it's going to be crowded in there, so we may want to consider putting each version into its own directory. * `BB` in-source breakpoints are now supported (in `DEBUG=1` builds): put `BB;` inside the code and it will stop if you're running under gdb. * Tests results in CircleCI are now aggregated as artifacts. * Build: setuptools-related fix (#455) (cherry picked from commit d1fcd0d) Co-authored-by: Rafi Einstein <[email protected]> Co-authored-by: Luca Antiga <[email protected]> Co-authored-by: DvirDukhan <[email protected]> Co-authored-by: Guy Korland <[email protected]> Co-authored-by: Sherin Thomas <[email protected]>

lantiga requested a review from filipecosta90 June 8, 2020 10:02

filipecosta90 previously approved these changes Jun 8, 2020

View reviewed changes

lantiga dismissed filipecosta90’s stale review via 80908e9 June 8, 2020 10:18

lantiga force-pushed the batch_fix branch from 80908e9 to d66f936 Compare June 8, 2020 10:19

lantiga marked this pull request as draft June 8, 2020 10:32

lantiga requested a review from filipecosta90 June 8, 2020 12:29

lantiga marked this pull request as ready for review June 8, 2020 12:30

filipecosta90 reviewed Jun 8, 2020

View reviewed changes

src/backends/torch.c Show resolved Hide resolved

lantiga added 4 commits June 10, 2020 09:47

Avoid splitting outputs in batches when nbatches == 1

1eeb776

Add batch size checks

7ad0d18

Fix batch checks

9312026

Update readies

f0011c9

lantiga force-pushed the batch_fix branch from 71d9add to f0011c9 Compare June 10, 2020 07:52

Add bad batching test

3a1bf0b

filipecosta90 self-requested a review June 10, 2020 09:23

filipecosta90 approved these changes Jun 10, 2020

View reviewed changes

lantiga merged commit a5d1524 into master Jun 10, 2020

lantiga deleted the batch_fix branch June 10, 2020 09:26

filipecosta90 pushed a commit that referenced this pull request Aug 24, 2020

Avoid splitting outputs in batches when nbatches == 1 (#406)

b360284

* Avoid splitting outputs in batches when nbatches == 1 * Add batch size checks * Fix batch checks * Update readies * Add bad batching test

filipecosta90 mentioned this pull request Aug 24, 2020

v1.0 cherry picks #450

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Avoid splitting outputs in batches when nbatches == 1 #406

Avoid splitting outputs in batches when nbatches == 1 #406

Uh oh!

lantiga commented Jun 8, 2020

Uh oh!

codecov bot commented Jun 8, 2020 •

edited

Loading

Uh oh!

Uh oh!

lantiga commented Jun 10, 2020 •

edited

Loading

Uh oh!

filipecosta90 commented Jun 10, 2020

Uh oh!

Uh oh!

Avoid splitting outputs in batches when nbatches == 1 #406

Avoid splitting outputs in batches when nbatches == 1 #406

Uh oh!

Conversation

lantiga commented Jun 8, 2020

Uh oh!

codecov bot commented Jun 8, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

lantiga commented Jun 10, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

filipecosta90 commented Jun 10, 2020

Uh oh!

Uh oh!

codecov bot commented Jun 8, 2020 •

edited

Loading

lantiga commented Jun 10, 2020 •

edited

Loading