Run model from script #787

alonre24 · 2021-06-16T11:28:26Z

This PR extends the external commands that torch script already supports through RedisAI (executing unblocking Redis commands), to support the execution of RedisAI models as well by adding a new redisAI.execute_model command to torch script. This command receives a model name that is stored in RedisAI, a list of the input tensors and the outputs number, and returns the output tensors.

This is tested by running models from torch script in all 3 backends.
This PR includes a unified arrangement of the device_id field in dl_tensor - this should be -1 whenever tensor is created for default CPU, otherwise torch interpreters it as if it should be sent to a different device (for device_id=0, the device is considered as CPU:0 insread of CPU).

Todo: figure out how (or if) to pass error message from torch extension back to RedisAI, to have more informative failing (currently a generic error message is returned)

… multiple devices). Tests pass (new feature hasn't tested yet)

…w device is introduced and use rwlock to synchronise.

…'t overflows than the tensor type in TENSORSET.

…aces that are affected)

…into ONNX_kill_switch

…om backend len (supported only for onnx now) in INFO MODULES command.

- Add a state flag to every entry in the onnx run sessions array and update it atomically, to avoid situations where main threads and bg thread both access the runOptions field. - Refactor the info_modules section, and change AI.INFO command so it must receive a module/script key. The other info will be accessible as part of the info modules command.

… instance). test info command with AI fields

codecov · 2021-06-17T06:49:48Z

Codecov Report

Merging #787 (10988d3) into master (0000228) will increase coverage by 6.28%.
The diff coverage is 81.75%.

@@            Coverage Diff             @@
##           master     #787      +/-   ##
==========================================
+ Coverage   74.11%   80.40%   +6.28%     
==========================================
  Files          39       52      +13     
  Lines        6081     7883    +1802     
==========================================
+ Hits         4507     6338    +1831     
+ Misses       1574     1545      -29

Impacted Files	Coverage Δ
src/redis_ai_types/model_type.c	`70.00% <ø> (ø)`
src/redis_ai_types/script_type.c	`70.00% <ø> (+5.00%)`	⬆️
src/redis_ai_types/tensor_type.c	`73.33% <ø> (ø)`
src/serialization/AOF/rai_aof_rewrite.c	`0.00% <0.00%> (ø)`
tests/module/DAG_utils.c	`88.23% <ø> (ø)`
tests/module/LLAPI.c	`74.46% <ø> (ø)`
tests/unit/rmalloc.h	`100.00% <ø> (ø)`
tests/unit/unit_tests_err.cpp	`100.00% <ø> (ø)`
src/backends/libtflite_c/tflite_c.cpp	`57.60% <35.71%> (ø)`
src/backends/tflite.c	`66.01% <57.14%> (+2.19%)`	⬆️
... and 77 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 7a5d18d...10988d3. Read the comment docs.

DvirDukhan

looks very good
few comments

src/backends/backends.c

src/backends/libtorch_c/torch_c.cpp

src/backends/libtorch_c/torch_extensions/torch_redis.cpp

src/redisai.h

tests/flow/includes.py

tests/flow/test_data/redis_scripts.py

DvirDukhan

few comments

src/backends/backends.c

src/backends/libtorch_c/torch_extensions/torch_redis.cpp

src/serialization/RDB/decoder/current/v2/decode_v2.c

- Add the option the include the backends api from multiple sources in c++ project.

…disAI into run_model_from_script

src/backends/libtorch_c/torch_c.cpp

alonre24 added 30 commits May 26, 2021 10:41

Introduce kill switch mechanism for onnxruntime sessions (not ready yet)

040043d

Putting login in onnx backend (not ready yet)

0961e66

WIP

9d5fbf8

Refactor background workers + add support to kill switch in onnx (for…

22662bd

… multiple devices). Tests pass (new feature hasn't tested yet)

Refactor - do not use rax, extend onnxRunSessions array whenever a ne…

c3a45e9

…w device is introduced and use rwlock to synchronise.

Refactor backends loading

2684852

Start testing - not finished

cd9baa1

Support bool type tensor

5c09106

Support tensors of type bool. Add validation that a input value doesn…

5d3dd2c

…'t overflows than the tensor type in TENSORSET.

Merge branch 'Support_BOOL_type_for_tensors' into ONNX_kill_switch

fa14217

Support tensor of type bool in ONNX, Add tests for kill switch

04dac08

Add load time config for ONNX_TIMEOUT. Parallel tests seems not to work.

1d6b3ed

Some fixes

ea3c174

Merge master (resolve conflicts in backends.c)

05c2a39

Remove debug print

4bbfbcd

Merge master with updated changes of supporting tensor of type bool

cd2936c

Some fixes and documentation complement.

4aed8ca

Refactor load time config

6cd9652

Remove redundant include

42059b8

Merge branch 'master' into ONNX_kill_switch

6c906aa

PR fixes part 1: refactor config and run queue info files (and all pl…

23749c4

…aces that are affected)

Merge branch 'ONNX_kill_switch' of https://github.com/RedisAI/RedisAI …

342afbb

…into ONNX_kill_switch

linter...

697faf9

Merge branch 'master' into ONNX_kill_switch

ee02cc0

linter...

1201cb2

Merge branch 'master' into ONNX_kill_switch

4360679

More PR fixes, add the option to get the global run sessions array fr…

21737e6

…om backend len (supported only for onnx now) in INFO MODULES command.

Minor fixes

73f2a91

Add the ability to execute model through torch script - WIP

6babc11

alonre24 added 9 commits June 14, 2021 15:48

Return error if onnx is executed in a non async manner (via gears for…

78da23e

… instance). test info command with AI fields

Merge branch 'master' into run_model_from_script

a67da55

Merge branch 'ONNX_kill_switch' into run_model_from_script

3545f7e

basic test passes - running torch model from torch script is enabled.

ab505ba

Extend tests to include onnx and tf as well.

c4d8b55

Fix device id - always use -1 when creating tensors for default CPU.

fedc508

Resolve conflicts after merging master

dd0d797

Remove test added for debug

56f815f

Remove debug additions and add comments and documentation.

fbee927

alonre24 requested review from lantiga and DvirDukhan June 16, 2021 11:28

Change device id of default CPU to -1 in RDB loading as well.

d57e598

Fix and test error raising when a redis torch script operation fails.

525acc4

DvirDukhan reviewed Jun 17, 2021

View reviewed changes

Some PR fixes.

e1b3745

DvirDukhan reviewed Jun 17, 2021

View reviewed changes

alonre24 added 4 commits June 20, 2021 11:39

Update device_id to -1 in older rdb versions

4869acf

Merge branch 'master' into run_model_from_script

f02e0b5

- Move ownership on the output tensor to torch instead of copying it.

d47d04a

- Add the option the include the backends api from multiple sources in c++ project.

Merge branch 'run_model_from_script' of https://github.com/RedisAI/Re…

9d6b409

…disAI into run_model_from_script

DvirDukhan reviewed Jun 21, 2021

View reviewed changes

src/backends/libtorch_c/torch_c.cpp Show resolved Hide resolved

Added comment

8a7cac9

alonre24 added the ci-test label Jun 22, 2021

DvirDukhan approved these changes Jun 22, 2021

View reviewed changes

Merge branch 'master' into run_model_from_script

10988d3

alonre24 added ci-test and removed ci-test labels Jun 24, 2021

alonre24 merged commit 406b337 into master Jun 24, 2021

alonre24 deleted the run_model_from_script branch June 24, 2021 10:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Run model from script #787

Run model from script #787

Uh oh!

alonre24 commented Jun 16, 2021 •

edited

Loading

Uh oh!

codecov bot commented Jun 17, 2021 •

edited

Loading

Uh oh!

DvirDukhan left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

DvirDukhan left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Run model from script #787

Run model from script #787

Uh oh!

Conversation

alonre24 commented Jun 16, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Jun 17, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

DvirDukhan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

DvirDukhan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

alonre24 commented Jun 16, 2021 •

edited

Loading

codecov bot commented Jun 17, 2021 •

edited

Loading