Control flows #2738

mattiasmar · 2019-04-17T17:47:44Z

Hi,
When can Glow be expected to support control flows? I would like to see LSTM implemented in Glow.
Thanks!

nadavrot · 2019-04-17T17:57:42Z

Hi @mattiasmar! We support LSTMs by unrolling the network. For example, the char-rnn and English-To-German use LSTM structures. At some point we may extend Glow to support more control flow features but this
is not something that anyone is actively working on.

mattiasmar · 2019-04-17T20:41:38Z

For the purpose of extending Glow to support LSTM structures (not unrolled); Could you provide some pointers? Design thoughts? Links to work in progress? Core elements that would need to be implemented?
Thanks!

nadavrot · 2019-04-17T20:47:33Z

To support general control flow we would need to extend the internal Glow data structures. Rename Function to BasicBlock, and add a new top-level function. Allow BasicBlocks to be terminated with some "branch" instruction that jumps to a different basic block.

This is going to be a major change, and I am not sure how to implement things like control-flow-aware automatic differentiation (that we have today for the DAG).

I think that the best path forward right now would be to rely on the high-level framework (example, PyTorch) to implement the control flow and call into Glow for the dag sections of the compute. Moving forward we can gradually implement full support for control-flow, based on the semantics of PyTorch.

mattiasmar · 2019-04-17T21:24:01Z

So how you approach a model like GNMT? Unrolling the network wouldn't be very processing efficient. How could we best take benefit of Glow, without doing a major change to the code base (while keeping the option to do some minor changes)?

jfix71 · 2019-04-17T22:02:35Z

We have some initial support for Predication, which would allow us to skip unnecessary computation in an unrolled network. That would probably be the shortest path to a highly performant NMT. Note that this requires the backend to also support predication.

ajayanto · 2019-09-09T10:53:11Z

Our backend supports LSTM as-is. So if I add code for loading LSTM operator in glow, then can we implement something similar to https://github.com/microsoft/onnxruntime/blob/master/onnxruntime/core/providers/cpu/rnn/deep_cpu_lstm.cc in BoundInterpreterFunction for profiling?

jfix71 · 2019-09-09T16:05:02Z

@ajayanto We already have basic LSTM support, just not as a single node (it's currently directly implemented via its component pieces instead of as a Node that's lowered; see Function::createLSTM()). We could add in an LSTM Node and then lower it essentially using the logic from Function::createLSTM(), and then your backend could just prevent lowering for it.

This could be useful for you if you are going to run an unrolled RNN that uses LSTMs, or some other network with an LSTM that does not use control flow. But just to be clear, we still do not have support for control flow needed for general RNNs, e.g. ONNX's Scan or C2's RecurrentNetwork, which we would need to unroll.

ajayanto · 2019-09-09T16:46:03Z

@jfix71 as a workaround,
Instead of lowering LSTM into multiple nodes, If we add LSTM node(as-is) in glow and implement its functionality (similar to onnx runtime implementation) in BoundInterpreterFunction::fwdLSTMInst function then is it possible to run inference and generate profile using interpreter backend?

jfix71 · 2019-09-10T01:15:44Z

@ajayanto You don't need to provide an implementation for fwdLSTMInst in order to get a profile. We will want to lower the LSTMNode to its subnodes for the Interpreter backend, with very similar logic to that found currently inside Function::createLSTM().

If your backend does not want to lower the LSTM then it does not need to -- it would return false for LSTMNode inside YourBackend::shouldLower(). It would then see the LSTMNode passed to it inside YourBackend::compile().

In order to do this you'd need add an LSTMNode, somewhat following the instructions in docs/NewOperators.md. You'd follow the instructions for if the Node does not need to have low-level IR because it will be lowered (see the third bullet here). The only main difference for the steps there is that there is already a Function::createLSTM() which we would instead return a newly created LSTMNode for. And then the logic currently inside Function::createLSTM() will be moved to its own case in Lower.cpp.

hgarg5072 · 2019-10-17T12:33:14Z

@jfix71 the current implementation of LSTM in Function::createLSTM() doesn't has any option to provide weights as input. So is this function training specific only? What do I have to do in order to have an inference node?
Also, can't we have LSTMs as node instead of just a function?

jfix71 · 2019-10-18T03:14:03Z

@hgarg5072 For historical reasons we didn't have an LSTMNode due to lowering/quantization-related issues, but those issues have been resolved, so it could be added now. And yes we should also add an API for creating an LSTM with already-trained weights regardless of whether we add an LSTMNode -- there's no real reason we don't have one yet. Are you interested in working on either of these issues?

ksaurabh-cadence · 2020-09-30T01:42:53Z

@jfix71 @nadavrot Just checking if there is any progress on supporting loop/control flow? It's been a while :-) Any approaches which have been considered and rejected?

jfix71 · 2020-09-30T18:14:54Z

Hi @ksaurabh-cadence -- I don't think much has changed here, actually. I still think the easiest path forward for now would be to allow e.g. PyTorch (or whatever is driving runtime execution, e.g. a C wrapper with a main for executing an AOT bundle) to call into Glow for high performance execution of each iteration of the model, assuming the control flow is mostly there to wrap around e.g. and LSTM until an end token is seen. Or alternatively, to improve support for predication and then unroll the model to some max length. We could still do a large refactor as Nadav mentioned as well, but I am not clear on the cost-benefit there, as it will require a decent amount of work, and I think we could likely get most of the benefit from the first approach.

ksaurabh-cadence · 2020-10-01T18:16:44Z

Hi @jfix71 Is there an example that I can look up for the first approach you suggest using a c-wrapper? If not, is there an example which can be added?

What kind of improvement for predication do you have in mind?

This was referenced Sep 3, 2019

LSTM operator is not supported #3475

Closed

ONNX Scan operator support needed #3488

Closed

ponnamsairam mentioned this issue Dec 24, 2019

error while loading lstm model #3939

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Control flows #2738

Control flows #2738

mattiasmar commented Apr 17, 2019

nadavrot commented Apr 17, 2019

Uh oh!

mattiasmar commented Apr 17, 2019

Uh oh!

nadavrot commented Apr 17, 2019

Uh oh!

mattiasmar commented Apr 17, 2019

Uh oh!

jfix71 commented Apr 17, 2019 •

edited

Loading

Uh oh!

ajayanto commented Sep 9, 2019

Uh oh!

jfix71 commented Sep 9, 2019

Uh oh!

ajayanto commented Sep 9, 2019

Uh oh!

jfix71 commented Sep 10, 2019

Uh oh!

hgarg5072 commented Oct 17, 2019

Uh oh!

jfix71 commented Oct 18, 2019

Uh oh!

ksaurabh-cadence commented Sep 30, 2020

Uh oh!

jfix71 commented Sep 30, 2020

Uh oh!

ksaurabh-cadence commented Oct 1, 2020

Uh oh!

Control flows #2738

Control flows #2738

Comments

mattiasmar commented Apr 17, 2019

nadavrot commented Apr 17, 2019

Uh oh!

mattiasmar commented Apr 17, 2019

Uh oh!

nadavrot commented Apr 17, 2019

Uh oh!

mattiasmar commented Apr 17, 2019

Uh oh!

jfix71 commented Apr 17, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ajayanto commented Sep 9, 2019

Uh oh!

jfix71 commented Sep 9, 2019

Uh oh!

ajayanto commented Sep 9, 2019

Uh oh!

jfix71 commented Sep 10, 2019

Uh oh!

hgarg5072 commented Oct 17, 2019

Uh oh!

jfix71 commented Oct 18, 2019

Uh oh!

ksaurabh-cadence commented Sep 30, 2020

Uh oh!

jfix71 commented Sep 30, 2020

Uh oh!

ksaurabh-cadence commented Oct 1, 2020

Uh oh!

jfix71 commented Apr 17, 2019 •

edited

Loading