CRF layer v3.0 #1733

gabrieldemarmiesse · 2020-04-26T11:16:04Z

With a subclassing approch, we have a nicer API and it's very flexible.

Works only with TF 2.2+

@howl-anderson for the review and the CLA

The plan is to show users how to do the subclassing for the CRF. We shouldn't provide and API to save them some code there because it's going to become very complex to design a good API and to maintain it later on.

So the CRF layer is a public API and for the CRF loss, we give a good tutorial about subclassing.

Quick tutorial right now:

import tensorflow as tf
from tensorflow_addons.layers.crf import CRF
from tensorflow_addons.text.crf import crf_log_likelihood

def unpack_data(data):
    if len(data) == 2:
        return data[0], data[1], None
    elif len(data) == 3:
        return data
    else:
        raise TypeError("Expected data to be a tuple of size 2 or 3.")


class ModelWithCRFLoss(tf.keras.Model):
    """Wrapper around the base model for custom training logic."""

    def __init__(self, base_model):
        super().__init__()
        self.base_model = base_model

    def call(self, inputs):
        return self.base_model(inputs)

    def compute_loss(self, x, y, sample_weights, training=False):
        y_pred = self(x, training=training)
        _, potentials, sequence_length, chain_kernel = y_pred

        crf_loss = -crf_log_likelihood(potentials, y, sequence_length, chain_kernel)[0]

        if sample_weights is not None:
            crf_loss = crf_loss * sample_weights

        return tf.reduce_mean(crf_loss), sum(self.losses)

    def train_step(self, data):
        x, y, sample_weight = unpack_data(data)

        with tf.GradientTape() as tape:
            crf_loss, internal_losses = self.compute_loss(
                x, y, sample_weight, training=True
            )
            total_loss = crf_loss + internal_losses

        gradients = tape.gradient(total_loss, self.trainable_variables)
        self.optimizer.apply_gradients(zip(gradients, self.trainable_variables))

        return {"crf_loss": crf_loss, "internal_losses": internal_losses}

    def test_step(self, data):
        x, y, sample_weight = unpack_data(data)
        crf_loss, internal_losses = self.compute_loss(x, y, sample_weight)
        return {"crf_loss_val": crf_loss, "internal_losses_val": internal_losses}


x_np, y_np = get_test_data()

x_input = tf.keras.layers.Input(shape=x_np.shape[1:])
crf_outputs = CRF(5)(x_input)
base_model = tf.keras.Model(x_input, crf_outputs)
model = ModelWithCRFLoss(base_model)

model .compile("adam")
model .fit(x=x_np, y=y_np)
model .evaluate(x_np, y_np)
model .predict(x_np)
model.save("my_model.tf")

If some users want to try this feature before it's merged, we have some wheels available

googlebot · 2020-04-26T11:16:11Z

All (the pull request submitter and all commit authors) CLAs are signed, but one or more commits were authored or co-authored by someone other than the pull request submitter.

We need to confirm that all authors are ok with their commits being contributed to this project. Please have them confirm that by leaving a comment that contains only @googlebot I consent. in this pull request.

Note to project maintainer: There may be cases where the author cannot leave a comment, or the comment is not properly detected as consent. In those cases, you can manually confirm consent of the commit author(s), and set the cla label to yes (if enabled on your project).

ℹ️ Googlers: Go here for more info.

elazarg · 2020-05-11T13:15:36Z

tensorflow_addons/layers/tests/crf_test.py

+        return tf.reduce_mean(crf_loss), sum(self.losses)
+
+    def train_step(self, data):
+        x, y, sample_weight = unpack_data(data)


Inconsistent naming - it's sample_weights, plural, in compute_loss()

elazarg · 2020-05-11T13:56:39Z

tensorflow_addons/layers/tests/crf_test.py

+    def test_step(self, data):
+        x, y, sample_weight = unpack_data(data)
+        crf_loss, internal_losses = self.compute_loss(x, y, sample_weight)
+        return {"crf_loss_val": crf_loss, "internal_losses_val": internal_losses}


prefix val_ is already added

YanZhu1105 · 2020-05-26T06:52:26Z

Hi,
Is it possible to include an example with sample weights?
Also an example where the input of model.fit is a generator which yield (x, y, sample_weight) for eatch batch?

Thanks!

ndrewl · 2020-06-02T07:57:37Z

tensorflow_addons/layers/crf.py

+
+    def mask_to_sequence_length(self, mask):
+        """compute sequence length from mask."""
+        sequence_length = tf.cast(tf.reduce_sum(tf.cast(mask, tf.int8), 1), tf.int64)


Here sums are computed using tf.int8 which will overflow on sequences longer than 127, which will produce negative sequence lengths. Maybe we can cast the mask to tf.int64 right away, and then the outer cast will be unnecessary?

luozhouyang · 2020-07-11T09:46:41Z

Any updates?

ndrewl · 2020-07-12T11:25:05Z

@gabrieldemarmiesse, are you still going to work on this PR?

gabrieldemarmiesse · 2020-07-12T12:15:51Z

Sorry, I'm very busy nowadays, somebody else is more than welcome to take this branch and open a new pull request with it :)

jaspersjsun · 2020-07-14T08:31:46Z

@gabrieldemarmiesse Hey! I'm working on a model with CRF layer recently and your solution here is very helpful. I'd gladly help finish this PR if you are not available currently ;)

gabrieldemarmiesse · 2020-07-14T08:34:53Z

I'm glad it helped you! Feel free to pull this branch into your fork and open a new PR :)

gabrieldemarmiesse · 2020-07-14T15:22:04Z

Closing in favor of #1999

DachuanZhao · 2020-09-22T03:58:44Z

With a subclassing approch, we have a nicer API and it's very flexible.

Works only with TF 2.2+

@howl-anderson for the review and the CLA

The plan is to show users how to do the subclassing for the CRF. We shouldn't provide and API to save them some code there because it's going to become very complex to design a good API and to maintain it later on.

So the CRF layer is a public API and for the CRF loss, we give a good tutorial about subclassing.

Quick tutorial right now:

import tensorflow as tf
from tensorflow_addons.layers.crf import CRF
from tensorflow_addons.text.crf import crf_log_likelihood

def unpack_data(data):
    if len(data) == 2:
        return data[0], data[1], None
    elif len(data) == 3:
        return data
    else:
        raise TypeError("Expected data to be a tuple of size 2 or 3.")


class ModelWithCRFLoss(tf.keras.Model):
    """Wrapper around the base model for custom training logic."""

    def __init__(self, base_model):
        super().__init__()
        self.base_model = base_model

    def call(self, inputs):
        return self.base_model(inputs)

    def compute_loss(self, x, y, sample_weights, training=False):
        y_pred = self(x, training=training)
        _, potentials, sequence_length, chain_kernel = y_pred

        crf_loss = -crf_log_likelihood(potentials, y, sequence_length, chain_kernel)[0]

        if sample_weights is not None:
            crf_loss = crf_loss * sample_weights

        return tf.reduce_mean(crf_loss), sum(self.losses)

    def train_step(self, data):
        x, y, sample_weight = unpack_data(data)

        with tf.GradientTape() as tape:
            crf_loss, internal_losses = self.compute_loss(
                x, y, sample_weight, training=True
            )
            total_loss = crf_loss + internal_losses

        gradients = tape.gradient(total_loss, self.trainable_variables)
        self.optimizer.apply_gradients(zip(gradients, self.trainable_variables))

        return {"crf_loss": crf_loss, "internal_losses": internal_losses}

    def test_step(self, data):
        x, y, sample_weight = unpack_data(data)
        crf_loss, internal_losses = self.compute_loss(x, y, sample_weight)
        return {"crf_loss_val": crf_loss, "internal_losses_val": internal_losses}


x_np, y_np = get_test_data()

x_input = tf.keras.layers.Input(shape=x_np.shape[1:])
crf_outputs = CRF(5)(x_input)
base_model = tf.keras.Model(x_input, crf_outputs)
model = ModelWithCRFLoss(base_model)

model .compile("adam")
model .fit(x=x_np, y=y_np)
model .evaluate(x_np, y_np)
model .predict(x_np)
model.save("my_model.tf")

If some users want to try this feature before it's merged, we have some wheels available

Hi ~ How to use this code to build a bi-lstm-crf model ? like this ?

x_input = tf.keras.layers.Input(shape=x_np.shape[1:])
bilstm_output = bi_lstm_model(x_input)
crf_outputs = CRF(5)(bilstm_output)
base_model = tf.keras.Model(x_input, crf_outputs)
model = ModelWithCRFLoss(base_model)

xuxingya · 2021-03-12T09:18:45Z

I build a crf tool for TF2, you can refer to tf2crf. And pip install tf2crf.

DachuanZhao · 2021-03-15T01:35:42Z

I build a crf tool for TF2, you can refer to tf2crf. And pip install tf2crf.

What's the difference between your CRF layer and tensorflow_addons.layers.crf ？

xuxingya · 2021-03-15T02:00:08Z

Because during a longtime the tfa.layers.crf has problems with mixed precision and other bugs. And it has 4 outputs when predicting. So I implemented a CRF layer using the CRF functions in tfa.text.crf, which is more like old CRF of keras_contrib.

…

On Mon, Mar 15, 2021 at 9:35 AM DachuanZhao ***@***.***> wrote: I build a crf tool for TF2, you can refer to tf2crf <https://github.com/xuxingya/tf2crf>. And pip install tf2crf. What's the difference between your CRF layer and tensorflow_addons.layers.crf ？ — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#1733 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ADFZVRFOGNEYKJYP4XGW3ELTDVP73ANCNFSM4MRF3B4Q> .

howl-anderson and others added 13 commits March 21, 2020 18:41

Squash all.

8d754a1

Merge branch 'master' into trying_to_squash

a080aaa

Cleanup for easier review.

037549c

Calming the angry bazel.

e4cdfcb

Fix the strange bug.

76a4375

Replaced one bug by another bug.

bf691c8

Minor simplification.

413f242

Fix unused parameter.

a6afeb9

Simplified the signature.

3c0f306

Merge branch 'master' into trying_to_squash

4517e98

Removing boilerplate

fa347ae

Unused import.

4f820b4

CRF layer v3.0

89111ff

gabrieldemarmiesse requested review from facaiy and seanpmorgan as code owners April 26, 2020 11:16

boring-cyborg bot added the layers label Apr 26, 2020

googlebot added the cla: no label Apr 26, 2020

Finish the conversion.

35021a0

gabrieldemarmiesse mentioned this pull request Apr 26, 2020

Dropping support for tensorflow < 2.2. #1734

Closed

Some renaming here and there.

bb68d01

seanpmorgan added blocked Pending something elses completion and removed blocked Pending something elses completion labels Apr 26, 2020

Merge branch 'master' into crf_layer_again

152eb34

gabrieldemarmiesse mentioned this pull request Apr 26, 2020

CRF layer continued #1363

Closed

Added a test where some training is done after reloading the model.

cc74721

gabrieldemarmiesse mentioned this pull request Apr 26, 2020

Add Conditional Random Field (CRF) layer #377

Closed

elazarg reviewed May 11, 2020

View reviewed changes

ndrewl reviewed Jun 2, 2020

View reviewed changes

KacperKubara mentioned this pull request Jun 11, 2020

How Exactly Do I Use CRF with this Library? #1769

Closed

jaspersjsun mentioned this pull request Jul 14, 2020

CRF layer v3.0 continued #1999

Merged

gabrieldemarmiesse closed this Jul 14, 2020

Harsh188 mentioned this pull request Sep 11, 2020

Request for example: CRF #337

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CRF layer v3.0 #1733

CRF layer v3.0 #1733

gabrieldemarmiesse commented Apr 26, 2020 •

edited

Loading

googlebot commented Apr 26, 2020

elazarg May 11, 2020

elazarg May 11, 2020

YanZhu1105 commented May 26, 2020

ndrewl Jun 2, 2020 •

edited

Loading

luozhouyang commented Jul 11, 2020

ndrewl commented Jul 12, 2020

gabrieldemarmiesse commented Jul 12, 2020

jaspersjsun commented Jul 14, 2020

gabrieldemarmiesse commented Jul 14, 2020

gabrieldemarmiesse commented Jul 14, 2020

DachuanZhao commented Sep 22, 2020

xuxingya commented Mar 12, 2021

DachuanZhao commented Mar 15, 2021

xuxingya commented Mar 15, 2021 via email •

edited

Loading

CRF layer v3.0 #1733

CRF layer v3.0 #1733

Conversation

gabrieldemarmiesse commented Apr 26, 2020 • edited Loading

googlebot commented Apr 26, 2020

elazarg May 11, 2020

Choose a reason for hiding this comment

elazarg May 11, 2020

Choose a reason for hiding this comment

YanZhu1105 commented May 26, 2020

ndrewl Jun 2, 2020 • edited Loading

Choose a reason for hiding this comment

luozhouyang commented Jul 11, 2020

ndrewl commented Jul 12, 2020

gabrieldemarmiesse commented Jul 12, 2020

jaspersjsun commented Jul 14, 2020

gabrieldemarmiesse commented Jul 14, 2020

gabrieldemarmiesse commented Jul 14, 2020

DachuanZhao commented Sep 22, 2020

xuxingya commented Mar 12, 2021

DachuanZhao commented Mar 15, 2021

xuxingya commented Mar 15, 2021 via email • edited Loading

gabrieldemarmiesse commented Apr 26, 2020 •

edited

Loading

ndrewl Jun 2, 2020 •

edited

Loading

xuxingya commented Mar 15, 2021 via email •

edited

Loading