Move Bert to models folder #288

jbischof · 2022-08-09T00:48:33Z

Following the general design pattern of keras-cv.

Make Bert first model in models/ folder
Make Bert Model class using functional API
Make BertBase configured in code
Make BertClassifier API

Test plan: unit tests + ran examples/bert/* by hand.

mattdangerw

Thanks! Left some comments. Just focusing on the high level for now.

examples/bert/bert_preprocess.py

keras_nlp/applications/bert.py

examples/bert/bert_train.py

mattdangerw

Left a more detailed pass of comments!

examples/bert/bert_train.py

keras_nlp/__init__.py

keras_nlp/applications/bert.py

mattdangerw

Few more comments.

keras_nlp/models/bert.py

mattdangerw · 2022-08-11T21:32:45Z

Axes we should test this on (though need not be all on this PR):

Directly calling a model, directly calling a model with a classification head (maybe just a shape assertion on output).
Compiling a classification model with a loss and running a very small fit() (e.g. two batches of two examples).
Doing the 2) but compiling the model with jit_compile=True.
Saving a model with model.save(some path), restoring with model.load() and asserting the outputs for original and saved are verbatim equal.

This PR has an example of how to test with and without jit_compile (with XLA and without XLA) in a parameterized test.
https://github.com/keras-team/keras-nlp/pull/271/files

We can figure out weight loading tests, and what (if any) correctness testing we want to do in unit testing after we actually have some checkpoints to load.

jbischof · 2022-08-12T03:31:36Z

I added a docstring example usage for keras_nlp.models.Bert(), but this doesn't appear to be tested automatically. I verified in colab.

mattdangerw

Looking good, last few comments on this round of the API I think!

keras_nlp/models/bert.py

mattdangerw · 2022-08-12T13:23:35Z

@jbischof re docstrings.

The >>> style docstring will get tested. The fenced style docstring will not, but IMO they are still much more readable for larger code blocks. We could work to add fenced docstring testing, tf core has some code for it here, but no project is actually using that yet.

Anyway, most of the testing should be on the unit tests. That's just a good way to make sure our docstring don't get stale.

mattdangerw

Another couple spots I just noticed!

keras_nlp/models/bert.py

mattdangerw · 2022-08-12T14:39:18Z

Axes we should test this on (though need not be all on this PR):

Directly calling a model, directly calling a model with a classification head (maybe just a shape assertion on output).

Compiling a classification model with a loss and running a very small fit() (e.g. two batches of two examples).

Doing the 2) but compiling the model with jit_compile=True.

Saving a model with model.save(some path), restoring with model.load() and asserting the outputs for original and saved are verbatim equal.

Of these, I think 1. and 4. are probably the ones we should make sure to have on this PR. @chenmoneygithub are you OK to review the testing code on this when it is ready (probably quite shortly)?

mattdangerw

LGTM from me! With the understanding that testing will come in here after I'm on vacay.

Thanks for the huge amount of work here. This is big!!

keras_nlp/models/bert.py

mattdangerw requested changes Aug 9, 2022

View reviewed changes

jbischof changed the title ~~Move Bert to applications folder~~ Move Bert to models/ folder Aug 10, 2022

jbischof changed the title ~~Move Bert to models/ folder~~ Move Bert to models folder Aug 10, 2022

mattdangerw requested changes Aug 10, 2022

View reviewed changes

mattdangerw requested changes Aug 11, 2022

View reviewed changes

keras_nlp/models/bert.py Outdated Show resolved Hide resolved

keras_nlp/models/bert.py Outdated Show resolved Hide resolved

keras_nlp/models/bert.py Outdated Show resolved Hide resolved

keras_nlp/models/bert.py Outdated Show resolved Hide resolved

jbischof marked this pull request as ready for review August 12, 2022 03:48

mattdangerw requested changes Aug 12, 2022

View reviewed changes

keras_nlp/models/bert.py Outdated Show resolved Hide resolved

keras_nlp/models/bert.py Outdated Show resolved Hide resolved

mattdangerw approved these changes Aug 12, 2022

View reviewed changes

chenmoneygithub suggested changes Aug 12, 2022

View reviewed changes

fchollet reviewed Aug 12, 2022

View reviewed changes

jbischof added 15 commits August 15, 2022 23:51

first draft

de94eab

first working version

d3934b7

Partial draft of functional API + BertBase

5b8572a

Change Bert to Model subclass API

7add3a7

Get fine-tuning script working

2cfa58c

Move pretraining head back to examples/

9eeff1a

Rename to BertPretrainingModel

483d781

Small notes

2d9fca7

Formatting and notes

e54e39c

Note

1ac992b

Move Bert to models/ folder

66b5c7c

Small style changes re: comments

b3d22c0

Fix Bert docstrings and remove weights param

b704f28

Initialization and docstring for classifier

4f1ecdd

Decouple finetuning from model config

caf9dc5

jbischof added 21 commits August 16, 2022 00:03

Add docstring test for Bert encoder

c596267

Format

52d08a0

Add docstring test for classifier

3cbf709

Format

b2d6053

Set max_sequence_length for BertBase

5fc84ed

Move TODO

a4c7f8d

Standarize initializers to match Bert paper

94f81b2

Respond to minor comments

b99a84c

More minor comment fixes

a8a6dc8

Format fix

4053bcb

Improve documentation

0775307

Tiny fix

8fde616

Tiny fix

f375175

Clarifying comments in dim args

0d8d1d5

Remove unnecessary comment

7b34f1a

Add typehints in the comments.

ae00003

Restore comment

b908604

Improve handling of super args

628bda3

Initial tests for model call

ce8b2c4

Make kwargs passing consistent

49fd2e5

Saving model test

ddb59f1

jbischof force-pushed the bert_model branch from ec80504 to ddb59f1 Compare August 16, 2022 00:31

jbischof added 2 commits August 16, 2022 00:38

Fix TODOs

cfdbbb1

Format fix

091ae34

chenmoneygithub approved these changes Aug 16, 2022

View reviewed changes

jbischof merged commit bf2110e into keras-team:master Aug 16, 2022

jbischof mentioned this pull request Aug 17, 2022

Add pretrained checkpoints for Bert #297

Closed

jbischof deleted the bert_model branch August 25, 2022 18:21

jbischof mentioned this pull request Aug 25, 2022

Rename models.Bert() to models.BertCustom() #309

Closed

mattdangerw mentioned this pull request Sep 16, 2022

Validate bert pretraining on new code changes #362

Closed

Move Bert to models folder #288

Move Bert to models folder #288

Uh oh!

Conversation

jbischof commented Aug 9, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mattdangerw left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mattdangerw left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mattdangerw left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mattdangerw commented Aug 11, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jbischof commented Aug 12, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mattdangerw left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mattdangerw commented Aug 12, 2022

Uh oh!

mattdangerw left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

mattdangerw commented Aug 12, 2022

Uh oh!

mattdangerw left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jbischof commented Aug 9, 2022 •

edited

Loading

mattdangerw commented Aug 11, 2022 •

edited

Loading

jbischof commented Aug 12, 2022 •

edited

Loading