Implement the DLRM model #344

jordannad · 2020-02-24T03:36:11Z

No description provided.

googlebot · 2020-02-24T03:36:15Z

Thanks for your pull request. It looks like this may be your first contribution to a Google open source project (if not, look below for help). Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

📝 Please visit https://cla.developers.google.com/ to sign.

Once you've signed (or fixed any issues), please reply here with @googlebot I signed it! and we'll verify it.

What to do if you already signed the CLA

Individual signers

It's possible we don't have your GitHub username or you're using a different email address on your commit. Check your existing CLA data and verify that your email is set on your git commits.

Corporate signers

Your company has a Point of Contact who decides which employees are authorized to participate. Ask your POC to be added to the group of authorized contributors. If you don't know who your Point of Contact is, direct the Google project maintainer to go/cla#troubleshoot (Public version).
The email used to register you as an authorized contributor must be the email used for the Git commit. Check your existing CLA data and verify that your email is set on your git commits.
The email used to register you as an authorized contributor must also be attached to your GitHub account.

ℹ️ Googlers: Go here for more info.

jordannad · 2020-02-24T03:37:26Z

@googlebot I signed it!

googlebot · 2020-02-24T03:37:31Z

CLAs look good, thanks!

ℹ️ Googlers: Go here for more info.

Models/Recommendation/DLRM.swift

Tests/RecommendationModelTests/InferenceTests.swift

xihui-wu · 2020-02-28T21:32:20Z

Thanks for contributing this @jordannad. Two general questions regarding the model as I looked into PyTorch implementations:

For MLP they have activations between the dense layers, any reason we omit it?
Looks like there are two other OPs besides concat, should we support ?

Models/Recommendation/DLRM.swift

As part of ensuring the train test is reliable, I have matched the initialization of the reference implementation for the Embedding layers. Additionally, the DLRM model is susceptible to a "bad initialization" that doesn't perfectly memorize the single test minibatch. Although this is infrequent (~1 out of 50 test runs), I have modified the tests to randomly re-initialize 5 times, ensuring the test is approximately flaky with a probability of 3.2e-9 while still maintaining the quality of the test (e.g. testing random initialization, etc). Finally, instead of checking that loss drops below a particular value, the test checks that the accuracy is 100%. This results in a faster stopping condition, and thus the convergence test often runs in under 300ms on a laptop.

jordannad · 2020-03-23T02:40:14Z

Please take a look? I believe I've addressed the comments. Thank you!

xihui-wu · 2020-03-25T02:30:46Z

Models/Recommendation/DLRM.swift

+        sparseInput: [Tensor<Int32>]
+    ) -> Tensor<Float> {
+        precondition(denseInput.shape.last! == nDense)
+        assert(sparseInput.count == latentFactors.count)


use precondition as well ?

xihui-wu

One minor comment. Rest LGTM!

Implement the DLRM model

12f80a1

xihui-wu self-assigned this Feb 26, 2020

dan-zheng reviewed Feb 27, 2020

View reviewed changes

Models/Recommendation/DLRM.swift Outdated Show resolved Hide resolved

xihui-wu reviewed Feb 28, 2020

View reviewed changes

Tests/RecommendationModelTests/InferenceTests.swift Outdated Show resolved Hide resolved

xihui-wu reviewed Feb 28, 2020

View reviewed changes

Models/Recommendation/DLRM.swift Outdated Show resolved Hide resolved

Respond to comments

1386885

shabalind requested a review from xihui-wu March 11, 2020 17:44

Add dot interactions and MLP activations

6939ea6

saeta mentioned this pull request Mar 21, 2020

Fix bug in batchGathering tensorflow/swift-apis#767

Merged

jordannad added 3 commits March 22, 2020 09:20

Merge remote-tracking branch 'origin/master' into dlrm

f15a00f

Add comments to DLRM initializers to explain the hyperparameters.

659e982

saeta added the kokoro:run label Mar 23, 2020

kokoro-team removed the kokoro:run label Mar 23, 2020

Add CMake build directives.

e08495d

saeta added the kokoro:run label Mar 23, 2020

kokoro-team removed the kokoro:run label Mar 23, 2020

xihui-wu reviewed Mar 25, 2020

View reviewed changes

xihui-wu approved these changes Mar 25, 2020

View reviewed changes

Switch from assert to precondition

dc49cc0

saeta added the kokoro:run label Mar 25, 2020

kokoro-team removed the kokoro:run label Mar 25, 2020

dabrahams merged commit 713bb8d into tensorflow:master Mar 25, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement the DLRM model #344

Implement the DLRM model #344

jordannad commented Feb 24, 2020

googlebot commented Feb 24, 2020

jordannad commented Feb 24, 2020

googlebot commented Feb 24, 2020

xihui-wu commented Feb 28, 2020

jordannad commented Mar 23, 2020

xihui-wu Mar 25, 2020

xihui-wu left a comment

Implement the DLRM model #344

Implement the DLRM model #344

Conversation

jordannad commented Feb 24, 2020

googlebot commented Feb 24, 2020

What to do if you already signed the CLA

Individual signers

Corporate signers

jordannad commented Feb 24, 2020

googlebot commented Feb 24, 2020

xihui-wu commented Feb 28, 2020

jordannad commented Mar 23, 2020

xihui-wu Mar 25, 2020

Choose a reason for hiding this comment

xihui-wu left a comment

Choose a reason for hiding this comment