tensorflow
diff --git a/‎README.md
+12-1 b/‎README.md
+12-1
diff --git a/‎quantization/.babelrc
+18 b/‎quantization/.babelrc
+18
diff --git a/‎quantization/.gitignore
+4 b/‎quantization/.gitignore
+4
diff --git a/‎quantization/README.md
+150 b/‎quantization/README.md
+150
diff --git a/‎quantization/data_housing.js
+176 b/‎quantization/data_housing.js
+176
@@ -91,7 +91,7 @@ to another project.
     <td></td>
     <td>Building a tf.data.Dataset using a generator</td>
     <td>Regression</td>
-    <td>Multilayer perceptron</td>
+    <td>Browser</td>
     <td>Browser</td>
     <td>Layers</td>
     <td></td>
@@ -250,6 +250,17 @@ to another project.
     <td>Core (Ops)</td>
     <td></td>
   </tr>
+  <tr>
+    <td><a href="./quantization">quantization</a></td>
+    <td></td>
+    <td>Various</td>
+    <td>Demonstrates the effect of post-training weight quantization</td>
+    <td>Various</td>
+    <td>Node.js</td>
+    <td>Node.js</td>
+    <td>Layers</td>
+    <td></td>
+  </tr>
   <tr>
     <td><a href="./sentiment">sentiment</a></td>
     <td><a href="https://storage.googleapis.com/tfjs-examples/sentiment/dist/index.html">🔗</a></td>
 
@@ -0,0 +1,18 @@
+{
+  "presets": [
+    [
+      "env",
+      {
+        "esmodules": false,
+        "targets": {
+          "browsers": [
+            "> 3%"
+          ]
+        }
+      }
+    ]
+  ],
+  "plugins": [
+    "transform-runtime"
+  ]
+}
@@ -0,0 +1,4 @@
+data-fashion-mnist/
+data-mnist/
+imagenet-1000-samples/
+models/
@@ -0,0 +1,150 @@
+# TensorFlow.js Example: Effects of Post-Training Weight Quantization
+
+Post-training quantization is a model-size reducing technique useful for
+deploying model on the web and in storage-limited environments such as
+mobile devices. TensorFlow.js's
+[converter module](https://github.com/tensorflow/tfjs-converter)
+supports reducing the numeric precision of weights to 16-bit and 8-bit
+integers after the completion of the model training, which leads to
+approximately 50% and 75% reduction in model size, respectively.
+
+The following figure provides an intuitive understanding of the degree
+to which weight values are discretized under the 16- and 8-bit quantization
+regimes. The figure is based on a zoomed-in view of a sinusoidal wave.
+
+![Weight quantization: 16-bit and 8-bit](./quantization.png)
+
+This example focuses on how such quantization of weights affect the
+model's predicton accuracy.
+
+## What's in this demo
+
+This demo on quantization consists of four examples:
+1. housing: this demo evaluates the effect of quantization on the accuracy
+   of a multi-layer perceptron regression model.
+2. mnist: this demo evaluates the effect of quantization on the accuracy
+   of a relatively small deep convnet trained on the MNIST handwritten digits
+   dataset. Without quantization, the convnet can achieve close-to-perfect
+   (i.e., ~99.5%) test accuracy.
+3. fashion-mnist: this demo evaluates the effect of quantization on the
+   accuracy of another small deep convnet traind on a problem slightly harder
+   than MNIST. In particular, it is based on the Fashion MNIST dataset. The
+   original, non-quantized model has an accuracy of 92%-93%.
+4. MobileNetV2: this demo evaluates quantized and non-quantizd versions of
+   MobeilNetV2 (width = 1.0) on a sample of 1000 images from the
+   [ImageNet](http://www.image-net.org/) dataset. This subset is based on the
+   sampling done by https://github.com/ajschumacher/imagen.
+
+In the first three demos, quantizing the weights to 16 or 8 bits does not
+have any significant effect on the accuracy. In the MobileNetV2 demo, however,
+quantizing the weights to 8 bits leads to a significant deterioration in
+accuracy, as measured by the top-1 and top-5 accuracies. See example results
+in the table below:
+
+| Dataset and Mdoel      | Original (no-quantization) | 16-bit quantization | 8-bit quantization |
+| ---------------------- | -------------------------- | ------------------- | ------------------ |
+| housing: multi-layer regressor  |  MAE=0.311984     | MAE=0.311983        | MAE=0.312780       |
+| MNIST: convnet         | accuracy=0.9952            | accuracy=0.9952     | accuracy=0.9952    |
+| Fashion MNIST: convnet | accuracy=0.922             | accuracy=0.922      | accuracy=0.9211    |
+| MobileNetV2            | top-1 accuracy=0.618; top-5 accuracy=0.788 | top-1 accuracy=0.624; top-5 accuracy=0.789 | top-1 accuracy=0.280; top-5 accuracy=0.490 |
+
+MAE Stands for mean absolute error.
+
+They demonstrate different effects of the same quantization technique
+on different problems.
+
+## Running the housing quantization demo
+
+In preparation, do:
+
+```sh
+yarn
+```
+
+To run the train and save the model from scratch, do:
+```sh
+yarn train-housing
+```
+
+If you are running on a Linux system that is [CUDA compatible](https://www.tensorflow.org/install/install_linux), try installing the GPU:
+
+```sh
+yarn train-housing --gpu
+```
+
+To perform quantization on the model saved in the `yarn train` step
+and evaluate the effects on the model's test accuracy, do:
+
+```
+yarn quantize-and-evaluate-housing
+```
+
+## Running the MNIST quantization demo
+
+In preparation, do:
+
+```sh
+yarn
+```
+
+To run the train and save the model from scratch, do:
+```sh
+yarn train-mnist
+```
+
+or with CUDA acceleration:
+
+```sh
+yarn train-mnist --gpu
+```
+
+To perform quantization on the model saved in the `yarn train` step
+and evaluate the effects on the model's test accuracy, do:
+
+```
+yarn quantize-and-evaluate-mnist
+```
+
+## Running the Fashion-MNIST quantization demo
+
+In preparation, do:
+
+```sh
+yarn
+```
+
+To run the train and save the model from scratch, do:
+```sh
+yarn train-fashion-mnist
+```
+
+or with CUDA acceleration:
+
+```sh
+yarn train-fashion-mnist --gpu
+```
+
+To perform quantization on the model saved in the `yarn train` step
+and evaluate the effects on the model's test accuracy, do:
+
+```
+yarn quantize-and-evaluate-fashion-mnist
+```
+
+## Running the MobileNetV2 quantization demo
+
+Unlike the previous three demos, the MobileNetV2 demo doesn't involve
+a model training step. Instead, the model is loaded as a Keras application
+and converted to the TensorFlow.js format for quantization and evaluation.
+
+The non-quantized and quantized versions of MobileNetV2 are evaluated
+on a sample of 1000 images from the [ImageNet](http://www.image-net.org/)
+dataset. The image files are downloaded from the hosted location on the
+web. This subset is based on the sampling done by
+https://github.com/ajschumacher/imagen.
+
+All these steps can be performed with a single command:
+
+```sh
+yarn quantize-and-evaluate-MobileNetV2
+```
@@ -0,0 +1,176 @@
+/**
+ * @license
+ * Copyright 2019 Google LLC. All Rights Reserved.
+ * Licensed under the Apache License, Version 2.0 (the "License");
+ * you may not use this file except in compliance with the License.
+ * You may obtain a copy of the License at
+ *
+ * http://www.apache.org/licenses/LICENSE-2.0
+ *
+ * Unless required by applicable law or agreed to in writing, software
+ * distributed under the License is distributed on an "AS IS" BASIS,
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ * See the License for the specific language governing permissions and
+ * limitations under the License.
+ * =============================================================================
+ */
+
+import * as tf from '@tensorflow/tfjs';
+
+const HOUSING_CSV_URL = 'https://storage.googleapis.com/learnjs-data/csv-datasets/california_housing_train_10k.csv';
+
+export const featureColumns = [
+  'longitude', 'latitude', 'housing_median_age', 'total_rooms',
+  'total_bedrooms', 'population', 'households',  'median_income'];
+const labelColumn = 'median_house_value';
+
+/**
+ * Calculate the column-by-column statistics of the housing CSV dataset.
+ *
+ * @return An object consisting of the following fields:
+ *   count {number} Number of data rows.
+ *   featureMeans {number[]} Each element is the arithmetic mean over all values
+ *     in a column. Ordered by the feature columns in the CSV dataset.
+ *   featureStddevs {number[]} Each element is the standard deviation over all
+ *     values in a column. Ordered by the columsn in the in the CSV dataset.
+ *   labelMean {number} The arithmetic mean of the label column.
+ *   labeStddev {number} The standard deviation of the albel column.
+ */
+export async function getDatasetStats() {
+  const featureValues = {};
+  featureColumns.forEach(feature => {
+    featureValues[feature] = [];
+  });
+  const labelValues = [];
+
+  const dataset = tf.data.csv(HOUSING_CSV_URL, {
+    columnConfigs: {
+      [labelColumn]: {
+        isLabel: true
+      }
+    }
+  });
+  const iterator = await dataset.iterator();
+  let count = 0;
+  while (true) {
+    const item = await iterator.next();
+    if (item.done) {
+      break;
+    }
+    featureColumns.forEach(feature => {
+      if (item.value.xs[feature] == null) {
+        throw new Error(`item #{count} lacks feature ${feature}`);
+      }
+      featureValues[feature].push(item.value.xs[feature]);
+    });
+    labelValues.push(item.value.ys[labelColumn]);
+    count++;
+  }
+
+  return tf.tidy(() => {
+    const featureMeans = {};
+    const featureStddevs = {};
+    featureColumns.forEach(feature => {
+      const {mean, variance} = tf.moments(featureValues[feature]);
+      featureMeans[feature] = mean.arraySync();
+      featureStddevs[feature] = tf.sqrt(variance).arraySync();
+    });
+
+    const moments = tf.moments(labelValues);
+    const labelMean = moments.mean.arraySync();
+    const labelStddev = tf.sqrt(moments.variance).arraySync();
+    return {
+      count,
+      featureMeans,
+      featureStddevs,
+      labelMean,
+      labelStddev
+    };
+  });
+}
+
+/**
+ * Get a dataset with the features and label z-normalized,
+ * the dataset is split into three xs-ys tensor pairs: for training,
+ * validation and evaluation.
+ *
+ * @param {number} count Number of rows in the CSV dataset, computed beforehand.
+ * @param {{[feature: string]: number}} featureMeans Arithmetic means of the
+ *   features. Use for normalization.
+ * @param {[feature: string]: number} featureStddevs Standard deviations of the
+ *   features. Used for normalization.
+ * @param {number} labelMean Arithmetic mean of the label. Used for
+ *   normalization.
+ * @param {number} labelStddev Standard deviation of the label. Used for
+ *   normalization.
+ * @param {number} validationSplit Validation spilt, must be >0 and <1.
+ * @param {number} evaluationSplit Evaluation split, must be >0 and <1.
+ * @returns An object consisting of the following keys:
+ *   trainXs {tf.Tensor} training feature tensor
+ *   trainYs {tf.Tensor} training label tensor
+ *   valXs {tf.Tensor} validation feature tensor
+ *   valYs {tf.Tensor} validation label tensor
+ *   evalXs {tf.Tensor} evaluation feature tensor
+ *   evalYs {tf.Tensor} evaluation label tensor.
+ */
+export async function getNormalizedDatasets(
+    count, featureMeans, featureStddevs, labelMean, labelStddev,
+    validationSplit, evaluationSplit) {
+  tf.util.assert(
+      validationSplit > 0 && validationSplit < 1,
+      () => `validationSplit is expected to be >0 and <1, ` +
+            `but got ${validationSplit}`);
+  tf.util.assert(
+      evaluationSplit > 0 && evaluationSplit < 1,
+      () => `evaluationSplit is expected to be >0 and <1, ` +
+            `but got ${evaluationSplit}`);
+  tf.util.assert(
+      validationSplit + evaluationSplit < 1,
+      () => `The sum of validationSplit and evaluationSplit exceeds 1`);
+
+  const dataset = tf.data.csv(HOUSING_CSV_URL, {
+    columnConfigs: {
+      [labelColumn]: {
+        isLabel: true
+      }
+    }
+  });
+
+  const featureValues = [];
+  const labelValues = [];
+  const indices = [];
+  const iterator = await dataset.iterator();
+  for (let i = 0; i < count; ++i) {
+    const {value, done} = await iterator.next();
+    if (done) {
+      break;
+    }
+    featureColumns.map(feature => {
+      featureValues.push(
+          (value.xs[feature] - featureMeans[feature]) /
+          featureStddevs[feature]);
+    });
+    labelValues.push((value.ys[labelColumn] - labelMean) / labelStddev);
+    indices.push(i);
+  }
+
+  const xs = tf.tensor2d(featureValues, [count, featureColumns.length]);
+  const ys = tf.tensor2d(labelValues, [count, 1]);
+
+  // Set random seed to fix shuffling order and therefore to fix the
+  // training, validation, and evaluation splits.
+  Math.seedrandom('1337');
+  tf.util.shuffle(indices);
+
+  const numTrain = Math.round(count * (1 - validationSplit - evaluationSplit));
+  const numVal = Math.round(count * validationSplit);
+  const trainXs = xs.gather(indices.slice(0, numTrain));
+  const trainYs = ys.gather(indices.slice(0, numTrain));
+  const valXs = xs.gather(indices.slice(numTrain, numTrain + numVal));
+  const valYs = ys.gather(indices.slice(numTrain, numTrain + numVal));
+  const evalXs = xs.gather(indices.slice(numTrain + numVal));
+  const evalYs = ys.gather(indices.slice(numTrain + numVal));
+
+  return {trainXs, trainYs, valXs, valYs, evalXs, evalYs};
+
+}