rapoth
diff --git a/‎src/Microsoft.ML.FastTree/doc.xml‎
Lines changed: 0 additions & 91 deletions b/‎src/Microsoft.ML.FastTree/doc.xml‎
Lines changed: 0 additions & 91 deletions
diff --git a/‎src/Microsoft.ML.LightGbm/doc.xml‎
Lines changed: 0 additions & 16 deletions b/‎src/Microsoft.ML.LightGbm/doc.xml‎
Lines changed: 0 additions & 16 deletions
diff --git a/‎src/Microsoft.ML.StandardTrainers/Standard/LogisticRegression/doc.xml‎
Lines changed: 0 additions & 68 deletions b/‎src/Microsoft.ML.StandardTrainers/Standard/LogisticRegression/doc.xml‎
Lines changed: 0 additions & 68 deletions
diff --git a/‎src/Microsoft.ML.StandardTrainers/Standard/doc.xml‎
Lines changed: 0 additions & 37 deletions b/‎src/Microsoft.ML.StandardTrainers/Standard/doc.xml‎
Lines changed: 0 additions & 37 deletions
@@ -1,97 +1,6 @@
 <?xml version="1.0" encoding="utf-8"?>
 <doc>
   <members>
-    <!--  
-    The following text describes the FastTree algorithm details.
-    It's used for the remarks section of all FastTree-based trainers (binary, regression, ranking)
-    -->
-    <member name="FastTree_remarks">
-      <remarks>
-        <para>
-          FastTree is an efficient implementation of the <a href='https://arxiv.org/abs/1505.01866'>MART</a> gradient boosting algorithm.
-          Gradient boosting is a machine learning technique for regression problems.
-          It builds each regression tree in a step-wise fashion, using a predefined loss function to measure the error for each step and corrects for it in the next.
-          So this prediction model is actually an ensemble of weaker prediction models. In regression problems, boosting builds a series of such trees in a step-wise fashion and then selects the optimal tree using an arbitrary differentiable loss function.
-        </para>
-        <para>
-          MART learns an ensemble of regression trees, which is a decision tree with scalar values in its leaves.
-          A decision (or regression) tree is a binary tree-like flow chart, where at each interior node one decides which of the two child nodes to continue to based on one of the feature values from the input.
-          At each leaf node, a value is returned. In the interior nodes, the decision is based on the test 'x &lt;= v' where x is the value of the feature in the input sample and v is one of the possible values of this feature.
-          The functions that can be produced by a regression tree are all the piece-wise constant functions.
-        </para>
-        <para>
-          The ensemble of trees is produced by computing, in each step, a regression tree that approximates the gradient of the loss function, and adding it to the previous tree with coefficients that minimize the loss of the new tree.
-          The output of the ensemble produced by MART on a given instance is the sum of the tree outputs.
-        </para>
-        <list type='bullet'>
-          <item><description>In case of a binary classification problem, the output is converted to a probability by using some form of calibration.</description></item>
-          <item><description>In case of a regression problem, the output is the predicted value of the function.</description></item>
-          <item><description>In case of a ranking problem, the instances are ordered by the output value of the ensemble.</description></item>
-        </list>
-        <para>For more information see:</para>
-        <list type="bullet">
-          <item><description><a href='https://en.wikipedia.org/wiki/Gradient_boosting#Gradient_tree_boosting'>Wikipedia: Gradient boosting (Gradient tree boosting).</a></description></item>
-          <item><description><a href='https://projecteuclid.org/DPubS?service=UI&amp;version=1.0&amp;verb=Display&amp;handle=euclid.aos/1013203451'>Greedy function approximation: A gradient boosting machine.</a></description></item>
-        </list>  
-      </remarks>
-    </member>
-
-    <!--  
-    The following text describes the FastForest algorithm details.
-    It's used for the remarks section of all FastForest-based trainers (regression)
-    -->
-    <member name="FastForest_remarks">
-      <remarks>
-        Decision trees are non-parametric models that perform a sequence of simple tests on inputs.
-        This decision procedure maps them to outputs found in the training dataset whose inputs were similar to the instance being processed.
-        A decision is made at each node of the binary tree data structure based on a measure of similarity that maps each instance recursively through the branches of the tree until the appropriate leaf node is reached and the output decision returned.
-        <para>Decision trees have several advantages:</para>
-        <list type='bullet'>
-          <item><description>They are efficient in both computation and memory usage during training and prediction. </description></item>
-          <item><description>They can represent non-linear decision boundaries.</description></item>
-          <item><description>They perform integrated feature selection and classification. </description></item>
-          <item><description>They are resilient in the presence of noisy features.</description></item>
-        </list>
-        <para>Fast forest is a random forest implementation.
-        The model consists of an ensemble of decision trees. Each tree in a decision forest outputs a Gaussian distribution by way of prediction.
-        An aggregation is performed over the ensemble of trees to find a Gaussian distribution closest to the combined distribution for all trees in the model.
-        This decision forest classifier consists of an ensemble of decision trees.</para>
-        <para>Generally, ensemble models provide better coverage and accuracy than single decision trees.
-         Each tree in a decision forest outputs a Gaussian distribution.</para>
-         <para>For more see: </para>
-        <list  type='bullet'>
-          <item><description><a href='https://en.wikipedia.org/wiki/Random_forest'>Wikipedia: Random forest</a></description></item>
-          <item><description><a href='http://jmlr.org/papers/volume7/meinshausen06a/meinshausen06a.pdf'>Quantile regression forest</a></description></item>
-          <item><description><a href='https://blogs.technet.microsoft.com/machinelearning/2014/09/10/from-stumps-to-trees-to-forests/'>From Stumps to Trees to Forests</a></description></item>
-        </list>
-      </remarks>
-    </member>
-
-    <!--  
-    The following text describes the GAM algorithm details.
-    It's used for the remarks section of all GAM-based trainers (regression, binary classification)
-    -->
-    <member name="GAM_remarks">
-      <remarks>
-        <para>
-        Generalized Additive Models, or GAMs, model the data as a set of linearly independent features
-        similar to a linear model. For each feature, the GAM trainer learns a non-linear function,
-        called a "shape function", that computes the response as a function of the feature's value.
-        (In contrast, a linear model fits a linear response (e.g. a line) to each feature.)
-        To score an example, the outputs of all the shape functions are summed and the score is the total value.
-        </para>
-        <para>
-        This GAM trainer is implemented using shallow gradient boosted trees (e.g. tree stumps) to learn nonparametric
-        shape functions, and is based on the method described in Lou, Caruana, and Gehrke.
-        <a href='http://www.cs.cornell.edu/~yinlou/papers/lou-kdd12.pdf'>&quot;Intelligible Models for Classification and Regression.&quot;</a> KDD&apos;12, Beijing, China. 2012.
-        After training, an intercept is added to represent the average prediction over the training set,
-        and the shape functions are normalized to represent the deviation from the average prediction. This results
-        in models that are easily interpreted simply by inspecting the intercept and the shape functions.
-        See the sample below for an example of how to train a GAM model and inspect and interpret the results.
-        </para>
-      </remarks>
-    </member>
-
     <member name="TreeEnsembleFeaturizerTransform">
       <summary>
         Trains a tree ensemble, or loads it from a file, then maps a numeric feature vector to outputs.